An embodiment of the invention is generally related to audio signal processing techniques that are intended to enhance the quality of an audio signal in terms of how it sounds during playback. Other embodiments are also described.
The search for audio signal processing techniques and sound systems that can reproduce sound with higher quality continues in the modern age. An original, high quality audio signal often becomes degraded due to manipulations applied to it for purposes of storage and transmission. The problem of poor quality sound during playback is especially acute when the original audio signal has undergone lossy compression to reduce its bit rate, for purposes of either reduced storage requirements or to meet reduced transmission bandwidth over the Internet.
The quality of an encoded and then decoded (codec processed) audio signal may be improved by digital signal processing of the codec-processed signal, seeking to harmonically enhance and frequency equalize the signal. A digital filter can be designed to reshape the phase and frequency content of the codec processed audio signal that is passed through it, in hopes of recovering the lost realism (as experienced during its playback.) In another approach, the use of an all-pass filter has been suggested, which is a signal-processing block that passes all frequencies equally in terms of gain or magnitude, but changes the phase relationship between various frequencies. In an all-pass filter, the phase shift between the output and the input varies as a function of frequency. An all-pass filter may be described by the frequency at which its phase shift crosses 90° or when the input and output signals are described as going into quadrature or when there is a quarter wavelength of delay between the output and the input. All-pass filters are often used to compensate for undesired phase shifts that have arisen in an audio system. An all-pass filter may be implemented in a myriad of ways, as a digital infinite impulse response (IIR) filter whose difference equation has the well-known general form
The efficacy of any codec-processed audio signal enhancement technique may be judged by comparing the spectral content of the enhanced audio signal to the original audio signal, or it may be judged in view of the improvement in how the enhanced audio signal sounds during playback.
An embodiment of the invention is a digital audio signal enhancement technique, for processing an input audio signal so as to produce an output audio signal that may have improved quality in how it sounds during playback, in terms of a fuller or richer sound, or manifestly as a more realistic sound. The output audio signal may then be encoded using a lossy compression algorithm (for bit rate reduction), and then subsequently decoded in preparation for playback. The enhancement technique in that case is a pre-processing operation, which enables the subsequent, codec processed version of the audio signal to maintain a more complete audio spectrum (as compared to the situation where the codec processed signal had been produced without the enhancement technique having been applied as a pre-processing operation.) In particular, the pre-processing may advantageously prevent an upper frequency content of the codec processed audio signal from being suppressed.
In another embodiment of the invention, an input audio signal is split into its low and high frequency portions, before being fed to respective, enhancement processing blocks, in parallel. The outputs of these two enhancement processing blocks may then be combined, by a summing unit.
The enhancement processing may be duplicated for each of the typical left and right channels of a stereo audio signal, where each channel is enhanced separately without contributions from the other channel.
The above summary does not include an exhaustive list of all aspects of the present invention. It is contemplated that the invention includes all systems and methods that can be practiced from all suitable combinations of the various aspects summarized above, as well as those disclosed in the Detailed Description below and particularly pointed out in the claims filed with the application. Such combinations have particular advantages not specifically recited in the above summary.
The embodiments of the invention are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” embodiment of the invention in this disclosure are not necessarily to the same embodiment, and they mean at least one. Also, in the interest of conciseness and reducing the total number of figures, a given figure may be used to illustrate the features of more than one embodiment of the invention, and not all elements in the figure may be required for a given embodiment.
Several embodiments of the invention with reference to the appended drawings are now explained. Whenever the shapes, relative positions and other aspects of the parts described in the embodiments are not explicitly defined, the scope of the invention is not limited only to the parts shown, which are meant merely for the purpose of illustration. Also, while numerous details are set forth, it is understood that some embodiments of the invention may be practiced without these details. In other instances, well-known circuits, structures, and techniques have not been shown in detail so as not to obscure the understanding of this description.
The input audio signal may be produced, as a digital audio signal, by an audio source 7, such as a digital media player program that is stored in memory and is being executed by a processor; the processor and memory may be part of a server, or they may be part of a consumer electronics end user device such as a smartphone, laptop computer, or in-vehicle infotainment system. A link between the audio source 7 and the all-pass block 1 may include a digital communications path through the Internet or through a cellular telephone network, for example. In another embodiment, the connection between the audio source 7 and the all-pass block 1 may lie entirely within a server, for example as part of a media server that is producing the output audio signal which is being streamed over the Internet; in that case, the output audio signal may be encoded by an audio encoder 8 for purposes of bit rate reduction, e.g., the audio encoder 8 may implement a lossy compression algorithm. Subsequent to the encoding, there is a corresponding decoding operation performed by an audio decoder 10 that undoes the encoding performed by the audio encoder 8. The decoder 10 may be part of a consumer electronics end user device (a client or playback device) such as a smartphone, laptop computer, or in-vehicle infotainment system. The “channel” between the output of the audio encoder 8 and the input of the audio decoder 10 may include a path through the Internet or other digital communications network, including for example a cellular telephone network. It may also, or instead, be a storage device such as a cloud-based mass storage. In another embodiment, the output audio signal may be converted by a sound system 9 into sound, e.g., as part of a consumer electronics end user (client or playback) device, where in that case the all-pass block 1 and the summing unit 5 could be implemented within the same consumer electronics device of which the sound system 9 is a part, e.g., a smartphone, laptop computer, or in-vehicle infotainment system.
Still referring to
The detected envelope (by the modulation generator 6) is used to alter the all-pass filter 2 “dynamically” or in real-time. The all-pass filter 2 is thus a time-varying digital filter that is being updated by the modulation generator 6, for example at a rate of every sample, or a little slower, e.g., up to every ten (10) samples, of the input audio signal. An example of the all-pass filter 2 is depicted by a signal flow diagram in
In one embodiment, those two time-varying elements, namely the delay element 12 and the gain element 15, are the only time-varying elements of the all-pass filter 2 that are being updated dynamically or in real-time in accordance with the detected envelope; the scalar gain g1 of the feedforward gain element 13 and the second delay d2 presented by the feedback delay element 14 may remain fixed or static (in relation to the dynamically variable first delay d1 and scalar gain g2). In other words, the phase response of the all-pass filter 2 is changing due to variations in d1 and g2 which are dynamically controlled by the modulation generator 6 (see
The first delay d1, which may refer to the number of samples by which the filter input x[n] is delayed by the delay element 12, is variable between a minimum delay (lower bound) and a maximum delay (upper bound), in proportion to the detected envelope of the input audio signal x[n]. Said another way, the all-pass filter 2 is modulated in a dynamic manner, by the modulation generator 6 which is detecting the envelope of x[n], so that the first delay d1 becomes longer in proportion to or in response to the detected envelope increasing, shorter in proportion to or in response to the detected envelope decreasing. For example, if the minimum delay is set to “0” and the maximum delay to “10”, and the envelope or level of the input signal is at 50% of the maximum level allowed, then the first delay d1 is set to be “5”. This number representing the first delay d1 varies in real time, according to the input signal x[n]. In one embodiment, the modulation generator 6 is designed to set the first delay d1 to “0” (the minimum delay) when the level of the input audio signal is “lowest”, which may be when the input audio signal is at some minimum threshold level above a noise floor. In addition, in one embodiment, the modulation generator 6 is designed to be independent of the user volume setting (that may be “manually” changed by a user of a playback device to change the loudness of the sound produced by the sound system 9—see
In one embodiment, the feedback gain g2 that is applied to the delayed version of the filter output (at the third input of the summing junction 11) is also variable (between a minimum gain and a maximum gain), in proportion to the detected envelope of the input audio signal. For example, the feedback gain g2 increases responsive to the detected envelope increasing, and decreases responsive to the detected envelope decreasing. In one embodiment, the same detected envelope may trigger both a change in the feedback gain g2 and a change in the first delay d1.
Turning now to
In
Turning now to
As explained above, an embodiment of the invention may be a digital signal processing method that includes operations such as all-pass filtering, scaling, combining (e.g., summing), envelope detection, and time-variation of the all-pass filtering. Such operations may be performed entirely by a programmed processor, programmed in accordance with the structural algorithms or procedures described above. Another embodiment of the invention is a machine-readable medium (such as microelectronic memory) having stored therein instructions that program one or more data processing components (generically referred to here as “a processor”) to perform the structural digital signal processing operations described above. Such instructions may be part of a media server application program, or a media client/player application program. In other embodiments, some of those operations might be performed by specific hardwired circuit components that contain hardwired logic (e.g., dedicated digital filter blocks, state machines). Those operations might alternatively be performed by any combination of programmed data processing components and hardwired circuit components.
While certain embodiments have been described and shown in the accompanying drawings, it is to be understood that such embodiments are merely illustrative of and not restrictive on the broad invention, and that the invention is not limited to the specific constructions and arrangements shown and described, since various other modifications may occur to those of ordinary skill in the art. For example, in
Number | Name | Date | Kind |
---|---|---|---|
4184047 | Langford | Jan 1980 | A |
4594730 | Rosen | Jun 1986 | A |
4873722 | Tominari | Oct 1989 | A |
4908858 | Ohno | Mar 1990 | A |
4947133 | Thomas | Aug 1990 | A |
5027410 | Williamson et al. | Jun 1991 | A |
5230022 | Sakata | Jul 1993 | A |
5668884 | Clair, Jr. et al. | Sep 1997 | A |
5771299 | Melanson | Jun 1998 | A |
5822440 | Oltman et al. | Oct 1998 | A |
6507657 | Kamada et al. | Jan 2003 | B1 |
6865430 | Runton et al. | Mar 2005 | B1 |
8229135 | Sun et al. | Jul 2012 | B2 |
8538037 | Herre et al. | Sep 2013 | B2 |
8705764 | Baritkar et al. | Apr 2014 | B2 |
8934641 | Avendano et al. | Jan 2015 | B2 |
20060008101 | Kates | Jan 2006 | A1 |
20080044038 | Houle | Feb 2008 | A1 |
20150003633 | Trammell | Jan 2015 | A1 |
Entry |
---|
Ambede, A., “Design and Implementation of High-Speed All-Pass Transformation-Based Variable Digital Filters by Breaking the Dependence of Operating Frequency on Filter Order.” IEEE Transactions on Very Large Scale Integration (VLSI) Systems, May 2016, pp. 2008-2012, vol. 24, No. 5. USA. |
Darak, Sumit J., Efficient implementation of reconfigurable warped digital filters with variable low-pass, high-pass, bandpass, and bandstop responses. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2013, pp. 1165-1169, vol. 21, No. 6, USA. |
Intersil Corporation, “An Introduction to Digital Filters—Application Note (AN9603.2).” Jan. 1999, 10 pages. |
Puckette, M., “The Theory and Technique of Electronic Music.” World Scientific Publishing Co. Pte. Ltd., Draft—Dec. 30, 2006, 337 pages. |
Regalia, et al., “The Digital All-Pass Filter: A Versatile Signal Processing Building Block.” Proceedings of the IEEE, Jan. 1988, pp. 19-37, vol. 76, No. 1, USA. |
Number | Date | Country | |
---|---|---|---|
20180270574 A1 | Sep 2018 | US |
Number | Date | Country | |
---|---|---|---|
62473963 | Mar 2017 | US |