The present invention relates to acoustic equalization and in particular to filters used for acoustic equalization.
Loudspeaker-room acoustic equalization is a challenging problem to solve with realizable digital equalization filters, especially at lower frequencies (for example, less than 300 Hz). A typical room is an acoustic enclosure which may be modeled as a linear system. When a loudspeaker is placed in the room, the resulting response is the convolution of the room linear response and the loudspeaker response and may be denoted as h(n); nε{0, 1, 2, . . . }. This loudspeaker-room impulse response has an associated frequency response, H(ejω) (i.e., H(z)), which is a function of frequency. Generally, H(ejω) is also referred to as the Loudspeaker-Room Transfer Function (LRTF). In the frequency domain, the LRTF shows significant spectral peaks and dips in the human range of hearing (for example, 20 Hz to 20 kHz), in the magnitude response, causing audible sound degradation at a listener position.
An equalization filter may be applied to correct such response variations in the frequency domain (i.e., minimize the deviations in the magnitude response to obtain a flat response) and ideally also minimize the energy of the reflections in the time domain. Known approaches include using psychoacoustic warping where the equalization filter is designed on a warped frequency axis (i.e., the perceptual Bark scale) of the room response function with a lower order model (for example, linear predictive coding). Other similar approaches using low-order spectral modeling and warping are described in:
The present invention addresses the above and other needs by providing a combined multirate-based Finite Impulse Response (FIR) filter equalization technique combining a low-order FIR equalization filter operating at a lower rate for equalization of a loudspeaker-room response at low frequencies, and a complementary low-order minimum-phase FIR equalization filter operating at a higher rate for equalization of the loudspeaker-room response at higher frequencies. The design of two complementary band filters for separately performing low and high frequency equalization keeps the system delay at a minimum while maintaining excellent equalization performance. The two equalization filters are separately applied to two parallel equalization paths with splicing of outputs of the two equalization paths. Level adjustment of one equalization path relative to the other is performed before splicing for maintaining a flat magnitude response in the transition region of the two complementary filters. The present invention achieves excellent equalization at low filter orders and hence reduced computational complexity and signal processing requirements.
In accordance with one aspect of the invention, there is provided a method for equalizing audio signals. The method includes parallel processing of an input signal through a low frequency equalization path and a high frequency equalization path. The low frequency equalization path includes steps of: low pass filtering the input signal to obtain a low pass filtered signal; sub sampling the low pass filtered signal to obtain a sub-sampled signal; equalizing the sub-sampled signal with a low frequency equalization filter to obtain an equalized low frequency sub-sampled signal; up sampling the equalized low frequency sub-sampled signal to obtain an up-sampled low frequency equalized signal; and low pass filtering the up-sampled low frequency equalized signal to obtain a low frequency equalized signal. The high frequency equalization path includes steps of: high pass filtering the input signal to obtain a high pass filtered signal and equalizing the high pass filtered signal to obtain a high frequency equalized signal. The low frequency equalized signal and the high frequency equalized signal are summed to obtain an equalized signal. The high frequency equalized signal may further be leveled if desired to maintain a flat magnitude response in the transition region of the two equalization paths.
In accordance with another aspect of the invention, there is provided a method for computing low frequency and high frequency equalization filters. Computing the low frequency equalization filter includes steps of: low pass filtering H(z) the z transform of the room response h(n); sub sampling the filtered H(z); computing F(z) from the sub sampled filtered H(z); up sampling F(z) to obtain F′(z); computing C(z) as the product of F′(z) and H(z); computing a magnitude response of C(z); and computing a mean level L1 of the magnitude response. Computing the high frequency equalization filter includes steps of: high pass filtering H(z); computing an initial G(z); computing D(z) as the product of the initial G(z) and H(z); constraining FFT bins below Fs/2M to 0 dB; computing the magnitude response of D(z); computing a mean level L2 of the magnitude response at step 110; and applying a level adjustment of 10((L1-L2)/20) to the initial G(z) to obtain G(z).
The above and other aspects, features and advantages of the present invention will be more apparent from the following more particular description thereof, presented in conjunction with the following drawings wherein:
Corresponding reference characters indicate corresponding components throughout the several views of the drawings.
The following description is of the best mode presently contemplated for carrying out the invention. This description is not to be taken in a limiting sense, but is made merely for the purpose of describing one or more preferred embodiments of the invention. The scope of the invention should be determined with reference to the claims.
The present invention comprises the formation of an equalization (or inverse) filter, heq(n), which compensates for the effects of the loudspeaker and room which cause sound quality degradation at a listener position. In other words, the goal is to satisfy heq(n){circle around (x)}h(n)=δ(n), where {circle around (x)} denotes the convolution operator and δ(n) is the Kronecker delta function.
In practice, an ideal delta function is not achievable with low filter orders as room responses are non-minimum phase. Furthermore, from a psychoacoustic standpoint, a target curve, such as a low-pass filter having a reasonably high cutoff frequency is generally applied to the equalization filter (and hence the equalized response) to prevent the played back audio from sounding exceedingly “bright”. An example of a low-pass cutoff frequency is the frequency where the loudspeaker begins its high-frequency roll-off in the magnitude response. Additionally, the target curve may also be customized according to the size and/or the reverberation time of the room. Additionally, a high pass filter may be applied to the equalized response, depending on the loudspeaker size and characteristics (for example, a satellite channel loudspeaker), in order to minimize distortions at low frequencies. Examples of environments where multiple listener room response equalization is used are in home theater (for example, a multi-channel 5.1 system), automobile, movie theaters, etc.
In audio playback applications, where a general goal is to enhance the quality of speech/audio reproduction, a typical setup process includes measuring the loudspeaker room impulse response at least one measurement position (generally an expected listener position), and designing the equalization filter heq(n) based on the measurements. The equalization filter heq(n) is designed to compensate for spectral deviations in the magnitude domain and/or to minimize the energy of reflections in the time domain. The equalization filter heq(n) is generated based on a model which fits the measured response for real-time applications.
A generalized diagram of a playback chain comprising an unfiltered signal 14, an equalization filter 16, an equalized signal 18 produced by the equalizer 16, a loud speaker 20 receiving the equalized signal 18, sound waves 22 generated by the speaker 20, and a listener 24 hearing the sound waves 22 are shown in
Unfortunately, it is difficult to achieve effective low-frequency equalization below 300 Hz with low-order and realizable Finite Impulse Response (FIR) equalization filters. The present invention comprises a combined multirate-based and FIR-based filtering technique shown in
The input signal 30 is processed in a low frequency path A by a low pass filter Hlp(z) 32 to generate a low pass filtered signal 34, and the low pass filtered signal 34 is down sampled by M (typically 24) in down sampler 36 to generate a sub-sampled signal 38. The sub-sampled signal 38 is processed by the low frequency equalization filter F(z) 40 to generate an equalized low frequency sub-sampled signal 42. The equalized low frequency sub-sampled signal 42 is up sampled by M (typically 24) in up-sampler 44 to generate an up-sampled low frequency equalized signal 46. The up-sampler 44 is preferably an interpolation. The up-sampled low frequency equalized signal 46 is filtered by a second low pass filter 48 to generate a filtered low frequency equalized signal 50.
The input signal 30 is processed in parallel in a high frequency path B by a high pass filter Hhp(z) 52 to generate a high pass filtered signal 54. The high pass filtered signal 54 is processed by the high frequency equalization filter G(z) 56 to generate a high frequency equalized signal 57. The high frequency equalized signal 57 may be leveled by level 58 to generate a leveled high frequency signal 59. The filtered low frequency equalized signal 50 and the leveled signal 59 are summed by the summer 60 to generate an equalized signal 62.
The level 58 is preferably a leveling of 10((L1−L2)/20) described in
A method for equalizing an audio signal according to the present invention is described in
The low rate equalization filter F(z) 40 is preferably designed using linear predictive coefficients, for example, using the Linear Predictive Coding (LPC) method, where the room response is sub-sampled before the LPC method is applied. The filter F(z) 40 is thus obtained as the inverse of an estimate of the loudspeaker room transfer function at low frequencies, Ĥ1, where the coefficients fm of the LPC are selected as the coefficients of the low rate equalization filter F(z) 40. Specifically, since the LPC polynomial is minimum-phase, the low frequency equalization filter F(z) 40 may be expressed as:
where fm is the mth FIR filter coefficient of F(z) 40 and the length of the filter F(z) 40, N1, was set as N1=2 fs/fs′=48.
Similarly, the high frequency equalization filter G(z) 56, is preferably designed using the LPC method where the room response is high-pass filtered by the high pass filter Hhp(z) 52 before applying an LPC fit to the room response. Specifically:
where Ĥ2 is an estimate of the loudspeaker room transfer function above the low frequencies, gm is the mth FIR filter coefficient of G(z) 56, and the length N2=48 is selected so as to offer a good fit to the room response at the lowest bin frequency fc=1 kHz and keep computational requirements for real-time filtering low. The length of N2=48 was based on the following relation:
A method for computing the low frequency equalization filter F(z) 40 and the high frequency equalization filter G(z) 56 is shown in
To compute G(z): H(z) is high pass filtered, preferably using the high pass filter Hhp(Z) 52 described above, to obtain H3(z) (for example, multiply H(z) times the z domain representation of the high pass filter) at step 100; an initial G(z) is computed based on H3(z) at step 102; a second complex response D(z) is computed as the product of the initial G(z) and H(z) at step 104; the FFT bins of D(z) below fs/2M are constrained to 0 dB, where M is the sub-sampling (or decimation) rate, at step 106; the magnitude ID(Z)I of D(z) is computed at step 108; ID(Z)I is smoothed to obtain ID(Z)I′ at step 109, and a mean level L2 of ID(Z)I′ computed at step 110. A level adjustment of 10((L1-L2)/20) is applied to the initial G(z) to obtain G(z) at step 114.
The smoothing in steps 97 and 109 may be, for example, ⅓ octave resolution or 1/12 octave resolution for both low and high frequency paths, Equivalent Rectangular Bandwidth (ERB) smoothing, critical-band rate scale. The low-frequency octave band for performing level matching is preferably [400, 800] Hz, whereas the high-frequency octave band is preferably [3, 6] kHz.
To better understand the processing in
While the methods of the present invention contemplate the use of an LPC model, any method for obtaining a low frequency equalization filter and a high frequency equalization filter, which method includes first low pass filtering and sub-sampling the LRTF steps, and processing the result to obtain the low frequency equalization filter, and a first high pass filtering the LRTF step, and processing the result to obtain the high frequency equalization filter, is intended to come within the scope of the present invention.
The room responses were obtained in a reverberant room having a Schroeder reverberation time T60 (computed using the backward integration method) of approximately 0.5 seconds. The responses were measured roughly on-axis at a distance of about six meters from the loudspeaker. An unequalized loudspeaker and room response 120 for a first loudspeaker is shown in
No target curves, such as ones used for limiting the loudspeakers from being overdriven, are shown as the goal was to demonstrate the improvements obtained with this technique. As is clearly evident a substantial equalization is achieved with short FIR filter lengths in both bands. Listening tests after applying specific speaker dependent target curves revealed dramatic and audible improvement in playback audio quality (speech as well as music).
The present invention has described a dual-rate based equalization technique where a low-order FIR filter operates at a lower rate for equalization of a loudspeaker-room response at low frequencies, and a low-order minimum-phase FIR filter operates at a higher rate for higher frequency equalization. Due to the design of two complementary band filters for separately performing low and high frequency equalization, the system delay is kept at a minimum while maintaining excellent equalization performance as demonstrated in the paper. The splicing between the two equalization filters, operating at different rates, for maintaining a flat magnitude response in the transition region of the two complementary filters is done automatically through level adjustment of one equalization filter relative to the other. The present invention may be expanded to include this technique for multi-position (that is, multi-listener) equalization.
While the invention herein disclosed has been described by means of specific embodiments and applications thereof, numerous modifications and variations could be made thereto by those skilled in the art without departing from the scope of the invention set forth in the claims.
Number | Name | Date | Kind |
---|---|---|---|
4888808 | Ishikawa et al. | Dec 1989 | A |
5572443 | Emoto et al. | Nov 1996 | A |
5805715 | Rhee | Sep 1998 | A |
20040264718 | Arai | Dec 2004 | A1 |
20050244012 | Asada | Nov 2005 | A1 |
Number | Date | Country | |
---|---|---|---|
20080279318 A1 | Nov 2008 | US |