The present invention relates to improving the performance of audio equipment and in particular to adapting equalization to a speaker and room combination.
Low frequency room acoustic response modeling and equalization is a challenging problem. Traditionally, Infinite-duration Impulse Response (IIR) or Finite-duration Impulse Response (FIR) filters have been used for acoustic response modeling and equalization. The IIR filter, also called a parametric filter; has a bell-shaped magnitude response and is characterized by its center frequency Fc, the gain G at the center frequency, and a Q factor (which is inversely related to the bandwidth of the filter) and is easily implemented as a cascade for purposes of room response modeling and equalization.
Room response modeling, and hence equalization or correction, has traditionally been approached as an inverse filter problem, where the resulting equalization filter is the inverse of the room response (or the minimumphase part). Such response modeling is especially challenging at low frequencies where standing waves often cause significant variations in the frequency response at a listening position. Typical filter structures for realizable equalization filter design include IIR filters or warped FIR filters.
A typical room is an acoustic enclosure which may be modeled as a linear system. When a loudspeaker is placed in the room, the resulting time domain response is the convolution of the room linear response and the loudspeaker response, and is denoted as a loudspeaker-room impulse response h(n); nε{O, 1, 2, . . . }. The loudspeaker-room impulse response has an associated frequency response, H(ejw), which is a function of frequency. Generally, H(ejw) is also referred to as the Loudspeaker-Room Transfer Function (LRTF). In the frequency domain, the LRTF shows significant spectral peaks and dips in the human range of hearing (i.e., 20 Hz to 20 kHz) in the magnitude response, causing audible sound degradation at a listener position.
Known methods of equalization include modeling the room responses (either via time domain or magnitude domain or jointly) and subsequently inverting the model to obtain an equalization filter. Unfortunately, traditional search based parametric filter design approaches (such as described in “Direct Method with Random Optimization for Parametric IIR Audio Equalization” by Ramos and Lopez, Proc. 116 AES Conv., Berlin May 2004) involve a search strategy which is susceptible to being stuck in a local minima, thereby effectively limiting the amount of correction at low frequencies.
The present invention addresses the above and other needs by providing a frequency domain approach for modeling the low frequency magnitude response for equalization with a cascade of parametric IIR filters. Each of the cascaded parametric IIR filters may be described by filter parameters comprising the center frequency Fc, the gain G, and the bandwidth term Q (or quality factor). The filter parameters may be determined by first modeling the response using a high-order Linear Predictive Coding (LPC) model to capture the peaks and valleys in the magnitude response, especially at low frequencies, and then inverting the model. Parameters of the IIR parametric filters are then determined from the inverted model. As few as three to four cascaded parametric IIR filters may be used to achieve real-time room response equalization at low frequencies.
In accordance with one aspect of the invention, there is provided a method for equalizing audio signals. The method includes measuring loudspeaker-room acoustics to obtain time domain room response data and forming an equalization filter based on the time domain room response data Steps in the method include processing the time domain room response data with a Linear Predictive Coding (LPC) model to obtain smoothed time domain room response data, computing parameters for a plurality of parametric Infinite-duration Impulse Response (IIR) filters from the smoothed time domain room response data, cascading the plurality of parametric IIR filters and forming an equalizing filter, and equalizing the loudspeaker-room response with the equalization filter.
In accordance with another aspect of the invention, there is provided a first method for computing parameters of cascaded parametric IIR filters. Unprocessed time domain room response data is collected. An FFT is performed on the time domain room response data to obtain a frequency domain room response. The frequency domain room response is normalized in a frequency range of interest to obtain a normalized frequency domain room response. An inverse FFT is performed on the normalized frequency domain room response to obtain normalized time domain room response data. The normalized time domain room response data is represented using an LPC model to obtain smoothed time domain room response data. An FFT is performed on the smoothed time domain room response data to obtain smoothed frequency domain room response data. The smoothed frequency domain room response data is inverted to obtain equalization frequency response. The magnitude of the equalization frequency response is computed. The peaks and valleys of the magnitude of the equalization frequency response are found. The gains, center frequencies, bandwidths and Q factors of each peak are computed. The gains and Qs are optimized. The parametric filter coefficients are then computed from the optimized gains and Qs.
In accordance with yet another aspect of the invention, there is provided a second method for computing parameters of cascaded parametric IIR filters. The second method includes collecting unprocessed time domain room response data. Performing an FFT on the time domain room response data to obtain a frequency domain room response. Normalizing the frequency domain room response in a frequency range of interest to obtain a normalized frequency domain room response. Performing an inverse FFT on the normalized frequency domain room response to obtain a normalized time domain room response data. Representing the normalized time domain room response data using an LPC model to obtain smoothed time domain room response data. Performing an FFT on the smoothed time domain room response data to obtain smoothed frequency domain room response data. Computing the magnitude of the smoothed frequency domain room response. Detecting peaks and valleys of the magnitude of the smoothed frequency domain room response. Computing gains, center frequencies, bandwidths and Q factors of each of the peaks. Optimizing the gains and the Q factors. Computing parametric filter coefficients from the optimized gains and the optimized Q factors. Determining poles and zeros of each of the parametric IIR filters based on the parametric filter coefficients. Computing minimum-phase zeroes from the zeros of each of the parametric filters. Reflecting each minimum-phase zero as a reflected pole and reflecting each pole as a reflected zero for each parametric filter. And expanding each reflected zero and its complex conjugate into a real second order numerator polynomial and expanding each reflected pole and its complex conjugate into a real second order denominator polynomial for each cascaded parametric filter.
In accordance with yet another aspect of the invention, there is provided a third method for computing parameters of cascaded parametric IIR filters. The third method includes collecting unprocessed time domain room response data. Performing an FFT on the time domain room response data to obtain a frequency domain room response. Normalizing the frequency domain room response in a frequency range of interest to obtain a normalized frequency domain room response. Performing an inverse FFT on the normalized frequency domain room response to obtain a normalized time domain room response data. Representing the normalized time domain room response data using an LPC model to obtain smoothed time domain room response data. Performing an FFT on the smoothed time domain room response data to obtain smoothed frequency domain room response data. Computing the magnitude of the smoothed frequency domain room response to obtain a magnitude response. Inverting the magnitude response. Detecting peaks and valleys of the inverted magnitude response. Computing gains, center frequencies, bandwidths and Q factors of each of the peaks. Optimizing the gains and the Q factors. And computing parametric filter coefficients from the optimized center frequencies, the optimized gains, and the optimized Q factors.
The above and other aspects, features and advantages of the present invention will be more apparent from the following more particular description thereof, presented in conjunction with the following drawings wherein:
Corresponding reference characters indicate corresponding components throughout the several views of the drawings.
The following description is of the best mode presently contemplated for carrying out the invention. This description is not to be taken in a limiting sense, but is made merely for the purpose of describing one or more preferred embodiments of the invention. The scope of the invention should be determined with reference to the claims.
Low frequency room acoustic response modeling and equalization is a challenging problem. Traditionally, Infinite-duration Impulse Response (IIR) or Finite-duration Impulse Response (FIR) filters have been used for acoustic response modeling and equalization. The parametric IIR filter, also called a parametric filter, has a bell-shaped frequency domain magnitude response and is characterized by its center frequency Fc, gain G at the center frequency Fc, and a Q factor (which is inversely related to the bandwidth of the filter). Such an IIR filter is easily implemented as a cascade of lower order IIR filters for purposes of room response modeling and equalization.
The present invention includes a method for determining the coefficients of each of a family of cascaded second order IIR parametric filters using a high-order Linear Predictive Coding (LPC) model, where the poles (or roots) of the LPC model are used to obtain the parameters of the parametric IIR filters. The LPC model is used to solve the normal equations which arise from a least squares formulation, and a moderately high-order LPC model is able to accurately model the low-frequency room response modes. Due to the band interactions between the IIR filters which are cascaded to model the room response, the method includes optimizing the Q values to better characterize the room response. The LPC model allows for better equalization for correcting the loudspeaker and room acoustics, particularly at low frequencies. Advantages of the present method include fast computation of the IIR parametric filter parameters using the LPC model, because the LPC model may be efficiently computed using the Levinson-Durbin recursion to solve normal equations which arise from the least squares formulation, and because a moderately high-order LPC model is able to accurately model the low-frequency room response modes.
Multichannel audio is aimed at rendering spatial audio in an immersive and convincing manner to people involved in listening to music at home and in cars, viewing movies in home-theaters, movie-theaters, etc. Examples of multichannel audio formats include Philips/Sony's SACD (Super Audio CD) and the DVD-Audio format. Examples of movie formats include Dolby Digital 5.1 and DTS.
An example system level description of a 5.1 multi-channel audio system, with equalization filters in each channel for correcting loudspeaker-room acoustics, is shown in
The equalization filters 22a-22e and 32 are obtained by measuring the loudspeaker-room response and determining the equalization filter parameters (center frequency “Fc”, gain “G”, and Q factor “Q”) from the measured loudspeaker-room response using the LPC model. The resulting equalization filters 22a-22e and 32 do not change unless the multi-channel audio system is physically moved to another location in which case the loudspeaker room responses may need to be measured and new equalization filter parameters generated. Further, the loudspeaker-room responses vary with listening position and the method of the present invention may be adapted for multiple listener applications.
The equalization filters 22a-22e and 32 are preferably a set of cascaded second order parametric IIR filters and more preferably the equalization filters 22a-22e and 32 comprising as few as three or four cascaded second order parametric IIR filters. The second order parametric IIR filters are specified in terms of the Laplace transform, in terms of the center frequency Fc, gain G, and Q, as
where, if G≧then QD=Q, and QN=10|G|/20 Q. If G<0 then QN=Q, and QD=10−|G|/20 Q
In the digital domain, with a sampling frequency fs, the equivalent transfer function may be expressed as:
and the filter parameters are:
α0=4fs2+4πfcfs/QN+(2πfc)2
α1=−8fs2+2(2πfc)2
α2=4fs2−4πfcfs/QN
β0=4fs2+4πfcfsQD+(2πfc)2
β1=−8fs2+2(2πfc)2
β2=4fs2−4πfcfs/QD+(2πfc)2
{circumflex over (Q)}=f
c
/BW
G/√{square root over (2.5)}(fc)
For example, if:
BW=
4/√{square root over (2.5)}(fc)=253−158 Hz
then,
Q=200/95=2.1
which is close to the true Q.
The above described method for computing Q is preferred because {circumflex over (Q)} based on other bandwidth criteria may yield inaccurate filters, and in effect, band interactions between a cascade of inaccurately Q-estimated parametric filters may lead to poor response modeling. This is shown in
To model the important low-frequency magnitude response with parametric filters, it is required that the Fc, G and Q of each of the filters be estimated in a manner that the cascade of such parametric IIR filters yields a magnitude response with low error in the low-frequency region. Instead of performing an exhaustive search for maximas in the magnitude response, which can be computationally intensive and make the technique susceptible to local minimas, a sufficiently high-order all-pole model such as a Linear Predictive Coding (LPC) model may be used to track the dominant peaks in the magnitude response. Such LPC model is described in “Linear Prediction of Speech” authored by Markel and Gray and published by Springer-Verlag. The LPC is efficiently implemented using the Levinson-Durbin recursion, which is well known in the art. An LPC model of order 512 of the response in
Because the LPC model is characterized by an all-pole transfer function with denominator polynomial of order 512 in this example, it is necessary to find the frequency locations of the peaks using a root finding technique. The frequency locations of the peaks then yield the center frequency for the corresponding parametric IIR filters. A computationally efficient and accurate rootfinding technique for finding roots of high-order polynomials, is described in “Factoring Very-High Degree Polynomials” by Sutton and Burrus and published in the IEEE Signal Processing Magazine pages 27-43, November 2003. Thus, the root-finding method, such as described by Sutton, yields the poles of the LPC transfer function. That is,
The angle, θk of the poles Zk=rkejθk yields the center frequencies associated with the LPC spectrum of
In the next step the gain, Gi(I=1, 2, 3, and 4) at each of the center frequencies Fc1, Fc2, Fc3, and Fc4 (or at the corresponding nearest bins from the Fast Fourier Transform (FFT)) is computed. Finally, the two frequency points, for each parametric filter I, corresponding to Gi/√{square root over (2.5)} are determined to yield {circumflex over (Q)}i=fci/BWG
Due to band interactions between adjacent (cascaded) parametric IIR filters, the resulting frequency response may not match the modeled room frequency response beyond the center frequencies. As a result, the parameter Qi and Gi for each of the parametric IIR filters are annealed (or optimized) from the initial values, independent of each other, such that the errors:
E=(1/N)Σk=1N|HLPC(ωk)−Hf
are minimized at a finite number of discrete frequency points, N, in the neighborhood of each of the fci's Typically, the cascaded amplitude response will be greater than the LPC model spectrum and the {circumflex over (Q)}i values are gradually increased such that the average error, E, is minimized for each of the three highest fci parametric filters. Alternatively, a gradient descent scheme may be used to optimize the {circumflex over (Q)}i values.
After annealing (i.e., optimizing), the cascaded response is shown in
Another example of a room response to be modeled below 400 Hz is shown in
An overview of a method according to the present invention for computing parameters of cascaded parametric IIR filters is described in
A high level diagram of a second embodiment of a method according to the present invention is described in
A high level diagram of a third embodiment of a method according to the present invention is described in
A detailed method for computing the parameters of the parametric IIR filters from the coefficients of the LPC model is described in
Continuing the Q factor computation with
The calculation of the bandwidth and Q factor for n equal to 2 through num_peaks-1 is described in
Continuing with
Still continuing with
Completing the computation of Q in
Gain G and Q factor Q optimization is described in
Continuing with
A method for performing a detailed optimization is described in
Continuing with the detailed optimization in
Table 1 describes the variables used above:
A method for modeling the low-frequency room acoustical response using a cascade of parametric filters has been described above. The LPC model is first used to generate an all-pole model of the room response. Subsequently, the roots of the denominator polynomial are determined and used to determine the parameters of the parametric filter. Additional annealing of the Q values permit better modeling of the LPC response and subsequent equalization of the room response. Alternative methods include adapting the Q1 parameters using gradient descent techniques as well as modeling using frequency warping. Results may be extended also for multiple listener applications (viz., multiple positions).
While the invention herein disclosed has been described by means of specific embodiments and applications thereof, numerous modifications and variations could be made thereto by those skilled in the art without departing from the scope of the invention set forth in the claims.