The embodiments of the invention relates to a method for designing a modal equalizer for a low audio frequency range.
Traditional magnitude equalization attempts to achieve a flat frequency response at the listening location either for the steady state or early arriving sound. Both approaches achieve an improvement in audio quality for poor loudspeaker-room systems, but colorations of the reverberant sound field cannot be handled with traditional magnitude equalization. Colorations in the reverberant sound field produced by room modes deteriorate sound clarity and definition.
U.S. Pat. No. 5,815,580 describes this kind of compensating filters for correcting amplitude response of a room.
M. Karjalainen, P. Antsalo, A. Mäkivirta, T. Peltonen, and V. Välimäki, “Estimation of Modal Decay Parameters from Noisy Response Measurements”, presented at the AES 110th Convention, Amsterdam, The Netherlands, 2001 May 12-15, preprint 5290 (12), describes methods for modelling modal parameters. This publication does not present any methods for eliminating or equalizing these modes in audio systems.
Embodiments of the present invention differ from the prior art at least in that a discrete time description of the modes is created and with this information digital filter coefficients are formed.
Modal equalization can specifically address problematic modal resonances, decreasing their Q-value and bringing the decay rate in line with other frequencies.
Modal equalization also decreases the gain of modal resonances thereby affecting an amount of magnitude equalization. It is important to note that traditional magnitude equalization does not achieve modal equalization as a byproduct. There is no guarantee that zeros in a traditional equalizer transfer function are placed correctly to achieve control of modal resonance decay time. In fact, this is rather improbable. A sensible aim for modal equalization is not to achieve either zero decay time or flat magnitude response. Modal equalization can be a good companion of traditional magnitude equalization. A modal equalizer can take care of differences in the reverberation time while a traditional equalizer can then decrease frequency response deviations to achieve acceptable flatness of magnitude response.
Modal equalization is a method to control reverberation in a room when conventional passive means are not possible, do not exist or would present a prohibitively high cost. Modal equalization is an interesting design option particularly for low-frequency room reverberation control.
In the following, the invention will be described in more detail with reference to the exemplifying embodiments illustrated in the attached drawings in which
a shows a block diagram of type I modal equalizer in accordance with the invention using the primary sound source.
b shows a block diagram of type II modal equalizer in accordance with the invention using a secondary radiator.
a shows an impulse response of a real room.
b shows a frequency response of the same room as
c shows a three dimensional graph of case III, real room 1 in accordance with
A loudspeaker installed in a room acts as a coupled system where the room properties typically dominate the rate of energy decay. At high frequencies, typically above a few hundred Hertz, passive methods of controlling the rate and properties of this energy decay are straightforward and well established. Individual strong reflections are broken up by diffusing elements in the room or trapped in absorbers. The resulting energy decay is controlled to a desired level by introducing the necessary amount of absorbance in the acoustical space. This is generally feasible as long as the wavelength of sound is small compared to dimensions of the space.
As we move toward low frequencies, passive means of controlling reverberant decay time become more difficult because the physical size of necessary absorbers increases and may become prohibitively large compared to the volume of the space, or absorbers have to be made narrow-band. Related to this, the cost of passive control of reverberant decay greatly increases at low frequencies. Methods for optimizing the response at a listening position by finding suitable locations for loudspeakers have been proposed [1] but cannot fully solve the problem. Because of these reasons there has been an increasing interest in active methods of sound field control at low frequencies, where active control becomes feasible as the wavelengths become long and the sound field develops less diffuse [2-6].
Modal resonances in a room can be audible because they modify the magnitude response of the primary sound or, when the primary sound ends, because they are no longer masked by the primary sound [7,8]. Detection of a modal resonance appears to be very dependent on the signal content. Olive et al. report that low-Q resonances are more readily audible with continuous signals containing a broad frequency spectrum while high-Q resonances become more audible with transient discontinuous signals [8].
Olive et al. report detection thresholds for resonances both for continuous broadband sound and transient discontinuous sound. At low Q values antiresonances (notches) are as audible as resonances. As the Q value becomes high, audibility of antiresonances reduces dramatically for wideband continuous signals [8]. Detectability of resonances reduces approximately 3 dB for each doubling of the Q value [7,8] and low Q resonances are more readily heard with zero or minimal time delay relative to the direct sound [7]. Duration of the reverberant decay in itself appears an unreliable indicator of the audibility of the resonance [7] as audibility seems to be more determined by frequency domain characteristics of the resonance.
In this patent application we present methods to actively control low-frequency reverberation. We will first present the concept and two basic types of modal equalization. A target for modal decay time versus frequency will be discussed based on existing recommendations for high quality audio monitoring rooms. Methods to identify and parametrize modes in an impulse response are introduced. Modal equalizer design for an individual mode is discussed with examples. Several case studies of both synthetic modes and modes of real rooms are presented. Finally, synthesis of IIR modal equalizer filters is discussed.
The Concept of Modal Equalization
The embodiments of the invention is especially advantageous for frequencies below 200 Hz and environments where sound wavelength relative to dimensions of a room is not very small. A global control in a room is not of main interest, but reasonable correction at the primary listening position.
These limitations lead into a problem formulation where the modal behaviour of the listening space can be modeled by a distinct number of modes such that they can be individually controlled. Each mode is modeled by an exponential decay function
hm(t)=Ame−τ
Here Am is the initial envelope amplitude of the decaying sinusoid, τm is a coefficient that denotes the decay rate, ωm is the angular frequency of the mode, and φm is the initial phase of the oscillation.
We define modal equalization as a process that can modify the rate of a modal decay. The concept of modal decay can be viewed as a case of parametric equalization, operating individually on selected modes in a room. A modal resonance is represented in the z-domain transfer function as a pole pair with pole radius r and pole angle θ
The closer a pole pair is to the unit circle the longer is the decay time of a mode. To shorten the decay time the Q-value of a resonance needs to be decreased by shifting poles toward the origin. We refer to this process of shifting pole locations as modal equalization.
Modal decay time modification can be implemented in several ways—either the sound going into a room through the primary radiator is modified or additional sound is introduced in the room with one or more secondary radiators to interact with the primary sound. The first method has the advantage that the transfer function from a sound source to a listening position does not affect modal equalization. In the second case differing locations of primary and secondary radiators lead to different transfer functions to the listening position, and this must be considered when calculating a corrective filter. We will now discuss these two cases in more detail, drawing some conclusions on necessary conditions for control in both cases.
Type I Modal Equalization
In accordance with
Type I implementation modifies the audio signal fed into the primary loudspeaker 3 to compensate for room modes. The total transfer function from the primary radiator to the listening position represented in z-domain is
H(z)=G(z)Hm(z) (3)
where G(z) is the transfer function of the primary radiator from the electrical input to acoustical output and Hm(z)=B(z)/A(z) is the transfer function of the path from the primary radiator to the listening position. The primary radiator has essentially flat magnitude response and small delay in our frequency band of interest, or the primary radiator can be equalized by conventional means and can therefore be neglected in the following discussion,
G(z)=1 (4)
We now design a pole-zero filter Hc(z) having zero pairs at the identified pole locations of the modal resonances in Hm(z). This cancels out existing room 1 response pole pairs in A(z) replacing them with new pole pairs A′(z) producing the desired decay time in the modified transfer function H′m(z)
This leads to a correcting filter
The new pole pair A′(z) is chosen on the same resonant frequency but closer to the origin, thereby effecting a resonance with a decreased Q value. In this way the modal resonance poles have been moved toward the origin, and the Q value of the mode has been decreased. The sensitivity of this approach will be discussed later with example designs.
Type II Modal Equalization
In accordance with
This leads to a correcting filter Hc(z) where Hm(z) and H′m(z) differ by modified pole radii
Note that if the primary and secondary radiators are the same source, Equation 8 reduces into a parallel formulation of a cascaded correction filter equivalent to the Type I method presented above
H′m(z)=Hm(z)(1+Hc(z)) (10)
A necessary but not sufficient condition for a solution to exist is that the secondary radiator can produce sound level at the listening location in frequencies where the primary radiator can, within the frequency band of interest
|H1(f)|≠0, for |Hm(f)|≠0 (11)
At low frequencies where the size of a radiator becomes small relative to the wavelength it is possible for a radiator to be located such that there is a frequency where the radiator does not couple well into the room. At such frequencies the condition of Equation 11 may not be fulfilled, and a secondary radiator placed in such location will not be able to affect modal equalization at that frequency. Because of this it may be advantageous to have multiple secondary radiators in the room. In the case of multiple secondary radiators, Equation 7 is modified into form
where N is the number of secondary radiators.
After the decay times of individual modes have been equalized in this way, the magnitude response of the resulting system may be corrected to achieve flat overall response. This correction can be implemented with any of the magnitude response equalization methods.
In this patent application we will discuss identification and parametrization of modes and review some case examples of applying the proposed modal equalization to various synthetic and real rooms, mainly using the first modal equalization method proposed above. The use of one or more secondary radiators will be left to future study.
Target of Modal Equalization
The in-situ impulse response at the primary listening position is measured using any standard technique. The process of modal equalization starts with the estimation of octave band reverberation times between 31.5 Hz-4 kHz. The mean reverberation time at mid frequencies (500 Hz-2 kHz) and the rise in reverberation time is used as the basis for determining the target for maximum low-frequency reverberation time.
The target allows the reverberation time to increase at low frequencies. Current recommendations [9-11] give a requirement for average reverberation time Tm in seconds for mid frequencies (200 Hz to 4 kHz) that depends on the volume V of the room
where the reference room volume Vo of 100 m3 yields a reverberation time of 0.25 s. Below 200 Hz the reverberation time may linearly increase by 0.3 s as the frequency decreases to 63 Hz. Also a maximum relative increase of 25% between adjacent ⅓-octave bands as the frequency decreases has been suggested [10,11]. Below 63 Hz there is no requirement. This is motivated by the goal to achieve natural sounding environment for monitoring [11]. An increase in reverberation time at low frequencies is typical particularly in rooms where passive control of reverberation time by absorption is compromised, and these rooms are likely to have isolated modes with long decay times.
We can define the target decay time relative for example to the mean T60 in mid-frequencies (500 Hz-2 kHz), increasing (on a log frequency scale) linearly by 0.2 s as the frequency decreases from 300 Hz down to 50 Hz.
Mode Identification and Parameter Estimation
After setting the reverberation time target, transfer function of the room to the listening position is estimated using Fourier transform techniques. Potential modes are identified in the frequency response by assuming that modes produce an increase in gain at the modal resonance. The frequencies within the chosen frequency range (f<200 Hz) where level exceeds the average mid-frequencies level (500 Hz to 2 kHz) are considered as potential mode frequencies.
The short-term Fourier transform presentation of the transfer function is employed in estimating modal parameters from frequency response data. The decay rate for each detected potential room mode is calculated using nonlinear fitting of an exponential decay+noise model into the time series data formed by a particular short-term Fourier transform frequency bin. A modal decay is modeled by an exponentially decaying sinusoid (Equation 1 reproduced here for convenience)
hm(t)=Ame−τ
where Am is the initial envelope amplitude of the decaying sinusoid, τm is a coefficient defining the decay rate, ωm is the angular frequency of the mode, and φm is the initial phase of modal oscillation. We assume that this decay is in practical measurements corrupted by an amount of noise nb(t)
nb(t)=Ann(t) (15)
and that this noise is uncorrelated with the decay. Statistically the decay envelope of this system is
The optimal values An, τm and Am are found by least-squares fitting this model to the measured time series of values obtained with a short-term Fourier transform measurement. The method of nonlinear modeling is detailed in [12]. Sufficient dynamic range of measurement is required to allow reliable detection of room mode parameters although the least-squares fitting method has been shown to be rather resilient to high noise levels. Noise level estimates with the least-squares fitting method across the frequency range provide a measurement of frequency-dependent noise level A(f) and this information is later used to check data validity.
Modal Parameters
The estimated decay parameters τm(f) across the frequency range are used in identifying modes exceeding the target criterion and in calculating modal equalizing filters. It can be shown that the spectral peak of a Gaussian-windowed stationary sinusoid calculated using Fourier transform has the form of a parabolic function [13]. Therefore the precise center frequency of a mode is calculated by fitting a second-order parabolic function into three Fourier transform bin values around the local maximum indicated by decay parameters τm(f) in the short-term Fourier transform data
G(f)=af2+bf+c (17)
The frequency where the second-order function derivative assumes value zero is taken as the center frequency of the mode
In this way it is possible to determine modal frequencies more precisely than the frequency bin spacing of the Fourier transform presentation would allow.
Estimation of modal pole radius can be based on two parameters, the Q-value of the steady-state resonance or the actual measurement of the decay time T60. While the Q-value can be estimated for isolated modes it may be difficult or impossible to define a Q-value for modes closely spaced in frequency. On the other hand the decay time is the parameter we try to control. Because of these reasons we are using the decay time to estimate the pole location.
The 60-dB decay time T60 of a mode is related to the decay time constant τ by
The modal parameter estimation method employed in this work [12] provides us an estimate of the time constant τ. This enables us to calculate T60 to obtain a representation of the decay time in a form more readily related to the concept of reverberation time.
Discrete-Time Representation of a Mode
Consider now a second-order all-pole transfer function having pole radius r and pole angle θ
Taking the inverse z-transform yields the impulse response of this system as
where u(n) is a unit step function.
The envelope of this sequence is determined by the term rn. To obtain a matching decay rate to achieve T60 we require that the decay of 60 dB is accomplished in N60 steps given a sample rate fs,
20 log(rN
We can now solve for the pole radius r
Using the same approach we can also determine the desired pole location, by selecting the same frequency but a modified decay time T60 and hence a new radius for the pole. Some error checking of the identified modes is necessary in order to discard obvious measurement artifacts. A potential mode is rejected if the estimated noise level at that modal frequency is too high, implying insufficient signal-to-noise ratio for reliable measurement. Also, candidate modes that show unrealistically slow decay or no decay at all are rejected because they usually represent technical problems in the measurement such as mains hum, ventilation noise or other unrelated stationary error signals, and not true modal resonances.
Modal Equalizer Design
For sake of simplicity the design of Type I modal equalizer is presented here. This is the case where a single radiator is reproducing both the primary sound and necessary compensation for the modal behavior of a room. Another way of viewing this would be to say that the primary sound is modified such that target modes decay faster.
A pole pair z=F(r,θ) models a resonance in the z-domain based on measured short-term Fourier transform data while the desired resonance Q-value is produced by a modified pole pair zc=F(rc,θc). The correction filter for an individual mode presented in Equation 5 becomes
To give an example of the correction filter function, consider a system defined by a pole pair (at radius r=0.95, angular frequency ω=±0.18π) and a zero pair (at r=1.9, ω=±0.09π). We want to shift the location of the poles to radius r=0.8. To effect this we use the Type I filter of Equation 24 with the given pole locations, having a notch-type magnitude response (
The quality of a modal pole location estimate determines the success of modal equalization. The estimated center frequency determines the pole angle while the decay rate determines the pole distance from the origin. Error in these estimates will displace the compensating zero and reduce the accuracy of control. For example, an estimation error of 5% in the modal pole radius (
The before specified method is described as a flow chart in
In step 10 the decay rate target is set. In this step normal decay rate is defined and as a consequence an upper limit for this rate is defined.
In step 11 peaks or notches are defined for the specific room 1 and especially for a predefined listening position 2.
In step 12 accurate decay rates for each peak and notch exceeding the set limit are defined by nonlinear fitting.
The modes to be equalized are selected in step 13.
In step 14 accurate center frequencies for the modes are defined.
In step 15 a discrete-time description of the modes is formed and consequently the discrete-time poles are defined and in step 16 an equalizer is designed on the basis of this information.
Case Studies
Case studies in this section demonstrate the modal equalization process. These cases contain artificially added modes and responses of real rooms equalized with the proposed method.
The waterfall plots in
Cases I and II use an impulse response of a two-way loudspeaker measured in an anechoic room. The waterfall plot of the anechoic impulse response of the loudspeaker (
Cases with Artificial Modes
Case 1 attempts to demonstrate the effect of the developed mode equalizer calculation algorithm. It is based on the free field response of a compact two-way loudspeaker measured in an anechoic room. An artificial mode with T60=1 second has been added to the data at f=100 Hz and an equalizer has been designed to shorten the T60 to 0.26 seconds. The room mode increases the level at the resonant frequency considerably (about 30 dB) and the long decay rate is evident (
Case II uses the same anechoic two-way loudspeaker measurement. In this case five artificial modes with slightly differing decay times have been added. See Table I for original and target decay times and center frequencies of added modes. For real room responses, the target decay time is determined by mean T60 in mid-frequencies, increasing linearly (on linear frequency scale) by 0.2 s as the frequency decreases from 300 Hz down to 50 Hz. For the synthetic Case II the target decay time was arbitrarily chosen as 0.2 seconds. Again we note that the magnitude gain of modal resonances (
Table 1. Case II artificial modes center frequency f, decay time T60, and target decay time T′60.
Cases with Real Room Responses
Case III is a real room response. It is a measurement in a hard-walled approximately rectangular meeting room with about 50 m2 floor area. The target decay time specification is the same as in Case II.
In Case III the mean T60 in mid frequencies is 0.75 s. 20 modes were identified with decay time longer than the target decay time. The mode frequency fm, estimated decay time T60 and target decay time T′60 are given in Table 2.
a shows an impulse response of an example room.
b shows a frequency response of the same room. In figure arrows pointing upwards show the peaks in the response and the only arrow downwards shows a notch (antiresonance).
The waterfall plot of the original impulse response of
Table 2. Case III, equalized mode frequency fm, original T60 and target decay rate T′60.
Implementation of Modal Equalizers
Type I Filter Implementation
To correct N modes with a Type I modal equalizer, we need an order-2N IIR transfer function. The most immediate method is to optimize a second-order filter, defined by Equation 24, for each mode identified. The final order-2N filter is then formed as a cascade of these second-order subfilters
Hc(z)=Hc,1(z)·Hc,2(z)· . . . ·Hc,N(z) (26)
Another formulation allowing design for individual modes is served by the formulation in Equation 10. This leads naturally into a parallel structure where the total filter is implemented as
Asymmetry in Type I Equalizers
At low angular frequencies the maximum gain of a resonant system may no longer coincide with the pole angle [14]. Similar effects also happen with modal equalizers, and must be compensated for in the design of an equalizer.
Basic Type I modal equalizer (see Equation 24) becomes increasingly unsymmetrical as angular frequency approaches ω=0. A case example in
It is possible to avoid asymmetry by decreasing the sampling frequency in order to bring the modal resonances higher on the discrete frequency scale.
If sample rate alteration is not possible, we can symmetrize a modal equalizer by moving the pole slightly downwards in frequency (
Type II Filter Implementation
Type II modal equalizer requires a solution of Equation 8 for each secondary radiator. The correcting filter Hc(z) can be implemented by direct application of Equation 8 as a difference of two transfer functions convolved by the inverse of the secondary radiator transfer function, bearing in mind the requirement of Equation 11. A more optimized implementation can be found by calculating the correcting filter transfer function Hc(z) based on measurements, and then fitting an FIR or IIR filter to approximate this transfer function. This filter can then be used as the correcting filter. Any filter design technique can be used to design this filter.
In the case of multiple secondary radiators the solution becomes slightly more convoluted as the contribution of all secondary radiators must be considered. For example, solution of Equation 12 for the correction filter of the first secondary radiator is
It is evident that all secondary radiators interact to form the correction. Therefore the design process of these secondary filters becomes a multidimensional optimization task where all correction filters must be optimized together. A suboptimal solution is to optimize for one secondary source at a time, such that the subsequent secondary sources will only handle those frequencies not controllable by the previous secondary sources for instance because of poor radiator location in the room.
We have presented two different types of modal equalization approaches, Type I modifying the sound input into the room using the primary speakers, and Type II using separate speakers to input the mode compensating sound into a room. Type I systems are typically minimum phase. Type II systems, because the secondary radiator is separate from the primary radiator, may have an excess phase component because of differing times-of-flight. As long as this is compensated in the modal equalizer for the listening location, Type II systems also conform closely to the minimum phase requirement.
There are several reasons why modal equalization is particularly interesting at low frequencies. At low frequencies passive means to control decay rate by room absorption may become prohibitively expensive or fail because of constructional faults. Also, modal equalization becomes technically feasible at low frequencies where the wavelength of sound becomes large relative to room size and to objects in the room, and the sound field is no longer diffuse. Local control of the sound field at the main listening position becomes progressively easier under these conditions.
Recommendations [9-11] suggest that it is desirable to have approximately equal reverberant decay rate over the audio range of frequencies with possibly a modest increase toward low frequencies. We have used this as the starting point to define a target for modal equalization, allowing the reverberation time to increase by 0.2 s as the frequency decreases from 300 Hz to 50 Hz. This target may serve as a starting point, but further study is needed to determine a psychoacoustically proven decay rate target.
In this patent the principle of modal equalization application is introduced, with formulations for Type I and Type II correction filters. Type I system implements modal equalization by a filter in series with the main sound source, i.e. by modifying the sound input into the room. Type II system does not modify the primary sound, but implements modal equalization by one or more secondary sources in the room, requiring a correction filter for each secondary source. Methods for identifying and modeling modes in an impulse response measurement were presented and precision requirements for modeling and implementation of system transfer function poles were discussed. Several examples of mode equalizers were given of both simulated and real rooms. Finally, implementations of the mode equalizer filter for both Type I and Type II systems were described.
Number | Date | Country | Kind |
---|---|---|---|
20012313 | Nov 2001 | FI | national |
Number | Name | Date | Kind |
---|---|---|---|
4947360 | Dyer | Aug 1990 | A |
5226057 | Boren | Jul 1993 | A |
5263019 | Chu | Nov 1993 | A |
5325204 | Scarpa | Jun 1994 | A |
5400084 | Scarpa | Mar 1995 | A |
5491754 | Jot et al. | Feb 1996 | A |
5559891 | Kuusama et al. | Sep 1996 | A |
5815580 | Craven et al. | Sep 1998 | A |
5949894 | Nelson et al. | Sep 1999 | A |
Number | Date | Country |
---|---|---|
0 505 949 | Sep 1992 | EP |
0 535 737 | Jun 1997 | EP |
1 017 166 | Jul 2000 | EP |
1130608 | May 1989 | JP |
02142300 | May 1990 | JP |
02150197 | Jun 1990 | JP |
03106208 | May 1991 | JP |
04194996 | Jul 1992 | JP |
11262095 | Sep 1999 | JP |
Number | Date | Country | |
---|---|---|---|
20030099365 A1 | May 2003 | US |