1. Field of the Invention Technology
The present invention relates to microphone output signal levels, and more specifically, to the calibration thereof to a desired level. When output levels of different microphones are compared, it is assumed that the acoustical excitations thereof are identical. Manufacturers supply microphones having output levels varying around a specified mean value. For the often-used back-electret microphones, such tolerances are ±4 dB. Consequently, the output levels of such microphones may show a difference of up to 8 dB. Microphones with tolerances of ±2 dB are sometimes available. These, however, are more expensive.
2. Description of the Related Art
A usual approach for gain calibration of a microphone is carried out in an anechoic chamber, i.e., a chamber without reflections or reverberation. A loudspeaker is placed in front of the microphone (at an angle of 0°) inside the anechoic chamber. The loudspeaker plays a noise sequence at a known power level and the power of the microphone response is measured. Subsequently, an adjustable gain is set.
Further an audio processing arrangement is disclosed in International Patent Application No. WO 99/27522. According to this prior art reference, filtered sum and weighted sum beamforming are developed for maximizing power at the output. Filtered sum beamforming (FSB) makes the direct contributions maximally coherent upon adding thereof.
With multi-microphone algorithms such as beamforming, it is very important to sort the microphones during production to obtain sets with level differences within the required tolerances.
Moreover, with some multi-microphones systems, the consumer may buy additional microphones later in time, which will also have to be calibrated before installation.
The present invention provides a device for calibration of a microphone, comprising:
a loudspeaker for converting a loudspeaker input signal into sound;
a microphone for converting received sound into a microphone output signal, and
calibration means for calibrating the output power of the microphone relative to a desired power level, said calibration means comprising impulse response estimating means for estimating an acoustic impulse response of the microphone and/or the environment at the microphone by correlating the microphone output signal and the loudspeaker input signal when the microphone receives sound from the loudspeaker, whereby the output power of the microphone is estimated.
As indicated above, calibration of microphones is often of crucial importance for good performance of multi-microphone systems. The present invention is concerned with the adaptive calibration (in software) of microphones under reverberant room conditions. An advantage of the present invention is that the microphones need not be selected or calibrated when manufacturing an audio system, saving production time and, sometimes, additional hardware. The present invention can be applied in all speech communication systems where one or more microphones and a loudspeaker are available. One can think of hands-free telecommunication systems, but also of hands-free speech recognition systems for voice control of, e.g., a television set.
Non-uniformly ageing of microphones, which can also lead to output level differences, will also be neutralized by this invention.
In a preferred embodiment of the invention, direct part removal means are provided for removing the direct part of the so-called acoustic impulse response (a.i.r.) in order to use, especially, the diffuse part of the a.i.r. An advantage hereof is that calibration can be executed during use in a normal environment, e.g., a room of a microphone, and without the need for additional hardware. Calibration during the actual use also allows for either absolute calibration or relative calibration.
Another preferred embodiment comprises high- and low-pass filter means for filtering low and high frequencies, allowing for better calibration by using frequency ranges where signal quality is best suitable for processing.
Another preferred embodiment comprises squaring and summation means for creating a representation of the current power level of the diffuse sound-field response of the microphone, in order to create a value that can be related to a desired level.
The invention further preferably comprises relating means for relating the power level of the (diffuse) microphone response with a desired power level.
Although it may be possible to obtain an absolute value for the desired power level, this desired power level is preferably available from a reference microphone.
Further advantages, features, and details of the present invention will become clear when reading the following description with reference to the annexed drawings, in which:
Another example of use of a device according to present invention (not shown), relates to voice-based commanding of a television set, e.g., for switching channels or controlling the volume, by using microphone input. This can also be embodied in a form with one or several microphones. In order for a system to use the microphone output signal, calibration may be necessary.
For clarification, some acoustical concepts are explained that are relevant for understanding the detailed description of the drawings.
An acoustic impulse response (a.i.r.) can be estimated from the loudspeaker excitation signal and the microphone response by correlation techniques. An a.i.r. is the response on an impulsive acoustic excitation. An example of such an estimated a.i.r. is depicted in FIG. 3. During the first few milliseconds, the response is zero due to the delay from the limited speed of sound in air. Next, a large peak can be observed, which is due to the response to the direct acoustic propagation of the sound from the speaker towards the microphone, and is called the direct sound field contribution. This peak has a normalized value of 1.0. The tail relates to this value as depicted in this graph. The tail of the a.i.r. is due to reflections against room boundaries, and is called the diffuse sound field contribution. These reflections have a random character and increase statistically in density and decrease exponentially in amplitude over time. The combined effects of the reflections are called reverberation.
An important function of the a.i.r. is the energy decay. In discrete time, with n the sample index, the energy decay at index n amounts to the energy left in the tail of the a.i.r. In
Microphones can have unidirectional beam patterns. Unidirectional microphones only pick up acoustic signals from a certain range of angles around 0°, i.e., they more or less block acoustic signals arriving at 180°. This means that the direct field contribution of an a.i.r. measured at 180° will be almost zero.
In
In the following, it is assumed that the energy in the diffuse tail of the a.i.r. does not depend on the microphone or loudspeaker orientation and location in the room. In practice, some variation are found depending on orientation and location, but these variations are small when the acoustic absorption pattern in the room is more or less homogenous and the reverberation over time is not to small (T60>100 ms). It is worth mentioning that a typical room has a reverberation larger than 300 ms. A general rule is that the bigger a room, the longer the reverberation time.
The present invention uses, as input, not only the microphone response, but also the excitation signal of the loudspeaker (FIG. 2). First, the a.i.r. is estimated from the loudspeaker to the microphone using a well-known correlation method in the estimating means. When acoustic cancellation is performed, this adaptive filter is already available. The diffuse part of the a.i.r. is selected in the direct part removal means. At low frequencies, the loudspeaker output and/or the microphone sensitivity is low, which leads to unreliable a.i.r. coefficients. Therefore, a high-pass filter is applied to the diffuse part of the a.i.r. At the highest frequencies, near the Nyquist frequency, the signal levels will also be low due to anti-aliasing filters. Thus, to deal with unreliable a.i.r. coefficients at high frequencies, a low-pass filter is applied.
In
In the preferred embodiment, this calibration method can be applied each time the adaptive filter comes up with a new estimation of the a.i.r. For increased robustness of an acoustic echo canceller, a programmable filter is sometimes used (as described in U.S. Pat. No. 4,903,247). The adaptive filter runs in the background and the programmable filter, which takes its coefficients conditionally from the adaptive filter, is used for the actual echo removal. In this case, it is best to take the coefficients of the programmable filter and apply the calibration procedure after each coefficient transfer.
The loudspeaker 3 (
The estimated actual power level (P) 14 is fed to a relating program 15 as is an (external) desired power level (Q) 20. From here, the calibration gain factor 16 is fed to the averaging means 17. An adjusted calibration gain factor 18 is fed back to the microphone output signal in order to form the calibrated signal 19.
Especially when combined with an adaptive filter for acoustic echo cancellation, the proposed microphone calibration method can be applied all the time that the system is active. In
Below, the process of the embodiment of
The present invention is not limited to the above preferred embodiments; the rights applied for are defined in the annexed claims.
Number | Date | Country | Kind |
---|---|---|---|
00202298 | Jun 2000 | EP | regional |
Number | Name | Date | Kind |
---|---|---|---|
5029215 | Miller, II | Jul 1991 | A |
5841876 | Gifford et al. | Nov 1998 | A |
5844994 | Graumann | Dec 1998 | A |
Number | Date | Country |
---|---|---|
WO9927522 | Jun 1999 | WO |
Number | Date | Country | |
---|---|---|---|
20030076965 A1 | Apr 2003 | US |