The invention relates a microphone system, in particular to a steerable superdirectional microphone system.
Beyond this, the invention relates to a method operating a microphone system.
Moreover, the invention relates to a computer readable medium.
Furthermore, the invention relates to a program element.
First-order superdirectional microphones or microphone systems may be constructed out of a linear combination of an omni-directional response and a dipole-response. For a steerable first-order superdirectional microphone, the same method can be applied, but the arbitrary steered dipole is constructed out of two orthogonal dipoles with the main-lobes on the 2D plane. Such a steerable microphone system is commonly constructed with multiple (e.g. MEMS) microphones (e.g. 4 or 8) to increase the SNR. Additionally, echo cancellation may be introduced to further improve the performance of the microphone system to remove echoes originating from a loudspeaker. However, providing each microphone with an echo canceller increases the complexity and the costs of the microphone system.
Thus, there may be a need to provide an alternative microphone system and a method of operating the same, a computer readable element, and a program element which may exhibit high performance by reduced complexity.
In order to meet the need defined above, a microphone system, a method of operating a microphone system, a computer readable medium and a program element according to the independent claims are provided. Further improvements are disclosed in the dependent claims.
According to an aspect of the invention a microphone system is provided, wherein the microphone system comprises a microphone array comprising a plurality of microphone units each adapted to generate a primary signal indicative of an acoustic wave received from the respective microphone unit, a first echo cancellation unit, an integrator unit, and a combination unit, wherein the microphone system is adapted to generate a first dipole response and a monopole response from the primary signals, wherein the integrator unit is adapted to generate a first integrated dipole response by integrating the first dipole response, wherein the first echo cancellation unit is adapted to generate a first echo cancelled integrated dipole response from the first integrated dipole response, and wherein the combination unit is adapted to combine the monopole response and the first echo cancelled integrated dipole response.
In particular, the microphone array may comprise at least two microphone units, e.g. two, three, four or eight microphone units. The combination unit may be an adding unit which adds the monopole response and the processed dipole response, i.e. the echo cancelled integrated dipole response. In particular, the combining may be a weighted adding, i.e. the monopole response and/or the echo cancelled dipole response may be multiplied by a weighting factor before combining. Furthermore, the compensated monopole signal and/or the monopole response and/or the dipole response may be amplified before the respective signals are combined. Therefore, one or several amplifiers may be included into the microphone system. By providing an array having at least three microphone units uniformly or non-uniformly arranged on a circle, it may be possible to provide a steerable microphone system, e.g. a steerable superdirectional microphone system, where the maximum/main-lobe of the superdirectional response can be pointed in any azimuthal direction on the 2D plane.
According to an aspect of the invention a method of operating a microphone system comprising a microphone array is provided, wherein the method comprises generating a first dipole response from primary signals of the microphone array, generating a monopole response from primary signals of the microphone array, generating a first integrated dipole response by integrating the first dipole response, generating a first echo cancelled integrated dipole response from the first integrated dipole response, and combining the monopole response and the first echo cancelled integrated dipole response.
According to an aspect of the invention a program element is provided, which, when being executed by a processor, is adapted to control or carry out a method according to an aspect of the invention.
According to an aspect of the invention a computer-readable medium is provided, in which a computer program is stored which, when being executed by a processor, is adapted to control or carry out a method according to an aspect of the invention.
The term “microphone array” may particularly denote any kind of spatial arrangement of a plurality of microphone units wherein each of the plurality of microphone units generate a primary signal. The minimum number of microphone units may be two, while every higher number may be suitable. In particular, it may be necessary to provide at least three microphone units in order to achieve a steerable superdirectional microphone system. The microphone units may be arranged in a regular pattern on a 2D plane, e.g. uniformly on a circular array or may be arranged in an irregular pattern, e.g. non-uniformly on a circular array. In case of four microphone units the microphone units may be arranged in a rectangular or square pattern, for example. In particular, the microphone array may be a small microphone array, wherein the term “small” may particular denote the case that the distance between adjacent microphone units is smaller than the typical wavelengths of the acoustic waves or sound waves which are measured or detected by the microphone units.
By providing a microphone system which is adapted in such a way that an echo cancellation takes place after a performed integration of dipole responses but before the combining of the dipole responses and the monopole responses on the one hand it may be possible to reduce the amount of necessary echo cancellation units, since not for every primary signal, i.e. for each microphone unit, a separate echo cancellation unit is necessary. Further, this may also lead to a reduced degradation of the output signal since possible misadjustments or adaptation errors of the echo-canceller are not increased by the integrator. On the other hand the microphone system may be still steerable since the echo cancellation is performed before the combination. Thus, a steerable microphone system having low complexity and having a good performance may be provided. This may also help to reduce costs in producing the steerable microphone system.
Next, further embodiments of the microphone system are described. However, these embodiments also apply to the method of operating a microphone system, the computer-readable medium, and the program element.
According to another embodiment the microphone system further comprises a second echo cancellation unit which is adapted to generate an echo cancelled monopole response.
In particular, the echo cancellation may be performed by the second echo cancellation unit before the echo cancelled monopole response is combined with the first echo cancelled integrated dipole response.
According to another embodiment the microphone system further comprises a third echo cancellation unit, wherein the microphone system is further adapted to generate a second dipole response, wherein the integrator unit is further adapted to generate a second integrated dipole response by integrating the second dipole response, wherein the third echo cancellation unit is adapted to generate a second echo cancelled integrated dipole response from the second integrated dipole response, and wherein the combination unit is adapted to combine the monopole response, the first echo cancelled integrated dipole response and the second echo cancelled integrated dipole response.
In particular, the integrator unit may be formed by two subunits wherein each subunit is adapted to generate one of the integrated dipole responses, or may be formed by two separated integrator units. The first and third echo cancellation units may be formed by one or by two separate units. Furthermore, the first and the second cancelled integrated dipole responses may be combined before the combining result and is then combined by the combination unit with the monopole response. In particular, the microphone system may be adapted to generate exactly two dipole responses for further processing and exactly one monopole response for further processing.
According to another embodiment of the microphone system the first dipole response and the second dipole response are orthogonal to each other.
That is, the first and second dipole response have an orientation-difference of the main-lobe of π/2 radians.
According to another embodiment of the microphone system the first dipole response and the second dipole response are normalized dipole responses.
By performing the echo cancellation on the normalized and integrated versions of the orthogonal dipole responses, it may be possible to ensure that independent misadjustments/adaptation errors in the echo-reduction for lower frequencies are not degraded by the integrator unit.
According to another embodiment of the microphone system the first echo cancellation unit comprises an adaptive filter.
In particular, the first echo cancellation unit may be formed by or may consist of an adaptive filter. In case more than one echo cancellation units are included in the microphone system several or all echo cancellation units may comprise an adaptive filter.
According to another embodiment of the microphone system the adaptive filter is adapted to receive a reference signal.
In particular, the reference signal may be an output signal of a loudspeaker which may be the cause of background sounds and thus of the echo to be cancelled.
According to another embodiment the microphone system further comprises a compensation unit, wherein the compensation unit is adapted to generate a compensated monopole response, and wherein the combination unit is adapted to combine the compensated monopole response and the first echo cancelled integrated dipole response.
In particular, the compensation unit may be a compensation filter, e.g. a recursive compensation filter. The recursive filter may be formed by:
wherein j denotes the imaginary unit, CN(α1,γ) represents the compensation filter, α1 represents the weighting factor of the monopole response, θ is given by θ=2πf/fs wherein fs is the sampling frequency, γ is the leakage factor of a N'th order leaky integrator, and γ2 is given by:
The compensation filter may be a linear combination of two compensation filters. In particular, the two compensation filters may be a so called Turin integrator and a so called Simpson integrator and/or the compensation filter may be a so called Al-Alaoui integrator.
According to another embodiment of the microphone system the compensation unit is further adapted to generate the compensated monopole signal in such a way that at low frequencies a flat output signal is achievable for the angle where the superdirectional response has its maximum/main-lobe.
In particular, the compensation unit may be defined in such a way that for lower frequencies, e.g. between 10 Hz and 1000 Hz or between 100 Hz and 1000 Hz, a unity-response or a constant response is obtained.
According to another embodiment the microphone system further comprises a noise suppression unit, wherein the noise suppression unit is adapted to continuously estimate a noise-floor based on the monopole response and the dipole response.
In particular, the estimation may depend on the monopole response and two dipole responses, e.g. the first and second echo cancelled dipole responses and the echo cancelled monopole response. This estimated nose-floor may be used for noise suppression. The estimation of the noise-floor may in particular depend on an angle φ corresponding to the direction of a maximum response, i.e. on the orientation of the dipole, and of a weighting factor α1 characterizing a weighting of the monopole response, e.g. with respect to the dipole response in the combination.
Summarizing, a gist of an aspect of the invention may be seen in providing a steerable microphone system which may exhibit a high performance, in particular in the lower-frequencies range, while still having low complexity. The microphone system may comprise a small microphone array including at least two microphone units, but preferably more than two microphone units to enable a steerable microphone system, each generating a primary signal. From the primary signals a monopole response and at least one dipole response may be generated, preferably exactly two dipole responses are generated. The dipole response or the dipole responses may be integrated by using an integrator. The integrated dipole response(s) may then be echo cancelled and the echo cancelled integrated dipole response(s) may be added to the monopole response, which optionally is also echo cancelled. The monopole response may also be a processed by a compensation filter before adding it to the echo cancelled dipole responses. The compensation filter may be adapted in such a way that a decreasing of the integrated dipole responses at lower frequencies is compensated by an increasing of the compensated monopole signal at lower frequencies so that a flat response may be enabled for the whole range of frequencies of interest, e.g. the range of human hearing. A microphone system according to an aspect of the invention may be applied in car-radio chips of Car Entertainment Systems, for example and may be also beneficial for MEMS microphone technology.
The aspects and embodiments defined above and further aspects of the invention are apparent from the examples of embodiment to be described hereinafter and are explained with reference to these examples of embodiment. It should be noted that features described in connection with a specific embodiment or aspect may be combined with another embodiment or another aspect.
The invention will be described in more detail hereinafter with reference to examples of embodiment but to which the invention is not limited.
The illustration in the drawing is schematic. In different drawings, similar or identical elements are provided with similar or identical reference signs. In connection with
The normalized superdirectional microphone-response (with a maximum response on φ radians) may be formulated as:
Ēs,ideal(φ,φ,α1)=α1+(1−α1)·cos(φ−φ) (1)
where the first-order characteristic is determined by α1.
This ideal response may be approximated by:
Ēs(φ,φ,α1)=α1·Ēm(φ)+(1−α1)·Ēd(φ,φ), (2)
where Ēd(φ,φ) is the normalized dipole-response oriented with its maximum to φ and Ēm(φ) is the normalized monopole response.
The normalized (frequency-independent) dipole-response may be computed as:
and where (for small values of Ω, where the distance d is smaller than the wavelength of the sound):
with φ the angle of incidence of sound, Ei the signal picked up by each of the microphone units Mi, i.e. a primary signal, S the sensitivity of each of the microphones and Ω given by:
with ω the frequency (in radians), d the distance between the microphones and c the speed of sound.
Furthermore Iideal is an ideal integrator, which can be approximated in discrete-time, defined as:
and Υ is an extra compensation term defined as:
The normalized monopole-response Ēm(φ) may be computed as:
The overline indicates a normalized response with a maximum response S (equal to the response of a single sensor or microphone unit).
The integrator is required to remove the jω-dependency in the dipole response.
The method described above may be the simplest way to construct a steerable first-order microphone (via parameter φ) with a variable characteristic (via parameter α1). Although methods like delay-and-subtract, Linear Constrained Minimum Variance (LCMV) and Generalized Sidelobe Canceller (GSC) may also be modified to obtain steerable capabilities, they may require (FIR) filters that need to be recomputed for different values of φ and α1, which is computationally unattractive.
The same method of combined monopole/dipole can be applied for a microphone system 200 comprising eight microphones (also in a square geometry) as shown in
For the microphone system having a microphone array of eight microphone units the normalized dipole-response can be computed as:
The normalized monopole-response Ēm(φ) is computed as:
The main benefit of using 8 microphones (over 4 microphones) may be that the signal-to-noise ratio (SNR) of the resulting superdirectional microphone may be improved by 3 dB.
The monopole response 306 is inputted into a first amplifier 309 the output of which is connected to a first adder 310. A second input of the first adder 310 is an output of a first adaptive filter 311 forming a first echo cancellation unit. An input for the first adaptive filter 311 is formed by a signal x which is the sound outputted by a loudspeaker 312 which sound is the cause of an echo. Furthermore, an output 313 of the first adder 310 forms a feed back for the first adaptive filter 311, i.e. is used to control the first adaptive filter. The output 313, which forms an echo cancelled monopole response, is further inputted into a compensation unit or compensation filter 314 the output of which is inputted into a second amplifier 315. The second amplifier uses a value α1 as a weighting factor of the compensated echo cancelled monopole response which then in turn is inputted into a combination unit 316, e.g. a second adder.
The first 307 of the two dipole responses is inputted into a first integrator unit or integrator 317 to form a first normalized integrated dipole response 318 which is inputted into a third adder 319. A second input of the third adder 319 is an output of a second adaptive filter 320 forming a second echo cancellation unit. An input for the second adaptive filter 320 is formed by the signal x. Furthermore, an output 321 of the third adder 319 forms a feed back for the second adaptive filter 320, i.e. is used to control the second adaptive filter. The output 321, which forms a first echo cancelled integrated dipole response, is further inputted into a third amplifier 322 for obtaining a weighted version of the first echo cancelled integrated dipole response, to provide a first one of two orthogonal dipole responses which is then inputted into a fourth adder 323 to obtain a rotated dipole response with the main-lobe directed to angle φ. The weight of the third amplifier is indicated by the
in
The second 308 of the two dipole responses is inputted into a second integrator unit or integrator 324 to form a second normalized integrated dipole response 325 which is inputted into a fifth adder 326. A second input of the fifth adder 326 is an output of a third adaptive filter 327 forming a third echo cancellation unit. An input for the third adaptive filter 327 is formed by the signal x. Furthermore, an output 328 of the fifth adder 326 forms a feed back for the third adaptive filter 327, i.e. is used to control the third adaptive filter. The output 328, which forms a second echo cancelled integrated dipole response, is further inputted into a fourth amplifier 329 for obtaining a weighted version of the second echo cancelled integrated dipole response, to provide a second one of two orthogonal dipole responses which is then inputted into the fourth adder 323 to obtain a rotated dipole response with the main-lobe directed to angle φ. The weight of the fourth amplifier is indicated by the
in
An output 330 of the fourth adder 323 is then inputted into a fifth amplifier 331 which uses a weighting factor of 1−α1 to generate a normalized echo cancelled integrated dipole response 332 which is then inputted in the combination unit 316. The combination unit 316 adds the two signal inputted to provide a superdirectional output signal Ēs.
Summarizing,
As the echo cancellation is performed on the normalized and integrated versions of the orthogonal dipoles, this solution may also overcome the problem that the independent misadjustments/adaptation-errors in the echo-reduction for lower frequencies is degraded by the integrator.
The embodiment in
As can be seen, the embodiment of
An even further embodiment may be to apply also stationary-noise reduction techniques. When placing a stationary noise suppressor NS at the output Ēs of the system, the most straightforward way to estimate the stationary noise-floor may be by using also the output Ēs. However, a new noise-floor may have to be tracked in this way, which can take up to a few seconds, every time the angle φ and/or the characteristic (via parameter α1) is changed. To prevent these re-adaptations, it may be possible to estimate the stationary noise-floors on the monopole and the two orthogonal dipoles continuously and to construct a combined noise-floor depending on the parameters φ and α1. This constructed noise-floor may then be used for the noise suppression. A respective embodiment is shown in
The second embodiment of a microphone system 400 shown in
Finally, it should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be capable of designing many alternative embodiments without departing from the scope of the invention as defined by the appended claims. In the claims, any reference signs placed in parentheses shall not be construed as limiting the claims. The word “comprising” and “comprises”, and the like, does not exclude the presence of elements or steps other than those listed in any claim or the specification as a whole. The singular reference of an element does not exclude the plural reference of such elements and vice-versa. In a device claim enumerating several means, several of these means may be embodied by one and the same item of software or hardware. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage.
Number | Date | Country | Kind |
---|---|---|---|
08105590 | Oct 2008 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB2009/054343 | 10/5/2009 | WO | 00 | 4/13/2011 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2010/043998 | 4/22/2010 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5463694 | Bradley et al. | Oct 1995 | A |
6041127 | Elko | Mar 2000 | A |
6546099 | Janse | Apr 2003 | B2 |
7035415 | Belt et al. | Apr 2006 | B2 |
20060280318 | Warren et al. | Dec 2006 | A1 |
20080170716 | Zhang | Jul 2008 | A1 |
20090190769 | Wang et al. | Jul 2009 | A1 |
Number | Date | Country |
---|---|---|
0 869 697 | Oct 1998 | EP |
1 633 121 | Mar 2006 | EP |
2005076663 | Aug 2005 | WO |
2010044002 | Apr 2010 | WO |
Entry |
---|
Martin, Rainer, Stefan Gustafsson, and Mario Moser. “Acoustic Echo Cancellation For Microphone Arrays Using Switched Coefficient Vectors.” IWAENC (1997): 85-88. Web. |
Abed, A., et al. “Digital Integrators Using Optimal FIR Compensators”, IEEE Trans. on Acoustics, Speech, and Signal Processing, vol. ASSP-31, No. 3, pp. 726-729 (Jun. 1983). |
Martin, R., et al. “A Symmetric Two Microphone Speech Enhancement System—Theoretical Limits and Application in a Car Environment”, 5th IEEE Signal Processing Workshop, 2 pgs. (1992). |
Chu, P. “Desktop Mic Array for Teleconferencing”, Proc. IEEE 1995 Int'l. Conf. on Acoustics, Speech, & Signal Processing, vol. 5, pp. 2999-3002 (May 9, 1995). |
Brandstein, M. et al. “Microphone Arrays”, pp. 272-275 (2001). |
Elko, G. “Differential Microphone Arrays”, Ch. 2 of Audio Signal Processing for Next-Generation Multimedia Communication Systems, Huang & Benesty eds., pp. 11-65 (2004). |
International Search Report and Written Opinion for Int'l. Patent Application No. PCT/IB2009/054343 (Mar. 3, 2010). |
Number | Date | Country | |
---|---|---|---|
20110206212 A1 | Aug 2011 | US |