The invention relates generally to microphone arrays, and more particularly to a method for correcting the beam pattern and beamwidth of a microphone array embedded in an obstacle whose shape is not axi-symmetric.
Sensor arrays are known in the art for spatially sampling wave fronts at a given frequency. The most obvious application is a microphone array embedded in a telephone set, to provide conference call functionality. In order to avoid spatial sampling aliasing, the distance, d, between sensors must be lower than λ/2 where λ is the wavelength.
Many publications are available on the subject of sensor arrays, including:
For free-field linear, circular, or non-linear arrays, Ishimaru [1] discusses the issues of constant inter sensor spacing and non-constant inter-sensor spacing.
Meyer [2] discloses arrays embedded in a diffracting obstacle of simple shape, and provides an analytical solution for the wave equation in acoustics. For arrays of simple shape like circular rings embedded in a more complex shape, for which there is no analytical solution of the wave equation, Anciant [3] and Ryan [4] make use of numerical methods, such as Boundary Element methods (BEM) or Finite or Infinite Elements methods (FEM, IFEM).
Most of the literature describes broadband frequency invariant beamforming for circular arrays or linear arrays, but not for microphone arrays in shapes that are not symmetric or axi-symmetric. One example of such an obstacle whose shape is dictated by industrial design constraints resulting in an odd shape, is a telephone incorporating a microphone array. The problem of beamforming with such an array is quite different from that dealt with in the literature since the solution relies on constrained optimisation, with a constraint build using a set of vectors containing the sensor signal for acoustic waves with specific directions of arrival.
In that regard, the following prior art is relevant:
P. Kootssokos [5] proposes a technique intended for rejecting a far-field broadband signal from a given known direction by imposing pattern nulls on broadband array responses. The method consists of generating deep and wide “null” or quiescent areas in given directions. This is achieved by imposing a set of linear constraints.
Henry Cox [6] proposes robust adaptive beamforming by the use of different sets of constraints. The constraints, quadratic and linear, are used to make the beamformer more robust to small errors of sensor amplitude, phase or position.
Feng Qian [7] proposes a quadratically constrained adaptive beamforming technique, but deals only with coherent interfering signals.
In Zhi Tian, K Bell, H. L. Van Trees [8], LCMP beamforming is set forth under quadratic constraints to provide an adaptive beamformer, but is concerned only with the stability of convergence.
Although a number of the methods discussed in the above-referenced prior art use specific vectors to shape the beam they, do not deal with the consequences of non-linear or non axi-symmetric arrays on the beampatterns and the resultant possible loss of “look” direction.
The following prior art relates more specifically to beamforming with constant broadband frequency invariant beamwidth, but not in relation to non axi-symmetric or non-linear arrays:
Frost [9] sets forth an adaptive array with M sensors to produce M constraints on the beam pattern of the array at a single frequency. The author proposes an algorithm for linearly constrained adaptive array processing. A set of linear constraints is introduced to provide an adaptive process in order to build a super directive array. Although this method can produce a constant beam pattern or null in given directions at various frequencies it is not designed to produce an identical beam pattern over a continuous frequency band and for various azimuth angle when the array is “asymmetric”.
Lardies [10] proposes an acoustic multiple ring array with constant beamwidth over a very wide frequency range. To determine the unknown filter function, a linear constraint is imposed at an angle θH corresponding to the half-power beam angle. This procedure is intended to generate a constant beam over a band of frequencies, but is limited to symmetrical free-field arrays.
Berger and Silverman [11] disclose another approach consisting of designing the broadband sensor array by determining sensor gains and inter-sensor spacing as a multidimensional optimisation problem. This method does not use frequency dependant array sensor gains but attempts to find optimal spacing and fixed gains by minimising the array power spectral density over a given frequency band
Pirz [12] uses harmonic nesting, in which the array is composed of several sets of sub-arrays with different inter-sensor spacings adapted for different frequency ranges. It should be noted that lowering the inter-sensor spacing under λ/2 only provides redundant information and directly conflicts with the desire to have as much aperture as possible for a fixed number of sensors.
Ishimaru [1] uses the asymptotic theory of unequally spaced arrays to derive relationships between beam pattern properties (peal response, main lobe width, . . . ) and array design. These relationships are then used to translate beam pattern requirements into functional requirements on the sensor spacing and weighting, thereby deriving a constant broadband design.
The prior art culminates with Ward [13] who finds a more general solution for providing the best possible broadband frequency invariant beam pattern. Ward considers a broadband array with constant beam pattern in the far field. Again, the asymptotic theory of unequally spaced arrays is used to derive relationships between beam pattern properties such as main lobe width, peak response, and array design. These relationships are expressed versus sensor spacing and weightings and Ward uses an ideal continuous sensor that is then “discretised” in an optimal array of point sensors, giving constant broadband beamwidth.
The following prior art relates to arrays embedded in obstacles:
The benefit of an obstacle for a microphone array in terms of directivity and localisation of the source or multiple sources is discussed in Marc Anciant [4]. Anciant describes the “shadow” area induced by an obstacle for a 3D-microphone array around a mock-up of the Ariane IV launcher in detecting and characterising the engine noise sources at takeoff.
Meyer [2] uses the concept of phase mode to generate a desired beam pattern from a circular array embedded in a rigid sphere, taking advantage of the analytical expression of the pressure diffracted by such an obstacle. He describes the benefit of the obstacle in term of broadband performance and noise susceptibility improvement
Elko [14] uses a small sphere with microphone dipoles in order to increase wave-travelling time from one microphone to another and thus achieve better performance in terms of directivity. A sphere is used since it allows for analytical expressions of the pressure field generated by the source and diffracted by the obstacle. The computation of the pressure at various points on the sphere allows the computation of each microphone signal weight.
Jim Ryan et al [4] extend this idea to circular microphone arrays embedded in obstacles with more complex shapes using a super-directive approach and a boundary element method to compute the pressure field diffracted by the obstacle. Emphasis is placed on the low frequency end, to achieve strong directivity with a small obstacle and a specific impedance treatment for allowing air-coupled surface waves to occur. This treatment results in increasing the wave travel time from one microphone to another thereby increasing the “apparent” size of the obstacle for better directivity in the low frequency end. Ryan et al. have shown that using an obstacle improves directivity in the low frequency domain, compared to the same array in free field.
Skolnik [15] is noteworthy for teaching that error occurs when the position of the array sensors are subject to variation, and by extension that this error can be applied to non-uniform arrays.
Except for Anciant and Ryan, none of the techniques described in the prior art can be used when the sensor array is embedded in an obstacle with an odd shape, in the presence of a rigid plane for example, either with or without an acoustic impedance condition on its surface. Numerical methods are required. As they do not give an analytical expression of the pressure field at the sensor vs. frequency, the techniques proposed by most of the above-referenced authors (except Anciant and Ryan) can not be used. None of the prior art deals with or describes variation of the beam pattern in such conditions. It should be noted that Anciant and Ryan deal with circular arrays only, and do not deal with constant beamwidth or any other problem linked to frequency variation and array geometry properties.
According to the present invention, a method is provided for designing a broad band constant directivity beamformer for a non-linear and non-axi-symmetric sensor array embedded in an obstacle having an odd shape (such as a telephone set) where the shape is imposed, for example, by industrial design constraints. In particular, the method of the present invention corrects beam pattern asymmetry and keeps the main lobe reasonably constant over a range of frequencies and for different look direction angles. The invention prevents the loss of “look direction” resulting from a strong beampattern asymmetry for certain applications. The invention is particularly useful for microphone arrays but can be extended to other types of sensors. In fact, the method of the present invention may be applied to any shape of body that can be modelled with FEM/BEM and that is physically realisable.
First, a numerical method such as Boundary Element Method (BEM), Finite or Infinite Elements Method (FEM or IFEM) is applied to the body taking into account a rigid plane and, in one embodiment, acoustic impedance conditions on the surface of the body. Sensors of the array are positioned at selected nodes of the boundary element mesh. A set of potential sources to be detected is defined and modelled as monopoles, and the acoustic pressure (phase and magnitude) is determined at every sensor for each source. It should be noted that the use of acoustic monopoles is not restrictive. Plane Wave or any other source that can be modelled using Numerical Methods can be used (source in an obstacle to reproduce the mouth/head, radiating structure, etc.).
The second step involves defining a noise field, and the associated noise correlation matrix (denoted Rnn) at the sensors. A set of noise sources is defined and the response to each of them at each sensor is also calculated. According to the prior art this is usually a spherical noise diffuse field (e.g. a cylindrical diffuse field is quoted by Bitzer and Simmer in [18]). In this case the noise field consists of a set of un-correlated plane waves. By way of contrast, according to the present invention any variation of noise field may be used, from a diffuse field to one that only originates in a particular sector.
Depending on the size of the array relative to the acoustic wavelength and the number of microphones, the noise cross-correlation matrix (Rnn) can be ill conditioned at the low frequency end. In this case, the prior art proposes making the matrix invertible by a known regularisation technique, generally by adding a small positive number σ2 on the diagonal. Physically, this is the equivalent of adding a white noise field or a quadratic constraint controlling the amplitude of the beamforming optimal weight wopt to the optimisation problem. By increasing σ2 the main lobe beamwidth can be widened. The noise cross-correlation matrix is normalised so that in the limit, as σ2 tends to infinity, Rnn tends to I (i.e. the classical delay and sum method).
According to prior art methods; the next step defines a vector in the look direction at angle θ of interest (dθ). As the method presented herein relates to fixed beamforming, sectors are defined all around the array for detection of potential sources. The beamforming algorithm has fixed weights for each of these sectors and is coupled with a beamsteering algorithm tracking the sector where the source is positioned. According to the present invention, for each sector, with the look direction θ, a set of vectors is defined as follows:
A set of linear or quadratic constraints built with the set of vectors defined in each sector, is then introduced in the optimisation process to obtain the optimal weighting vector wopt for correction of the beamwidth and beampattern asymmetry. The number of linearly independent constraints imposed can be as many as there are sensors.
The method provides a solution to implement a fixed beamformer with a microphone array embedded in a complex obstacle, such as a telephone set for example. The correction of the beampatterns and the loss of look direction are important for the best efficiency possible in terms of noise filtering and source enhancing. Correction of the look direction is important if the beamsteering algorithm is based upon the beamforming weighting coefficients, which is the case here. It allows a more accurate detection.
Embodiments of the present invention will now be described more fully with reference to the accompanying drawings, in which:
The following table contains the different notations used in this specification, from which it will be noted that the frequency dependency for matrices, vectors and scalars, has for the most part been omitted to simplify the notations. Any other specific notations not appearing in Table 1 are defined in the specification.
The impedance condition (i.e. local surface treatment), the distance between sensors (or microphones) and the shape of the obstacle are all variable.
Let dρ,θ,ψ(ω) be the signal vector at the M sensors for a source at position (ρ,θ,ψ) in spherical co-ordinates. Although a point source is assumed in the near field, the method of the present invention can be extended to far-field sources, typically plane waves (wave vector k). Let n be a noise vector due to the environment, where n is not correlated to the signal d, and where n and d are both dependant upon the frequency ω. Let Rnn(ω) be the normalised noise correlation matrix, depending on the nature of the noise field. For an omni-directional noise field (spherical), cylindrical or any other “exotic” field adapted to a specific situation, Rnn(ω) can be calculated using a set of non correlated incident plane waves around the sensor array.
Designing a beamformer consists of finding a weighting vector wopt (complex containing amplitude and phase information), such as the Hermitian product woptHd, for enhancing the signal of the source in the desired direction (i.e. look direction) while attenuating the noise contribution. According to the superdirective method, this is done by minimising the noise power while looking in the direction of the source, or equivalently, maximising the Signal to Noise ratio under a linear constraint.
Design of the Beamformer
A fixed beamforming algorithm is set forth below, although the inventive method may be extended to adaptive beamforming under constraint (e.g. such as in Frost [9]).
The diffuse noise field (3D cylindrical or spherical) is assumed to be modelled by a set of L non-correlated plane waves resulting in L noise vectors nN, N={1, . . . , L}. It is assumed that the vector of look direction d or dθ is not correlated with the vectors of non-look direction nN.
The noise vectors can be computed analytically for a free-field sensor array, a sensor array embedded in a sphere or an infinite cylinder. Since the determination of n requires computation of the noise acoustic pressure at the M sensors, if a sensor array is embedded in any other shape of obstacle, Infinite Element (IFEM) or Boundary Element (BEM) methods must be used.
As an illustration of the applications set forth herein, the noise field is a set of non-correlated plane waves emanating from all directions and Rnn defined in the following way:
In the low frequency end, the matrix Rnn is generally ill conditioned due to size of the array relative to the acoustic wavelength. For an inversion, Rnn must be regularised taking into account the fluctuations of each microphone (white noise). Some authors have introduced amplitude and phase variations to account for microphone errors (e.g. Ryan [4]). The regularisation is equivalent to a quadratic constraint on the weighting vector w amplitude that can tend to infinity when the matrix is ill conditioned. Rnn can be regularised as:
Rnn=Rnn+σ2I (2)
where σ2 is a small number. This regularisation is made at the expense of the directivity.
The signal vector d(ω) contains the signal induced by the acoustic source to be detected, at the M sensors at frequency ω. It depends on the nature of the source (i.e. far field acoustic plane wave, near field, acoustic monopole, or any other type that can be modelled by numerical simulation).
Designing the beamformer requires finding a set of optimal coefficients, wi at each frequency ω such that weighting the signal di at each microphone “orients” the beam towards the source.
According to the superdirective approach, the weighting vector w is the solution of the following optimisation problem:
where the explicit dependence on the frequency ω for each vector and matrix is omitted to simplify the notation. In short, the superdirective approach minimises the noise energy while looking in the direction of the source. Minimising the following functional
gives the optimal weight vector wopt(ω).
This functional is quadratic since the matrix Rnn is Hermitian and positive (defined due to its link to signal energies). A pure diagonal Rnn (=I) makes the superdirective method equivalent to the classical Delay & Sum method (white noise gain array).
Under this condition, a null of gradient of J is a necessary and sufficient condition to generate a unique minimum.
Differentiating J following w, yields:
and the optimal weight vector is:
wopt=λRnn−1d (6)
The Lagrange coefficient λ realising the constraint in equation (3) is such that:
woptHd=1 i.e. λdHRnn−Hd=1 (7)
as Rnn is a Hermitian matrix, Rnn−1 is an Hermitian matrix and Rnn−H=Rnn−1. Thus
and the solution is:
The directivity is highly dependent on frequency for simple geometries such as circular arrays or linear arrays in free field or in simple solid geometry such as a sphere.
An application of the beamforming technique set forth above to a circular microphone array over a plane is shown with reference to
For the array of
Non Axi-symmetric Sensor Arrays
When the array is no longer circular, the beam varies with the azimuth angle of the source at each frequency. Consider the elliptical array illustrated in
When the sensor array is embedded in an obstacle, the results can be worse, due to diffraction of acoustics waves and the geometry of the obstacle rendering the implementation of beamforming and beamsteering critical. It is an object of the present invention to provide a method that overcomes these problems.
Since the fixed beamformer has frozen coefficients wopt, their determination is predictive by nature and any method of determination may be used, provided that the vector wopt has the best possible components for a given signal angle of arrival Θ. To correct the beamwidth and even the symmetry of the main lobe pattern, the minimisation of eq.(3) is realised under constraint. Let d(ρ,Θ,ψ) be the sensor signal vector for a source at position (ρ,Θ,ψ), and d the signal vector of the source to be detected.
woptHd=1 (10)
The Hermitian product woptHdρ,Θ,ψ describes the 3D beampattern of the microphone array for a source moving in 3D space at a radius ρ from the centre of the array and 0≦Θ<2π,
For the example of
Correction of the Beam Pattern
Now let dθ=d be the sensor signal vector at the M microphones for a look direction θ.
In order to modify the beampattern the following vectors are introduced: dθ+θ
The choice of the angles θ1 and their number depends on the beamwidth or the main lobe beampattern asymmetry after unconstrained minimisation, and the required beamwidth or lobe symmetry.
Firstly, it will be noted that for M microphones, a set of M linearly independent constraints can be considered. Secondly, the constrained minimisation process for shaping the beam gives a sub-optimal solution wopt generally at the expense of increased amplitude in the secondary lobes or an increase in beam width.
Beamwidth Correction for a Symmetric Beampattern
The problem of finding the optimal weighting vector wopt for a look direction θ becomes:
and subject to additional constraints using a pair of symmetric vectors dθ+0
where C is a rectangular matrix defined by:
C=[d|dθ+θ
and g is a vector defined by:
The constraint in (14) synthesises the constraints defined in (11), (12) and (13).
The optimal weight vector wopt under these conditions is given by:
wopt=Rnn−1C[CHRnnC]−1g (17)
(ii) or a set of quadratic constraints. In this case dθ+θ
Dθ
and the quadratic constraints are defined in the following way:
wHDθ
where βi is a set of values required for wHDθ
where the Lagrange coefficients λ, λi are dependant on frequency ω.
As discussed above, it is known from the prior art to correct beampattern main lobe beamwidth with a set of “symmetric” vectors [6].
Asymmetry and Beamwidth Correction for a Non Symmetric Beampattern
Since the look direction θ generates a non-symmetric beam after minimisation of the unconstrained superdirective method functional J(w,λ), then the method of the present invention can be applied to modify its beamwidth and correct its asymmetrical aspect. This last operation is particularly useful since very often the beam does not point towards the required look direction even if the maximum wHoptdθ=1 is reached for the correct look direction θ. The strong asymmetric array makes the beam globally “look” in a different direction. This deviation from the look direction depends on the frequency, the geometry of the array and the look direction angle.
According to one aspect of the present invention, this asymmetry is corrected by choosing a convenient set of vectors dθ±θ
In this situation, at least one pair of symmetrical vectors is chosen to adjust the beam width:
wHd0+θ
wHdθ−θ
with either at least a single vector dθ+θ
wHdθ±θ
wH(dθ+θ
These constraints are defined broadband.
A quadratic set of constraints can also be applied. The cross-correlation matrices associated with these vector choices are:
Dθ
for the single vectors,
Dθ
for the pair of symmetric (θj=θi) or asymmetric (θj≠θi) vectors. The optimisation process for determining wopt, consists of minimising a cost function similar to (20).
This key aspect of the present invention allows, among other things, implementation of a non axi-symmetric microphone array in a non axi-symmetric shape, with reasonably symmetric beam shapes. The implementation consists of defining several sectors around the array, and sets of symmetric, asymmetric pairs of vectors or single vectors to correct the beamwidth and the beam lobe asymmetry. The inventive beamforming approach is coupled with a beam-steering algorithm that can be based on the optimal weighting coefficients computed for each sector, in a reduced frequency band.
An illustration of some of the fixed beamforming sectors with associated choice of correction vectors for an elliptic array is shown in
Application: Optimal Beamforming of a Microphone Array Embedded in an Obstacle.
As discussed above, an important application of the present invention is in designing microphone arrays embedded in obstacles having “odd” shapes (non axi-symmetric) and dealing with induced problems such as: beampattern beamwidth variation vs. the look direction angle, loss of look direction, etc. The present method allows for the successful implementation of a microphone array in a telephone set for conferencing purposes or increased efficiency for speech recognition.
The left hand side of
After application of the method according to the present invention, the results on the right hand side of
Modifications and variations of the invention are possible. The method is illustrated for the detection of one source, in a conference context for example, and is more oriented towards fixed beamforming approaches rather than adaptive ones. However, the principles of the invention may be extended to adaptive approaches: n which case the array geometry demands a correction of the beam pattern for each sector, and the storage of the correction vectors dθ+θ
Number | Date | Country | Kind |
---|---|---|---|
0229059.1 | Dec 2002 | GB | national |
Number | Name | Date | Kind |
---|---|---|---|
5592441 | Kuhn | Jan 1997 | A |
6041127 | Elko | Mar 2000 | A |
7068801 | Stinson et al. | Jun 2006 | B1 |
20030072460 | Gonopolskiy et al. | Apr 2003 | A1 |
20030147539 | Elko et al. | Aug 2003 | A1 |
Number | Date | Country |
---|---|---|
2292357 | Dec 1999 | CA |
WO 02058432 | Jul 2002 | WO |
Number | Date | Country | |
---|---|---|---|
20040120532 A1 | Jun 2004 | US |