The present disclosure relates to the estimation of the direction of arrival of a source signal using a microphone array. Applications may include Adaptive Signal Processing schemes for Hearing Aids, car kits, mobile communications, voice controlled devices, and the like.
There are various estimators known for estimating the Direction Of Arrival (“DOA”) of a given source. With respect to a fixed system of coordinates, the DOA is defined as the set of angles that define the location of the source, up to the intrinsic symmetry of the array. Thus, for the 3-microphone array shown in
These and other drawbacks and disadvantages of the prior art are addressed by an apparatus and method for estimating the direction of arrival of a source signal using a microphone array.
An apparatus for estimating the direction of arrival (“DOA”) of a source signal includes microphones for transducing the source signal, amplifiers in signal communication with a corresponding one of the microphones, analog-to-digital converters in signal communication with a corresponding one of the amplifiers, and a digital processor in signal communication with the analog-to-digital converters for estimating the direction of arrival of the source signal.
A corresponding method for estimating the DOA of the source signal at one of several microphones includes estimating the measured signal spectral covariance matrix (“Rx”), and estimating the angle of the DOA responsive to the measured signal spectral covariance matrix. These and other aspects, features and advantages of the present disclosure will become apparent from the following description of exemplary embodiments, which is to be read in connection with the accompanying drawings.
These and other aspects, features and advantages of the present disclosure will become apparent from the following description of exemplary embodiments, which is to be read in connection with the accompanying drawings.
The present disclosure teaches an apparatus and method for estimating the direction of arrival of a source signal using a microphone array in accordance with the following exemplary figures, in which:
The present disclosure presents an array signal processing apparatus and method for estimating the direction of arrival (“DOA”) of a source signal in a noisy environment using a multi-sensor array. Embodiments of this disclosure may be used as a component block in general adaptive algorithms or schemes.
Possible applications include: Adaptive Signal Processing schemes for Hearing Aids, car kits, mobile communication, voice controlled devices, and the like. The present disclosure provides a method and apparatus to estimate the angle theta defining the DOA of a source. The presently disclosed approach makes novel use of statistical assumptions and optimization.
As shown in
The mixing model is assumed as:
x1(t)=s(t)+n1(t) (1)
. . . (2)
xD(t)=s(t−(D−1)δτ)+nD(t) (3)
where D is the number of microphones, δ=cos (θ) is a direct measure of the DOA, τ=dfx/c with d distance between adjacent microphones, fx is the sampling frequency, and c is the sound speed; x1(t), . . . , xD(t) are the signals as measured by the microphones, s(t) are the source signals, and n1(t), . . . , nD(t) are the noise signals.
The distance d between microphones is assumed to be very small (smaller than c/fx, so that τ≦=1 and therefore the noise signals are not independent. The following assumption is made regarding their statistics:
Assumption 1: The joint random variables (N 1 (w), . . . , N D (w)), which are the Fourier transforms of ( n 1 . . . , n D), are Gaussian distributed with zero mean and covariance matrix:
It is also assumed that:
Assumption 2: The source signal Fourier transform S (w) is Gaussian distributed with zero mean and spectral power Rs(w).
The presently disclosed method optimally estimates (in the sense of maximum likelihood) the parameters θ, Rx and ρn. The block diagram of this solution is depicted in
Turning to
Turning now to
Similarly, the second microphone 304 is coupled in signal communication to a second amplifier 324, which, in turn, is coupled in signal communication to a second analog-to-digital converter 334. The output of the analog-to-digital converter 334 is also coupled in signal communication to the digital processor 340. Likewise, the third microphone 306 is coupled in signal communication to a third amplifier 326, which, in turn, is coupled in signal communication to a third analog-to-digital converter 336. The output of the analog-to-digital converter 336 is also coupled in signal communication to the digital processor 340.
Thus, a physical embodiment of the scheme is presented in
Components of the DOA Estimator are as follows: The DOA estimator comprises two blocks: the first block 210 of
Referring to
where w(0), . . . , w(M−1) is the window, and w is the frequency.
where α is a learning rate.
Referring again to
repeate the following steps:
In operation, the exemplary embodiment of
The computations presented in the previous section may be implemented very efficiently so that the total computational load is fairly small. The following features are also pertinent:
Exemplary values for the parameters are summarized as follows:
It shall be understood that the above-mentioned values are merely exemplary. Thus, those of ordinary skill in the pertinent art may contemplate various other values without diminishing the generality of the algorithm or of the apparatus.
In this section we present the rationale for the expressions (7–17). The likelihood of the parameters θ, Rs(·), ρn(w) given the model (1–3) is
where we have omitted the explicit dependency on w of Rsρn, M, v, Rx for the sake of clarity, and
({circumflex over (θ)}, {circumflex over (R)}S(·), {circumflex over (ρ)}n(·))=argmaxθ,R
For sufficiently many samples, the approximate solution to the above estimator can be found by using the Covariance Matching Estimation Technique (“COMET”), together with the Extended Invariance Principle as known in the art. This amounts to solving at every frequency w the problem
argminz,R
where z=[1 z2 z3] is the unknown vector, constraine to be 1 on the first component. This problem is solved, under the additional positivity constraint Rszz*+ρnM≦Rx in 8–11. Once z has been obtained at every frequency, the solution for θ is obtained through the optimization
These and other features and advantages of the present disclosure may be readily ascertained by one of ordinary skill in the pertinent art based on the teachings herein. It is to be understood that the teachings of the present disclosure may be implemented in various forms of hardware, software, firmware, special purpose processors, or combinations thereof.
Most preferably, the teachings of the present disclosure are implemented as a combination of hardware and software. Moreover, the software is preferably implemented as an application program tangibly embodied on a program storage unit. The application program may be uploaded to, and executed by, a machine comprising any suitable architecture. Preferably, the machine is implemented on a computer platform having hardware such as one or more central processing units (“CPU”), a random access memory (“RAM”), and input/output (“I/O”) interfaces. The computer platform may also include an operating system and microinstruction code. The various processes and functions described herein may be either part of the microinstruction code or part of the application program, or any combination thereof, which may be executed by a CPU. In addition, various other peripheral units may be connected to the computer platform such as an additional data storage unit and a printing unit.
It is to be further understood that, because some of the constituent system components and methods depicted in the accompanying drawings are preferably implemented in software, the actual connections between the system components or the process function blocks may differ depending upon the manner in which the present disclosure is programmed. Given the teachings herein, one of ordinary skill in the pertinent art will be able to contemplate these and similar implementations or configurations of the present disclosure.
Although the illustrative embodiments have been described herein with reference to the accompanying drawings, it is to be understood that the present disclosure is not limited to those precise embodiments, and that various changes and modifications may be effected therein by one of ordinary skill in the pertinent art without departing from the scope or spirit of the present disclosure. All such changes and modifications are intended to be included within the scope of the present disclosure as set forth in the appended claims.
This application claims the benefit of U.S. Provisional Application Ser. No. 60/386,760, filed Jun. 5, 2002 and entitled “Apparatus and Method for Estimating the Direction Of Arrival of a Source Signal using a Microphone Array”, which is incorporated herein by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
5459668 | Dogan et al. | Oct 1995 | A |
5659520 | Watson et al. | Aug 1997 | A |
5694474 | Ngo et al. | Dec 1997 | A |
6069961 | Nakazawa | May 2000 | A |
6075484 | Daniel et al. | Jun 2000 | A |
6104345 | Tweg et al. | Aug 2000 | A |
6198693 | Marash | Mar 2001 | B1 |
6525993 | Wake et al. | Feb 2003 | B1 |
6526148 | Jourjine et al. | Feb 2003 | B1 |
6694028 | Matsuo | Feb 2004 | B1 |
6760599 | Uhlik | Jul 2004 | B1 |
20020097885 | Birchfield et al. | Jul 2002 | A1 |
20030033150 | Balan et al. | Feb 2003 | A1 |
Number | Date | Country | |
---|---|---|---|
20040013275 A1 | Jan 2004 | US |
Number | Date | Country | |
---|---|---|---|
60386760 | Jun 2002 | US |