This disclosure is generally directed to direction of arrival estimation and more particularly to identifying sharp nulls in space.
In signal processing, Direction of Arrival (DOA) denotes the direction from which a wave (usually a propagating wave) arrives at a point, where a set of sensors may be located. This set of sensors form what is called a sensor array. DOA estimation methods may rely on a sensor array, and many methods exist with variations in complexity and estimation accuracy.
One type of relatively simple method is based on beamforming. Beamforming may help in estimating the signal from a given direction. In such a method, a steerable beam is formed towards the angle of interest by applying a complex set of weights to each array element. The DOA of the signal can be discovered by steering the beam through all possible angles of interest, and the angle that has the maximum energy output is considered to be the DOA of the signal.
The accuracy of these methods depends on the width of the beam, which is determined by factors such as the number of array elements and the physical size of the entire array. Narrower beam width can be achieved by increasing array elements and/or enlarging the array size. Additionally, the beam width is inversely proportional to the working frequency of the array, i.e., at lower frequencies the beam width is wider and hence poorer estimation accuracy. Such inconsistent performance over frequencies becomes a problem when the signal of interest is broadband.
One or more embodiments provide a device for identifying a direction of a sound. The device includes a controller comprising circuitry. The circuitry is configured to receive a first output from a first input device and a second output from a second input device. The circuitry is also configured to add a delay to the second output. The circuitry is also configured to compare the first output to the delayed second output in a plurality of directions to form a comparison. The circuitry is also configured to identify a number of null directions of the plurality of directions where a set of nulls exists based on the comparison.
One or more embodiments provide a method for identifying a direction of a sound. The method includes receiving a first output from a first input device and a second output from a second input device. The method also includes adding a delay to the second output. The method also includes comparing the first output to the delayed second output in a plurality of directions to form a comparison. The method also includes identifying a number of null directions of the plurality of directions where a set of nulls exists based on the comparison.
In an example embodiment, the distance d 106 is chosen to be less than the wavelength of the highest frequency of interest or threshold frequency.
In an embodiment, the source is in the far field. Due to the distance, the signals received by the two microphones have substantially the same incoming angle. The difference in the traveling path of the two microphone signals is d cos θ0 as can be seen from
The path difference introduces mostly time delay in the microphone signals, while, in an example, the amplitude difference can be ignored when a far-field model is assumed. In one embodiment, the amount of the delay is:
where v is the speed of the sound.
In an embodiment, system 200 is operable to apply, using delay unit 206, the same or substantially the same amount of delay of microphone 204 electrically to the signal from microphone 202. In other words, delay unit 206 anticipates the delay in microphone 204 and adds the anticipated delay to the output signal from microphone 202.
Subtractor unit 208 is operable to subtract the output signal from microphone 204 from the output signal of the modified output signal of microphone 202. In effect, any sounds with a DOA of θ are removed from the final output. In an example, some constraint on the array spacing d can be used to limit possible spatial aliasing, i.e., to prevent nulls from appearing in undesirable locations.
In an example embodiment, a tone signal arrives from an angle of θ, 0<θ<π. The signal received by microphone 202, before the delay unit 206, can be denoted by sin(2πft), which results in:
s1=sin [2πf(t−τx(θ0))], (2)
and
s2=sin [2πf(t−τx(θ))], (3)
where f is the frequency of the tone.
The value of s1−s2 will be zero whenever θ satisfies the following relationship:
τx(θ)=τx(θ0)±2mπ,m=0,±1,±2. (4)
Putting (1) into (4) yields:
where λ is the wavelength of the tone and
is used here.
Equation (5) has a solution θ=±θ0 when m=0. Further, depending on values of θ0, d and m, θ may have other solutions, which create additional nulls in the array beam pattern. As described herein, nulls are used to detect the incoming angle of a signal. The embodiments recognize and take into account that limiting the nulls generated by a microphone pair to θ=±θ0 increases accuracy of detection. Decreasing the nulls can be achieved by letting d satisfy the following condition,
It follows that d only needs to satisfy the following inequality to satisfy (6),
Solving the above inequality yields
Hence it the array spacing is less than half of the wavelength of the working frequency, only two nulls will appear at θ=±θ0. The frequency
will be referred as the highest working frequency for the microphone pair. As long as the incoming signal's frequency is less than fH, the null position is fixed since the delay amount is independent of frequency as can be seen from Equation (1).
As an example, beam pattern 300 of the microphone array 200 in
In an example embodiment, the distance d 410 is chosen to be less than the wavelength of the highest frequency of interest or threshold frequency. The source is in the far field. Due to the distance, the signals received by the two microphones have substantially the same incoming angle.
In an example embodiment, to narrow the DOA between the two nulls of the one dimensional array as shown in beam pattern 300 in
In an embodiment, system 500 is operable to apply, using delay unit 502, the same or substantially the same amount of delay of microphone 514 electrically to the signal from microphone 512. In other words, delay unit 502 anticipates the delay in microphone 514 and adds the anticipated delay to the output signal from microphone 512.
Similarly, system 500 is operable to apply, using delay unit 504, the same or substantially the same amount of delay of microphone 518 electrically to the signal from microphone 516. In other words, delay unit 504 anticipates the delay in microphone 518 and adds the anticipated delay to the output signal from microphone 516. System 500 may be referred to herein as a null generating block.
Subtractor unit 506 is operable to subtract the output signal from microphone 514 from the modified output signal of microphone 512. In effect, any sounds with a DOA of θ are removed from the final output.
Subtractor unit 508 is operable to subtract the output signal from microphone 518 from the modified output signal of microphone 516. In effect, any sounds with a DOA of θ are removed from the final output.
Absolute value units 520 and 522 are operable to obtain the absolute values from the outputs of subtractor units 506 and 508, respectively. Adder unit 510 is operable to add the outputs from absolute value units 520 and 522 to obtain a final output.
In an embodiment, described in operational terminology, system 500 is operable to generate a single null in an x-y plane. The location of the null can be adjusted by changing the amount of delay applied to individual microphone output. For example, system 500 is an example system to detect angles between 0° and 90°. System 500 assumes the signal received by microphones 514 and 518 are lagging behind signals received by microphones 512 and 516, respectively. Accordingly, delay units 502 and 504 are applied after microphones 512 and 516. To detect angles in other ranges, delay units 502 and 504 can be moved to any two of the microphones 512-518.
A direction finding system can be built by implementing a number of such systems in parallel, with each system generating a null at a different direction. One or more embodiments recognize and take into account that position of the null is independent of frequency, so that it is very suitable for broadband applications. Whenever a sound event occurs, the system that has a null nearest to the arriving angle of the sound event will generate a substantially lower level output compared to all other systems. Hence the direction of the sound event can be identified.
In another embodiment, instead of implementing the null-generating system in parallel, a direction finding system can also implement a single system with its delay value changed in a predetermined serial sequence, resulting in a direction finding system that scans all angles of interest in serial.
One or more embodiments provide a system structure that is flexible for either analog or digital implementation. Parallel processing is particular suitable for analog circuit implementation, which can achieve very low power consumption.
One or more embodiments recognize and take into account that a type of conventional DOA estimator is based on using the main lobe of its beam pattern to scan the angles of interest. Such techniques usually require many microphones and a large array size to achieve the sharp beam width necessary for high resolution DOA estimation. Moreover, beam width is inversely proportional to working frequency, so that complex algorithms are required to maintain relatively constant performance over a wide frequency range.
One or more embodiments provide sharp nulls generated by several very compact microphone pairs to scan the angles of interest. Sharp nulls can be generated with a few closely spaced microphones, so that the physical format of the whole system is highly compact. Also, null position is independent of working frequency, so that a direction finding system based on nulls is very suitable for broadband applications.
In one or more embodiments, system 500 as shown in
In an example, delay unit 504 and subtraction unit 508 as shown in
Comparing beam pattern 300 and 600 shows that both beam patterns have a null at 60° and each has a second null at −60° and 120°, respectively. By adding the absolute value of the outputs from the two microphone pairs, a common null will be kept and the other two nulls will be removed.
Beam pattern 700 is a result of the system 500 as shown in
In an embodiment, system 900 is operable to obtain a direction of an audio signal from direction finding unit 904 and position an angle of camera 912 towards the audio signal. Mic 1 and mic 2 may represent microphones 202 and 204 as shown in
Controller 908 can include one or more processors or other processing devices that control the overall operation of system 900. Controller 908 is operable to send a signal to motor driver 910 to move an angle of camera 912. In an example, controller 908 can communicate with control unit 906 through an integrated circuit bus and receive angle information and audio information from direction finding unit 904.
At step 1102, system 200 receives a first output from a first input device and a second output from a second input device. The first input device and second input device are microphones. In other examples, the input devices may be another type of sound sensing device. The first input device and second input device receive an audio signal from a source signal. The source signal may reach each input device at a different time. In an example, the input devices may monitor ambient sound at a particular location. The source signal may be coming from a new element creating a new sound within the range of the input devices.
At step 1104, system 200 adds a delay to the second output. The delay is used to match the first output to the second output for the audio signal from the source signal on each input device.
At step 1106, system 200 compares the first output to the delayed second output in a plurality of directions to form a comparison. By comparing, through subtracting, the two outputs signals, the sound coming from the source signal can create a null in the compared signal.
At step 1108, system 200 identifies a number of null directions of the plurality of directions where a set of nulls exists based on the comparison. When viewing a beam pattern, the nulls created indicate the direction from the source signal. In other examples, more pairs of input devices are combined with this pair to further define the direction.
In an example embodiment, a microphone pair consists of two closely placed microphones that can generate sharp and steerable nulls by first adding an appropriate amount of delay in the microphone outputs and then subtracting the two microphone outputs. The spacing of the microphones is less than the wavelength of the highest frequency of interest. Usually, the spacing is around a few centimeters, resulting in a very compact array structure, and covers a large range of audible frequency range.
One or more embodiments provide a method that combines the two or more microphone pairs in the above example embodiment to form 2-D and 3-D arrays to reduce the DOA ambiguity. This is achieved by adjusting the delay amount in the microphone outputs so that different microphone pairs have a null in common and nulls that are not in common can be removed by adding the absolute value of outputs from the microphone pairs.
One or more embodiments provide a direction finding system based on the microphone pair as described above that continuously monitors the output level of all null-generating blocks and uses the knowledge that the signal entering a certain null will be greatly reduced as a basis to further estimate signal's DOA.
One or more embodiments provide a digital implementation of the system above that uses analog to digital converters to convert the microphone signal to digital samples and then implements the null-generating blocks and direction finding algorithm digitally.
One or more embodiments provide an analog implementation of the method above, which implements the null-generating blocks and direction finding system using analog circuits. A mixed of digital and analog processing can also be used to implement the direction finding system.
Although illustrative embodiments have been shown and described by way of example, a wide range of alternative embodiments is possible within the scope of the foregoing disclosure.
This application claims priority of U.S. Patent Application Ser. No. 61/844,965 entitled “METHOD AND SYSTEM FOR DIRECTION OF ARRIVAL ESTIMATION USING MICROPHONE ARRAY WITH SHARP NULL,” filed Jul. 11, 2013, the contents of which are incorporated herein by reference in their entirety.
Number | Name | Date | Kind |
---|---|---|---|
20010028720 | Hou | Oct 2001 | A1 |
20050213432 | Hoshuyama | Sep 2005 | A1 |
20110200205 | Tokuda | Aug 2011 | A1 |
20140241549 | Stachurski | Aug 2014 | A1 |
Number | Date | Country |
---|---|---|
WO 0037962 | Jun 2000 | WO |
Entry |
---|
H. Krim et al., “Two Decades of Array Signal Processing Research—The Parametric Approach,” Jul. 1996, IEEE Signal Processing Magazine, pp. 67-94. |
Number | Date | Country | |
---|---|---|---|
20150016628 A1 | Jan 2015 | US |
Number | Date | Country | |
---|---|---|---|
61844965 | Jul 2013 | US |