The invention relates to audio beamforming and in particular, but not exclusively, to audio beamforming using microphone arrays substantially smaller than the wavelength of the audio signals being beamformed.
Advanced processing of audio signals has become increasingly important in many areas including e.g. telecommunication, content distribution etc. For example, in some applications, such as hands-free communication and voice control systems, complex processing of inputs from a plurality of microphones has been used to provide a configurable directional sensitivity for a microphone array comprising the microphones. Specifically, the processing of signals from a microphone array can generate an audio beam with a direction that can be changed simply by changing the characteristics of the combination of the individual microphone signals.
Typically, beam form algorithms seek to attenuate interferers while providing a high gain for a desired sound source. For example, a beamforming algorithm can be controlled to provide a strong attenuation (preferably a null) in the direction of a signal received from a main interferer.
For practical reasons it is desirable that the microphone array is relatively small. However, when the wavelength of the sound of interest is much larger than the size of the array, many beamforming algorithms, such as additive delay-and-sum beamforming algorithms, are not able to provide sufficient directivity as the beamwidth deteriorates substantially for such wavelengths.
One approach for achieving an improved directivity is to apply so called superdirective beamforming techniques. Such superdirective beamforming techniques are based on filters with asymmetrical filter coefficients and the approach essentially corresponds to subtraction of signals or determining spatial derivatives of the sound pressure field. However, although this may improve the directivity, it is also known that this is achieved at the expense of robustness, such as increased sensitivity to white (sensor) noise and an increased sensitivity to mismatches in microphones characteristics.
In the article “Optimal Azimuthal Steering of a First-order Superdirectional Microphone Response” by R. M. M. Derkx, International Workshop on Acoustic Echo and Noise Control, September 2008, Seattle, a system is analyzed which generates Eigenbeams for a two dimensional microphone array. The Eigenbeams are then combined to maximize the attenuation of a single point interference source. In particular, a null is located in the direction of a single point interferer while maintaining a suitable gain for the desired direction.
However, although this approach provides improved performance in many scenarios, it provides non optimal performance in some practical scenarios. It also tends to require relatively complex and resource demanding processing.
Hence, an improved approach for audio beamforming would be advantageous and in particular an approach allowing improved adaptation to current conditions and audio environment, increased flexibility, facilitated implementation, improved performance for different operating scenarios and/or improved performance would be advantageous.
Accordingly, the Invention seeks to preferably mitigate, alleviate or eliminate one or more of the above mentioned disadvantages singly or in any combination.
According to an aspect of the invention there is provided an audio beamforming apparatus comprising: a receiving circuit for receiving signals from an at least two-dimensional microphone array comprising at least three microphones; a reference circuit for generating at least three reference beams from the microphone signals; combining circuit for generating an output signal corresponding to a desired beam pattern by combining the reference beams in response to a first direction of a desired sound source and a direction estimate for an interfering sound source; an estimation circuit for generating the direction estimate by: determining a first angle corresponding to a local minimum for a power measure of the output signal in a first angle interval, determining a second angle corresponding to a local minimum for a power measure of the output signal in a second angle interval, and determining the direction estimate as an angle selected from a set of angles corresponding to local minima for a power measure of the output signal, the set of angles comprising at least the first angle and the second angle; and wherein the combining circuit is arranged to determine combination parameters for the combining of the reference beams to provide a notch in an angle corresponding to the direction estimate and a minimization of a directivity cost measure, the directivity cost measure being indicative of a ratio between a gain in the first direction and an average gain.
The invention may allow improved performance. In particular, an improved and/or facilitated adaptation to a current audio environment can be achieved. The invention may allow a beamforming approach which provides high performance for both directional point interference cancellation and for diffuse noise attenuation. The approach is particularly suitable for, and may provide particularly advantageous performance for, systems wherein the wavelength of the audio signals may be substantially larger than the size of the microphone array.
The invention may allow low complexity implementation and/or operation. The approach may be suitable for providing improved directivity and may in particular be suitable for scenarios wherein the size of the microphone array is much smaller than a wavelength of interest.
In many embodiments and scenarios, the approach may allow a null to be directed towards a single point interference while substantially reducing diffuse noise. In particular, the approach may in many scenarios allow a reduction of a single point interference corresponding to or better than many prior art interference reduction techniques, while at the same time providing improved diffuse noise.
The approach may in many scenarios allow a low complexity yet highly efficient and advantageous beam steering based on low complexity parallel local minima extraction. In many embodiments, the approach may ensure that at least one of the identified local minima is also a global minimum and thus may allow an efficient estimation of the angle of interference.
The reference beams may be non-adaptive and may be independent of the captured signals and/or the audio conditions. The reference beams may be constant and may be generated by a constant/non-adaptive combination of the signals from the at least three microphones. The reference beams may specifically be Eigenbeams or orthogonal beams.
The first angle interval and the second angle interval may be disjoint intervals and may be adjacent intervals. The first and second angle intervals may together cover the entire 360° interval.
The interfering sound source may be an assumed interfering sound source. A direction estimate for a sound source may be generated independently of whether the sound source is present or not. Thus, even if no interfering point source is detected, the estimation circuit may generate the direction estimate from the microphone signals under the assumption that an interfering sound source is present.
In accordance with an optional feature of the invention, the estimation circuit is arranged to select the direction estimate as one of the first angle and the second angle in response to a gradient of a power measure of the output signal as a function of the direction estimate for an angle separating the first angle interval and the second angle interval.
This may provide a particularly efficient and low complexity determination of the direction estimate. The angle may be any angle between the first angle interval and the second angle interval including the end points of one or both of the angle intervals.
In accordance with an optional feature of the invention, the first angle interval comprises angles from 0 to π and the second angle interval comprises angles from π to 2π.
This may provide particularly advantageous performance and may in particular allow adaptation for all possible directions of the interfering sound source.
In accordance with an optional feature of the invention, the estimation circuit is arranged to select the direction estimate as one of the first angle and the second angle in response to a gradient of a power measure of the output signal as a function of the direction estimate for an angle of π.
This may provide a particularly efficient and low complexity determination of the direction estimate.
In accordance with an optional feature of the invention, the combining circuit comprises a sidelobe canceller.
This may provide particularly advantageous performance and/or practical implementation.
In accordance with an optional feature of the invention, the sidelobe canceller is arranged to generate the output signal as a weighted combination of at least a primary signal, a first noise reference signal and a second noise reference signal.
This may provide particularly advantageous performance and/or practical implementation. The primary signal may correspond to a beam adapted in the direction of the desired sound source and each of the reference signals may correspond to beams adapted to cancel/reduce noise. The noise reference signals may specifically have notches in the direction of the desired sound source.
In accordance with an optional feature of the invention, the combining circuit is arranged to calculate weights for the first and second noise reference signals in response to the direction estimate and maximization of the directivity cost measure.
This may provide a particularly advantageous performance and/or low complexity implementation. In particular, the weights may be determined as a function of the direction estimate wherein the function is selected to maximize the directivity cost measure.
In accordance with an optional feature of the invention, the estimation circuit is arranged to determine at least one of the first and second angles by a gradient search applied to a sidelobe canceller corresponding to the sidelobe canceller of the combining circuit and having an angle input variable.
This may provide a particularly advantageous performance and/or low complexity implementation. In particular, a gradient search may provide a highly efficient approach for identifying potential minima that may optimize the beamforming operation. An efficient and low complexity adaptation of the beamforming may be achieved which can reduce both diffuse noise and reduce/cancel a single point interference.
In many embodiments both the first and single angle are determined by a gradient search. The gradient search may be performed using a sidelobe canceller operation which is identical to the sidelobe canceller operation used to generate the output signal but with a value of the angle input variable that may be different than the phase value (the direction estimate) used to generate the output signal (thus which can be varied independently).
In some embodiments, a gradient search may be applied in parallel in the two angle intervals using parallel sidelobe canceller operations with independent angle input variables. The output signal of the combining circuit may be selected as the signal of the parallel sidelobe canceller corresponding to the selected angle of the first and second angles.
In some embodiments, a sidelobe canceller corresponding to the sidelobe canceller of the combining circuit may be used to determine a gradient of a power measure of the output signal for a given angle (specifically π) and the selection between the first and second angle may be in response to the gradient.
In accordance with an optional feature of the invention, an update value for the angle input variable is determined as a function of an output signal of the sidelobe canceller for a current phase value of the angle input variable, and a first and second noise reference signal of the sidelobe canceller for the current phase value.
This may provide particularly advantageous performance and/or facilitated implementation and/or operation.
In accordance with an optional feature of the invention, the first and second noise reference signals are weighted as a function of the current phase value.
This may provide particularly advantageous performance and/or facilitated implementation or operation.
In accordance with an optional feature of the invention, the estimation circuit is arranged to determine a power estimate for at least one of the first and second noise reference signals and to perform a normalization of the update value as a function of the power estimate.
This may provide particularly advantageous performance and/or facilitated implementation and/or operation.
In accordance with an optional feature of the invention, the at least two-dimensional microphone array comprises at least four microphones and the apparatus comprises a circuit for combining signals from at least two of the at least four microphones prior to generating the reference beams.
This may provide particularly advantageous performance and/or facilitated implementation and/or operation. In particular, it may provide improved noise performance in many scenarios.
In accordance with an optional feature of the invention, the apparatus of further comprises the at least two-dimensional microphone array, the at least two-dimensional microphone array comprising directional microphones having a maximum response in a direction outwardly of a perimeter of the at least two-dimensional microphone array.
This may provide particularly advantageous performance and/or facilitated implementation and/or operation.
According to an aspect of the invention there is provided a method of audio beamforming comprising: receiving signals from an at least two-dimensional microphone array comprising at least three microphones; generating at least three reference beams from the microphone signals; generating an output signal corresponding to a desired beam pattern by combining the reference beams in response to a first direction of a desired sound source and a direction estimate for an interfering sound source; generating the direction estimate by: determining a first angle corresponding to a local minimum for a power measure of the output signal in a first angle interval, determining a second angle corresponding to a local minimum for a power measure of the output signal in a second angle interval, and determining the direction estimate as an angle selected from a set of angles corresponding to local minima for a power measure of the output signal, the set of angles comprising at least the first angle and the second angle; and wherein the combining of the reference beams comprises determining combination parameters for the combining of the reference beams to provide a notch in an angle corresponding to the direction estimate and a minimization of a directivity cost measure, the directivity cost measure being indicative of a ratio between a gain in the first direction and an energy averaged gain.
These and other aspects, features and advantages of the invention will be apparent from and elucidated with reference to the embodiment(s) described hereinafter.
Embodiments of the invention will be described, by way of example only, with reference to the drawings, in which
The system processes the received microphone signals to generate a set of constant non-adaptable reference beams. These reference beams are then adaptively combined to generate a desired beam pattern. The combination is adapted such that the resulting beam form is adapted to cancel or substantially attenuate an assumed single point interference source while at the same time minimizing or reducing the impact of diffuse noise.
The system provides an efficient adaptive beamforming where the main lobe can be steered towards the direction of a desired sound source while adapting the directional pattern such that a point interferer from another angle is effectively rejected and a substantially optimal rejection of diffuse (isotropic) noise is achieved. The system of
The system of
In the specific example, the microphone array 101 comprises three microphones which are spaced uniformly on a circle as illustrated in
Thus, in the example a circular array of at least three (omni- or uni-directional) sensors in a planar geometry is used. It will be appreciated that in other embodiments, other arrangements of the microphones may be used. It will also be appreciated that for embodiments wherein more than three microphones are used, these may possibly be arranged in a non-planar geometry, i.e. the microphone array may be a three dimensional microphone array. However, the following description will focus on a three microphone equidistant circular array arranged in the azimuth plane.
The microphone array 101 is coupled to a receiving circuit 103 which receives the microphone signals. In the example of
The receiving circuit 103 is coupled to a reference processor 105 which is arranged to generate at least three reference beams from the microphone signals. The reference beams are constant beams that are not adapted but are generated by a fixed combination of the digitized microphone signals from the receiving circuit 103. In the example of
In the example, the three microphones of the microphone array are directional microphones and are specifically uni-directional cardioid microphones which are arranged such that the main gain is pointing outwardly from the perimeter formed by joining the positions of the microphones (and thus outwardly of the circle of the circular array in the specific example). The use of uni-directional cardioid microphones provides an advantage in that the sensitivity to sensor noise and sensor-mismatches is greatly reduced. However, it will be appreciated that in other scenarios other microphone types may be used, such as omni-directional microphones.
Denoting the responses of the three cardioid microphones as respectively EC0, EC2π/3 and EC4π/3 and ignoring any uncorrelated sensor-noise, the i'th cardioid microphone response is ideally given by:
where θ and Φ are the standard spherical coordinate angles: elevation and azimuth, c is the speed of sound and xi and yi are the x and y coordinates of the i'th microphone.
Using:
with r the radius of the circle we can write:
From the three cardioid microphones, the three orthogonal Eigenbeams can be determined from:
For wavelengths larger than the size of the array, the responses of the Eigenbeams are frequency invariant and ideally equal to:
Em=1.
Ed0(θ,φ)=cos φ sin θ
Edπ/2(θ,φ)=cos(φ−π/2)sin θ.
The directivity patterns of these Eigenbeams are illustrated in
The zero'th-order Eigenbeam Em represents the monopole response corresponding to a sphere whereas the other Eigenbeams represent first order Eigenbeams corresponding to double spheres as illustrated in
The resulting signals from each of the three Eigenbeams are fed to a beamform circuit 107 which proceeds to adaptively combine these signals to provide a desired beam pattern.
Specifically, by suitable combining the first order Eigenbeams, a dipole can be steered to any angle φs. E.g. a weighted summation of the orthogonal diagonals can be generated:
Edφ
where φs represents the desired angle for the resulting dipole.
The steered and scaled superdirectional microphone response can then be constructed by combining the steered dipole with the monopole, e.g. as:
where α≦1 is a parameter for controlling the directional pattern of the first-order response and S is an arbitrary scaling factor (that can also have negative values).
Thus, the beamform circuit 107 can generate a suitable beam pattern by a suitable combination of the reference Eigenbeams. The beamform circuit 107 is arranged to generate a nominal (e.g. unity) gain in the direction of a desired speaker coming from an arbitrary azimuthal angle Φ=φs. The direction of the desired speaker is assumed to be known by the beamform circuit 107. It will be appreciated that any suitable way of determining a desired direction may be used without detracting from the invention. For example, a fixed direction may be used or e.g. a tracking algorithm for a desired speaker or sound source may be used. It will be appreciated that many different algorithms for determining a desired sound source direction will be known to the skilled person.
The beamform circuit 107 is furthermore arranged to adapt the beam such that the sensitivity to diffuse noise is minimized and a notch is generated in an estimated direction of an assumed interfering point source. The system of
The beamform circuit 107 is specifically coupled to an estimation circuit 109 which determines an estimate for the direction to an assumed point source interference. Based on the estimated direction, the beamform circuit 107 generates combination parameters for the combination of the Eigenbeams such that a notch (typically a null) is generated in the estimated direction. However, the combination of three Eigenbeams provides sufficient degrees of freedom to allow a range of solutions to the constraint of providing a nominal gain in a desired direction and a notch in an interference direction. In the system, this additional degree of freedom is used to improve the diffuse noise performance. This is specifically achieved by the combination parameters being selected to maximize a directivity cost measure where the directivity cost measure is indicative of a ratio between a power/energy gain in the first direction and an average power/energy gain. Specifically, the directivity cost measure may be indicative of the gain in the desired direction relative to an average gain of the resulting beam where the averaging is over all angles in the azimuth plane (Le. from 0-2π) or from all directions in the three dimensions. Thus, the directivity cost measure is a function which indicates the attenuation of homogenous spatially diffuse noise (i.e. the same noise level in all direction) provided by the beam pattern.
The estimation circuit 109 is specifically arranged to determine the estimated angle of an interference point by searching for local minima of a power measure for the output signal. Thus, the estimation circuit 109 seeks to minimize the power of the output signal as this will correspond to the lowest noise/interference. In some embodiments, the estimation may only be performed when the desired sound source is inactive (e.g. when a desired speaker is not speaking) but it will be appreciated that this is not necessary for the minimization of the power of the output signal to be an indication of optimal noise/interference operation (specifically the presence of the desired signal may introduce an offset to the power measure but will not change the position of the minimum).
The estimation circuit 109 determines at least two local minima by searching in at least two angle intervals. The two angle intervals are typically disjoint, although in some embodiments some overlap may occur. The local minima are determined in the different angle intervals by a parallel processing based on different angles. Specifically, the estimation circuit 109 may copy the operation of the beamform circuit 107 and evaluate the resulting output signal for different angles in the different angle intervals. The estimation circuit 109 may then select one of the angles that have been found to correspond to a local minima for the output signals and the selected angle is then used as the estimate for the assumed single point interference source. The selected angle is then fed to the beamform circuit 107 which proceeds to perform the combination such that a nominal gain is provided in the direction of the desired source and a notch is provided in the estimated direction of the main single point interference. Furthermore, the combination uses weights that are selected to further minimize the diffuse noise. This constraint is imposed by the weights being selected to maximize directivity cost measure.
The estimation operation and adaptation is independent of the actual noise and interference conditions and specifically is independent of whether a significant single point interferer or diffuse noise is present or not. However, the approach results in very efficient performance across a wide variety of scenarios including scenarios with a dominant single point interference and no diffuse noise as well as scenarios with no single point interference but substantial diffuse noise. Indeed, the approach and underlying assumptions result in an operation that not only adapts to the specific characteristics of the noise and single point interference characteristics but also adapts to the type of noise/interference scenario that is experienced. This also reduces complexity and facilitates operation as there is no need to adapt the algorithm to the type of audio environment being experienced. This also provides increased flexibility and a wider application of the approach.
In the following, a specific example of the system of
The primary response may be given by
Ep(θ,φ)=FαTRφ
where
The primary signal thus corresponds to the desired audio signal but also comprises signals from undesired directions. The impact of these sidelobes is reduced by generation of noise reference signals which are weighted and subtracted from the primary signal to generate the output signal.
Thus, the sidelobe canceller generates the noise reference signals given by
where B is a blocking matrix given by:
It is noted that the noise-references are respectively a cardioid and a dipole response, with a null steered towards the primary signal at azimuth φs and elevation θ=π/2.
The two noise reference signals are then weighted by weights w1 and w2 before being subtracted from the primary signal to provide the output signal. Thus, the overall beam-pattern from the sidelobe canceller is given by:
E(θ,φ)=Ep(θ,φ)−w1Er
The beamform circuit 107 is arranged to generate a nominal gain, in the following a unity gain, in a desired angle φs and a notch, specifically a zero, in the direction φn of an assumed single point interference determined by the estimation circuit 109.
With a unity gain in the direction of φs, the weights required to steer a zero towards the angle φn can be calculated by solving the equation:
Ep(π/2,φ)−w1Er
where φ=φs−φn.
Solving the equation yields:
or alternatively:
As can be seen, the constraints of the unity gain and the direction of the zero do not uniquely define the required weights but provide an extra degree of freedom.
In the system, this degree of freedom is used to optimize diffuse noise performance. In particular, the noise reference weights are selected such that a directivity cost measure is maximized.
A suitable directivity cost measure is given by:
Thus, the directivity cost measure represents a ratio between the gain in the desired direction and the overall (power) gain averaged over the entire sphere. It will be appreciated that in other embodiments, the gain averaging may e.g. only be in a two-dimensional plane such as the azimuth plane.
For the response given by
this can be shown to correspond to:
Inserting the output signal response given by:
E(θ,φ)=Ep(θ,φ)−w1Er
and inserting the value of w1 as given by:
in the directivity cost measure, and differentiating with respect to w2 and setting the result to zero allows for the minima of the directivity cost measure with respect to w2 to be determined. Thus, the value of w2 for which the directivity cost measure is maximized and thus the diffuse noise sensitivity is minimized can be determined. This specifically yields:
With this value of w2, we can also compute w1 as:
Thus, w1 and w2 can be calculated such that a unity gain is provided in the desired direction, a zero is formed in the direction of an assumed interferer and the diffuse noise attenuation is maximized under these constraints.
Thus, once the estimation circuit 109 has determined a suitable angle estimate for the assumed point source interferer, the derived equations can be used to calculate suitable weights that will also maximize the directivity cost measure and thus optimize the diffuse noise performance.
Rather than using the previously derived equation for w1, a value compensated for the effect of the design parameter can be used:
It can be shown that w2 can then be derived from:
Thus, the output signal y[k] is given by:
y[k]=p[k]−(ŵ1[k]+2α)r1[k]−ŵ2[k]r2[k],
with
where is the estimate of the angle of the assumed undesired interferer and φs is the angle of the desired audio source.
The estimation circuit 109 proceeds to determine the direction estimate by minimizing a power measure for the output signal in different angle intervals.
Specifically, the estimation circuit 109 seeks to maximize the cost function given by:
J({circumflex over (φ)})=ε{y2[k]},
where denotes the expected value.
Some observations can be made from
This can further be seen in
However, a further observation is that the correct minimum is always the only local minimum in the phase interval from either 0 to π or from π to 2π. Thus, for an interferer angle within the interval of [0;π], the only local minimum in the interval of [0;π] is the correct value. Similarly, for an interferer angle within the interval of [π;2π], the only local minimum in the interval of [π;2π] is the correct value.
This realization is exploited in the system of
The estimation circuit 109 then proceeds to select one of the two estimated values as the estimated angle that is used to control the beamforming by the beamform circuit 107. Thus, one of the local minima is selected and used to calculate the weights for the noise reference signals using the equations that also optimize diffuse noise performance.
It will be appreciated that different criteria for selecting between the determined local minima may be used. For example, in some embodiments, a simple beamforming may be applied to the microphone signals such that a beam is formed in each of the two directions in order to measure the interference level in those directions. The direction having the highest level is then selected as it corresponds to the most dominant interference.
However, in the specific example, the selection of the correct local minima is based on the gradient of the cost function at a specific angle which separates the two angle intervals (i.e., it is inbetween the two angle intervals and may specifically be an endpoint of one or both of the intervals). In the specific example, the gradient at π is determined and is used to select the appropriate local minimum. Specifically, if the cost function has a negative gradient for then the local minimum in the interval of [0;π] is selected and otherwise the local minimum in the interval of [π;2π] is selected. Indeed, it has been found that such a selection provides a very reliable indication of the correct local minima and thus provides a low complexity but efficient selection approach.
Intuitively, it can be understood as follows. The directional beam pattern for yields a cardioid response with only a single null. The gradient of the cost function for therefore yields the direction toward the true value of φ. If the gradient is positive the true value of φ lies in the interval [π;2π]. If the gradient is positive, the true value of φ lies in the interval [0;π].
It should also be noted that in some embodiments, all the local minima of the function may be determined and separated into the two angle intervals. Indeed, in such an embodiment, the detection of two local minima in one interval may automatically lead to the selection of the other minimum (i.e. the one in the other angle interval). This approach is based on the realization that (as illustrated in
In the system of
Thus, the estimation circuit 109 performs a sidelobe cancelling operation corresponding to that of the beamform circuit 107 while using an input angle value that is constantly updated and biased in the direction that will reduce the cost function. This approach will result in the angle variable ending in a local minimum.
Specifically, a steepest descent update equation for {circumflex over (φ)} can be derived by stepping in the direction opposite to the surface of the cost function with respect to {circumflex over (φ)}:
{circumflex over (φ)}[k+1]={circumflex over (φ)}[k]−μ∇J({circumflex over (φ)}),
with a gradient given by:
and where μ is the update step-size with 0<μ<1.
In practice, the mean value is not available and therefore an instantaneous estimate of the gradient is used:
by inserting the previously derived formulas for y[k] and performing the derivation, this can be shown to lead to:
Thus, the update value for the angle input variable of the gradient search is a function of an output signal of the sidelobe canceller and of the first and second noise reference signals.
In the above example, the update value is dependent on the power of the noise references. In order to compensate for this, the estimation circuit 109 may determine a power estimate for one or both of the noise reference signals and normalize the update value accordingly.
Accordingly, the following update equation for the gradient search may be used:
where ε is a small value to prevent zero division and is the power estimate of the i'th noise reference signal. This can specifically be calculated by a recursive averaging:
{circumflex over (P)}r
where β is a suitable design parameter.
Thus, for each angle interval, the estimation circuit 109 operates a sidelobe canceller applied to the same signals as the sidelobe canceller of the beamform circuit 107. However, the sidelobe cancellers are operated based on an input angle variable which corresponds to a current estimate of the angle to the assumed point source interferer. The input angle variable is continuously updated using the gradient search approach such that it will converge on the local minimum in the angle interval.
The estimation circuit 109 then selects between the current values of the input angle variables and uses this result as the estimated angle for the assumed point source interferer. The selection is based on the gradient of the cost function for an input variable of π. The estimation circuit 109 may specifically determine this by operating a further sidelobe canceller process on the input signals but with a fixed angle value of π. Specifically, the estimation circuit 109 may continuously evaluate the update value:
for {circumflex over (φ)}=π. The derived values may be averaged over time and the sign of the averaged value (i.e. the gradient of the cost function at π) is then used to select which of the angles determined by the gradient searches is used.
It will be appreciated that whereas the previous discussion illustrated the principle by referring to the use of four sidelobe cancellers (one for the beamform circuit 107, one for each gradient search, and one for determining the gradient at π), this is merely used to illustrate the principle. Indeed, in many embodiments, the same sidelobe canceller may be implemented, e.g. as a subroutine, and used for the different purposes and with different input angles.
It will also be appreciated that typically, the beamform circuit 107 will not repeat a sidelobe canceller operation for the estimated angle but will directly use the output signal calculated for the selected angle when performing the estimation.
In the example, the gradient search is arranged to re-initialize the gradient search if the value of the angle input variable moves out of the corresponding angle interval. Specifically, a re-initialization of the gradient search may be performed if the two gradient searches reach a scenario wherein the both have angle values in the same angle interval. For example, if during the gradient search in the [0;π] interval, the updated angle value moves into the [π;2π] interval such that both gradient searches have current values within this interval, the gradient search is re-initialized. The re-initialization is specifically performed by resetting the value of the input angle variable of one of the two gradient searches to an initial value. The initial value may for example be a fixed value such as the midpoint in the interval (i.e. π/2 and 3π/2).
From
Next looking at the lower-left quadrant for the gradient search in the interval [0;π], we can see from
When the re-initialization would be larger than η, there is a risk that the gradient search again ends up in the wrong quadrant, i.e. the upper-left quadrant. Especially when φ is equal or close to zero, it is mandatory that re-initialization is done in the range [0;η], where η≈2.55.
Hence, for the re-initialization, it is safe to choose a mid-point in the interval [0;η], (i.e. η/2), where η≈2.55.
A specific example of the approach that may be used is illustrated in
Step 801 is followed by step 803 wherein it is ensured that {circumflex over (φ)}1[k] is smaller than {circumflex over (φ)}2[k] (if not the two variable values are simply swapped).
Step 803 is followed by step 805 wherein it is determined if the two gradient searches have resulted in angle values, {circumflex over (φ)}1[k], {circumflex over (φ)}2[k], in the same angle interval. If so, the appropriate value is re-initialized to ensure there is one angle in each angle interval.
Step 805 is followed by step 807 wherein the weights for the noise reference signals, the resulting output signal and the cost function gradients are calculated.
Step 807 is followed by step 809 wherein the new values for the angle input variables, {circumflex over (φ)}1[k], {circumflex over (φ)}2[k], of the gradient searches are calculated. Furthermore, the filtered cost function gradient at it is calculated.
Step 809 is followed by step 811 wherein the appropriate angle value is selected based on the filtered cost function gradient at π.
Step 811 is followed by step 813 wherein the power estimates for the noise reference signals used in the update value determination are updated.
After step 813 the method returns to step 803 to process the next sample.
A pseudo-code of an algorithm corresponding to
In the specific example, the number of microphones in the microphone array 101 corresponded to the number of reference beams (i.e. three). However, in some embodiments, the microphone array may comprise more microphones than reference beams.
Specifically the microphone array 101 may comprise at least four microphones. The system may still only generate three reference beams and may specifically be arranged to combine signals from at least two microphones prior to generating the reference beams. Thus, the reference processor 105 may still only receive three input signals and generate three reference beams from these. However, at least one of these input signals may be generated by combining (and specifically averaging or adding (e.g. by a weighted summation)) the signals from at least two microphones. Such an approach may provide improved noise performance in many scenarios as the level of uncorrelated noise may be averaged. Furthermore, using more microphones on a particular area, has the advantage that spatial aliasing will occur at a higher frequency.
It will be appreciated that the above description for clarity has described embodiments of the invention with reference to different functional circuits, units and processors. However, it will be apparent that any suitable distribution of functionality between different functional circuits, units or processors may be used without detracting from the invention. For example, functionality illustrated to be performed by separate processors or controllers may be performed by the same processor or controllers. Hence, references to specific functional units or circuits are only to be seen as references to suitable means for providing the described functionality rather than indicative of a strict logical or physical structure or organization.
The invention can be implemented in any suitable form including hardware, software, firmware or any combination of these. The invention may optionally be implemented at least partly as computer software running on one or more data processors and/or digital signal processors. The elements and components of an embodiment of the invention may be physically, functionally and logically implemented in any suitable way. Indeed the functionality may be implemented in a single unit, in a plurality of units or as part of other functional units. As such, the invention may be implemented in a single unit or may be physically and functionally distributed between different units, circuits and processors.
Although the present invention has been described in connection with some embodiments, it is not intended to be limited to the specific form set forth herein. Rather, the scope of the present invention is limited only by the accompanying claims. Additionally, although a feature may appear to be described in connection with particular embodiments, one skilled in the art would recognize that various features of the described embodiments may be combined in accordance with the invention. In the claims, the term comprising does not exclude the presence of other elements or steps.
Furthermore, although individually listed, a plurality of means, elements, circuits or method steps may be implemented by e.g. a single circuit, unit or processor. Additionally, although individual features may be included in different claims, these may possibly be advantageously combined, and the inclusion in different claims does not imply that a combination of features is not feasible and/or advantageous. Also the inclusion of a feature in one category of claims does not imply a limitation to this category but rather indicates that the feature is equally applicable to other claim categories as appropriate. Furthermore, the order of features in the claims do not imply any specific order in which the features must be worked and in particular the order of individual steps in a method claim does not imply that the steps must be performed in this order. Rather, the steps may be performed in any suitable order. In addition, singular references do not exclude a plurality. Thus references to “a”, “an”, “first”, “second” etc do not preclude a plurality. Reference signs in the claims are provided merely as a clarifying example shall not be construed as limiting the scope of the claims in any way.
Number | Date | Country | Kind |
---|---|---|---|
09166297 | Jul 2009 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB2010/053335 | 7/22/2010 | WO | 00 | 1/18/2012 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2011/010292 | 1/27/2011 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
20050141731 | Hamalainen | Jun 2005 | A1 |
20060140417 | Zurek | Jun 2006 | A1 |
20090103749 | Goodwin | Apr 2009 | A1 |
Number | Date | Country |
---|---|---|
1455552 | Sep 2004 | EP |
2009034524 | Mar 2009 | WO |
Entry |
---|
Shimizu et al: “Isotropic Noise Suppression in the Power Spectrum Domain by Symmetric Microphone Arrays”; 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct. 21-24, 2007, New Paltz, NY, pp. 54-57. |
McKinney et al: “Directionalizing Adaptive Multi-Microphone Arrays for Hearing Aids Using Cardioid Microphones”; IEEE International Conference on Acoustics, Speech, and Signal Processing, 1993, I-177-I-180. |
Zhang et al: “Maximum Likelihood Sound Source Localization and Beamforming for Directional Microphone Arrays in Distributed Meetings”; IEEE Transactions on Multimedia, vol. 10, No. 3, Apr. 2008, pp. 538-548. |
Saruwatari et al: “Speech Enhancement Using Nonlinear Microphone Array With Noise Adaptive Complementary Beamforming”; IEEE, 2000, pp. 1049-1052. |
R. Derkx: “Adaptive Azimuthal Null-Steering for a First-Order Microphone Response”; Proceedings of the 12th International Workshop on Acoustic Echo and Noise Control, Aug. 30-Sep. 2, 2010, Tel-Aviv, Israel, 4 Page Document. |
R. Derkx: “Optimal Azimuthal Steering of a First-Order Superdirectional Microphone Response”; Proceedings of the 11th International Workshop on Acoustic Echo and Noise Control, Sep. 14-17, 2008, Seattle, WA, 4 Page Document. |
Nagata et al: “Two-Channel Adaptive Microphone Array With Target Tracking”; Electronics and Communications in Japan, Part 3, vol. 83, No. 12, 2000, pp. 19-24. |
Number | Date | Country | |
---|---|---|---|
20120114128 A1 | May 2012 | US |