The present invention relates to a noise eliminating apparatus which eliminates a noise component from an output signal of a microphone.
There exists a technique of receiving a sound wave (speaking voice, etc.) containing a noise with a microphone and eliminating a noise component from an output signal of the microphone. For example, Nonpatent Document 1 describes a study on elimination of the noise component.
However, in some cases, a conventional noise eliminating apparatus cannot effectively eliminate the noise component, or causes sound wave distortion due to the elimination of the noise component.
An object of the present invention is to provide an apparatus which eliminates the noise component from the mixture of noise and sound wave (speaking voice, etc.) and carries out such a process that the sound wave (speaking voice, etc.) can be heard clearly.
To solve the above problems, a noise eliminating apparatus of the present invention comprises a first microphone, a second microphone and a signal processing unit, wherein: the signal processing unit includes a linear prediction filter and a noise resynthesis filter; the linear prediction filter receives an output signal of the first microphone, predicts the output signal of the first microphone by linear prediction and generates a prediction signal; and the noise resynthesis filter is an adaptive filter which receives, as a main input signal, a first difference signal obtained by subtracting one of the output signal of the first microphone and the prediction signal from the other, receives, as an error signal, a second difference signal obtained by subtracting one of an output signal of the second microphone and an output signal of the noise resynthesis filter itself from the other, and updates a filter coefficient so that the error signal is minimized.
In the noise eliminating apparatus, a coefficient vector of the noise resynthesis filter at a time j+1 may be produced by adding an update vector to a coefficient vector at a time j, and when a magnitude of the update vector determined by an adaptive algorithm applied by the noise resynthesis filter is larger than a predetermined value, the magnitude of the update vector may be reduced so as to become the predetermined value without changing a direction of the update vector, and the coefficient vector of the noise resynthesis filter may be updated by the reduced update vector.
Moreover, in the noise eliminating apparatus, the adaptive algorithm applied by the noise resynthesis filter may be a learning identification method.
Moreover, in the noise eliminating apparatus, the linear prediction filter may be an adaptive filter which receives the first difference signal as the error signal, and updates the filter coefficient so that the error signal is minimized.
The noise eliminating apparatus of the present invention can effectively eliminate the noise component without distorting the sound wave.
a is a view showing a basic structure of a proposed noise eliminating apparatus.
b is a view showing a structure of a linear prediction error filter.
A, B: microphone
A basic structure of a proposed noise eliminating apparatus is shown in
Formula (1)
x
a(j)=sa(j)+na(j) [1a]
Then, the noise eliminating apparatus generates a prediction residual, shown by Formula (2) below, obtained as a result of the above analysis.
Formula (2)
e
a(j)=s′a(j)+n′a(j) [1b]
In these formulas, sa(j) denotes a sound wave captured by a microphone A, na(j) denotes a noise, s′a(j) denotes a prediction residual of the sound wave, and n′a(j) denotes a prediction residual of the noise.
Any type of linear prediction error filter may be adopted as a linear prediction error filter of
The linear prediction error filter of
The signal xa(j) having been inputted to the linear prediction error filter branches inside the linear prediction error filter, and the branched signals are respectively inputted to the subtracter and the linear prediction filter. To the subtracter, an output signal y(j) of the linear prediction filter is also inputted. The subtracter subtracts the signal y(j) from the signal xa(j), and outputs a signal ea(j) as the prediction residual obtained as a result of the subtraction.
The linear prediction filter is an FIR filter whose number of taps is P. The output signal y(j) of the linear prediction filter is shown by the following formula.
In this formula, hi(j) denotes an i-th filter coefficient.
The filter coefficient hi(j) is updated so that the power of the prediction residual signal ea(j) is minimized. A learning algorithm (adaptive algorithm) is used for this updating. The learning algorithm (adaptive algorithm) used here may be any type of adaptive algorithm, and for example, an LMS algorithm, an RLS algorithm or an NLMS algorithm (learning identification method) may be used.
Next, a noise resynthesis filter synthesizes x′b(j), shown by Formula (3) below, using the prediction residual ea(j).
In Formula (3), s′b(j) denotes a resynthesized sound wave, and n′b(j) denotes a resynthesized noise.
Meanwhile, a signal, shown by Formula (4) below, generated by overlapping a sound wave sb(j) with a noise nb(j) is inputted to a microphone B.
Formula (4)
x
b(j)=sb(j)+nb(j) [4]
Therefore, in the noise eliminating apparatus of the present embodiment, if the noise resynthesis filter can synthesize only a signal shown by Formula (5) below, an enhanced sound wave shown by Formula (6) below can be obtained as an output of the noise eliminating apparatus of the present embodiment.
Formula (5)
n
b(j)≈n′b(j) [5]
Formula (6)
e
b(j)≈sb(j) [6]
This resynthesis of the noise is carried out simultaneously with system identification in which a sound propagation path from the microphone A to the microphone B is an unknown system. Therefore, due to the identification, a blind corner is caused to be adaptively directed to a noise arrival direction.
The noise resynthesis filter is an adaptive filter. The learning algorithm (adaptive algorithm) applied by the noise resynthesis filter may be any type, such as the LMS algorithm or the RLS algorithm. Especially, by using the NLMS (Normalized-LMS: learning identification method) as the learning algorithm, a high effect (noise eliminating effect) of suppressing noise with comparatively less computation can be obtained. However, echoey distortion of the sound wave (speaking voice) occurs. A component for reducing this distortion is added.
Since the signal inputted to the noise resynthesis filter contains the sound wave and the noise as shown in Formula (3), the noise resynthesis filter resynthesizes both the sound wave and the noise. However, synthesizing only the noise is ideal, and the output sound wave is distorted since the sound wave is also synthesized. The sound wave distortion is significant when the NLMS is used as the learning algorithm, since the noise resynthesis filter functions well.
If the noise resynthesis filter is intended only to the noise, the sound wave distortion should be reduced.
A value of an updated term of NLMS, shown by Formula (7) below, is small when the input is only the noise.
The value becomes large when the sound wave is inputted. Accordingly, proposed here is a method for using an appropriate threshold value in the updated term of the NLMS to carry out a clip process.
The term “clip process” used herein is a process of, when the magnitude of a parameter update vector determined by the adaptive algorithm applied by the noise resynthesis filter is larger than a predetermined value (threshold value), reducing the parameter update vector so that the magnitude of the vector becomes the predetermined value without changing its direction. By the parameter update vector whose magnitude is reduced to the predetermined value, a parameter value of the noise resynthesis filter is updated.
The applicant carried out an experiment under an environment of
The speakers and the microphones were placed on a table whose height was 70 cm from a floor surface and 200 cm from a ceiling, the interval between the microphones was 10.0 cm, the SPS was placed at an angle θ of 135 degrees, and the SPN was placed at an angle θ of 45 degrees. This corresponds to a path difference of 7.07 cm (1.66 wavelengths with respect to an upper limit frequency of 8 kHz when the sonic speed is 340 m). An A-weighted background noise at an experimental place was 46.5 dB. A male announcement was used as the sound wave, and a colored noise that is a fake jet fan noise whose peak is about 1 kHz was used as the noise.
Table 2 shows the throughput, memory utilization, etc. of each of the linear prediction error filter (LPEF) and the noise resynthesis filter (NRF) when each filter is incorporated into a DSP of Table 1. Used as a threshold value of an updated term clip was 0.0001.
An experiment of suppressing the noise was carried out under the above conditions. An original sound waveform, a noise overlapping sound waveform and an enhanced sound waveform are respectively shown in
By carrying out the clip process of the updated term, the sound wave distortion is apparently reduced to human ears. To quantitatively evaluate this reduction, a sound quality evaluation value VE below was calculated.
Since the sound wave inputted to the microphone B is necessary to calculate the value VE, the value VE can be calculated only by a simulation. The value VE was calculated by a computation simulation using an input SNR (SN ratio) of −3 dB, and the same sound wave and noise as those used in the above experiment.
It can be seen that the value VE is slightly better when the clip process of the updated term was carried out.
Moreover, in view of the use in the actual environment, the same experiment was carried out under the same environment and conditions as above except that the noise of crowds was collected and used as the noise. From the results shown in
As above, a noise suppressing apparatus (noise eliminating apparatus) using two microphones was proposed, and its noise suppressing effect was confirmed by the experiment using the real DSP apparatus. Moreover, a solution was proposed for the sound wave distortion generated when the NLMS was used as the learning algorithm of the noise resynthesis filter, and its effectiveness was also confirmed.
Since the noise component can be eliminated from the output signal of the microphone by using the noise eliminating apparatus of the present invention, the present invention is applicable to a technical field of electro-acoustics.
Thus, the noise eliminating apparatus that is one embodiment of the present invention is explained.
Whereas the above embodiment illustrated the noise eliminating apparatus in which the linear prediction filter is the adaptive filter, the linear prediction filter of the noise eliminating apparatus does not have to be the adaptive filter.
From the foregoing explanation, many modifications and other embodiments of the present invention are obvious to one skilled in the art. Therefore, the foregoing explanation should be interpreted only as an example, and is provided for the purpose of teaching the best mode for carrying out the present invention to one skilled in the art. The structures and/or functional details may be substantially modified within the spirit of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
2005-062935 | Mar 2005 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2006/304378 | 3/7/2006 | WO | 00 | 2/11/2009 |