This application is a National Stage of International Application No. PCT/JP2011/071747, filed on Sep. 15, 2011, which claims priority from Japanese Patent Application No. 2010-228655, filed on Oct. 8, 2010, the contents of all of which are incorporated herein by reference in their entirety.
The present invention relates to a signal processing technology which cancels a noise, an interference signal, and an echo or the like mixed in a signal.
Background noise is often superimposed on a voice signal inputted from a microphone, and a hand set or the like, and it becomes a big problem when voice coding or speech recognition is performed. As a signal processing device which aimed at cancelling a noise superimposed acoustically, a two-input type noise cancellation device using two adaptive filters is disclosed in patent literatures 1 and 2. Using a signal to noise ratio in a primary signal estimated by using the first adaptive filter among two adaptive filters, a stepsize calculation unit calculates a coefficient-update stepsize of the second adaptive filter. The first adaptive filter operates like the second adaptive filter, but a value larger than the coefficient-update stepsize of the second adaptive filter is set to a coefficient-update stepsize of the first adaptive filter. For this reason, although the first adaptive filter has a high follow-up performance of output to an environmental variation, its estimation accuracy of the noise is inferior to the second adaptive filter.
The stepsize calculation unit, based on the signal to noise ratio in the primary signal estimated by using the first adaptive filter, assumes that interference by a voice signal is large when the voice signal is larger than the noise, and provides a small coefficient-update stepsize to the second adaptive filter. On the contrary, the step size calculation unit assumes that interference by the voice signal is small when the voice signal is smaller than the noise, and provides a large coefficient-update stepsize to the second adaptive filter. Thus, by controlling the second adaptive filter with the coefficient-update step size provided by the stepsize calculation unit, a noise-cancelled signal which accomplishes sufficient follow-up performance to the environmental variation and low distortion in the signal after noise cancellation, simultaneously.
Further, in patent literature 3, a structure of a noise cancellation device which, by extending a structure of patent literatures 1 and 2 mentioned above, also performs cancellation of the voice signal mixed in the background noise, when an influence of the voice signal mixed in a background noise is large, that is, a so-called crosstalk of the voice signal exists, is disclosed. In patent literature 3, in addition to the structure of patent literatures 1 and 2 mentioned above, a second stepsize calculation unit calculates the coefficient-update stepsize of the adaptive filter which cancels the voice signal from the noise input signal in order to cancel the background noise correctly from the voice signal input.
[Patent Literature 1] Japanese Patent Application Laid-Open No. H10-215193
[Patent Literature 2] Japanese Patent Application Laid-Open No. 2000-172299
[Patent Literature 3] International publication number WO2005/024787 official bulletin
However, in the noise cancellation device in patent literatures 1 to 3, the primary voice signal is delayed by the first adaptive filter and the second adaptive filter, and is inputted to a subtractor for noise cancellation. This represents that the noise-suppressed signal which is an output signal of the noise cancellation device is delayed from the inputted voice signal. Accordingly, in case this noise cancellation device is applied to a two-way communication, the communication becomes unnatural due to the delay of the voice signal. Also, since the noise cancellation device in patent literatures 1 to 3 needs the first adaptive filter for controlling the coefficient-update stepsize of the second adaptive filter, required processing cost also increases.
On the other hand, in case this delay is not applied in patent literatures 1 to 3, an estimated value of a desired-signal to noise ratio in the primary voice signal is obtained later than a correct value, the noise cancellation device in patent literatures 1 to 3 cannot perform correct coefficient-update stepsize control. Accordingly, a residual noise in the signal after noise cancellation increases and distortion of the output signal increases.
The object of the present invention is to provide a technology which solves the problems mentioned above.
To achieve the above object, a device according to an exemplary aspect of the invention includes first input means for inputting a first mixed signal in which a first signal and a second signal are mixed; second input means for inputting a second mixed signal in which the first signal and the second signal are mixed in a different proportion from the first mixed signal; first subtraction means for outputting an estimated first signal by subtracting a pseudo second signal from the first mixed signal; a first adaptive filter for generating the pseudo second signal by performing filter processing to the second mixed signal using a first coefficient updated by a first control signal; first compare means for comparing a value of the estimated first signal and a value of the first mixed signal; and first control means for outputting the first control signal to make an update amount of the first coefficient smaller as compared with a case when a value of the estimated first signal is smaller than the first mixed signal, to the first adaptive filter, in case a value of the estimated first signal is larger than a value of the first mixed signal as a result of comparison by the first compare means.
To achieve the above object, a method according to an exemplary aspect of the invention includes generating an estimated first signal by subtracting a pseudo second signal which is estimated to be mixed in a first mixed signal in which a first signal and a second signal are mixed from the first mixed signal; obtaining the pseudo second signal which is estimated to be mixed in the first mixed signal by performing filter processing to a second mixed signal in which the first signal and the second signal are mixed in a different proportion from the first mixed signal, using a coefficient of a first adaptive filter updated by a first control signal; and outputting the first control signal to make an update amount of the coefficient of the first adaptive filter smaller as compared with a case when the estimated first signal is smaller than the first mixed signal, to the first adaptive filter, in case the estimated first signal is larger than the first mixed signal.
To achieve the above object, a program according to an exemplary aspect of the invention causes a computer to execute a step which generates an estimated first signal by subtracting a pseudo second signal which is estimated to be mixed in a first mixed signal in which a first signal and a second signal are mixed from a first mixed signal; and a step which obtains the pseudo second signal which is estimated to be mixed in the first mixed signal by performing filter processing to a second mixed signal in which the first signal and the second signal are mixed in a different proportion from the first mixed signal, using a coefficient of a first adaptive filter updated by a first control signal, and outputs the first control signal to make an update amount of the coefficient of the first adaptive filter smaller as compared with a case when the estimated first signal is smaller than the first mixed signal, to the first adaptive filter, in case the estimated first signal is larger than the first mixed signal.
According to the present invention, it is possible, from a mixed signal in which a first signal and a second signal are mixed, to remove the second signal at low processing cost and without delay, and as a result, to obtain an estimated first signal which has low residue of the second signal and low distortion.
Hereinafter, exemplary embodiments of the present invention will be described illustratively and in detail with reference to drawings. However, components indicated in the following exemplary embodiments are provided for illustrated purposes only and are not intended to limit the technological scope of the present invention to them.
A signal processing device 100 as a first exemplary embodiment of the present invention will be described using
As shown in
The first input unit 101 inputs the first mixed signal xp(k) in which the first signal and the second signal are mixed. The second input unit 102 inputs a second mixed signal xs(k) in which the first signal and the second signal are mixed in a different proportion from the first mixed signal xp(k).
The subtraction unit 103 subtracts a pseudo second signal n(k) which is estimated to be mixed in the first mixed signal xp(k) from the first mixed signal xp(k) and outputs the estimated first signal e(k). And in order to obtain the pseudo second signal n(k), the adaptive filter 104 performs filter processing to the second mixed signal xs(k) or a signal based on the second mixed signal xs(k) with coefficients 141 updated based on the estimated first signal e(k).
The compare unit 106 compares a value of the first mixed signal xp(k) and a value of the estimated first signal e(k). The coefficient-update control unit 107 outputs a control signal μ(k) to make an update amount of the coefficients 141 of the adaptive filter 104 smaller, to the adaptive filter 104, in case the value of the estimated first signal e(k) is larger than the value of the first mixed signal xp(k) as a result of comparison by the compare unit 106.
According to this exemplary embodiment with such a structure, it is possible to remove the second signal from the mixed signal in which the first signal and the second signal are mixed at low processing cost and without delay, and as a result, to obtain the estimated first signal which has low residue of the second signal and low distortion.
As a signal processing device according to a second exemplary embodiment of the present invention, a noise cancellation device which cancels a part or all of the noise from a deteriorated signal (a signal in which a desired-signal and noise are mixed), and outputs an emphasized signal (a signal in which the desired-signal is emphasized) is described. Here, the deteriorated signal corresponds to the first mixed signal in which the first signal and the second signal are mixed, and the emphasized signal corresponds to a desired voice signal (estimated first signal).
(Explanation of a Basic Technology of Noise Cancellation)
Hereinafter, a basic technology of noise cancellation for cancelling a noise, an interference signal, and an echo or the like, which are mixed with the desired-signal inputted from a microphone, a hand set, and a channel or the like by the adaptive filter, or emphasizing the desired-signal will be described briefly.
As disclosed in patent literatures 1 to 3, the two-input type noise cancellation device generates a pseudo noise (pseudo second signal) corresponding to a noise component which is mixed with a voice at a voice input terminal from a reference signal by using the adaptive filter which approximates an impulse response of an acoustic path from a noise source to the voice input terminal. And the noise cancellation device operates in such a way that the noise component is suppressed by subtracting this pseudo noise from the signal inputted to the voice input terminal (first mixed signal). Here, the mixed signal is a signal in which the voice signal and the noise are mixed and is generally supplied from a microphone or a hand set to the voice input terminal. Also, the reference signal is a signal which has a correlation with the noise component in the noise source and is captured at a location close to the noise source. Since the reference signal is captured at a location close to the noise source as above, it is possible to assume that the reference signal is almost equal to the noise component at the noise source. The reference signal supplied to a reference input terminal is inputted to the adaptive filter.
The coefficients of the adaptive filter are modified based on a correlation between an error obtained by subtracting the pseudo noise from the received signal and the reference signal inputted to the reference input terminal. As a coefficient modification algorithm of such adaptive filter, an “LMS algorithm (Least Mean-Square Algorithm)” and an “LIM (Learning Identification Method)” are disclosed in patent literatures 1 to 3. The LIM is also referred to as a normalized LMS algorithm.
The LMS algorithm and the LIM are one of algorithms called a gradient method. Speed and precision of coefficient update depend on a constant called a coefficient-update stepsize. The filter coefficients are updated by a product of the coefficient-update stepsize and the error. The desired-signal included in the error interferes with coefficient update. In order to reduce its influence, a very small value needs to be set to the coefficient-update stepsize. In patent literatures 1 to 3 mentioned above, the follow-up performance to the environmental variation of the adaptive filter coefficients decreases in case the coefficient-update stepsize is small. Therefore patent literatures 1 to 3 disclose one method to solve the problem such as increasing of the error or occurring of the distortion of the desired-signal.
(Structure of a Noise Cancellation Device of the Second Exemplary Embodiment)
As shown in
The primary signal xp(k) is supplied to the input terminal 201 as a series of sample values. The primary signal xp(k) is transmitted to the subtractor 203 and the error evaluation unit 206. The reference signal xs(k) is supplied to the input terminal 202 as a series of sample values. The reference signal xs(k) is transmitted to the adaptive filter 204.
The adaptive filter 204 performs convolution operation of the reference signal xs(k) and the filter coefficients and transmits the result to the subtractor 203 as the pseudo noise n1(k).
To the subtractor 203, the primary signal xp(k) is supplied from the input terminal 201 and the pseudo noise n1(k) is supplied from the adaptive filter 204. The subtractor 203 subtracts the pseudo noise n1(k) from the primary signal xp(k) and transmits the result to the output terminal 205 and the error evaluation unit 206 as the estimated desired-signal and, at the same time, feeds it back to the adaptive filter 204.
The primary signal and the estimated desired-signal are supplied to the error evaluation unit 206. The error evaluation unit 206 compares the primary signal and the estimated desired-signal and outputs an evaluation result β1(k) decided by their magnitude relation. The easiest determination method of the evaluation result β1(k) is a method which uses β1(k)=1 when a square value of the output e12(k) is larger than a square value of the primary signal xp2(k), and β1(k)=0 when the output is smaller than the primary signal. As equivalent processing, the evaluation result β1(k) may be determined by obtaining a ratio Γ(k) of the primary signal and the output and comparing a value of the ratio and a threshold value δ0 set in advance. That is, the error evaluation unit 206 may use β1(k)=1 if Γ(k)≧δ0, and β1(k)=0 if Γ(k)<δ0 for the following (Equation 1).
Γ(k)=xp2(k)/e12(k) (Equation 1)
The error evaluation unit 206 can also get a similar effect by using an absolute value of the primary signal xp(k) instead of the square value of the primary signal xp(k). Note that, here, although 1 and 0 are used as a value of the evaluation result β1(k), the present invention is not limited to this, and as far as two different states of the output can be expressed, any kind of expression can be used. For example, the value of β1(k) may be 10 and 0, or +1 and −1 and so on.
Although the determination method described so far compares Γ(k) with the threshold value and performs a two-level decision with 1 or 0 (a hard decision), the error evaluation unit 206 may apply a soft decision. For example, in case δ0+δ1 is set to an upper limit threshold value of the soft decision and δ0−δ2 is set to a lower limit threshold value, the error evaluation unit 206 may use β1(k)=1 when Γ(k) exceeds the upper limit threshold value, β1(k)=0 when Γ(k) is less than the lower limit threshold value, and β1(k) between 1 and 0 when Γ(k) takes a value between the lower limit threshold value and the upper limit threshold value. That is, β1(k) may be obtained by the following equation (Equation 2).
β1(k)=Γ(k)/(δ1+δ2)+(δ2−1)/(δ0+δ2) (Equation 2)
Although (Equation 2) is a equation providing a linear interpolation between the upper limit threshold value and the lower limit threshold value, β1(k) may be specified by a more complicated arbitrary function or a polynomial equation. The evaluation result β1(k) calculated by either one of the methods described so far is transmitted to the step size control unit 207.
The evaluation result β1(k) is supplied from the error evaluation unit 206 to the stepsize control unit 207. The stepsize control unit 207 obtains a coefficient update amount adjustment value α1(k) according to the value of the evaluation result β1(k) supplied from the error evaluation unit 206 and finally sets a value according to the error to a coefficient-update stepsize μ1(k).
The coefficient update amount adjustment value α1(k) is a function of the evaluation result β1(k), and is expressed as the following (Equation 3) using the function Ψ1.
α1(k)=ψ1{β1(k)} (Equation 3)
Function Ψ1 is set in such a way that a value of the coefficient update amount adjustment value α1(k) becomes small when a value of the evaluation result β1(k) is large, and α1(k) becomes large when a value of the evaluation result β1(k) is small. Note that, 0≦α1(k)≦1. For example, it may be set such that α1(k)=0.125 for β1(k)=1 and α1(k)=1 for β1(k)=0.
The stepsize control unit 207 obtains the final coefficient-update step size μ1(k) by multiplying an original coefficient update amount μ0 (constant) by the coefficient update amount adjustment value α1(k), and supplies it to the adaptive filter 204. That is, the coefficient-update step size μ1(k) is expressed as the following (Equation 4).
μ1(k)=μ0×α1(k) (Equation 4)
The adaptive filter 204 update coefficients using μ1(k) supplied from the stepsize control unit 207 as the coefficient-update stepsize. The adaptive filter 204 can make the coefficient update amount smaller by using the coefficient update amount obtained through adjusting the original coefficient update amount μ0 with the coefficient update amount adjustment value α1(k). In particular, α1(k)=0 represents suspending of the coefficient update and α1(k)=1 represents a state where the stepsize control unit 207 does not have any influence on the coefficient update at all.
The reason why robustness to the interference signal is increased and signal processing which has minimum residual noise and low distortion of the output signal can be realized by making the coefficient update amount smaller is indicated below. First, the reference signal is represented by xs(k) and the output of the adaptive filter 204 is represented by n1(k). Also, a noise source signal is represented as n0(k), a voice source signal is represented by S(k) and a characteristic vector (impulse response) of the acoustic path from the noise source to the input terminal 201 is represented by H1. In order to make explanation simple, this acoustic path is supposed to be time invariant, and sample number k corresponding to a concept of time is omitted. Then, a coefficient vector of the adaptive filter is described as W(k), number of taps thereof is described as M, a noise source signal vector including a time series sample of the noise source signal n0(k) is described as N0(k), and an element of the characteristic vector in the acoustic path is described as h1(j), j=0, 1 . . . M−1. In this case, W(k), N0(k) and H1 are expressed as the following equations. Here, [ ]T expresses transposition of a matrix.
W(k)=[w(k,0),w(k,1), . . . ,w(k,M−1)]T (Equation 5)
N0(k)=[n0(k),n0(k−1), . . . ,n0(k−M+1)]T (Equation 6)
H1=[h1(0),h1(1), . . . ,h1(M−1)]T (Equation 7)
Since the primary signal xp(k) is expressed as the following (Equation 8), the output e1(k) of the subtractor 203 can be expressed as (Equation 9).
xp(k)=S(k)+H1T*N0(k) (Equation 8)
e1(k)=xp(k)−n1(k)=S(k)+H1T*N0(k)−WT(k)*N0(k)=S(k)+{H1T−WT(k)}*N0(k) (Equation 9)
Here, * represents a convolution operation. In the right-hand side of (Equation 9), the second term can be minimized by updating the coefficient vector W(k) of the adaptive filter 204. S(k) behaves independently of the second term and is an interference signal to the second term. That is, when the first term of the right-hand side is not small enough compared with the second term of the right-hand side, the coefficient update using e1(k) is not performed correctly. Accordingly, it is effective to detect that the first term of the right-hand side is not small enough compared with the second term of the right-hand side and make the coefficient update amount smaller.
In this exemplary embodiment, the magnitude relation between xp(k) and e1(k) is used for this detection. First, S(k)=0 for simplicity. In this case, the following equations hold.
xp(k)=H1T*N0(k) (Equation 10)
e1(k)={H1T−WT(k)}*N0(k) (Equation 11)
When the coefficient vector W(k) of the adaptive filter 204 approximates the characteristic vector H1 of the acoustic path to a certain extent, polarities of them are identical. Also, since setting 0 to the coefficient vector W(k) as an initial value is a general way of use of the adaptive filter, and considering that |w(k, j)|>0, j=0, 1, . . . , M−1, the following (Equation 12) holds based on (Equation 10) and (Equation 11).
xp2(k)>e12(k) (Equation 12)
If (Equation 12) does not hold, the reason is that the assumption S(k)=0 is wrong. That is, S(k)≠0. Accordingly, by detecting that an opposite condition of (Equation 12), that is, detecting that the following (Equation 13) holds, it is possible to know whether an interference signal S(k) is zero.
xp2(k)<e12(k) (Equation 13)
On the other hand, especially just after the coefficient update has been started, the coefficient vector W(k) may hardly approximate the characteristic vector H1 of the acoustic path. In this case, a probability that polarities of the coefficient vector W(k) and the characteristic vector H1 of acoustic path are identical starts from 0.5 and comes close to 1 as the coefficient vector converges to the correct value. Accordingly, it can be considered that a probability that detecting the interference signal S(k) using (Equation 13) is correct also starts from 0.5 and comes close to 1. Considering that the existence of the interference signal S(k) has a decisive bad influence on the coefficient update, misdetection is more desirable than no detection. Accordingly, even when the coefficient vector W(k) hardly approximates the characteristic vector H1 of the acoustic path, that is, just after the coefficient update, the detection of the interference signal S(k) using (Equation 13) is effective.
In the description so far, a square value is used for each of the primary signal and the output. However, a similar effect can also be obtained by using an absolute value instead of the square value for each of the primary signal and the output.
According to this exemplary embodiment, with the structure mentioned above, signal processing which has strong robustness to the interference signal, low residual noise and low distortion of the output signal can be realized. In particular, by detecting the interference signal S(k) using (Equation 13) and making the coefficient update amount smaller when signal S(k) is detected, high robustness can be achieved at low processing cost.
A third exemplary embodiment of the present invention will be described using
wherein, R is a natural number representing a number of samples (window size) used to obtain the average value. When the average value is obtained by the leaky integration operation, e12(k) bar is expressed as the next (Equation 15).
e12(k)bar=γ·e12(k−1)bar+(1−γ)·e12(k) (Equation 15)
wherein, γ is a constant which satisfies 0<γ<1. In the same way, xp2(k) bar can also be obtained. The error evaluation unit 306 uses e12(k) bar and xp2(k) bar obtained in this way instead of e12(k) and xp2(k) of (Equation 13).
Averaging operation can be realized by averaging using a sliding window or leaky integration. When the average value of the estimated desired-signal e1(k) (estimated first signal average value) and the average value of primary signal xp(k) (first mixed signal average value) are represented by e12(k) bar and xp2(k) bar respectively, e12(k) bar which is the output of the averaging unit 361 is indicated as (Equation 14) or (Equation 15). Values of R and γ may be different from values used in (Equation 14) or (Equation 15). Also, xp2(k) bar which is the output of the averaging unit 362 can be described as the following (Equation 16) or (Equation 17).
The values R and γ in these equations can also be decided independently from the values used in (Equation 14) or (Equation 15) or the calculation of e12(k) bar mentioned above.
The evaluation result calculation unit 363 receives the output power average value e12(k) bar and the primary signal power average value xp2(k) bar which are the outputs of the averaging unit 361 and the averaging unit 362 and calculates an evaluation result Γ(k) as shown in the following (Equation 18).
Γ(k)=xp2(k)bar/e12(k)bar (Equation 18)
Also, as shown in the following (Equation 19), a logarithm of the right-hand side of (Equation 18) may be defined as Γ(k).
Γ(k)=log10(xp2(k)bar/e12(k)bar) (Equation 19)
β1(k) is determined similarly to the second exemplary embodiment using Γ(k) obtained by (Equation 18) or (Equation 19). β1(k) is supplied to the stepsize control unit 207.
With the error evaluation using the average values shown above, it is possible to absorb statistical uncertainty of the signal and obtain more correct evaluation result β1(k).
A noise cancellation device 400 as a fourth exemplary embodiment of the present invention will be described using
In
The step size control unit 407 calculates the coefficient update amount μ1(k) by the following equation and transmits it to the adaptive filter 204.
μ1(k)=μ0·α1(k)·m1(k) (Equation 20)
m1(k)=Φ1{SNR1(k)} (Equation 21)
Having high of the SNR estimated value SNR1(k) means that power of the voice source signal S(k) is large. Therefore the function Φ1{ } is designed in such a way that a value of μ1(k) becomes smaller when SNR is high, and a value of μ1(k) becomes larger when SNR is low. That is, it is advisable to use a monotonic decreasing function which gives a small value when a value of SNR1(k) is large and gives a large value when a value of SNR1(k) is small, as the function Φ1{ }. Also, its range is defined as m1min≦m1(k)≦m1max. Note that, m1min and m1max are constants which satisfy m1min<m1max.
The stepsize control unit 407 determines a value of μ1(k) as μ0·α1(k) based on the signal α1(k) supplied from the error evaluation unit 206, as described in the second exemplary embodiment. The stepsize control unit 407 further adjusts the value of μ1(k) determined in this way with a value of m1(k)=Φ1{SNR1(k)} based on the SNR estimated value SNR1(k) supplied from the SNR estimation unit 408.
The simplest example of the function Φ1{ } is a downward sloping straight line, and it can be expressed by (Equation 22) to (Equation 24).
m1(k)=m1max (m1max<Φ1{SNR1(k)}) (Equation 22)
m1(k)=Φ1{SNR1(k)}(m1min≦Φ1{SNR1(k)}≦m1max) (Equation 23)
m1(k)=m1min (m1min>1011{SNR1(k)}) (Equation 24)
When values of SNR1 corresponding to the range m1min and m1max are represented by SNR1min and an upper limit value SNR1max respectively, the monotonic decreasing function Φ1{SNR1(k)} can be expressed, for example, by the following (Equation 25) to (Equation 27).
Φ1{SNR1(k)}=−A·SNR1(k)+B (Equation 25)
A=(m1max−m1min)/(SNR1max−SNR1min) (Equation 26)
B={m1max+m1min+A·(SNR1max+SNR1min)}/2 (Equation 27)
In order to obtain small m1(k) when SNR1(k) is large, and in order to obtain large m1(k) when SNR1(k) is small, the monotonic decreasing function is used. Accordingly, other than the linear function shown in (Equation 25) to (Equation 27), a nonlinear function such as a polynomial function or a trigonometric function or a more complicated function expressed by their combination, and so on may be used. Through the procedure mentioned above, the stepsize control unit 407 outputs μ1(k) obtained by (Equation 20) as the stepsize.
The adaptive filter 204 updates coefficients using μ1(k) supplied from the stepsize control unit 407 as the coefficient-update stepsize. The adaptive filter 204 can make the coefficient update amount smaller when the interference signal to the coefficient update exists, by using the coefficient-update stepsize adjusted with α1(k) and SNR1(k) as shown by (Equation 20).
According to this exemplary embodiment, with the structure mentioned above, signal processing which has strong robustness to the interference signal, low residual noise, and low distortion of the output signal can be realized.
A noise cancellation device 500 as a fifth exemplary embodiment of the present invention will be described using
In
According to this exemplary embodiment, with the structure mentioned above, signal processing which has strong robustness to the interference signal, low residual echo and low distortion of the output signal can be realized.
A noise cancellation device 600 as a sixth exemplary embodiment of the present invention will be described using
According to this exemplary embodiment, signal processing which has strong robustness to the interference signal, low residual echo, and low distortion of the output signal can be realized
In the description so far, it is assumed that the reference signal is the noise itself by capturing the reference signal at a location close to the noise source. However, in reality, there is a case where this condition cannot be satisfied. In such a case, the reference signal includes the noise and the voice signal mixed therewith. Such a mixed component of the voice signal in the reference signal is called a crosstalk. A structure of a noise cancellation device applied when the crosstalk exists is disclosed in WO2005/024787.
In this exemplary embodiment, a second adaptive filter for cancelling the crosstalk, similarly to cancelling the noise, is introduced.
A noise cancellation device 700 generates a pseudo voice signal corresponding to a voice signal component which is mixed at the reference input terminal using the second adaptive filter which approximates impulse response of an acoustic path (a crosstalk path) from the voice signal source to the reference input terminal. And the noise cancellation device 700 cancels the voice signal component (crosstalk) by subtracting this pseudo voice signal from the signal inputted to the reference input terminal (reference signal).
The noise cancellation device 700 as a seventh exemplary embodiment of the present invention will be described using
The noise cancellation device 700 generates a pseudo crosstalk n2(k) by transforming a signal, which has a correlation with the crosstalk to be cancelled (output at the output terminal 205=estimated desired-signal), with the adaptive filter. And the noise cancellation device 700 cancels the crosstalk by subtracting this from the reference signal in which the voice and the noise are mixed. In coefficient update of the adaptive filter based on the crosstalk cancellation result, the noise cancellation device 700 compares the signal before and after subtraction, and when the signal after subtraction is not smaller than the signal before subtraction, judges that it is in an abnormal situation and makes a coefficient update amount smaller.
The reference signal in which the voice and the noise are mixed is supplied to the input terminal 202, as a series of sample values. The reference signal is transmitted to the subtractor 703 and the error evaluation unit 706.
The estimated desired-signal e1(k) is supplied to the adaptive filter 704 from the output terminal 205. The adaptive filter 704 performs convolution operation of the estimated desired-signal e1(k) and the filter coefficients and transmits the result to the subtractor 703 as the pseudo crosstalk n2(k).
To the subtractor 703, the reference signal xs(k) is supplied from the input terminal 202 and the pseudo crosstalk n2(k) is supplied from the adaptive filter 704. The subtractor 703 subtracts the pseudo crosstalk n2(k) from the reference signal xs(k), transmits the result to the adaptive filter 204 and the error evaluation unit 706 and, at the same time, feeds it back to the adaptive filter 704.
Since the operation of the error evaluation unit 706 and the stepsize control unit 707 is similar to the operation of the error evaluation unit 206 and the stepsize control unit 207 described in the second exemplary embodiment, the detailed description is omitted.
According to this exemplary embodiment, with the structure mentioned above, signal processing which has strong robustness to the interference signal, low residual noise, low distortion of the output signal can be realized in the situation where the crosstalk exists.
A noise cancellation device 800 as an eighth exemplary embodiment of the present invention will be described using
The noise cancellation device 800 generates the pseudo crosstalk by transforming the signal, which has a correlation with the crosstalk to be cancelled (output at the output terminal 205=estimated value of the voice), with the adaptive filter. And the noise cancellation device 800 cancels the crosstalk by subtracting this from the noise on which the voice is superimposed. In coefficient update of the adaptive filter based on the crosstalk cancellation result, the noise cancellation device 800 compares the signal before and after subtraction, and when the signal after subtraction is not smaller than the signal of before subtraction, judges that it is in an abnormal situation and makes a coefficient update amount smaller. Further, the noise cancellation device 800 controls the coefficient update amount based on a second SNR estimated value supplied from the SNR estimation unit 808, that is, a ratio of the noise component and the voice signal component at the input terminal 202.
Note that, an ideal output of the subtractor 703 is the noise. Therefore a component which changes by the coefficient update of the adaptive filter 704 is a residual crosstalk, and the noise component which is not affected is the interference component in the coefficient update of the adaptive filter 704. From these relations, a second SNR estimated value SNR2(k) supplied from the SNR estimation unit 808 is not a ratio of the voice signal component to the noise component at the input terminal 202, but is a ratio of the noise component to the crosstalk component (noise to crosstalk ratio).
In
The estimated noise signal e2(k) can be used as an estimated value of the noise n0(k) and the output of the adaptive filter 704 n2(k) can be used as an estimated value of the crosstalk component included in the second mixed signal xs(k).
Accordingly, a ratio of them can be used as the SNR estimated value SNR2(k) (estimated value of noise to crosstalk ratio). The SNR estimation unit 808 supplies the obtained SNR estimated value SNR2(k) to the stepsize control unit 807.
Since the SNR estimation unit 808 is exactly same as the structure of the SNR estimation unit 408 described in the fourth exemplary embodiment using
Next, operation of the step size control unit 807 will be described along with the operation of the stepsize control unit 407 which has been already described in the fourth exemplary embodiment.
A signal β2(k) representing existence of the noise n0(k) is supplied from the error evaluation unit 706 to the stepsize control unit 807. Also, the SNR estimated value SNR2(k) is supplied to the stepsize control unit 807 from the SNR estimation unit 808. The step size control unit 807 calculates a value of μ2(k) based on the information by the following equation and transmits it to the adaptive filter 704.
μ2(k)=μ0·α2(k)·m2(k) (Equation 28)
m2(k)=Φ2{SNR2(k)} (Equation 29)
Having high of the SNR estimated value SNR2(k) means that power of the noise n0(k) is large. Therefore the function Φ2{ } is designed in such a way that a value of μ2(k) becomes smaller when SNR is high, and a value of μ2(k) becomes larger when SNR is low. That is, it is advisable to use a monotonic decreasing function which gives a small value when a value of the SNR estimated value SNR2(k) is large and gives a large value when a value of SNR2(k) is small, as the function Φ2{ }. Also, its range is defined as m2min≦m2(k)≦m2max. Note that, m2min and m2max are constants which satisfy m2min<m2max.
A value of μ2(k) is determined as μ0·α2(k) based on the signal α2(k) supplied from the error evaluation unit 706, as described in the fourth exemplary embodiment. The value of μ2(k) determined in this way is further adjusted with a value m2(k)=Φ2{SNR2(k)} based on the SNR estimated value SNR2(k) supplied from the SNR estimation unit 808.
The simplest example of the function Φ2{ } is a downward sloping straight line, and it can be expressed by (Equation 30) to (Equation 32).
m2(k)=m2max (m2max<Φ2{SNR2(k)}) (Equation 30)
m2(k)=Φ2{SNR2(k)}(m2min≦Φ2{SNR2(k)}≦m2max) (Equation 31)
m2(k)=m2min (m2min>Φ2{SNR2(k)}) (Equation 32)
When values of SNR2 corresponding to the range m2min and m2max are represented by SNR2min and an upper limit value SNR2max respectively, the monotonic decreasing function Φ2{SNR2(k)} can be expressed, for example, by the following (Equation 33) to (Equation 35).
Φ2{SNR2(k)}−−C·SNR2(k)+D (Equation 33)
C=(m2max−m2min)/(SNR2max−SNR2min) (Equation 34)
D={m2max+m2min+C·(SNR2max+SNR2min)}/2 (Equation 35)
In order to obtain small m2(k) when SNR2(k) is large, and in order to obtain large m2(k) when SNR2(k) is small, the monotonic decreasing function is used. Other than the linear function shown in (Equation 33) to (Equation 35), a nonlinear function such as a polynomial function or a trigonometric function or a more complicated function expressed by their combination, and so on may be used. Through the procedure mentioned above, the stepsize control unit 807 outputs μ2(k) obtained by (Equation 28) as the stepsize.
The adaptive filter 704 updates coefficients using μ2(k) supplied from the stepsize control unit 807 as the coefficient-update stepsize. The adaptive filter 704 can make the coefficient update amount smaller when the interference signal to the coefficient update exists, by using the coefficient-update-stepsize adjusted with α2(k) and SNR2(k) as shown by (Equation 28).
According to this exemplary embodiment, with the structure mentioned above, signal processing which has strong robustness to the interference signal, low residual noise, and low distortion of the output signal can be realized, in the situation when the crosstalk exists. Noise cancellation which has high follow-up performance to the environmental variation and low output distortion can be achieved for the input signal with wide range of a desired-signal to noise ratio in the primary signal and the reference signal.
A noise cancellation device 900 as a ninth exemplary embodiment of the present invention will be shown in
According to this exemplary embodiment, signal processing which has low residual noise and low distortion of the output signal can be realized with higher accuracy.
As above, although a plurality of exemplary embodiments of the present invention has been explained in detail, a system or a device which is a combination of separate features included in the respective exemplary embodiments arbitrarily is also included in the category of the present invention.
Also, the present invention may be applied to a system including a plurality of devices or it may be applied to a device of stand-alone. Further, the present invention is also applicable to a case when an information processing program which realizes the functions of the exemplary embodiments mentioned above is supplied directly or from remote to a system or a device. Such a program is executed by a processor such as a DSP (Digital Signal Processor) of which a signal processing device or a noise cancellation device is composed. Moreover, a program installed in a computer in order to realize the functions of the present invention by the computer, a medium which stored the program and a WWW (World Wide Web) server from which the program is downloaded are also included in the category of the present invention.
The CPU 1002 controls operation of the computer 1000 by reading the signal processing program stored in the memory 1004. That is, in Step S1011, the CPU 1002 executing the signal processing program inputs a primary signal in which a desired-signal and a noise are mixed from the input unit 1001 and inputs a reference signal in which the desired-signal and the noise are mixed in a different proportion from the primary signal. Next, in Step S1013, the CPU 1002 obtains a pseudo noise signal by performing filter processing to the reference signal with an adaptive filter using coefficients updated based on an estimated desired-signal.
Further, in Step S1015, the CPU 1002 subtracts the pseudo noise signal from the primary signal and outputs an error. In Step S1019, the CPU 1002 compares a value of the primary signal and a value of the error, and in case the value of the error is larger than the value of the primary signal, proceeds to Step S1021, and makes an update amount of the coefficients smaller as compared with a case when a value of the error is smaller than a value of the primary signal. As a result, the same effect as the first exemplary embodiment can be obtained.
Although a part or all of the exemplary embodiments mentioned above can also be described as the following supplementary notes, they are not limited to the followings.
(Supplementary Note 1)
A signal processing device including:
first input means for inputting a first signal;
second input means for inputting a second mixed signal in which the first signal and the second signal are mixed in a different proportion from the first mixed signal;
first subtraction means for outputting the first mixed signal;
a first adaptive filter for performing filter processing to the second mixed signal using a first coefficient updated based on the estimated first signal in order to obtain the pseudo second signal;
first compare means for comparing with a value of the first mixed signal; and
first control means for outputting a first control signal to make an update amount of the coefficient smaller as compared with a case when it is smaller than a value of the estimated first signal, to the first adaptive filter, as a result of comparison by the first compare means.
(Supplementary Note 2)
The signal processing device according to supplementary note 1, wherein the first adaptive filter inputs the second mixed signal.
(Supplementary Note 3)
The signal processing device according to either of supplementary notes 1 or 2, wherein
the first compare means obtains an estimated first signal average value by averaging values of the first mixed signal, and compares the first mixed signal average value and the estimated first signal average value, and
the first control means outputs the first control signal to make the coefficient update amount of the first adaptive filter smaller as compared with a case when the estimated first signal average value is smaller than the first mixed signal average value, to the first adaptive filter, in case the estimated first signal average value is larger than the first mixed signal average value.
(Supplementary Note 4)
The signal processing device according to either one of supplementary notes 1 to 3 further including a first ratio output unit for outputting a ratio to the estimated first signal as a first ratio, wherein
the first control means outputs the first control signal to make a coefficient update amount of the first adaptive filter smaller as compared with a case when the value of the first ratio is small, to the first adaptive filter, in case a value of the first ratio is large.
(Supplementary Note 5)
The signal processing device according to either one of supplementary notes 1 to 4, wherein
the first signal is a near-end signal, and
the second signal is a far-end signal having no correlation with the near-end signal.
(Supplementary Note 6)
The signal processing device according to supplementary note 1, wherein
the first adaptive filter inputs an estimated second signal based on the second mixed signal, and
further including:
second subtraction means for outputting a pseudo first signal which is estimated to be mixed in the second mixed signal;
a second adaptive filter for performing filter processing using a second coefficient updated based on the pseudo first signal;
second compare means for comparing with a value of the second mixed signal; and
second control means for outputting a second control signal to make an update amount of the second coefficient smaller as compared with a case when it is smaller than a value of the estimated second signal, to the second adaptive filter, as a result of comparison by the second compare means.
(Supplementary Note 7)
The signal processing device according to supplementary note 6, wherein
the first compare means obtains an estimated first signal average value by averaging values of the first mixed signal, and compares the first mixed signal average value and the estimated first signal average value,
the first control means outputs the first control signal to make a coefficient update amount of the first adaptive filter smaller as compared with a case when the estimated first signal average value is smaller than the first mixed signal average value, to the first adaptive filter, in case the estimated first signal average value is larger than the first mixed signal average value,
the second compare means obtains an estimated second signal average value by averaging values of the second mixed signal, and compares the second mixed signal average value and the estimated second signal average value, and
the second control means outputs the second control signal to make a coefficient update amount of the second adaptive filter smaller as compared with a case when the estimated second signal average value is smaller than the second mixed signal average value, to the second adaptive filter, in case the estimated second signal average value is larger than the second mixed signal average value.
(Supplementary Note 8)
The signal processing device according to either of supplementary notes 6 or 7 further including a second ratio output unit for outputting a ratio to the estimated second signal as a second ratio, wherein
the second control means outputs the second control signal to make a coefficient update amount of the second adaptive filter smaller as compared with a case when a value of the second ratio is small, to the second adaptive filter, when a value of the second ratio is large.
(Supplementary Note 9)
The signal processing device according to either one of supplementary notes 1 to 8, wherein
the first signal is a desired voice signal,
the second signal is a noise signal,
the first mixed signal is a signal in which the desired voice signal is mixed more than the noise signal,
and the second mixed signal is a signal in which the noise signal is mixed more than the desired voice signal.
(Supplementary Note 10)
A signal processing method including:
generating an estimated first signal by subtracting a pseudo second signal which is estimated to be mixed in a first mixed signal in which a first signal and a second signal are mixed from the first mixed signal; and
when obtaining the pseudo second signal which is estimated to be mixed in the first mixed signal by a first adaptive filter using a second mixed signal in which the first signal and the second signal are mixed in a different proportion from the first mixed signal, making a coefficient update amount of the first adaptive filter smaller as compare with a case when the estimated first signal is smaller than the first mixed signal, in case the estimated first signal is larger than the first mixed signal.
(Supplementary Note 11)
The signal processing method according to supplementary note 10, wherein the first adaptive filter generates the pseudo second signal by inputting the second mixed signal.
(Supplementary Note 12)
The signal processing method according to either of supplementary notes 10 or 11 further including:
obtaining an estimated first signal average value by averaging values of the first mixed signal;
comparing the first mixed signal average value and the estimated first signal average value; and
making the coefficient update amount of the first adaptive filter smaller as compare with a case when the estimated first signal average value is smaller than the first mixed signal average value, in case the estimated first signal average value is larger than the first mixed signal average value.
(Supplementary Note 13)
The signal processing method according to either one of supplementary notes 10 to 12 further including:
outputting a ratio to the estimated first signal as a first ratio; and
making the coefficient update amount of the first adaptive filter smaller as compared with a case when a value of the first ratio is small, in case a value of the first ratio is large.
(Supplementary Note 14)
The signal processing method according to either one of supplementary notes 10 to 13, wherein
the first signal is a near-end signal, and
the second signals is a far-end signal having no correlation with the near-end signal.
(Supplementary Note 15)
The signal processing method according to supplementary note 10, wherein
the first adaptive filter inputs an estimated second signal based on the second mixed signal, and
further including:
generating a pseudo first signal which is estimated to be mixed in the second mixed signal; and
when obtaining the estimated first signal, making an update amount of the second coefficient smaller as compared with a case when it is smaller than a value of the estimated second signal.
(Supplementary Note 16)
The signal processing method according to supplementary note 15 further including:
obtaining an estimated first signal average value by averaging values of the first mixed signal;
comparing the first mixed signal average value and the estimated first signal average value;
making the coefficient update amount of the first adaptive filter smaller as compared with a case when the estimated first signal average value is smaller than the first mixed signal average value, in case the estimated first signal average value is larger than the first mixed signal average value;
obtaining an estimated second signal average value by averaging values of the second mixed signal;
comparing the second mixed signal average value and the estimated second signal average value; and
making the coefficient update amount of the second adaptive filter smaller as compared with a case when the estimated second signal average value is smaller than the second mixed signal average value, in case the estimated second signal average value is larger than the second mixed signal average value.
(Supplementary Note 17)
The signal processing method according to either of supplementary notes 15 or 16 further including:
outputting a ratio to the estimated second signal as a second ratio; and
making the coefficient update amount of the second adaptive filter smaller as compared with a time when the value of the second ratio is small, when a value of the second ratio is large.
(Supplementary Note 18)
The signal processing method according to either one of supplementary notes 10 to 17, wherein
the first signal is a desired voice signal,
the second signal is a noise signal,
the first mixed signal is a signal in which the desired voice signal is mixed more than the noise signal,
and the second mixed signal is a signal in which the noise signal is mixed more than the desired voice signal.
(Supplementary Note 19)
A signal processing program causing a computer to execute:
a step which generates an estimated first signal by subtracting a pseudo second signal which is estimated to be mixed in a first mixed signal in which a first signal and a second signal are mixed from the first mixed signal; and
a step which, when obtaining the pseudo second signal which is estimated to be mixed in the first mixed signal by a first adaptive filter using a second mixed signal in which the first signal and the second signal are mixed in a different proportion from the first mixed signal, makes a coefficient update amount of the first adaptive filter smaller as compared with a case when the estimated first signal is smaller than the first mixed signal, in case the estimated first signal is larger than the first mixed signal.
While the invention has been particularly shown and described with reference to exemplary embodiments thereof, the invention is not limited to these embodiments. It will be understood by those of ordinary skill in the art that various changes in form and details may be made therein without departing from the spirit and scope of the present invention as defined by the claims.
This application is based upon and claims the benefit of priority from Japanese Patent Application No. 2010-228655, filed on Oct. 8, 2010, the disclosure of which is incorporated herein in its entirety by reference.
101 First input unit
102 Second input unit
201, 202, 501 Input terminal
103 Subtraction unit
203, 703 Subtractor
104, 204, 704 Adaptive filter
205 Output terminal
106 Compare unit
206, 706 Error evaluation unit
107 Coefficient update control unit
207, 707 Stepsize control unit
408, 808 SNR estimation unit
361, 362 Averaging unit
363 Evaluation result calculation unit
550 Speaker
551 Microphone
Number | Date | Country | Kind |
---|---|---|---|
2010-228655 | Oct 2010 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2011/071747 | 9/15/2011 | WO | 00 | 4/3/2013 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2012/046582 | 4/12/2012 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4649505 | Zinser, Jr. | Mar 1987 | A |
5848151 | Boudy | Dec 1998 | A |
6700976 | Zhang | Mar 2004 | B2 |
7072465 | Benesty | Jul 2006 | B1 |
20040114542 | Stopler | Jun 2004 | A1 |
20100217587 | Sato | Aug 2010 | A1 |
20110058667 | Takada | Mar 2011 | A1 |
20120183133 | Lindstrom | Jul 2012 | A1 |
Number | Date | Country |
---|---|---|
2-244098 | Sep 1990 | JP |
7-212278 | Aug 1995 | JP |
8-110794 | Apr 1996 | JP |
10-215193 | Aug 1998 | JP |
2000-172299 | Jun 2000 | JP |
2001-22383 | Jan 2001 | JP |
2001-509659 | Jul 2001 | JP |
2008-42816 | Feb 2008 | JP |
2010-152021 | Jul 2010 | JP |
2005024787 | Mar 2005 | WO |
Entry |
---|
Mirchandani, et al, EEE Trans. Circuit & System II Analog & Digital Signal Processing Oct. 1992, “A New Adaptive Noise Cancellation Scheme in the Presence of Crosstalk”. |
Mader et al Signal Processing 80 (2000) pp. 1697-1719 “Step-size control for acoustic echo cancellation filters—an overview”. |
Mirchandani, et al, EEE Trans. Circuit & System II Analog & Digital Signal Processing Oct. 1992, A New Adaptive Noise Cancellation Scheme in the Presence of Crosstalk. |
Ikeda et al, IEEE Trans. Signal Process, Mar. 1999 “An Adaptive Noise Canceller with Low Signal Distortion for Speech Codecs”. |
iKeda et al, IEICE Trans. Fundamental Aug. 1999 “An adaptive Noise Canceller with Low Signal-Distortion in the Presence of Crosstalk”. |
Communication dated Feb. 2, 2016 from the Japanese Patent Office issued in corresponding Japanese Application No. 2012-537638. |
Communication dated Sep. 29, 2015, issued by the Japan Patent Office in corresponding Japanese Application No. 2012-537638. |
Number | Date | Country | |
---|---|---|---|
20130191119 A1 | Jul 2013 | US |