The present invention will further be explained below with reference to exemplary embodiments illustrated in the accompanying drawings, in which:
The echo canceling device 1′ according to the Prior Art as shown in
The loudspeaker 2 reproduces the far-end signal x that has been amplified by the amplifier 17. This reproduced far-end signal appears as an echo e at the microphone 3, together with the near-end signal s. The echo cancellation device 1′ attempts to remove the echo e from the microphone signal z.
The microphone signal z is combined with an echo cancellation signal y produced by the adaptive filter. The combination unit 19 is typically constituted by an adder having a negative or inverting input (−) to which the echo cancellation signal y is fed, resulting in a subtraction of said signal. The residual signal r produced by the combination unit 19 is therefore the difference signal of the echo cancellation signal y and the microphone signal z: r=z−y.
The filter update unit 15 typically determines the correlation of the far-end signal x and the residual signal r and controls the(coefficients of the) adaptive filter 13 in such a way that this correlation is minimized. It can be seen from
A post-processor (PP) 18 processes the residual signal r so as to remove remaining echo components and produces the processed residual signal rp. The echo cancellation signal y and the microphone signal z are fed to the post-processor 18 as auxiliary signals.
An echo canceling device of this type is disclosed in United States Patent Application US 2003/0031315 (Belt et al./Philips), the entire contents of which are herewith incorporated in this document.
Although the echo canceling device of
The echo canceling device 1 according to the present invention shown merely by way of non-limiting example in
The first adaptive filter 13 produces a first echo cancellation signal y1 which is combined with the microphone signal z in the first combination unit 19 to produce a first residual signal r1. Similarly, the second adaptive filter 14 produces a second echo cancellation signal y2 which the second combination unit 20 combines with the first residual signal r1 to produce a second residual signal r2. The post-processor 18 receives this second residual signal r2 and the echo cancellation signals y1 and y2 to produce a processed residual signal rp from which the echoes are substantially completely removed.
In accordance with the present invention, the second adaptive filter 14 operates on the delayed far-end signal xd produced by the delay unit 21, while the first adaptive filter 13 operates on the original far-end signal x. As a result, the first adaptive filter 13 cancels the echoes corresponding with the first part of the impulse response underlying the echo signal, while the second adaptive filter 14 cancels the echoes of a second, further part of the impulse response. This will be further explained with reference to
The EDC is a measure of the remaining energy of the signal and may be expressed mathematically as
where hm is the amplitude of the mth signal sample. In
The amplitude A of the signal of
The present inventors have realized that the direct part (from t=0 to t=T) and the diffuse part (from t=T) of the impulse response have different properties. The echo signal, which can be described as the convolution of the impulse response and the (amplified) far-end signal, should therefore be canceled using two (or more) echo cancellation signals based upon different parts of the impulse response. The operation of the post-processor may be significantly improved by using these different echo cancellation signals.
Returning to
The operation of the filter units 13 and 14 is controlled by the respective filter (coefficient) update units 15 and 16. These filter update units 15 and 16 determine the coefficients of the filter units 13 and 14 respectively using techniques which may be known per se, for example techniques based on the correlation of the far-end signal x and the first residual signal r1 (filter update unit 15), and the correlation of the delayed far-end signal xd and the second residual signal r2 (filter update unit 16). In accordance with the present invention, both the first echo cancellation signal y1 and the second echo cancellation signal y2 are fed to the post-processor 18 to further process the (second) residual signal r2 so as to substantially remove any remaining echoes. The post-processor of the present invention preferably utilizes spectral subtraction to remove remaining echoes. Accordingly, the absolute value |Rp| of the frequency spectrum Rp of the processed residual signal rp may be calculated by:
|Rp|=|R1|−γ1ε.|Y1|−γ2.|Y2| (2)
where |R1|, |Y1| and |Y2| are the absolute values (that is, magnitudes) of the frequency spectra of the (first) residual signal r1, the first echo cancellation signal y1 and the second echo cancellation signal y2 respectively, γ1 and γ2 are a first and a second over-subtraction factor respectively, and ε is an estimate of the achieved echo return loss enhancement (ERLE) of the first adaptive filter 13. The product ε.|Y1| is therefore an estimate of the residual direct echo signal after the first combination unit 19. The (complex) frequency spectrum Rp is determined using the absolute value |Rp| and the phase of the spectrum R2 calculated from the second residual value r2. Using an inverse Fourier transform, the processed residual signal rp is determined from the complex spectrum Rp.
The ERLE factor ε can be estimated by the formula:
where |R1| is the absolute value of the frequency spectrum of the (first) residual signal r1, |Z| is the absolute value of the frequency spectrum of the microphone signal z, and both |R1| and |Z| are averaged over a (short) time period in which no near-end signal (s in
The over-subtraction factors γ1 and γ2 determine the weights of the respective spectra in the spectral subtraction of formula (2) and are typically slightly greater than or approximately equal to 1, for example 1.1, although values ranging from about 0.5 to about 2.0 may be used. It is preferred that γ2 is greater than γ1. The over-subtraction factors compensate for the fact that the amplitudes of the echo cancellation signals typically have some variance relative to their “ideal” values and can be too small to effectively cancel the echo signal.
It can be seen from formula (2) that the frequency spectra of both the first echo cancellation signal y1 and the second echo cancellation signal y2 are used by the post-processor 18. In addition, the frequency spectrum of the first residual signal r1 is used in formula (2). Although
The value |Rp| is typically determined for each frequency component separately and is therefore frequency-dependent. The value |Rp| may further be used to determine a gain factor G which may be defined by:
which gain factor G is then multiplied with the complex value R2 of the residual signal r2 to obtain the spectrum Rp of the processed residual signal rp:
Rp=G.R2 (5)
The original phase of the (complex) spectrum R2 is used to produce Rp and, using an inverse Fourier transform, the signal rp. It is noted that the gain factor G is preferably also frequency-dependent. Gain factors may advantageously be used to limit the amount of echo suppression in order to reduce signal distortion. To this end, the gain factor may be compared with a minimum gain, the gain factor having a maximum value equal to one.
It is further noted that the post-processor 18 preferably processes the signals z, y1, y2, . . . per frame or block (B), each block being subjected to a fast Fourier transform (FFT) to obtain the complex frequency spectrum. This complex spectrum is separated into a magnitude (absolute value) and a phase using well-known techniques.
By using two distinct echo cancellation signals (y1 and y2) in the post-processor, a much improved post-processing is achieved which results in a virtually complete echo cancellation without distorting the near-end signal.
Instead of using the first residual signal r1 as in formula (2), the post-processor may utilize the microphone signal z to determine the processed residual signal rp using the formula:
|Rp|=|Z|−γ1.|Y1|−γ2.|Y2| (6)
where |Z| is the absolute value of the spectrum Z of the microphone signal z. This is schematically illustrated in
The exemplary device 1 of
Referring to
|Rp|=|Z|−γ1.|Y1|−γ2.|Y2|−γ3.|Y3| (7)
where γ3 is the over-subtraction factor of the absolute value |Y3| of the tail spectrum Y3. As is the case with γ1 and γ2, the over-subtraction factor γ3 is slightly greater than or approximately equal to 1, and γ3>γ2>γ1. As can be seen, formula (7) is largely identical to formula (6), with the exception of an added tail term involving the tail estimation. The tail spectrum Y3 may be estimated as follows:
|Y3|=α.|Y3|−1+|Y2B|−1 (8)
where the subscript “−1” indicates a previous block or frame, and where α is a factor related to the reverberation time (often referred to as “T60 time” by those skilled in the art) of the room in which the device is located, α typically being smaller than one. In addition, Y2B is the spectrum of the signal obtained by convolving the last B coefficients of the second filter 14 with the delayed far-end signal xd:
where l0=N2−B, l1=N2−1, the block size B is a number approximately equal to 80, and h2(1) are the coefficients of the second filter 14. This will later be explained in more detail with reference to
It is noted that the tail estimation unit 22 of
The embodiment of
The structure of the filter 13′ of
Similarly, the filter section 14 has a number of filter cells 401+1-40m and associated filter coefficient units 411+1-41m which produce weighted sample values that are summed by a summation unit 43 to produce a second echo cancellation signal y2. These echo cancellation signals y1 and y2 may be added in an adder 44 to produce a combined echo cancellation signal y.
The length of a data block B is typically less than or equal to the length of a filter section so as to allow an entire block of samples to be filtered simultaneously. The block length B is used in formula (9) above. It can be seen that the signal y2B is equal to the signal y2 if the filter length is equal to the block length B. If the block length B is smaller than the filter length, a separate summation unit 43′ (not shown) may be used to produce the signal y2B of formula (9).
The second filter section 14 receives delayed signal samples from the delay unit 21. It is preferred that the delay imposed by delay unit 21 is approximately equal to the delay caused by the filter units 40i of the first filter section 13 so that the filter sections 13 and 14 process contiguous sets of samples.
The combined filter unit 13′ of
It is noted that no delay unit 21 is required in the embodiment of
The echo cancellation device of the present invention may be incorporated into various other devices, for example consumer devices such as mobile (cellular) telephone apparatus. An exemplary telephone apparatus 80 incorporating the echo cancellation device of the present invention is shown in
The preferred embodiment of the present invention may be summarized as an echo cancellation device for canceling any echo in a microphone signal, the device comprising:
As mentioned above, it has been shown that the two adaptive filters may be replaced with a single adaptive filter having two concatenated sections, each section producing an individual echo cancellation signal, to obtain the same benefits.
The present invention is based upon the insight that an echo cancellation device may advantageously comprise two filters which each operate on a different part of the acoustic impulse response and each produce an individual echo cancellation signal. The present invention benefits from the further insight that two or more individual echo cancellation signals may advantageously be used by a post-processor to remove any remaining echoes from the residual signal.
It is noted that any terms used in this document should not be construed so as to limit the scope of the present invention. In particular, the words “comprise(s)” and “comprising” are not meant to exclude any elements not specifically stated. Single (circuit) elements may be substituted with multiple (circuit) elements or with their equivalents.
It will be understood by those skilled in the art that the present invention is not limited to the embodiments illustrated above and that many modifications and additions may be made without departing from the scope of the invention as defined in the appending claims.
Number | Date | Country | Kind |
---|---|---|---|
04105007.1 | Oct 2004 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB05/53365 | 10/13/2005 | WO | 00 | 4/9/2007 |