IMPROVED METHOD AND APPARATUS FOR DETECTING ECHO PATH CHANGES IN AN ACOUSTIC ECHO CANCELLER

Description

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram showing an echo canceller with echo change detection, according to an embodiment of the invention;

FIG. 2 is a flowchart showing method steps for detecting echo path changes according to an embodiment of the invention; and

FIG. 3 is a flowchart showing the method steps of FIG. 2 for method steps for detecting echo path changes and further steps for detecting double-talk, according to an alternative embodiment.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

With reference to FIG. 1, FIG. 1 an adaptive echo canceller is shown according an embodiment of the invention. A reference signal (FE_signal) is applied as an input to the echo canceller and to the acoustic echo path (i.e. the signal is broadcast via a telephone speaker). The echo path gives rise to an Echo Return Loss (ERL), which is a measure of the actual amount of reflected signal. A high ERL indicates only a relatively small signal reflected back to the talker, and vice versa, as discussed above. An adaptive filter (long filter 100) models an estimation of the echo introduced by the echo path using the well known NLMS algorithm (although other adaptive algorithms may be used), and subtracts the echo signal from the near-end input signal (i.e. NE_signal received via a telephone microphone) which contains the undesirable echo, via a first subtractor 110. Provided that the transfer function of the model of the echo path provided by adaptive filter 100 is identical to the transfer function of the echo path, the error signal becomes zero and the filter converges to the correct transfer function, resulting in perfect echo cancellation. The number of coefficients used in the long filter 100 defines its length (Length=N_long, representing the estimated echo path length).

A short default coefficients filter 120 (Default_Length=N, number of coefficients representing the estimated length of the direct echo path) represents the direct echo path captured by the long adaptive filter 100. Filter 120 is non-adaptive such that it does not track echo path changes and also does not diverge during double-talk, as is known from published Canadian Patent Application 2,451,417, referred to herein above. Subtractor 125 outputs an error signal resulting from echo cancellation via filter 120.

According to an aspect of the invention, a further short adaptive filter 130 (Short_Length=Default_Length) is provided for modeling only the direct echo path and adapting quickly whenever a reference signal is present. This is in contrast with filter 100 which adapts slowly when the reference signal is present. The filter 130 is not used for echo cancellation, but only for echo path detection. Specifically, the filter 130 quickly diverges during double-talk and is able to provide an early indication of any echo path changes. A subtractor 135 outputs an error signal resulting from filter 130.

Decision logic 140 distinguishes between echo path changes and double-talk based on the reference signal and the estimated error signals from all three filters 100, 120 and 130, as discussed in greater detail below.

Finally, a non-linear processor 150 (NLP) is provided, as is conventional in adaptive echo cancellers.

Applying the short adaptive filter 130 to the major (direct) echo path changes minimizes the impact on normal FDHF (Full Duplex Hands Free) behavior. Typically, in FDHF applications, the direct echo path reflection is the major contributor to the echo. Since the secondary echo path changes are much smaller due to attenuation in the room, their impact on FDHF performance is not critical such that NLP 150 is capable of handling them.

One difference between non-adaptive default filter 120 and the short adaptive filter 130 is that filter 130 adapts whenever the reference signal (FE_signal) is present, whereas the default filter 120 statically models the previously captured echo path and never adapts. On the other hand, a difference between the short and long adaptive filters 130 and 100 is that the short filter 130 is an under modeled system (i.e. the short adaptive filter 130 only covers the direct echo path, and never converges as deeply as the long NLMS filter 100 when it is in the stable/converged state, or when there is no echo path change). Therefore, in a single talk scenario a measurable difference will exist in the ERLE between short and long adaptive filters 130 and 100 (i.e. the long filter 100 performs better than the short filter 130). By monitoring the difference between the two error energies from filters 100 and 130 the EPC detector logic 140 indicates a change of state from single talk to one of either an echo path change (EPC) or double talk (DT).

During an echo path change the short adaptive filter 130 converges more quickly to the new echo path than the long adaptive filter 100, as discussed above. Consequently the difference between the two error signals output from filters 100 and 130 will be small or even negative, as the short filter 130 becomes better converged than filter 100.

Turning to FIG. 2, operation of the EPC detection logic 140 is shown according to an aspect of the invention. First, the reference signal (FE_signal) and error signals from subtractors 110, 125 and 135 are applied as inputs to EPC detection logic 140 (step 200). EPC detection logic 140 then calculates (step 205) the energies of the error signals output from filter 100 (E_long), filter 120 (E_def) and filter 130 (E_short).

An EPC timer is set to a predetermined value (EPC_DECISION_HOLD) whenever an echo path change (EPC) is detected, as in step 250. The EPC detection logic 140 remains in an EPC detected state until this timer expires. The NLP 150 remains turned on such that the echo is masked, during the time between when the EPC is detected (the timer is set) and the timer has expired. The value EPC_DECISION_HOLD is used to hold the EPC detected state. In a successful prototype, the constant was chosen to be 600 samples (i.e. or 75 ms).

In the event the state hold timer has expired at step 210 (either expired, or never set since it last expired), the EPC_decision is set to FALSE (step 225). This makes sure that the default-state of the EPC detection logic 140 is one in which no echo path change has been detected.

Next, at step 230, a determination is made as to whether the reference signal is present (i.e. the energy of the reference signal exceeds a Threshold (e.g. −32 dBmo) and the measured ERLE of the long filter 100 exceeds a predetermined value (e.g. 12 dB). If either of these conditions fails, the algorithm exits (step 220), indicating either an absence of echo or that the algorithm has not yet converged, such that there no good condition to decide about the echo path change.

In the event of a “Yes” decision at step 230, a determination is made (step 235) as to whether E_long>=(Thresh_activity*E_short), where Thresh_activity is a threshold of, for example, −6 dB in a successful implementation of an embodiment of the invention. This condition is based on the fact that the long filter 100 cancels echo much better than short filter 130 in a stable/converged single-talk scenario. However, in either a double talk or EPC scenario the long filter 100 does not achieve as good ERLE as in the stable/converged single-talk state. This makes the relation between long filter 100 and short filter 130 change significantly so that the long filter 100 does not achieve 6 dB better than the short filter 130. A “No” event at step 235 indicates there is no double talk or EPC. The algorithm exits (step 220).

In the event of a “Yes” decision at step 235, a determination is made (step 245) as to whether E_short<=(Thresh_epc*E_def), where Thresh_epc is a threshold of, for example, −5 dB in a successful implementation of an embodiment of the invention. This condition is based on the fact that the short filter 130 will quickly adapt to the new echo path while the default filter 120 does not. In the event of an echo path change the short filter 130 achieves better ERLE than the default filter 120, by, e.g. 5 dB. A “No” event at step 245 indicates that the activity detected from step 235 is not for EPC. The algorithm exits (step 220).

In the event of a “Yes” decision at step 245, then at step 250 the EPC_decision is set to TRUE and an Echo Path Change (EPC) is detected. As described above, this state will be held for at least EPC_DECISION_HOLD samples (e.g. 600 samples). and the NLP 150 is set to mask the error (i.e. provide full attenuation of the signal), and the algorithm ends (step 220). Alternatively, rather than control the NLP 150 for masking unwanted echo due to echo path changes, the EPC detection logic 140 may be used to control the NLMS adaptation. Specifically, the EPC detection logic 140 may be used to freeze or slow down the adaptation of the long filter 100 when double-talk is detected and to speed up the adaptation of the long filter when an echo path change (EPC) is detected.

As shown in FIG. 3, analysis of the behavior between the fixed filter 120 and the short adaptive filter 130 may be used to determine when a double talk state is entered into. The EPC steps of FIG. 3 are identical to the steps of FIG. 2, and identical reference numerals are used to denote equivalent steps. After common step 210, the DT algorithm determines whether a DT timer has not yet expired, then double talk is detected whereupon a flag (DT_decision) is set to TRUE, The DT timer is updated at step 310 and the algorithm ends (step 220).

In a double talk scenario the default coefficients continue to be valid and may be used to cancel the echo signal. The short adaptive filter 130, on the other hand, updates its coefficients based on the NE_signal (echo+near-end speech) and thus causes a divergence of the coefficients. If the adaptive filter error is consistently worse relative to the fixed filter error over a period of time a double talk condition is identified.

During double talk, the near-end (NE) speech contributes to the residue echo. That is:

Echo=Real_Echo+NE_Speech; and

Energy_residue_echo=Energy_of_Echo−Estimated echo of the NLMS filter 100.

The presence of the near-end signal results in a decrease of ERLE for both long and short adaptive filters 100 and 130, so that the ratio of the error energies between the two filters can no longer achieve the aforementioned measurable difference (i.e. the difference, in dB, between the two filters in the single talk scenario). By monitoring the difference between the two error energies the EPC detector logic 140 is able to identify one of either an echo path change (as discussed above in connection with FIG. 2) or a double-talk condition. In particular, whenever the error energy ratio between filters 100 and 130 is greater than Thresh_activity (e.g. −6 dB) then there is some activity (either DT or EPC), per a “Yes” decision at step 235. Further processing at steps 330, 335 and 340 distinguishes between DT and EPC. Thus, at step 245, the error of the default non-adaptive (fixed) filter 120 and the error of the short adaptive filter 130 are compared.

During an echo path change, the default coefficients in the fixed filter 120 are no longer valid, while the short adaptive filter 130 converges to the new echo path change. Consequently, the error from the short adaptive filter 130 becomes much smaller than the error from the default fixed filter 120. Therefore, when the error energy ratio between the adaptive filter 130 and fixed filter 120 is less than Thresh epc, (e.g. −5 dB) an echo path change is flagged, as discussed above in connection with FIG. 2.

On the other hand, if E_Short<=(Thresh_epc*E_def) is not true, then the EPC_decision is set to FALSE and a further determination is made (step 330) as to whether E_Short>=(Thresh_dt*E_def), where Thresh_dt is +1 dB according to a successful prototype of the invention. To make a reliable decision for double talk this condition has to be consecutively fulfilled for at least DECISION_TIMER_THRESH times. The DECISION_TIMER_THRESH is chosen for 16 samples, according to a successful prototype. If a double talk state is finally detected this state will at least be held for DT_DECISION_HOLD samples. To hold the DT state a DT_hold_timer is set to DT_DECISION_HOLD. The timer is checked at step 300 and updated at step 310 if necessary.

The many features and advantages of the invention are apparent from the detailed specification and, thus, it is intended by the appended claims to cover all such features and advantages of the invention that fall within the true spirit and scope of the invention. Further, since numerous modifications and changes will readily occur to those skilled in the art, it is not desired to limit the invention to the exact construction and operation illustrated and described, and accordingly all suitable modifications and equivalents may be resorted to, falling within the scope of the invention.

Claims

1. An echo canceller for cancelling echo from a near-end signal, comprising: a first adaptive filter having N_long coefficients for converging to an echo path of said near-end signal;a non-adaptive filter representing a direct echo path portion of said echo path and having N_short default coefficients, where N_long>N_short, for quick convergence of said echo canceller at start-up, said default coefficients being replaced by the first N_short coefficients from said first adaptive filter responsive to an improvement in echo return loss enhancement (ERLE) of said first adaptive filter;a second adaptive filter having N_short default coefficients for modeling said direct echo path;decision logic connected to said first and second adaptive filters and said non-adaptive filter for detecting and distinguishing between an echo path change and double-talk; anda non-linear processor responsive to said echo path change being detected by said decision logic for attenuating said near-end signal.
2. The echo canceller of claim 1, wherein said decision logic detects said echo path change by (i) calculating respective energy levels of signals output from said first and second adaptive filters and said non-adaptive filter, (ii) in the event the signal energy ratio between said first and second adaptive filters is equal or greater than a first threshold then indicating one of either said echo path change and double-talk, and (iii) in the event the signal energy ratio between said second adaptive filter and said non-adaptive filter is equal or less than a second threshold then indicating said echo path change.
3. The echo canceller of claim 2, wherein said first threshold is approximately −6 dB and said second threshold is approximately −5 dB.
4. The echo canceller of claim 1, wherein said decision logic detects said double-talk by (i) calculating respective energy levels of signals output from said first and second adaptive filters and said non-adaptive filter, (ii) in the event the signal energy ratio between said first and second adaptive filters is equal or greater than a first threshold then indicating one of either said echo path change and double-talk, and (iii) in the event the signal energy ratio between said second adaptive filter and said non-adaptive filter is greater than a second threshold and equal or greater than a third threshold then indicating said echo path change.
5. The echo canceller of claim 4, wherein said first threshold is approximately −6 dB, said second threshold is approximately −5 dB, and said third threshold is approximately +1 dB
6. A method of cancelling echo from a near-end signal, comprising: filtering said near-end signal according to a first adaptive filter having N_long coefficients for converging to an echo path of said near-end signal;filtering said near-end signal according a non-adaptive filter representing a direct echo path portion of said echo path and having N_short default coefficients, where N_long>N_short, for quick convergence of said echo canceller at start-up, said default coefficients being replaced by the first N_short coefficients from said first adaptive filter responsive to an improvement in echo return loss enhancement (ERLE) of said first adaptive filter;filtering said near-end signal according a second adaptive filter having N_short default coefficients for modeling said direct echo path;detecting and distinguishing between an echo path change and double-talk; andresponsive to said echo path change being detected then attenuating said near-end signal.
7. The method of claim 6, wherein said echo path change is detected by (i) calculating respective energy levels of signals output from said first and second adaptive filters and said non-adaptive filter, (ii) in the event the signal energy ratio between said first and second adaptive filters is equal or greater than a first threshold then indicating one of either said echo path change and double-talk, and (iii) in the event the signal energy ratio between said second adaptive filter and said non-adaptive filter is equal or less than a second threshold then indicating said echo path change.
8. The method of claim 7, wherein said first threshold is approximately −6 dB and said second threshold is approximately −5 dB.
9. The method of claim 6, wherein said double-talk is detected by (i) calculating respective energy levels of signals output from said first and second adaptive filters and said non-adaptive filter, (ii) in the event the signal energy ratio between said first and second adaptive filters is equal or greater than a first threshold then indicating one of either said echo path change and double-talk, and (iii) in the event the signal energy ratio between said second adaptive filter and said non-adaptive filter is greater than a second threshold and equal or greater than a third threshold then indicating said echo path change.
10. The method of claim 9, wherein said first threshold is approximately −6 dB, said second threshold is approximately −5 dB, and said third threshold is approximately +1 dB.
11. The echo canceller of claim 2, further comprising a timer for causing said decision logic to continue attenuating said near-end signal for a time period responsive to initial detection of said echo path change irrespective of fluctuations in said energy levels of signals output from said first and second adaptive filters and said non-adaptive filter.
12. The method of claim 6, further comprising continuing attenuation of said near-end signal for a time period responsive to initial detection of said echo path change irrespective of fluctuations in said energy levels of signals output from said first and second adaptive filters and said non-adaptive filter.

IMPROVED METHOD AND APPARATUS FOR DETECTING ECHO PATH CHANGES IN AN ACOUSTIC ECHO CANCELLER

Information

Publication Number

Date Filed

Date Published

Inventors

CPC

US Classifications

International Classifications

Abstract

Description

Claims