The invention relates to echo suppression, and more particularly to controlling the echo suppression.
In telephonic communication, it has become common to use speaker phones or other such equipment that does not require users to hold equipment to their ears. This provides for the ability to use both hands while communicating and for multiple people to be involved in the same conversation. The free use of both hands is becoming particularly important in the use of cell phones in an automobile. One of the difficulties in any such system is that the output of a speaker is picked-up by a microphone. One alternative, known as half duplex operation, has been to simply turn off one while the other operates. Thus, only one of the speaker or microphone is operational at a given point in time, but this is not desirable for maximizing natural communication. Another alternative, which provides more natural communication, is to operate in full duplex mode. In order to allow this, echo cancellation has been introduced. This has been effective but often the conversation still does not seem natural because of the cancellation process and the echo cancellation is generally not completely effective.
The performance of a typical acoustic echo canceller in practice is limited by nonlinearities of the devices in the signal path as well as the ambient noise of the near end environment. Due to these limitations an echo canceller may be capable of delivering only 25 dB to 30 dB of attenuation in the echo path. This implies that during periods of single talk, where only the far-end speaker is active, there may still be some small amount of audible echo reflected back to the far end. This can be very annoying to the one speaking at the far end. Therefore it is usually necessary to add some additional attenuation into the signal path during far end only activity. This is typically accomplished by inserting a non-linear echo suppressor function into the echo path following the echo canceller.
Typically echo suppressors add attenuation by ramping the attenuation factor up or down at some predefined constant rate based on the presence of far end and/or near end energies. This has the effect of creating a trapezoidal attenuation window around the speech envelopes and will invariably either cut off the beginning of the near end speech or allow significant echo to pass. The double talk performance of these systems is less than desirable.
Thus, there is a need to provide echo cancellation and attenuation that reduces distortion in a double talk situation.
The present invention is illustrated by way of example and not limited by the accompanying figures, in which like references indicate similar elements, and in which:
Skilled artisans appreciate that elements in the figures are illustrated for simplicity and clarity and have not necessarily been drawn to scale. For example, the dimensions of some of the elements in the figures may be exaggerated relative to other elements to help improve the understanding of the embodiments of the present invention.
In one form, echo suppression is controlled by providing a signal that is a mix of error signal and comfort noise. The mix of these two is selected based on the energies of an input signal and an error signal. This is better understood with reference to the drawings and the following description.
Shown in
In operation, speaker 30 responds to signal x(n) by providing audio when signal x(n) is provided. Fixed filter 24 responds to signal x(n) by providing a signal to adder 12 with a selected relationship to signal x(n). Similarly, adaptive filter 26 responds to signal x(n) by providing a signal to adder 14 with a selected relationship to signal x(n). Signal ef(n) is attenuated by controlled amplifier 16 according to a factor a which is a positive number less than one. If a equals “1” there is no attenuation. If a equals “0” signal ef(n) is completely blocked. Similarly controlled amplifier 18 attenuates signal g(n) from comfort noise generator 22 based on one minus factor a. Thus, adder 17 thus receives signals ef(n) and g(n) in attenuated form. Adder 17 thus provides signal s(n) as a combination of these attenuated forms of signals ef(n) and g(n).
The factor a is obtained by the use of ES control function 20. ES control function 20 calculates ERLE by using the output of adder 14 and signal y(n). ES control function 20 then calculates factor a using the ERLE. The use of an adaptive filter, signal y(n), and an adder to find the ERLE is known in the art and may be achieved in such a manner for this purpose. The a factor is calculated as shown in
The equations for the exponent (e) and factor a, where m is the scaling factor, are as follows:
e=(ERLE/Ceiling)(Noise floor)
a=10me
For the case in which there is no signal coming from the far end, signal x(n) is small, and for simplicity is considered zero for this example. When signal x(n) is zero, adaptive filter 24 provides no canceling signal to adder 12 so that signal ea(n) and signal y(n) are the same. In such case, ERLE is zero. With ERLE zero, the exponent in obtaining factor a is zero. With the exponent at zero, factor a becomes 1. Thus, for the case in which signal x(n) is without energy, factor a is one which causes signal ef(n) to not be attenuated and signal g(n) to be completely blocked. In this case, signal y(n) is effectively passed through as error signal s(n), which is the output provided to the far end. This is the desirable result, because there is no need for additional attenuation when all of the audio that is being generated by the system is at the near end and that audio is being received at the far end.
For the case in which there are relatively large signals at both x(n) and y(n), which is the double talk situation, adaptive filter 26 will diverge due to the uncorrelated energy of signals x(n) and y(n) and will provide no significant echo cancellation. Thus, in such case, ea(n) is very close to y(n). With the result of there being no difference between y(n) and ea(n), ERLE is calculated to be zero. Thus results in a zero exponent so that a is one. With factor a at zero, controlled amplifier 16 provides no attenuation and controlled amplifier provides complete blocking with the result of signal ef(n) passing through as signal s(n). Thus, in the double talk situation, in which the predominant information in both directions is valid, there is no additional attenuation added by the echo suppressor.
For the case in which signal x(n) is large and there is no audio being originated at the near end, which is a single talk situation, signal y(n) receives significant echo cancellation at adder 14 due to adaptive filter 26. Since no audio is being generated from the near end, signal y(n) is all echo so that signal ef(n) is preferably zero and the difference between signal y(n) and signal ea(n) is large. With a large difference between the energy of y(n) and ea(n), ERLE is relatively large. With ERLE relatively large, the exponent is a relatively large negative number with the result that factor a is effectively zero. With factor a at zero, controlled amplifier 16 provides no input to adder 17 whereas controlled amplifier 18 passes signal g(n) unattenuated to adder 17. The result in such a case is that signal s(n) is provided as comfort noise without error signal being present.
Adaptive filter 26 loses correlation very quickly if there is audio originated from the near end. Thus, any speech from the near end will cause signal y(n) to be less correlated to x(n) resulting in adaptive filter 26 losing correlation and making signal ea(n) the same as y(n), which in turn causes factor a to become 1, thereby passing ef(n) as s(n). The equation used for factor a is as follows:
Shown in
In the foregoing specification, the invention has been described with reference to specific embodiments. However, one of ordinary skill in the art appreciates that various modifications and changes can be made without departing from the scope of the present invention as set forth in the claims below. For example, signal x(n) is shown being directed connected to fixed filter 24, adaptive filter 26, and speaker 30. Such signal x(n) would be buffered in different ways for each of these elements. In particular, speaker 30 would need different buffering than would filters 24 and 26. Similarly, controlled amplifiers 18 and 16 may provide an output of a greater magnitude than the input signal for the cases where the signal is to be passed. Also, comfort noise generator 22 may be excluded so that no comfort noise is generated or passed through as signal s(n) while the operation of controlled amplifier 16 could be maintained. Accordingly, the specification and figures are to be regarded in an illustrative rather than a restrictive sense, and all such modifications are intended to be included within the scope of present invention.
Benefits, other advantages, and solutions to problems have been described above with regard to specific embodiments. However, the benefits, advantages, solutions to problems, and any element(s) that may cause any benefit, advantage, or solution to occur or become more pronounced are not to be construed as a critical, required, or essential feature or element of any or all the claims. As used herein, the terms “comprises,” “comprising,” or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.
Number | Name | Date | Kind |
---|---|---|---|
4064379 | Horna | Dec 1977 | A |
4165449 | Vachon | Aug 1979 | A |
4213014 | Luder | Jul 1980 | A |
4282411 | Stewart | Aug 1981 | A |
4368360 | Sato | Jan 1983 | A |
4546216 | Tegethoff | Oct 1985 | A |
4628156 | Irvin | Dec 1986 | A |
4852081 | Bonnet | Jul 1989 | A |
4864608 | Miyamoto | Sep 1989 | A |
4885755 | Yoshida | Dec 1989 | A |
5016271 | Ford | May 1991 | A |
5271057 | Addeo | Dec 1993 | A |
5283784 | Genter | Feb 1994 | A |
5327495 | Shenoi | Jul 1994 | A |
5381475 | Cavallo | Jan 1995 | A |
5390250 | Janse et al. | Feb 1995 | A |
5563944 | Hasegawa | Oct 1996 | A |
5619566 | Fogel | Apr 1997 | A |
5668794 | McCaslin | Sep 1997 | A |
5790657 | Fujiwara | Aug 1998 | A |
5852661 | Chen | Dec 1998 | A |
6148078 | Romesburg | Nov 2000 | A |
6160886 | Romesburg | Dec 2000 | A |
6282176 | Hemkumar | Aug 2001 | B1 |
6532289 | Magid | Mar 2003 | B1 |
6542611 | Lane et al. | Apr 2003 | B1 |
Number | Date | Country | |
---|---|---|---|
20050058277 A1 | Mar 2005 | US |