1. Field of the Invention
The invention relates to telecommunication equipment, and in particular, to a method for echo cancellation and suppression.
2. Background Art
Echo Cancellers
Echo Canceller (EC) is a device or software which could cancel echo signals by using some reference signal. The reference signal is also called received signal (Rx signal). The echo signal is also called echo return signal which is mixed in the transmission signal (Tx signal). There are two major types of echo cancellers: one is called Acoustic Echo Canceller (AEC); another one is named as Line Echo Canceller (LEC). Obviously, AEC is used to cancel acoustic echoes and LEC is employed to cancel line echoes.
Line echoes result from imperfect impedance matching by hybrids in two-to-four wire signal conversion. Acoustic echoes happen in the field of telecommunication equipment where acoustically coupled echoes are transmitted back to the receiving party while also providing a full duplex connection; it is also a problem in the filed of audio conferencing to prevent adding noise of the inactive talkers into the output while also providing a conference bridge where two or more conferees can talk at one. Some echo is acceptable in voice conversations, however, users are annoyed by listening to their own speech delayed by the round-trip time of the system.
Echo canceller often consists of an adaptive filter with Least Mean Square (LMS) algorithm which generates an echo replica signal similar to the echo signal. A subtraction between the echo return signal and the echo replica signal is conducted in order to cancel the echo return signal. Due to the fact that the replica signal could not be perfect enough to duplicate the echo return signal for various reasons, there is always some residual echo left in the transmission signal. Echo Suppression is an apparatus which could perform echo reduction and/or echo elimination, in particular for reducing or eliminating residual echo signals left after echo canceller. Due to the inherent problems associated with echo cancellers, many solutions rely significantly on an additional echo suppression stage.
Echo Suppression
Echo Suppressor is a device or software which could dramatically reduce the (residual) echo energy without significantly distorting the non-echo speech signal. Although Echo Suppressor can work alone without combining with echo canceller, it often works as a compensation of echo canceller. Not only Echo Suppressor can dramatically suppress the residual echo energy but also reduce background noise energy by taking the advantage of available parameters. Echo Suppressor could be viewed as an independent function or just a part of Echo Canceller system.
Existing Echo Suppression methods are mostly based on the following approaches:
This invention proposed an Echo Suppression approach which can achieve robust performance with low complexity level.
In accordance with the purpose of the present invention as broadly described herein, there is provided model and system for Echo Suppressor.
The invention proposed an Echo Suppressor which can efficiently suppress both echoes and background noise without introducing “choppiness”. The Echo Suppressor System includes said two adaptive gains Gr(RSR) and Gn(NSR), said one adaptive zeros-filter A1(z) and said one adaptive poles-filter A2(z); wherein, thr gain Gr(RSR) is controlled by RSR (Residual echo level to Signal level Ratio); the gain Gn(NSR) is controlled by NSR (Noise signal level to current Signal (Tx) level Ratio); the filter A1(z) is converted from LSF1 obtained from first modification of LSFTx (Line Spectral Frequencies of Tx signal); the filter A2(z) is converted from LSF2 obtained from second modification of LSFTx.
The features and advantages of the present invention will become more readily apparent to those ordinarily skilled in the art after reviewing the following detailed description and accompanying drawings, wherein:
The present invention discloses an Echo Suppressor system which can suppress residual echoes and background noise even at double talker case, without introducing “choppiness”. The following description contains specific information pertaining to Echo Suppressor. However, one skilled in the art will recognize that the present invention may be practiced in conjunction with various algorithms different from those specifically discussed in the present application. Moreover, some of the specific details, which are within the knowledge of a person of ordinary skill in the art, are not discussed to avoid obscuring the present invention.
The drawings in the present application and their accompanying detailed description are directed to merely example embodiments of the invention. To maintain brevity, other embodiments of the invention which use the principles of the present invention are not specifically described in the present application and are not specifically illustrated by the present drawings.
This invention proposed an Echo Suppression approach which can achieve robust performance with low complexity level. It works by gain-controlled filtering processing which can be defined by using the following filtering model:
where Gn( ) is a gain which is a function of NSR (or SNR); NSR is defined as background Noise level to Signal level Ratio. NSR is measured by analyzing the Tx signal before the echo suppressor and utilizing VAD (Voice Activity Detection) information. Gr( ) is a gain which is a function of RSR; RSR is defined as an estimate of Residual echo level to Signal level Ratio in the Tx signal; RSR estimation is more complicated as explained later. A1(z) and A2(z) are linear predictors consisting of LPC coefficients, which are converted from LSF1 and LSF2 where LSF means Line Spectral Frequencies. Both LPC coefficients and LSF are well-known parameters in speech processing domain, that are often used to represent spectral envelope. LSF1 is obtained by first modification of LSFTx, wherein LSFTx is calculated from doing LPC analysis on the Tx signal (input signal of the echo suppressor); LSF2 is obtained from second modification of LSFTx. The LSF modifications are controlled by the parameters SNR, RSR, and another set of LSFRx which is calculated from the LPC analysis on the Rx signal.
In the equation (1), the gains, which usually do not go down to zero but could be small enough in pure echo area so that pure echoes can not be heard, mainly contribute to non-double talk area to significantly reduce the energy of pure echo or noise. The gain factors could be also smaller than 1 in speech or double talk areas, which depend on the parameters of NSR and RSR. Because NSR and RSR are made changing smoothly or slowly, the gain factors are also changed smoothly in time to avoid any “choppiness”.
The LPC filters A1(z) and A2(z) in the equation (1) mainly contribute to suppress the residual echo spectral formants (see
The basic concept has been summarized in the above. The details will be explained later.
So, the current residual echo energy level can be estimates as
Current Residial Echo Level=(RRR) (Current Rx Signal Level) (3)
With the above formula, the current residual echo level can also be estimated during even double talk areas. Finally, RSR is calculated by
According to the definition of (4), RSR is around the value of 1 in pure residual echo areas and it should be smaller than 1 in double talk areas.
Gain Gn(NSR) could be linear function or non-linear function of the parameter NSR. An example of the linear function can be
G
n(NSR)=1−NSR (5)
where Cn is a constant: 0<Cn<1
Gain Gr(RSR) could be linear function or non-linear function of the parameter RSR. An example of the linear function can be
G
r(RSR)=1−Cr·RSR (6)
where Cr is a constant: 0<Cr<1
Estimate of LSF of Tx signal, LSFTx(i), i=0, 1, . . . , Order-1, is based on LPC-analyzing the Tx signal. Typical number of Order is around 10 for narrow band signal at the sampling rate of 8 kHz.
Estimate of LSF of noise signal, LSFnois(i), i=0, 1, . . . , Order-1, is based on the average (or running mean) of LSFTx(i) in background noise areas of the Tx signal.
Estimate of LSF of Rx signal, LSFRx(i), i=0, 1, . . . , Order-1, is based on LPC-analyzing the Rx signal.
Estimate of LSF of (residual) echo signal, LSFecho(i), i=0, 1, . . . , Order-1, is more difficult especially in double talk areas where residual echo signal is mixed in speech signal. As an example, LSFecho(i) can be calculated by using prediction from LSFRx(i). First, the prediction factors, P(i), are evaluated by taking the recent average ratio (or running mean ratio) between LSFRx(i) and LSFTx(i) in the pure residual echo areas:
Then, the current LSFecho(i) of residual echo is estimated by
LSF
echo(i)=P(i)·LSFRx(i), i=0, 1, . . . , Order-1 (8)
where LSFRx(i) is the current Linear Spectral Frequencies of the Rx signal.
LPC predictors A1(z) and A2(z) are respectively converted from two sets of LSF, noted as LSF1(i) and LSF2(i), i=0, 1, . . . , Order-1. Both LSF1(i) and LSF2(i) are based on modifications of LSFTx(i). The modifications are mainly influenced by LSFecho(i), LSFnois(i), NSR, and RSR. As an example of the modifications, LSF1(i) and LSF2(i) are constructed as following:
LSF
1(i)=λ1·LSFTx(i)+β·LSFecho(i)+α·LSFnois(i), i=0,1, . . . ,Order-1 (9)
LSF
2(i)=λ2·[LSFTx(i)−β·LSFecho(i)−α·LSFnois(i)], i=0,1, . . . ,Order-1 (10)
where
β=Cβ·RSR, (11)
α=Cα·NSR, (12)
Cα and Cβ are constants. Their values should be larger than zero and much smaller than 1. λ1 and λ2 are determined in the following way:
The present invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described embodiments are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is, therefore, indicated by the appended claims rather than the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.
Number | Date | Country | |
---|---|---|---|
60877174 | Dec 2006 | US |