1. Technical Field
The present disclosure relates to the technical field of an acoustic echo cancellation, in more particular, to a method for acoustic echo cancellation and a echo cancellation system thereof.
2. Description of Related Art
Acoustic echo interference usually occurs in a communication system and an audio system, such as mobile phone, video conference, telecommunication conference, VOIP phone and so on. There are two primary acoustic elements, which are woofer and microphone. When the two elements are disposed in an enclosure space or close to each other, where the soundwave energy is enough to transmit from the woofer to the microphone, its generating echo would greatly decrease the quality of communication and user may have an auditorily-uncomfortable feeling.
Adaptive filtering algorithms have been widely employed in many signal processing applications such as equalization, active noise control, acoustic echo cancellation, and biomedical engineering. The normalized least-mean-square (NLMS) adaptive filter is the most popular due to its simplicity. The stability of the basic NLMS is controlled by a fixed step-size μc. This parameter also governs the rate of convergence, speed of tracking ability and the amount of steady-state excess mean-square error (MSE). Aiming to solve the conflicting objectives of fast convergence and low excess MSE associated with the conventional NLMS, a number of variable step-size NLMS (VSS-NLMS) algorithms have been presented in the past two decades.
Kwong used the power of instantaneous error to derive a variable step-size LMS (VSS-LMS) filter. This VSS-LMS employs a larger step size when the estimation error is large, and vice versa. Aboulnasr pointed out that the advantageous performance of this VSS-LMS and several other variable step-size LMS algorithms is usually obtained in a high signal-to-noise environment. She then developed a scheme using the autocorrelation of errors to alleviate the influence of uncorrelated disturbance. Recently Shin, Sayed, and Song developed a variable step-size affine projection algorithm, which employs the norm of the filter coefficient error vector as a criterion for optimal variable step-size.
Another type of variable step-size algorithm is the regularized NLMS. Mandic derived a generalized normalized gradient descent (GNGD) algorithm, which updates the regularization parameter gradient adaptively. Choi presented a robust regularized NLMS (RR-NLMS) filter, which uses a normalized gradient to update the regularization parameter. It should be noted that the RR-NLMS is effectively a “sign GNGD” algorithm. While most variable step-size algorithms need to tune several parameters for better performance, Benesty introduced a relatively tuning-free nonparametric VSS-NLMS (NPVSS) algorithm.
This invention presents a new nonparametric algorithm, which employs the MSE and the estimated system noise power to control the step-size update. The motivation is that a large MSE increases step-size and a large system noise decreases step-size, and vice versa. Our new VSS-NLMS is easy to implement and gives very good performance.
An object of the present invention is to provide a method for acoustic echo cancellation, wherein a new nonparametric algorithm, which employs the MSE and the estimated system noise power to control the step-size update, is adopted such that the hardware thereof would be easy to implement.
Another object of the present invention is to provide a system for acoustic echo cancellation, wherein a new nonparametric algorithm, which employs the MSE and the estimated system noise power to control the step-size update, is adopted such that the performance thereof would be increased.
In order to achieve the abovementioned first object of the present invention and other object of the present invention, a echo cancellation system is provided. The communication system comprises a speaker, a microphone, a convergence statistic generator, a variable step-size generator, a decision device, a adaptive weight control unit, a transversal filter, and a error signal generator. The speaker is used for outputting a acoustic signal. The microphone is used for receiving an input audio signal. The convergence statistic generator receives the input audio signal, the acoustic signal and a error signal to generate a convergence parameter. The variable step-size generator receives the error signal and the acoustic signal to generate a step size parameter in accordance the error signal and the acoustic signal. The decision device receives the convergence parameter and the step size parameter to output a adjusted step size parameter, wherein the adjusted step size parameter is equal to a boundary value when the convergence parameter is greater than a convergence threshold, wherein the adjusted step size parameter is equal to step size parameter when the convergence parameter is smaller than the convergence threshold. The adaptive weight control unit receives the acoustic signal, the adjusted step size parameter and the error signal to generate a coefficient vector. The transversal filter receives the input audio signal and the coefficient vector to generate the echo cancellation signal. The error signal generator receives the input audio signal and the echo cancellation signal to generate the error signal in accordance with the difference between the input audio signal and the echoless output signal.
In order to achieve the abovementioned second object of the present invention and other object of the present invention, a method for acoustic echo cancellation is provided. the method comprises the steps of: outputting a acoustic signal; receiving an input audio signal; generating a convergence parameter in accordance with the input audio signal, the acoustic signal and a error signal; generating a step size parameter in accordance the error signal and the acoustic signal; outputting a adjusted step size parameter, wherein the adjusted step size parameter is equal to a boundary value when the convergence parameter is greater than a convergence threshold, wherein the adjusted step size parameter is equal to the step size parameter when the convergence parameter is smaller than the convergence threshold; generating a coefficient vector in accordance with the acoustic signal, the adjusted step size parameter and the error signal; generating the echo cancellation signal in accordance with the input audio signal and the coefficient vector; and generating the error signal in accordance with the difference between the input audio signal and the echoless output signal.
In order to achieve the abovementioned second object of the present invention and other object of the present invention, a communication system, receiving a received signal from a channel represented as x(n), the communication system comprises a pilot signal generation unit, a convergence statistic generator, a variable step-size generator, a decision device, a adaptive weight control unit, a transversal filter, and a error signal generator. The pilot signal generation unit is used for outputting a pilot signal, which is represented as d(n). The convergence statistic generator receives the pilot signal, the received signal and a error signal for generating a convergence parameter. The variable step-size generator receives the error signal and the received signal for generating a step size parameter in accordance the error signal and the received signal. The decision device receives the convergence parameter and the step size parameter to output a adjusted step size parameter, wherein the adjusted step size parameter is equal to a boundary value when the convergence parameter is greater than a convergence threshold, wherein the adjusted step size parameter is equal to the step size parameter when the convergence parameter is smaller than the convergence threshold. The adaptive weight control unit receives the received signal, the adjusted step size parameter and the error signal to generate a coefficient vector. The transversal filter receives the received signal and the coefficient vector to generate a equalized signal. The error signal generator receives the pilot signal and the equalized signal to generate the error signal in accordance with the difference between the pilot signal and the equalized signal.
To sum up, the spirit of the present invention is to vary the step size parameter in accordance with the error signal and the output acoustic signal, wherein the filter is easy to implement, nonparametric VSS-NLMS algorithm which employs the mean-square error and the estimated system noise power to control the step-size update. The new nonparameteric VSS-NLMS algorithm has been shown to perform with fast convergence rate, good tracking, and low misadjustment. In comparison with existing VSS-NLMS algorithms, the proposed algorithm has demonstrated consistently superior performance both in convergence and for final error level relative to published algorithms in application on both simulated data and real speech data.
In order to further understand the techniques, means and effects of the present disclosure, the following detailed descriptions and appended drawings are hereby referred, such that, through which, the purposes, features and aspects of the present disclosure can be thoroughly and concretely appreciated; however, the appended drawings are merely provided for reference and illustration, without any intention to be used for limiting the present disclosure.
The accompanying drawings are included to provide a further understanding of the present disclosure, and are incorporated in and constitute a part of this specification. The drawings illustrate exemplary embodiments of the present disclosure and, together with the description, serve to explain the principles of the present disclosure.
Reference will now be made in detail to the exemplary embodiments of the present disclosure, examples of which are illustrated in the accompanying drawings. Wherever possible, the same reference numbers are used in the drawings and the description to refer to the same or like parts.
In order to conveniently describe the present invention, it is assumed that the error cancellation system is adopted by the mobile phone. Generally, a mobile phone has a functionality of loudspeaker. When the functionality of loudspeaker is on, the acoustic signal x(n) would be amplified and the amplified acoustic signal would be inputted to the microphone and the echo would be induced. It is assumed the echo is a unknown system, wherein its input signal is the acoustic signal x(n), its output signal is y(n), and its impulse response is h(n). Therefore, if the adaptive filter's impulse response is designed as h(n), the filtered signal y′(n) would be equal to y(n). As a result, the error signal would be the signal without the echo.
wherein the ε is a positive regulation parameter, and μ(n) is the step size parameter. In general, the value of the step size parameter μ(n) is between 0 and 2. In the equation (1), the greater the step size parameter μ(n) is, the higher speed convergence is, but stability is worse. If the step size parameter μ(n) is smaller, the steady state error will be lower. The problem is that the instantaneous error is contaminated by the disturbance noise such as the presence of near-end speech in hands-free communications. In this situation, instead of increasing step-size, one should either freeze the adaptation or use smaller step-size.
In order to mitigate this problem, we propose a variable step-size which can be used in the equation (1).
The decision device 503 in the first embodiment further comprises a switching unit 511, a boundary value output unit 512 and a boundary value control unit 513. The switching unit 511 comprising a receiving terminal, a control terminal, a first input terminal and a second input terminal. The receiving terminal of the switching unit 511 receives the step size parameter μ(n). The control terminal of the switching unit 511 receives the convergence parameter ζ. The receiving terminal of the switching unit 511 is selectively coupled to the first input terminal if the convergence parameter ζ is greater than a convergence threshold ζth, the receiving terminal of the switching unit 511 is selectively coupled to the second input terminal if the convergence parameter ζ is smaller than or equal to the convergence threshold ζth.
The boundary value output unit 512 comprises an input terminal and an output terminal. The input terminal thereof is coupled to the first input terminal of the switching unit 511. When the input terminal of the boundary value output unit receives the step size parameter μ(n), the output terminal of the boundary value output unit 512 output the boundary value such that the decision device 503 uses the boundary value to be the adjusted step size parameter. The boundary value control unit 513 comprises an input terminal and an output terminal. The input terminal thereof is coupled to the second input terminal of the switching unit. When the input terminal thereof receives the step size parameter μ(n), the step size parameter μ(n) is determined and adjusted to be the adjusted step size parameter.
In the embodiment, we propose the new step size parameter μ(n) in the model with a new algorithm to vary the step-size. That is,
where β is a positive parameter that adds one freedom of design, the estimated MSE {circumflex over (σ)}e2(n) and the system noise power {circumflex over (σ)}v2(n) can be obtained recursively as:
In {circumflex over (r)}ex(n) denotes the cross-correlation between x(n) and e(n), and {circumflex over (σ)}x2(n) is the input signal power. They can be estimated by
{circumflex over (σ)}x2(n)=α{circumflex over (σ)}x2(n−1)+(1−α)x2(n) (5)
{circumflex over (r)}ex(n)=α{circumflex over (r)}ex(n−1)+(1−α)x(n)e(n) (6)
The variable step-size generator 502 can be represented as the equation (2) to output the step size parameter μ(n). The step-size parameter μ(n) is variable. However, to guarantee the stability, we set a minimums value μmin and a maximum value μmax, and the variable step-size μ(n) is bounded by μmin and μmax. In practice, we can set 0≦μ(n)≦2, and the boundary control of the step-size parameter μ(n) is implement in the abovementioned boundary value control unit 513.
After some straightforward manipulations on equation (2), we get
Assuming the adaptive filter converges, it can be shown that
wherein E{•} denotes mathematical expectation, σe2(∞) represents E{{circumflex over (σ)}e2(∞)}, and σv2(n) is noise power.
In the early stage, {circumflex over (σ)}e2(n) is generally big due to the system mismatch, thus the adaptive filter uses a large μ(n). When the algorithm starts to converge, {circumflex over (σ)}e2(n) becomes smaller, and μ(n) gets smaller. When the adaptive filter converges to the optimum solution, σe2(∞) is pretty close to σe2(n), resulting in a constant step-size, μ(n)≈1/β. The parameter β, which controls the mis-adjustment of the algorithm, will be investigated in the adjusted section. Note that the contribution of system noise to {circumflex over (σ)}e2(n) is balanced as we utilize σe2(n) in (2). If system noise becomes larger or near-end speech occurs, μ(n) tends to decrease. This is exactly what we like to have in the scenario of acoustic echo cancellation.
In the embodiment, the decision device 503 provides another VSS-NLMS scheme which is to include the echo-path change detection statistic ζ(n) to our algorithm. The adjusted step-size is updated as
where ζth is a small positive quantity. The statistic ζ(n) is the convergence parameter and calculated in the convergence statistic generator 501. If the convergence parameter ζ is greater than a convergence threshold ζth, the adjusted step size parameter is set to a boundary value, wherein the boundary value is for example equal to one.
In our algorithm, we provides three definitions of the statistic ζ(n) for implementation. The three definitions respectively represents ζ1, ζ2, and ζ3 the mathematic Forms of ζ1, ζ2, and ζ3 are described as below. The convergence parameter ζ1(n) is represented as
where {circumflex over (r)}ex(n) is a cross-correlation between the acoustic signal and the error signal, {circumflex over (r)}dx(n) is an estimate of E{x(n)d(n)}, x(n) represents a vector of the acoustic signal.
The convergence parameter ζ2(n) is represented as
where {circumflex over (σ)}v2(n) represents an energy of the noise signal, {circumflex over (σ)}e2(n) represents an energy of the error signal, {circumflex over (σ)}d2(n) represents an energy of the input audio signal.
The convergence parameter ζ3(n) is represented as
where {circumflex over (σ)}e2(n) is an energy of the error signal, {circumflex over (σ)}d2(n) is an energy of the input audio signal, {circumflex over (r)}de(n) is an estimate of E{d(n)e(n)}, wherein E{•} represents an expectation.
The steady-state MSE and the normalized steady-state squared norm of the system mismatch vector are derived in the following section relying heavily on the methods. The following assumptions are used to make the analysis tractable.
(1) The adaptive filter is operating in the system identification mode, and has the same structure as that of the unknown system.
(2) Input signal x(n) and system noise v(n) are zero-mean, jointly stationary random processes. Moreover, v(n) is white and independent of x(n).
(3) x(n) and d(n) are jointly Gaussian random processes.
(4) The filter coefficient vector w(n) is independent of the input vector x(n) and the desired response d(n).
Let ζmin denote the minimum MSE at the filter output, and ζexcess represent the steady-state value of the excess MSE. Assuming the mis-adjustment is small (0.1 or less) and the regularization parameter ε is much smaller than xT(n) x(n), and employing the mis-adjustment result of a LMS algorithm, a conventional NLMS with a constant step-size μc can be shown to be
In the system identification mode, the step-size of our VSS-NLMS converges to μ(∞)=σe2(∞)/βσv2=σe2(∞)/βξmin, i.e., ξmin=σv2. Substituting μc=σe2(∞)/βξmin into (10), we can derive the excess MSE of the proposed VSS-NLMS as
Using excess ξexcess=σe2(∞)−ξmin, then (11) takes the form
(σe2(∞))2−2βξminσe2(∞)+2β(ξmin)2≈0. (12)
Solving for σe2(∞) of (12) and noting that the misadjustment is small, we have
σe2(∞)≈ξmin(β−√{square root over (β2−2β)}), β>2. (13)
Finally, the misadjustment can be obtained as
≈(β−1)−√{square root over (β2−2β)}. (13)
Note that the mis-adjustment is less than 0.1 for β>6.05.
In the following description of the embodiment, we present simulation results to demonstrate the proposed VSS-NLMS as well as confirm the theoretical analysis presented earlier. The adaptive filter is used to identify an acoustic echo system h(n) of length M. The impulse response of the system is normalized, i.e., hT(n) h(n)=1. The sampling rate is 8 kHz. The system noise is assumed to be a white, Gaussian process with zero mean. The parameter settings of our VSS-NLMS are α=0.998, ε=0.1, μ(0)=1, μmin=10−5, μmax=1, and ζth=0.35. Results are ensemble averages over 200 independent runs.
This example confirmed the theoretical performance analysis presented in the previous section. Several cases are studied:
(1), β=30 (the corresponding mis-adjustment is 0.018).
(2), Input signals x(n): white input, AR(1) process, and MA(3) process with σx2=10.
(3), System noise levels σv2=0.1.
(4), Time-invariant systems of length M=64, 128, and 256. The experimental values were measured by time-averaging the ensemble average of the corresponding quantity over 8,000 time samples in the range [72,001, 80,000]. These are then compared to the theoretical MSE σe2(∞) from (13). The results are presented in
The excess MSE of the proposed VSS-NLMS is controlled by β. We compare the behaviors of six different values of β (5, 15, 30, 50, 75, and 100). The acoustic echo system of length 128 changes signs at time 5 so that we can investigate the tracking performance of the filters. The input signal is a white Gaussian process with σx2=1. The system noise has a power σv2=0.01. The system distance curves (in dB) are presented in
Firstly we evaluate these VSS algorithms in the same scenario as the previous example. The parameter settings are tabulated in
The new non-parametric VSS-NLMS algorithm introduced in this embodiment of the present invention has been shown to perform with fast convergence rate, good tracking, and low mis-adjustment. In comparison with existing VSS-NLMS algorithms, the proposed algorithm has demonstrated consistently superior performance both in convergence and for final error level relative to published algorithms in application on both simulated data and real speech data.
In accordance with the abovementioned first embodiment, a method for acoustic echo cancellation can be provided.
In step S1701, an acoustic signal x(n) is outputted.
In step S1702, an input audio signal d(n) is received.
In step S1703, a convergence parameter ζ is generated in accordance with the input audio signal d(n), the acoustic signal x(n) and a error signal e(n). The convergence parameter could be generated as equation (10) for example such that the detail description is omitted.
In step S1704, a step size parameter μ(n) is generated in accordance the error signal and the acoustic signal. The step size parameter μ(n) could be generated as equation (2) for example such that the detail description is omitted.
In step S1705, it is determined whether the convergence parameter ζ is greater than or equal to a convergence threshold ζth. If the convergence parameter ζ is greater than the convergence threshold ζth, the step S1706 is performed. If the convergence parameter is smaller than the convergence threshold ζth, the step S1707 is performed.
In step S1706, the adjusted step size parameter μ(n+1) is equal to a boundary value, such as 1.
In step S1707, the adjusted step size parameter μ(n+1) is equal to the step size parameter μ(n).
Although, the abovementioned embodiment is only provided a convergence parameter ζ and a convergence threshold ζth, people having ordinary skill in the art may know that there is more than one convergence parameter ζ, such as previous equations (10.1), (10.2) and (10.3). Therefore, the present invention is not limited thereto.
In step S1708, a coefficient vector is generated in accordance with the acoustic signal, the adjusted step size parameter and the error signal. The detail description is shown in the first embodiment, so the description of the coefficient vector is omitted.
In step S1709, the echo cancellation signal is generated in accordance with the input audio signal and the coefficient vector.
In step S1710, the error signal is generated in accordance with the difference between the input audio signal and the echoless output signal.
The abovementioned algorithm can not only be used for performing the echo cancellation, but applied for a communication system, such as an adaptive equalizer.
Similarly, the variable step size generator 1802 in
In the abovementioned
In the training period, the terminal U1 of the switching unit 1906 is electrically connected to the terminal U2 of the switching unit 1906 and the terminal U1 of the switching unit 1906 is electrically disconnected to the terminal U3 of the switching unit 1906. The error signal generator 1904 receives the pilot signal d(n) so that the impulse response of the adaptive filter equalizer 1907 can be close to the impulse response of the channel. In the tracking mode, the terminal U1 of the switching unit 1906 is electrically connected to the terminal U3 of the switching unit 1906 and the terminal U1 of the switching unit 1906 is electrically disconnected to the terminal U2 of the switching unit 1906. The hard decision unit 1905 receives the equalized signal y(n) for performing a hard decision to an equalized signal y(n) to obtain a decision signal d′(n). Since the impulse response of the adaptive filter equalizer is close to the impulse response of the channel, the noise, especially the Inter Symbol Interference (ISI), can be eliminated.
In summary, the spirit of the present invention is to provide a variable step size parameter in accordance with the error signal and the output acoustic signal, wherein the filter is easy to implement, nonparametric VSS-NLMS algorithm which employs the mean-square error and the estimated system noise power to control the step-size update. The new nonparametric VSS-NLMS algorithm has been shown to perform with fast convergence rate, good tracking, and low mis-adjustment. In comparison with existing VSS-NLMS algorithms, the proposed algorithm has demonstrated consistently superior performance both in convergence and for final error level relative to published algorithms in application on both simulated data and real speech data.
The above-mentioned descriptions represent merely the exemplary embodiment of the present disclosure, without any intention to limit the scope of the present disclosure thereto. Various equivalent changes, alternations or modifications based on the claims of present disclosure are all consequently viewed as being embraced by the scope of the present disclosure.
Number | Name | Date | Kind |
---|---|---|---|
5745564 | Meek | Apr 1998 | A |
5764753 | McCaslin et al. | Jun 1998 | A |
Entry |
---|
Hsu-Chang Huang and Junghsi Lee,A New Variable Step-Size NLMS Algorithm and Its Performance Analysis, IEEE Transactions on Signal Processing,vol. 60,Issue 4,pp. 2055-2060. |
Number | Date | Country | |
---|---|---|---|
20130322638 A1 | Dec 2013 | US |