The present invention relates generally to an adaptive filter with improved operational characteristics, and, more specifically, to methods and systems of achieving echo cancellation employing the improved adaptive filter.
Adaptive filters are used in numerous applications to remove undesired frequencies from a signal. In an exemplary application, adaptive filters are used in telecommunication systems, more specifically in echo cancellation systems, to remove from a signal echoes that may arise as a result of the reflection and/or retransmission of modified input signals back to the originator of the input signals. Commonly, echoes occur when signals that were emitted from a loudspeaker are then received and retransmitted through a microphone (acoustic echo) or when reflections of a far end signal are generated in the course of transmission along hybrids wires (line echo).
Although undesirable, echo is tolerable in a telephone system, provided that the time delay in the echo path is relatively short; however, longer echo delays can be distracting or confusing to a far end speaker. Understandably, the telecommunications industry has devoted substantial resources to developing systems that minimize echo, without adversely affecting the ability of two speakers to communicate with one another.
Acoustic echo often occurs in speakerphones that employ one or more microphones together with one or more speakers to enable “hands-free” telephone communication. Line echo originates because telephone facilities usually comprise two-wire circuits within each area connecting individual subscribers with the switching office and four-wire transmission circuits between switching offices in different local exchange areas. A call between subscribers in different exchange areas is carried over a two-wire circuit in each of the areas and a four-wire circuit between the areas, with conversion of speech energy between the two and four-wire circuits being conducted by hybrid circuits. If the hybrid circuit input ports had perfectly matched impedances of the two and four-wire circuits, the signals transmitted from one exchange area to the other would not be reflected or returned to the first area as echo. Unfortunately, due to impedance differences that inherently exist between different two and four-wire circuits, and because impedances must be matched at each frequency in the voice band, it is difficult for a given hybrid circuit to perfectly match the impedances of any particular two and four-wire transmission circuit.
To substantially remove echoes from a communication system, echo cancellation systems and methods employ adaptive filters to generate an estimate of the echo-generating signal (echo estimate) that is then removed from the signal being transmitted back to the originator of the echo-generating signal (far-end source). More specifically, a far-end source transmits a signal (far-end signal) that passes through a connection medium and into an input terminal of a communication unit. The far-end signal received at the input terminal is cross-coupled via a cross-coupling path (either acoustically or in line) and creates a cross-coupling echo component. That echo component combined with a new signal from the near-end (near-end signal) is transmitted back to the far-end source as a composite output signal. In this system, an echo it canceller is conventionally deployed to monitor the far-end signal and generate an estimate of the actual echoes expected to return in the form of a composite signal with the near-end signal. The echo estimates are then applied to a subtractor circuit in the transmit channel to remove or at least reduce the actual echo.
To create an accurate estimate of the actual echoes, various types of adaptation methods are known in the prior art and can be employed in the echo canceller in the form of an adaptive filter. Conventionally, a finite impulse response (FIR) filter is used which carries out the convolution between the far-end signal and the estimated impulse response, N samples in length, of the echo paths. In the most basic model, an adaptive filter (filter vector) operates on a far-end signal vector to produce an estimate of the echo, which is subtracted from the combined near-end and echo signal. The overall output of the adaptive echo canceller is then used to control adjustments made to tap values of the filter vector.
In the aforementioned application, and other applications requiring the use of adaptive filters, a critical design requirement is the ability of the filter system to achieve convergence in a rapid, stable manner and, in the process, use a minimal amount of computational resources. A trade-off traditionally exists between stability, accuracy, and speed of convergence of an adaptive filter.
In that regard, digital filters are commonly categorized into two classes: infinite-length impulse response (IIR) filters and finite-length impulse response (FIR) filters. FIR filters have certain advantages relative to IIR filters, namely that FIR filters are stable and have a linear-phase response. Linear-phase FIR filters are widely used in digital communication systems, image processing, speech processing, spectral analysis and applications where non-linear phase distortion cannot be tolerated. Compared to IIR filters, FIR filters generally require shorter data word length but have much higher orders for the same magnitude specification and, at times, introduce large delays that make them unsuitable for certain applications. For example, when dealing with a system where echo cancellation must be performed for hundreds of channels on the same processor, the use of a conventional linear finite impulse response (FIR) filter to model a long impulse response requires substantial memory and computational resources.
Notwithstanding the above, different types of adaptive filter systems have been employed in echo cancellation systems that attempt to improve the rate of convergence and stability, while still minimizing the computational resources required. U.S. Pat. No. 5,995,620 discloses a method of canceling an echo that, according to the inventors, has an improved convergence time with low complexity. The echo cancellation method includes the step of canceling the echo in a far-end signal with a Kalman filter having a time varying Kalman gain vector K(t) proportion to the vector
(p1(t)×(t−1) . . . pn(t)×(t−n))T
where pi(t) are the diagonal elements of a diagonal matrix P(t) satisfying a Riccati equation, i denotes the ith diagonal elements of P(t), t denotes discrete time, n denotes the number of filter taps, and T denotes transpose.
Despite the aforementioned prior art, an adaptive filter is still needed that achieves rapid convergence without an increase in computational resources or the introduction of instability. Additionally, a method and system of echo cancellation having an improved computational speed, while still remaining stable and minimizing the computational resources required, is also needed.
The present invention is directed toward a novel adaptive filter and novel methods and systems for conducting echo cancellation in telecommunication systems. In one embodiment of the adaptive filter of the present invention, a filter is chosen, preferably an adaptive finite impulse response (FIR) filter of an appropriate length N. Once the filter is chosen, convergence is achieved using a convergence process. With convergence complete, the filter is converted to an infinite impulse response (IIR) filter using a generalization of the ARMA-Levinson approach. In the course of operation, data is received from an input source and used to adapt the zeroes of the IIR filter using the least-mean-square (LMS) approach, keeping the poles fixed. The adaptation process generates a set of converged filter coefficients that are then applied to the input signal to create a modified signal used to filter the data. The error between the modified signal and actual signal received is monitored and used to further adapt the zeroes of the IIR filter.
In a second embodiment of the adaptive filter of the present invention, a filter is chosen, preferably an adaptive finite impulse response (FIR) filter of an appropriate length N. Once the filter is chosen, convergence is achieved using a convergence process. With convergence complete, the filter is converted to an infinite impulse response (IIR) filter using a generalization of the ARMA-Levinson approach. In the course of operation, data is received from an input source and used to adapt the zeroes of the IIR filter using the LMS approach, keeping the poles fixed. The adaptation process generates a set of converged filter coefficients that are then applied to the input signal to create a modified signal used to filter the data. The error between the modified signal and actual signal received is monitored and used to further adapt the zeroes of the IIR filter. If the measured error is greater than a pre-determined threshold, convergence is re-initiated by reverting back to the FIR convergence step.
The present invention is also directed toward an exemplary use of the novel adaptive filter method and system, namely novel media gateways, echo cancellation, and channel equalization methods and systems. Used in media gateways and echo cancellation systems, adaptive filters are used to generate an echo signal component used to cancel the echo generated by the engagement of a far-end signal with a cross-coupling path. The novel adaptive filter method and system presented herein can be used to improve the calculation of the echo impulse response by, among other things, reducing the computational complexity and memory requirements of the coefficient calculation conducted within the adaptive filter. In one embodiment, the novel filter of the present invention is used to generate the echo signal component. After having achieved convergence on a FIR filter and converted the filter to an IIR filter, in accordance with the previously described methodology, the adaptive filter generates an echo estimate by obtaining individual samples of the far-end signal on a receive path, convolving the samples with the calculated coefficients, and then subtracting, at the appropriate time, the resulting echo estimate from the received signal y on the transmit channel. Ongoing adaptation of the filter occurs by the adjustment of the zeroes of the IIR filter.
In a preferred embodiment of an echo cancellation application, the converged FIR filter is the truncated by taking a first set of taps, K, from the truncated FIR filter, taking the last N-K taps of the truncated FIR filter, referred to as hiir, and converting hiir, to an IIR model where K is preferably at or around 10. The truncated FIR filter, together with the IIR filter, is then used in combination to track the system response and filter data.
The present invention provides for adaptive filters that have improved convergence, computational, and memory bandwidth proprieties. When applied to telecommunication applications, the present invention additionally provides for improved methods and systems of canceling echoes.
These and other features and advantages of the present invention will be appreciated as they become better understood by reference to the following Detailed Description when considered in connection with the accompanying drawings, wherein:
Referring now to
The present system will be further described by specific reference to a finite impulse response (FIR) filter. An FIR system is generally described by the difference equation:
where yk represents an output signal that is the summation of the convolution of hi with input signal xk−i. One of ordinary skill in the art would appreciate that there are several methods for implementing an FIR system, including direct form, cascade-form, frequency-sampling, and lattice realizations. While the embodiments of the present invention will be described by reference to a direct form realization, the invention is not limited to direct form realizations and encompass any realization that could be effectively utilized in accordance with the teachings provided herein.
Designing a FIR filter requires the determination of N coefficients from a specification of the desired frequency response of the FIR filter. One of ordinary skill in the art would appreciate how to design a FIR system for a particular application and, more specifically, to define the appropriate length of the FIR filter. In the case of FIR filters being designed for echo cancellation applications, the desired frequency response of the FIR filter is to mimic the behavior of a cross-coupling transformation of a far-end signal, namely the nature and extent of the impulse response generated by the echo path. As such, the length of the FIR filter, designated by N, is calculated based upon the length of the impulse response that needs to be cancelled. For line echo cancellation, the impulse response is typically specified to be 64 ms in duration. The FIR filter length can therefore be approximated by multiplying the expected duration of the echo response, i.e., 64 ms, by the standard sampling rate in telephony systems, 8 kHz. In this case, the FIR filter would comprise 512 coefficients or 512 taps.
The taps, also referred to as coefficients or filter coefficients, of the FIR filter can be realized by using a convergence process that employs any one of several known approaches. To achieve convergence, the approach would typically have to derive a set of filter coefficients that, when applied to an input signal, would generate a signal that was sufficiently close to (converged with) the signal that needed to be filtered. In the aforementioned echo cancellation application, the approach would have to update the filter coefficients once every sample, or every 125 microseconds. The more filter coefficients that need to be updated, the more computational resources and memory that are needed by the filter system. Reducing the number of coefficients would therefore reduce the computational complexity and memory requirements of the adaptive filter. This reduction becomes even more critical when dealing with applications requiring numerous adaptive filter operations, such as multi-channel echo cancellation devices.
A conventionally used approach to achieve convergence comprises the LMS approach. One of ordinary skill in the art would appreciate the calculation that needs to be conducted in order to conduct a convergence process using the LMS approach. Conceptually, by effectuating convergence, the LMS approach enables the determination of values for a set of filter coefficients that comprise a transfer function that best approximates the transformation of an input signal to the noise-containing output signal via a channel. In the case of an echo cancellation application, the LMS approach enables the determination of values for a set of filter coefficients that comprise a transfer function which best approximates the transformation of an input signal to the echo-containing output signal via a cross-coupling pathway. Depending on the length of the FIR filter, the LMS approach can be computationally complex and therefore comprise the most computationally intensive part of any filtration process. Additionally, the LMS approach exhibits slow convergence and requires the trial-and-error determination of an adaptation coefficient, denoted by μ, which controls the speed of convergence and, if improperly selected, effects the stability of the convergence calculation.
Other approaches can be used and may be selected based upon a balance of factors including the rate of convergence, computational requirements, stability, and other properties. The rate of convergence is defined as the number of iterations required for the convergence process, in response to a set of inputs, to converge to the optimum solution. The computational requirements of the convergence process includes: (a) the number of operations (i.e., multiplications, divisions, additions, and subtractions) required to make one complete iteration of the convergence process; (b) the amount of memory needed to store the convergence process and accompanying data; and (c) the engineering investment required to program the convergence process.
As with other recursive algorithmic approaches, the LMS approach computes the coefficient values from the error signal by starting from some set of initial conditions that preferably approximates an assumed initial state of the system and iteratively configuring the taps to minimize the error signal when calculated in some mean-squared sense.
More specifically, given an input vector
J(
where σd2 is the variance of the desired signal d(k), R is the auto-correlation matrix of the input signal
The minimum value of J(
Here,
where ∇J is the gradient of J:
∇J=2E└
where E denotes a statistical expectation. The term in braces in the above equation is the error between the desired and estimated signal, which can be defined as:
e(k)=(d(k)−
In LMS, the statistical expectation is estimated by the instantaneous value of the gradient.
Therefore:
It has been shown that the proper choice of μ should be:
where λmax is the maximum eigenvalue of the auto-correlation matrix R. Since R is not known and, therefore, λmax is not known, one cannot necessarily choose a good value of μ. In practice, a value for μ is usually chosen by trial-and-error. The value of μ affects the filter performance. Smaller values of μ give higher signal-to-noise ratio but take more time to converge. Usually, a designer starts with a relatively large value of μ for fast initial convergence, and then chooses a smaller value for high SNR. Because stability properties have been shown to depend on the energy of the far-end signal, it is preferred to normalize the LMS approach with respect to the energy of the input signal, yielding a normalized least mean square (NLMS algorithm).
In a preferred embodiment, after convergence is achieved on the FIR filter using the above describe LMS method, the FIR filter is then truncated. When applying the novel adaptive filter system and method disclosed herein to an echo cancellation application, high sensitivity to the truncation process may be experienced. As such, inaccuracies in detecting the start and/or end of the response can degrade system performance. Accordingly, it is further preferred to truncate the converged FIR filter, take a first set of taps, K, from the truncated FIR filter, referred to as hfir, take the last N-K taps of the truncated FIR filter, referred to as hfir, and convert hiir to an IIR model, as discussed below. K is preferably at or around 10. The truncated FIR filter, hfir, together with the IIR filter are then used, in combination, to track the system response and filter data, as further discussed below. Although preferred for adaptive filter applications in echo cancellation methods and systems, this truncation step is optional and can be eliminated for other applications.
Referring back to
To one of ordinary skill in the art, the ARMA-Levinson approach is known and appreciated. The approach is a two-channel Levinson algorithm. It does not put any restriction on the number of poles and zeros, except what is required for causality: Number of poles should be greater or equal to the number of zeros. Conceptually, this process enables a FIR filter having N taps to be replaced with a stable pole-zero filter having M taps where M<N.
The input-output relationship of a long FIR filter, denoted by
where y is the output, x is the input and h are the FIR filter coefficients, is approximated in an ARMA model by p poles and q zeroes, denoted by ARMA (p, q) where the output samples yk are given by the following equation:
The FIR to IIR filter conversion requires that we appropriately select the model order p and q. If the chosen model order is too low, then there will be unacceptable errors in modeling. If the model order is too high, then memory and computational resources may be wasted. One method of appropriately selecting the model order selection is to optimize the model order by examining the model order error. Beginning with the definition of the model order error as the mean squared difference between the FIR impulse response, h, and the pole-zero impulse response, i.e. e=10 log (h−ĥ)2, the following calculations are made:
end
The impulse response of the pole-zero filter can be computed by inputting a long unit vector to the pole-zero filter. The length of this unit vector should be greater than the length of the FIR filter.
In the calculation below, define p and q as being the number of zeros and poles respectively and as having a computational complexity on the order (2(max(p,q)2). The approach does not restrict the generated pole-zero approximation to be minimum phase. As such, zeros can he outside the unit circle. Further, assume that p≧q.
Define:
Also, define the autocorrelation matrix:
In light of the above described definitions, the algorithm follows:
Initialization:
Recursions:
For j=1 to p−1
Δj+1e=R(j+1)+R(j)Θ1j++ . . . +R(1)Θjj; Δj+1r=Δ*j+1e.
Kj+1e=−(Σjr)−1Δj+1e.
Kj+1r=−(Σje)−1Δj+1r.
Σj+1e=Σje+Δj+1rKj+1e.
Σj+1r=Σjr+Δj+1eKj+1r.
Θij+1=Θij+j+1eΦj−1+ij; 1≦i≦j.
Θj+1j+1=Kj+1e.
Φij+1=ΦijK+j+1rΘj−1+ij; 1≦i≦j.
Φj+1j+1=Kj+1r,
end
The coefficients of the now converted filter can be directly read from Θ.
During the transition from FIR to the IIR filter, it is preferred to correctly set the initial conditions of the IIR filter in order to avoid errors during the transition. The preferred approach is to correctly set the initial conditions by first flushing the FIR filter with a zero input signal that is equal to length, N, of the FIR filter and saving the resulting output vector, z. This process places the FIR filter in its zero state. Consequently, the coefficients of the SIR filter are also initially set to zero after the conversion because the algorithmic approach described above insures both the FIR and IIR filters have the same zero state response, although not the same zero input response. The flushed output, z, is added appropriately to the output of IIR filter for N samples. The addition of the output vector, z, enables a proper transition from FIR to pole-zero filter.
More specifically,
With the coefficients of the converted filter obtained, the system can filter data and, when necessary, adapt to account for the time variations in the system response, in accordance with the measured error. In systems where the typical time varying response does change enough to generate unacceptable errors, but slowly and without large magnitude variations, it is possible to achieve substantially optimal performance by just adapting the zeroes of the IIR filter and keeping the poles fixed. The adaptation process comprises adapting the zeroes of the IIR filter in accordance with an adaptation process, preferably using the LMS method previously described.
More specifically, where the flushed output, z, is added appropriately to the output of IIR filter for N samples, the recursive LMS method comprises the following steps:
In a second embodiment, the system provides for a secondary adaptation step if measured divergence error is too great. Referring now to
The present invention provides for substantially greater performance relative to a filter system that begins with a FIR filter, converts to an IIR filter, and, when further adaptation is necessary to account for system changes, goes back to a FIR filter, reconverges as a FIR filter, and then reconverts to an IIR filter. When faced with a need to adapt, this system does not adapt its poles and zeros. Any changes in the impulse response after pole-zero or IIR modeling will render the model useless, necessitating a return to the initial FIR convergence step. This increases implementation complexity significantly and adversely affects memory usage and memory bandwidth. Furthermore, because impulse response changes are expected in certain applications, such as acoustic echo cancellation, line echo cancellation, and channel equalization, where the system response changes over time, it is important that the filter model be able to readily adapt to such changes. In this case, it would be highly preferable for the system to be able to adapt as an IIR filter and thereby avoid having to switch between FIR and IIR filters and adversely impact the memory bandwidth and computational resources required.
The novel adaptive filter method and system can be effectively deployed in a telecommunications system in the form of novel echo cancellation methods and systems to effectuate high quality communications, particularly as between users of a public switched telephone network (PSTN) and users of a packet-based network (e.g., the Internet). Referring to
Referring to
s=h‡x
where h is the impulse response of the echo characteristics.
Also being communicated through the transmit line 530 is a near-end signal v 545, communicated from a near-end source 550. The input signal v 545 combines with the echo response s 540 to generate a combined signal y 555. Therefore, the signal sent from the near-end source 550 to the far-end receiver 580, absent echo cancellation, is the signal y 555, which is the sum of the near-end signal v 545 and the echo response s 540.
To reduce and/or eliminate the echo response component s 540 from the signal y 555, a typical system uses an echo canceller 560 having a filter 565 that is capable of applying an impulse response, which is an estimate of the actual impulse echo response h experienced by the far-end signal x 510 as it engages the cross-coupling path 525. As such, a further signal 570 representing an estimate of echo response s 540 is generated by the echo canceller 560 in accordance with the following equation:
=‡x
The echo canceller 560 subtracts the echo estimate signal 570 from the signal y 555 to generate a signal e 575 that is returned to the far-end receiver 580. The signal e 575 thus corresponds to the following equation:
e=s+v−v
The signal returned to the far end receiver 580 is therefore dominated by the signal v of the near-end source 550. To the extent the impulse response more closely correlates to the actual echo impulse response h, then 570 more closely approximates s 540, resulting in the minimization of the magnitude of the echo signal component s 540 on the signal e.
An adaptive filter 565 is used to generate the echo signal component 570. In its simplest form, the adaptive filter 565 generates an echo estimate, i.e., 570, by obtaining individual samples of the far-end signal x 510 on a receive path 513, convolving the samples with an impulse response model of the system, i.e., and then subtracting, at the appropriate time, the resulting echo estimate, 570, from the received signal y 555 on the transmit channel 530. The conventional adaptive filter is a FIR filter using a LMS method for achieving tap convergence.
The novel adaptive filter method and system presented herein can be used to improve the calculation of the echo impulse response by, among other things, reducing the computational complexity and memory requirements of the tap calculation conducted within the adaptive filter. Shown in
As discussed above, to avoid degrading system performance through inaccuracies in detecting the start and/or end of the response, it is preferred in echo cancellation applications to truncate the converged FIR filter, take a first set of taps, K, from the truncated FIR filter, referred to as hfir, take the last N-K taps of the truncated FIR filter, referred to as hiir, and convert hiir to an IIR model, where K is preferably at or around 10. The truncated FIR filter, hfir, together with the IIR filter are then used, in combination, to track the system response and filter data.
Referring now to
Operationally, the novel echo cancellation application has achieved superior performance results in the form of computational savings and decreased filter length. An FIR filter of length N=512 was chosen, converged, and truncated according to the description provided above. The FIR filter was converted to an IIR filter using a pole-zero filter model of p=50 and q=50. To evaluate the ability of the echo cancellation system to adapt to changes over time, two actual hybrid responses, shown as 805, 830 in
Without an adaptive filter, as shown in
The present adaptive filter method and system can be employed in numerous applications employing adaptive processes in conjunction with convolutional coding. Accordingly, another embodiment of the present invention includes a novel method and system for channel equalization. Equalizers are a class of communication system devices used to compensate for distortion experienced in communication channels. Fixed equalizers have the average electrical characteristics of the channel predetermined and a fixed amount of equalization is therefore designed into the equalizer to compensate for the distortion. Adjustable equalizers monitor the channel and provide for equalization that varies, as necessary, to match the distortion determined at the time of monitoring. Adjustable equalizers, also known as adaptive equalizers, can provide for the adaptation using an adaptive process that employs convolutional coding to generate a signal inverse to the distortion, an equalizing signal.
Applied to a channel equalization application, the present invention is used to generate the equalizing signal by achieving convergence on a FIR filter, converting the filter to an IIR filter, in accordance with the previously described methodology and generating an equalizing signal by obtaining samples of a received signal and convolving the samples with the calculated taps. The channel equalizer then applies, at the appropriate time, the resulting equalizing signal to the received signal. Ongoing adaptation of the filter occurs by the adjustment of the zeroes of the IIR filter and not by updating the filter denominator. To match the delay incurred due to conversion of the FIR filter into an IIR filter, a delay is applied. To avoid degrading system performance through inaccuracies in detecting the start and/or end of the response, it is preferred to truncate the converged FIR filter, take a first set of taps, K, from the truncated FIR filter, referred to as hfir, take the last N-K taps of the truncated FIR filter, referred to as hiir, and convert hiir, to an IIR model, where K is preferably at or around 10. The truncated FIR filter, hfir, together with the IIR filter are then used, in combination, to track the system response and filter data. Ongoing adaptation of the IIR filter occurs by the adjustment of the zeroes of the IIR filter and not by updating the IIR filter denominator. To match the delay incurred due to conversion of the FIR filter into an IIR filter, a delay is applied, where D1−1 is a specific value of delay.
The present methods and systems provide for an adaptive filter that significantly reduces the memory requirement, memory bandwidth, and computational resources necessary to operate the filter. Applied to the problem of echo cancellation, embodiments of the present invention obtained a reduction of computational resource usage of roughly 10 times while maintaining an acceptable performance level. While various embodiments of the present invention have been shown and described, it would be apparent to those skilled in the art that many modifications are possible without departing from the inventive concept disclosed herein. For example, the adaptive filter has been applied in an echo cancellation application. It would be appreciated by one of ordinary skill in the art that the filter can be used in any application where the convergence, stability, computational requirements, and memory bandwidth characteristics of the novel filtration method and system could be effectively applied.
Number | Date | Country | |
---|---|---|---|
Parent | 10038367 | Oct 2001 | US |
Child | 11390989 | Mar 2006 | US |