1. Field of the Invention
The present invention relates to techniques for Least Mean Square (LMS) Echo Cancellation (EC) in a DMT (Discrete Multitone Technique) modem, in particular a ADSL Central Office (CO) modem.
2. Description of the Prior Art
In the following reference will be made to several widely known methods and circuits for telecommunications. Reference is made to publications such as N. Benvenuto and G. Cherubini ‘Algoritmi e Circuiti per le Telecomunicazioni’, Libreria Progetto, Padova Italy, S. Haykin, “Adaptive Filter Theory”, Prentice Hall, N.J., 1986, B. Widrow, “Adaptive filters”, in aspects of Network and System Theory, ed. Kalman, N.Y., 1970, G. Cariolaro, “Analisi spettrale”, Libreria Progetto, Padova, Italy, 1995, G. Cariolaro, “La Teoria unificata dei segnali”, UTET, 2001.
The new generation of ADSL modem requires a high performance adaptive echo-cancellation method, that shows improved convergence properties.
Known echo cancellations methods for cancelling the echo part of a received signal in presence of far-end signal, like the methods described in “Discrete Multitone Echo Cancellation” Minnie Ho, Cioffi, IEEE Transactions on Communications Vol. 44 NO. 7 Jul. 1996, show performance drawbacks that make them not usable in a real ADSL application.
The common approach provides for modeling the acoustical echo path using an adaptive linear combiner, which is a digital filter with adjustable coefficients. The coefficient update is performed via some adaptation method.
Recursive Least Squares (RLS) based methods have a too high computational complexity and are not reliable, whereas other time domain Least Mean Squares methods show poor results, i.e. poor convergence vs. tracking properties or no compliancy with ITU G.992.1 standard. Finally, methods based on frequency domain decision feedback with LMS error minimization, that, to decrease the computation complexity, simplify the update equation, show a degradation of convergence because of the loss of information due to such a simplification.
On the other hand “Full LMS” methods, that are efficient computational complexity LMS methods with frequency domain decision feedback, i.e. with no update simplification, are considered to show the best performance among EC LMS based methods. It is known a simplified version of such method called “Leven”, providing for reducing the computational complexity, that does not suffer of some performance degradation like other methods. The main drawback of such “Leven” method, however, is the increase of the peak excess noise that can be tolerated reducing the correction coefficient.
It is also known that the introduction of a per-tone correction can improve the convergence performance but it can sometimes create instability.
Both “Full LMS” and “Leven” methods can be considered optimum from a point of view of LMS error functional.
This is true for a reference stationary white signal.
However, the ADSL downstream DMT signal is:
Said two features make both “Full LMS” and “Leven” sub-optimum when applied to ADSL DMT signal.
In fact, in a non-white condition the Eigen value spread of the received signal correlation matrix is increased and so a reduced convergence speed can be observed.
Further, it is quite apparent that one of the basic properties of all the adaptive theory do not hold: a DMT signal is not a stationary signal. This is mainly due to the symbol based nature of such type of modulation. For example, if only a reduced number of tones is used, then all the samples of a certain symbol are strongly correlated while no correlation occurs with samples of other symbols. It is apparent therefore that increasing the number of tones used then the correlation between different samples of the same symbol decrease reducing the level of non-stationarity of the transmitted signal.
Also the insertion of the Cyclic Prefix CP, that is a partial symbol repetition, increase the level of cyclostationarity.
A phenomenon called “low taps problem” related both to “Full LMS” and “Leven” methods was recently observed, having the following features:
In the following, the result of some simulations is described in order to clarify the impact of “low taps problem” into the speed of convergence on a full LMS echo cancellation method.
In
In
In this condition the performances are:
This means a loss of performance of 0%.
In
In
In
In this condition the performances are:
This means a loss of performance of −3%.
Due to the fact that the convergence is an exponential decreasing function the speed of convergence decrease in function of the time. This problem is a severe drawback for the whole Echo Canceller approach.
Implementation in Central Office modems accounts for the most critical situation in ADSL environment, because the downstream signal is a short bandwidth reference signal and so all estimation problems become relevant.
The object of the present invention is thus to overcome the drawbacks of the prior art arrangements considered in the foregoing while providing an improved solution for echo cancellation in a DMT modem.
According to the invention such an object is achieved by means of a method having the features set forth in the annexed claims, which form an integral part of the description herein. The invention also relates to a corresponding modem apparatus and a corresponding computer program product directly loadable in the memory of a digital computer and comprising software code portions for performing the method of the invention when the product is run on a computer.
Substantially, a method for cancelling the echo part of a received signal comprising a full per-tone compensation method and a low modes compensation method, that is a simplification of the previous one, are proposed, the proposed methods involving decoupling of the non-white reference signal from cyclostationary reference signal problems. To solve the first problem it is proposed a solution that is dependent only on the tone mask used and to solve the second one it is proposed an efficient align-dependent error compensation.
Further objects, features and advantages of the present invention will become apparent from the following detailed description and annexed drawings, which are provided by way of non limiting example, wherein:
It was observed that the low taps problem phenomenon can be explained with the observation that the DMT signal is a non-white cyclostationary signal. As mentioned above, it was observed that is align-dependent and this is due to the cyclostationary nature of the signal. In other words the reference signal, depending on the alignment, does not excite the same taps of the echo cancellation impulse response.
It was also observed that for an extended number of downstream tones (full overlap case) said low taps problem phenomenon cannot be measured. This is in line with the previous consideration about the signal being ciclostationary. This also means that the worst case condition is related to a reduced number of tones (i.e. ITU G.992.1 Annex B AFDD).
Also the fact that the number of the “low taps” is equal to the number of cyclic prefix samples indicates a relation.
Said phenomenon is always present in the same taps of the echo cancellation impulse response and the impact on the performance is not the same depending on the energy of the impulse response under the window with the “low taps problem”.
Since the “low taps problem” is related to the nature of the reference DMT signal and the statistical properties can a priori be known, the basic concept underlying the presently preferred embodiment of the invention is to exploit this knowledge together with the alignment to compensate in part or at all the Eigen value spread, especially into low taps.
Also it is proposed to decouple the two different problem with two different solution:
The proposed method is particularly suitable for implementation in the Central Office modem side, because at CPE (Customer Premises' Equipment) side the per-tone compensation is enough to have good performances for different alignment.
In the following a echo cancellation method that involves full compensation is described.
In the proposed full LMS method with T=T(k) is indicated the state vector matrix of the echo canceller filter, where w(k) indicates an estimated impulse response of a FIR echo path filter in the step (symbol) k, μ indicates the increment correction term and e=e(k) the error of updating.
The update w(k+1) of the FIR filter can be written as:
w(k+1)=w(k)+μ·TH·e Eq. 1
Usually the increment correction term μ is considered a scalar correction. It can be considered, in frequency update environment, a per-tone correction.
In the proposed method it is provided to derive the energy introduced in the calculation per tone that means to derive the mean power (in each row) of
FT(k)=F·TH Eq. 2
where F is a Discrete Fourier Transform matrix, and to fix the per-tone correction A to the inverse of such quantity. The basic idea is to derive the statistical power of each tone vector and to compensate it. In other words it is proposed to extract an increment correction μk for each tone k:
where N indicates the dimension of the state vector matrix T.
In the proposed method it is considered like a square matrix correction.
In the following a stability and performance analysis of the full compensation LMS method based on the mean squared value of the estimation error e(k) is presented.
The weight-error vector is defined like difference of the estimated impulse response of the FIR filter w(k) and w0 the optimum Wiener solution for the tap-weight vector at iteration k.
ε(k)=w(k)−w0 Eq. 4
Defined h the impulse response of the echo-channel it is also defined d=T·h, then the output of estimated can be written
e(k)=d(k)−T(k)·w(k) Eq. 5
Using equation 1 into eq. 4 and finally substituting eq. 5 it can be obtained the weight error vector expression:
ε(k+1)=[I−μ·TH·T]ε(k)+μ·TH·e0(k) Eq. 6
where e0(k) in the estimation error produced in the optimum Wiener solution
e0(k)=d(k)−T··w0 Eq. 7
Now, the direct averaging method used in the proposed echo cancellation method is described.
Equation 6 is a stochastic difference equation in the weight-error vector ε(k). To study the convergence behavior of such stochastic algorithm in an average sense, it is possible to invoke a direct-averaging method, known from H. J. Kushner, “Approximation and Weak Convergence Methods for Random Processes with application to Stochastic System Theory”, MIT Press, Cambridge Mass., 1984. According to this method equation 6, for a small step size parameter μ, is close to the solution of another stochastic difference equation whose system matrix is equal to the ensemble average:
E[I−μ·T
H
·T]=I−μ·R
T Eq. 8
The correlation matrix RT is always nonnegative definite. Also the correlation matrix RT is definite positive except for noise-free signal-processing problems.
Now, the stability of the Least Mean Square procedure is discussed.
To determine the condition for the stability of the proposed LMS update, a method which is similar to the steepest-descent algorithm stability approach is used. The natural modes of the method are examined.
To study the stability of this method the feedback loop term of equation 9 can be considered
ε(k+1)=[I−μ·RT]ε(k) Eq. 10
Using the unitary similarity transformation, the correlation matrix can be expressed as follow
RT=Q·Λ·QH Eq. 11
The matrix Q has as its columns an orthogonal set of eigenvector associated with the Eigen value of the matrix RT.
Also Q is a unitary matrix ( ) QH·Q=I=>Q−q=Q 15 H and is a diagonal matrix whose elements are the Eigen values of the matrix RT.
Substituting eq. 11 into eq. 10 it is obtained
ε(k+1)=[I−μ·Q·Λ·QH]ε(k) Eq. 12
Defining a new set of coordinates ∓as follow
v(k)=QH·ε(k) Eq. 13
The matrix μQ can be seen like a diagonal matrix with element μQ,n. Indeed it is enough to compensate all the modes.
From eq. 14 the nth natural mode of the method can be written as
vn(k+1)=(1−μQ,n·λn)vn(k), n=0, . . . , N−1 Eq. 16
Eq. 16 is a geometric series. For stability the magnitude of the geometric ratio must be less then 1.
For all the real and positive Eigen value a necessary and sufficient condition for the convergence or the stability is
The n-th time constant can be expressed in terms of the step size parameter and Eigen value
Applying the same speed of convergence (same time constant) to all the modes it is obtained
μQ=μ0·Λ−1 Eq. 19
It is clear that all this operations are possible with the knowledge of the correlation matrix RT.
Therefore it is shown in the following the method of derivation of the correlation matrix for a DMT signal.
The major problem of a DMT signal is that it is not a stationary signal.
The correlation of a signal x the function is
rX(τ,t)=E[x(t)·x*(t−τ)] Eq. 21
A DMT signal is a cyclostationary signal. This means
rx(τ,t)=rx(τ,t+Tc) Eq. 22
Into the following considerations will be done the hypothesis of uncorrelation between samples of different symbols.
The DMT Signal Just after demodulator s(t) is expressed as
Without loss of generality it is imposed T=1. Due to correlation periodic property it can be derived rx(τ,t) for −v<t<NT.
This means that it is possible to refer to only one symbol (m=0 for example). Imposing also, αmk=(αmNT−k)* with k=0, . . . , NT/2−1, eq. 23 yields a real baseband signal.
From eq. 23 it is obtained
This equation is rewritten keeping into account the symmetry (hermitian) of the information per-tone.
Into the hypothesis that no information will be inserted into DC and Nyquist tones it is obtained
Eq. 25 can be rewritten in function of only sine and cosine.
Defined
Substituting eqs. 27 which introduce a vector notation into eq. 26 it is derived the DMT Signal Just after demodulator s(t)
Now, the method for deriving the Correlation function Just after modulator is described
The correlation of a DMT signal is derived under the hypothesis
From the previous hypothesis it can be concluded that:
The correct correlation of the signal can be finally written as which
keeps into account the periodicity of the signal.
It is proposed to derive the not null part of the correlation.
From Eq. 28, under the hypothesis of uncorrelation between real and imaginary information per-tone the cross multiplication can be considered null.
From Eq. 29 using hypothesis of uncorrelation between information on different tones the central multiplication can be considered a diagonal matrix. Defined
E[Re[αk]·Re[αk]*]=σkr2 Eq. 30
E[Im[αk]·Im[αk]*]=σk,j2
From Eq. 29 and 30 it is obtained
Under the fourth hypothesis, that the statistical power for real and imaginary part is the same for each tone, it is easily obtained
It can be observed that into its domain of validity, that is function of both t and τ, it is related only to τ. For this reason into the next considerations it will be referred to rx(τ)=rx(τ,t) for −v<t−τ<NT and −v<t<NT.
The cyclostationarity is due to the validity of eq. 32 and 33, i.e. the correlation is null outside the symbol.
An efficient derivation procedure of eq. 33 will be shown in a further part of the description.
In the following the derivation of the correlation function of a filtered DMT signal in the proposed method will be described.
A DMT signal filtered by a time-invariant FIR with impulse response h is considered.
The output of the filter is cyclostationary if the input is cyclostationary.
The correlation is derived into time to time function because is more efficient of a Spectral Density passage.
The output y (domain U) of a filter h ( ) with input signal x (domain I) is (generalized Haar integration)
y(t)=·ldu·h(t−u)×x(u) Eq. 34
Defined
sy(t1, t2)=E[y(t1)·y*(t2)] Eq. 35
sx(t1, t2)=E[x(t1)×(t2)]
It is more interesting to rewrite eq. 36 using eq. 21 definition
ry(τ,t)=sy(t,t−τ)=∫ldu2·h*(t−τ−u2)∫ldu1·h(t−u1)·rx(u1−u2,u1) Eq. 37
Introducing now the hypothesis of cyclostationarity of input signal (domain∈Z(T)/Z(Tc)) the previous relation can be written
ryp(τ,t)=∫ldu2·h*(t−τ−u2)∫ldu1·h(t−u1)·rsp(u1−u2,u1) Eq 38
Keeping into account that I=Z(T) and IC=Z(T)/Z(Tc) it is possible to rewrite eq. 38 as
For a more efficient derivation of eq. 39 it is possible to refer to rx( , ).
It can be observed that
And, to keep account the periodicity of the output,
An efficient derivation of eq. 40 is proposed further on in the description.
Now, for what concerns the derivation of state vector correlation matrix RT, defined the square matrix T with dimension N the state vector matrix of EC FIR filter.
It can be seen that correlation matrix RT is symmetric.
In
The complexity of this increment scaling is, in the proposed implementation due to the decimation before the FIR echo canceller, 2562 multiplication to be added to the standard update using Full-LMS complexity.
In
In
In
In this condition the performances are:
That means a loss of performance of 0%.
The performances of this method are really better then the performances of per-tone increment update version shown with reference to
The proposed full LMS compensation method gives very good results but it is also very complex.
The increment correction matrix is alignment dependent and requires storing a lot of information. The alignment at Central Office modem cannot be known a priori, so ideally 2562 real number should be stored for each alignment. It was proved that 16 bit are just enough to obtain good performances behavior. This is the major problem to be solved for its implementation.
As it can be seen in
Also it can be easily observed that the position of the ‘cross’ is dependent on the alignment and the delay of Echo Path.
In the simulation the impact of low-mode problem is more complex, it depends also on the level of impulse response energy in the ‘low taps’. It is clear that if in those taps there is no energy then no problem of convergence can be found and vice versa.
From the analytic study it is clear that low taps problem is due to cyclic prefix influence onto the correlation properties. Indeed this problem is related to the taps corresponding to the Cyclic Prefix taps.
Therefore in the following a low-complexity embodiment of the proposed method, for the 32-taps problem, it will be described. Said embodiment can be regarded as a simplification of the Full LMS Compensation method disclosed above.
The basic idea underlying said embodiment is to support the optimal per-tone correction and to add one or more correction terms to obtain the convergence.
The full compensation method can be rewritten like sum of each Eigen value/vector contribution of the correlation matrix. Eq. 1 can be written using eq. 20
Since from observation of
The frequency domain per-tone optimal increment matrix μf,PT can be defined as
μf,PT=diag({μk}k=0, . . . , N−1) Eq. 45
where the increment μk with k=0, ..., N−1, is defined further on the in the description, in particular with reference to eq. 57.
The low-mode compensation update can be written
where αn is the Eigen value of the correlation matrix and pn is its corresponding eigenvector that is associated to the low modes and NLM is the number of Low Modes.
With this method it is reduced both the computation complexity and the memory needed to store the Eigen value/vector.
Here the Eigen values/vector of the residual matrix
μres=μfull−F−1μf,PT·F Eq. 47
As it can be seen in
Also the third Eigen value/vector is related to the original low-taps problem but the Eigen value is really smaller.
It can be also observed that this problem relevant only with a misalignment between −26 and +449@2.2 MHz (TBC). In this region the value of the corresponding first Eigen value is very flat.
Also it can be observed from
That is not true between +450 and +519.
In
The complexity of this increment scaling is, in our implementation due to the decimation before the FIR echo canceller, 2562 multiplication to be added the standard update (Full-LMS) complexity. It can be observed that the additional complexity is more than double than the complexity of the base-update.
In
In
In this condition the performances are:
That means a loss of performance of 0%.
The performances are a little worse than the full compensation case. This is due to:
However performances are better than those of known methods, i.e. the method shown with reference to
In the following efficient derivation methods for the above equations are described
An efficient proposed derivation of eq. 33 is based on the only hypothesis that all used tones have the same power level for D1<σk2=σ2 k<D2 with D2<NT. In this case eq. 33 can be written
An efficient derivation of eq. 40 can be obtained, considering the correlation function of the signal just after the modulator (the non-periodic one) function only of τ parameter (−v<t−T<NT and −v<t<NT).
In conclusion eq. 40 can be rewritten as
The complexity of this derivation is (NT+v)2 for each couple t, τ. Eq. 1 cannot be used directly into the computation of correlation matrix due to complexity reason.
It is proposed to define the new function
By observing that the value of this function into (t+1,u+1) is strongly related to the one into (t+1,u+1), it is possible to rewrite Eq. 51
Referring to Eq. 53 into the (t+1) parameter and proving that is strongly related to the value into t and further considering the even symmetry of the rx( ) function then it is obtained for the correlation function
ry(τ,t+1)=ry(τ,t)+h*(t−τ+v+1)·uxN
This equation is an iterative solution into the t parameter.
This derivation is very efficient, some of the terms are function of t, other of (t−τ). This can help to reduce the complexity because this terms can be derived for all the (t, τ) couples.
Other methods to reduce the complexity are related to the hypothesis that it is possible to make on the impulse response (h(.)) for example:
Observing that ry(τ,t) correlation function is not zero for
For what concerns per tone compensation, with full LMS method, there is a different value of energy put for different tones. This is due to PSD shaping of DMT ADSL signal.
A good increment correction have to derive proportionally to the statistical power of input vector. The proposed solution, as mentioned above, is to make this for each per-tone vector.
In other words it has to be derived
RF=E└F·TH·T·F−1┘ Eq. 56
and to fix
Since it is known that DMT signal is a non-stationary signal, so generally Eq. 56 is alignment dependent. In this case it is desirable a scaling factor that have to be considered fixed (so not align-dependent). In this case it is considered the mean spectral analysis
{overscore (r)}sp(τ)=∫Icdt·rxp(τ,t) Eq. 58
Being the correlation (not the periodic one) not null for −v[t−τ<N from eq. 33, 41 and 58 for a DMT signal is obtained
The mean correlation after the Low Pass Filter of echo-emulation (with h impulse response) path is easily
{overscore (r)}yp(τ)=∫Icdu·{overscore (r)}xp(u)·ch(τ−u) Eq. 60
where
ch(τ)=∫Udv·h(v+τ)·h*(v) Eq. 61
A reduced complexity update method can be also used, that will be referred to as “Lambda Even” update. Such a Lambda Even update is described in the European Patent Application No. 02291217.4 in the name of the same Applicant and it was proved in to have good convergence properties, in connection with the full and low modes compensation procedure to improve the performance.
In
On the transmission branch 100 in a block 101 a inverse Fast Fourier transform operation from N real to N/2 complex is performed on the encoded signal to be transmitted and the CP Cyclic Prefix is added. The signal is then transmitted on the channel via digital to analog conversion in a block 103, preceded by filtration stage 102 comprising a biquadratic filter and shaping filters. Interpolation filters can be used as an alternative. The signal transmitted is defined in ITU-T G.992.1, G992.3, G992.5 standards.
Upstream the block 102 the transmitted signal is taken and sent to an adaptive filter 300. Signal is taken before the filtration stage 102 in order to increase the speed of convergence.
On the reception branch 200 a received signal is passed through an analog to digital converter indicated with 201, passed through a filtration block 202 comprising a biquadratic filter to reach a time domain equalizer (TEQ) 205. Downstream the time domain equalizer 205 is disposed a subtraction node 206 where the output of adaptive filter 300 is subtracted from the received signal to cancel echo. It has to be noted that the DSP front-end is not very important. What is important is that the echo signal is removed before the time domain equalizer 205 to avoid to insert TEQ variation in a adaptive equalization solution.
Subsequently, in a block 207 the Cyclic Prefix is removed and a Fast Fourier transform operation from N real to N/2 complex is performed, followed by a frequency equalization (FEQ) performed in a block 208.
Finally the received signal reaches a de-mapper (DEMAP) 209 that performs de-codification of the symbols. The input and the output of de-mapper 209 are however taken and subtracted in a subtraction node 210, which output is then feed to a multiplication node 316 of the adaptive filter 300.
It is important to perform a decision of the received symbol even if it is a hard decision. The algorithm implemented in the de-mapper 209 is very robust and can work even in presence of high probability of wrong decision.
Such an adaptive filter 300 comprises a filtering part 301 and an update part 302. The filtering part 302 comprises a biquadratic filter 303 and a FIR filter 304 with 256 taps.
The signal is taken upstream the FIR filter 304 and sent to the update part 302, firstly to a block 305 performing a Fast Fourier Transform operation from N real to N/2 complex, that yields an even array of values Aeven and an odd array of values Aodd, that represent the odd and even output of the Fast Fourier Transform on on 2*N samples. As it is based on a decimated transmitted signal, N sample are used. The odd array Aodd undergoes a conjugation operation in a block 306, which result is fed to a multiplication node 313. The even array Aeven undergoes a conjugation and hermitian reduction operation in a block 309, which result is fed to a multiplication node 314.
The output of subtraction node 209 is mixed in the multiplication mode 316 with the result of a block 316 that calculates the norm of the output of the frequency equalizer 208. The output of node 316 is fed to a block 307 that performs in association with a block 308 the classical echo cancellation according to a full LMS efficient method.
The output of block 307 is multiplied in a multiplication node 313 by the output of block 306 and fed to a block 308, that provides for removing the aliasing components in reception inserted by the last decimation stage contained in block 202, by replicating a complex FFT. The result is fed then to a multiplication node 315.
To said multiplication node 315 is also sent the output of a further multiplication node 314, that multiplies the output of block 309 with output of a block 310 that performs an operation of hermitian extension on the output of node 316.
The output of node 315 is fed to a block 311 in which the calculation of the increment correction, calculating the compensation for the two lower modes, is performed and feed such increment correction to a block 312 where the new update is calculated. The output of block 312 with update is then fed to the FIR filter 304.
In
As it can be seen the lambda odd array evaluation is suppressed.
From the transmitted signal taken upstream the FIR filter 304 only the even array Aeven is obtained via a folding operation in block 325. Then a real Fast Fourier transform operation is performed in block 326. Then he even array undergoes a conjugation and hermitian reduction operation in the block 309, which result is fed to a multiplication node 314.
The output of subtraction node 209 is mixed in the multiplication mode 316 with the result of a block 317 that calculates the norm of the output of the frequency equalizer 208.
The output of node 314 is then fed to the block 311 in which the calculation of the increment correction is performed and feed such increment correction to a block 312 where the new update is calculated. The output of block 312 with update is then feed to the FIR filter 304.
The proposed method, by introducing low modes compensation increases the performances of the echo canceller adaptive method.
The proposed method advantageously obtains a speed of convergence that is approximately the same for different alignments. Further, the efficiency of the adaptive method is increased, as can be seen comparing the mean quantitative of excess noise cancelled per update normalized by the computational complexity of each update. It was shown that this method can be ten times more efficient during tracking than all other methods known from the prior art.
The advantage in efficiency so obtained can be traded both to reduce the computational complexity, thus obtaining a reduced digital area for a hardware implementation or a less complex Digital Signal Processing unit in a software implementation, or to increase the number of update in the same time unit.
Without prejudice to the underlying principle of the invention, the details and embodiment may vary, also significantly with respect to what has been discussed just by way of example without departing from the scope of the invention, ad defined by the claims that follow.
For instance the solution described herein can be applied to all ADSL modem, in Central Office or Customer Premises' Equipment.
Number | Date | Country | Kind |
---|---|---|---|
03016993.2 | Jul 2003 | EP | regional |