The present invention relates to data processing, transmission, and digital communication. More specifically, it is related to low-power implementation of echo and NEXT (near-end crosstalk) cancellers for precoder based DSP transceivers.
Many multiple wireline communication systems, such as DSL (digital subscribe line) systems and gigabit Ethernet systems suffer from echo and crosstalks. Echo and NEXT (near-end crosstalk) cancellers are widely used to counter the effect of echo and NEXT noise. In 10 Gigabit Ethernet over copper (10GBase-T) system, full duplex baseband transmission is performaned over four pairs of UTP (unshield twisted pair). Each received signal is corrupted by echo from its own transmitter and NEXT interferences from three adjacent transmitters. To meet the desired throughput and target BER (10−12) requirements, echo and NEXT noise cancellation are expected to be about 55 dB and 40 dB, respectively.
The typical way to perform echo and NEXT noise cancellation is using finite impulse response (FIR) filters in digital domain, where the replica of the echo and NEXT estimated by the FIR filters is subtracted from the received noisy signals. This straightforward approach, however, will have a significant complexity if the size of the FIR filters is large and thus leads to large power and silicon area. In the 10GBase-T transceivers, 16 long FIR filters need to be implemented for noise cancellation. Due to the extreme high symbol rate (such as 800 Mega Baud required for 10Gbase-T) and high noise cancellation level requirement, each FIR-filter based canceller can be of several hundred taps, and the number of total taps is around 7000. Implementing those filters at such a high speed requires a significant amount of power. Therefore reducing the power consumption of these FIR filters is important for a successful DSP transceiver design.
How to design low power echo and NEXT cancellers for 10GBase-T transceivers is a challenging task. It is apparent in the industry that the FIR techniques used in 1000 BASE-T solutions, if implemented in a straightforward way, would result in a complexity increase on the order of 45× over 1000 BASE-T. The high degree of cancellation required at these speeds also makes all-analog cancellation difficult, since both high bandwidth and high power of adaptive analog filters are required if feasible. By using DFT transformation, approximate complexity saving can be 90% (See, e.g., Sanjay Kasturia and Jose Tellado, “Lower Complexity Architectures for Implementing 10GBT XTalk Cancellers and Equalizers FIRs”, 10GBase-T Study Group Meeting, http://www.ieee802.org/3/10GBT/public/sep03/kasturia—1—0903.pdf, September 2003). However, new issues such as block processing latency, increased memory and increased precision, associated with this technique make it unacceptable for the 10GBase-T application. Because of the inherent time-varying and randomness of the channel impulse responses, simple techniques to extend the length of the impulse response cancelled, such as continuous-time analog filters or infinite impulse response (IIR) digital filters are not flexible solutions. Methods proposed to exploit the sparsity of the echo and NEXT impulse responses are also not trivial as accurate channel estimates are needed before those significant taps with large magnitude can be identified. The problem becomes even worse by introducing Tomlinson-Harashima precoding (TH precoding) in 10GBase-T as the inputs to echo and NEXT cancellers are no longer simple PAM-M symbols but numbers uniformly distributed on [−M, M). Hence, the wordlength of the inputs for echo and NEXT cancellers could be as long as 10 bits, which further increases the complexity and cost of echo and NEXT cancellers (See, e.g., G. Zimmerman, “Downside of TH Precoding”, 10GBase-T Study Group Meeting, http://www.ieee802.org/3/an/public/may04/zimmerman—1—0504.pdf, May, 2004).
What is needed is a method for designing efficient echo and NEXT cancellers that achieve minimal power consumption and area costs by reducing word-length requirements.
The present invention provides an efficient implementation of echo and NEXT cancellers by wordlength reduction technique and describes a method for designing low complexity and low power echo and NEXT cancellers for 10GBase-T.
In accordance with the present invention, wordlength reduction technique is proposed for low complexity and low power design. A TH precoder is first converted to its equivalent form where the TH precoder can be viewed as an infinite impulse response (HR) filter with an input equal to the sum of the original input to the TH precoder and a finite-level compensation signal. Instead of using the output of the TH precoders as the input to the echo and NEXT cancellers, the sum of the original input to the precoder and the compensation signal is viewed as the input of the echo and NEXT cancellers. Then a data encoding technique can be used to reduce the wordlength of the input of the echo and NEXT cancellers resulting in low complexity and low power design since the number of possible values of the sum of the original input to the precoder and the compensation signal is finite. Finally, by removing the implicit IIR filter with poles near the unit circle from the adaptive loop of these cancellers, the convergence speed of the adaptation is greatly improved. In addition, an improved design by exploiting the statistics of the compensation signal is also proposed to further bring down the complexity and power consumption of these cancellers.
Further embodiments, features, and advantages of the present invention, as well as the structure and operation of the various embodiments of the present invention are described in detail below with reference to accompanying drawings.
The present invention is described with reference to the accompanying figures. The accompanying figures, which are incorporated herein and form part of the specification, illustrate the present invention and, together with the description, further serve to explain the principles of the invention and to enable a person skilled in the relevant art to make and use the invention.
Table 1 lists an encoding mapping from the sum of original input to the TH precoder and the corresponding compensation signal to its 8 bits 2's complement.
Table 2 lists an encoding mapping from the original transmitted symbol without TH precoder to its 2's complement representation in 4 bits.
Table 3 lists an encoding mapping from the compensation signal to its 4 bits binary representation.
Recently, TH precoding has been proposed to be used in 10GBase-T because it can eliminate error propagation and allow use of capacity-achieving channel codes, such as low-density parity-check (LDPC) codes, in a natural way. However, the use of TH precoding significantly increases the complexity of echo and NEXT cancellers in 10GBase-T.
Consider the block diagram of the typical 10GBase-T transceiver for one pair in
It is known that the hardware complexity and power consumption of the filter are influenced by many factors such as the number of taps used, the coefficient range of the taps, and the operating speed. In this invention, we propose to reduce the wordlength of the input signal for echo & NEXT cancellers to acheive the low complexity and low power design.
Consider an equivalent form of the TH precoder in
where H(z) is a causal FIR in the TH precoder feedback path.
From EQ. (1), we see that a TH precoder can be viewed as an IIR filter with the input equal to the original TH precoder and a finite level compensation signal, i.e., x(n)+v(n). For M=16, the input x(n) is a PAM-16 signal with symbol set {±1, ±3, . . . , ±15} and can be encoded as a binary representation using 4 bits. The number of levels of the compensation signal, v(n), is dependent on the coefficients of the precoder. Based on the precoder coefficients for the worst case (long cables) (See, e.g., IEEE 803.2an Draft Standard 2.0, 2004), it is found that v(n) has at most 13 possible levels from the set {0, ±32, . . . , ±192}. Thus, v(n) also can be represented with a binary representation using 4 bits. Hence, the sum of these two signals, x(n)+v(n), will have at most 162 possible values, which can be represented with an 8-bit binary number. Based on this key observation, the signal x(n)+v(n) is proposed to be used as the input to the echo and NEXT cancellers instead of using the TH precoder output, t(n), as shown in
Note that the 8-bit encoded sum signal x(n)+v(n) loses the actual value of the sum x(n)+v(n), and it only represents one of the 256 possible values. Therefore directly applying the 8-bit sum signal to the input of the echo and NEXT cancellers is not valid. One method to slove this problem is using precomputation technique since this sum signal is a finite integer number (See, Keshab K. Parhi, “Pipelining of Parallel Multiplexer Loops and Decision Feedback Equalizers”, in Proc. ICASSP 2004, vol. 5, pp. 21-24, May 2004). However, the hardware overhead associated with the precomputation technique is huge; especially when the number of filter taps is large, the hardware overhead exponentially increases. In this invention, we propose a method to encode the input data x(n)+v(n) before applying it to the echo and NEXT cancellers. The desired output of the overall FIR filter is obtained by using a corresponding decoding process. Thus, the idea in
Table 1 gives the proposed encoding mapping between the real value of x(n)+v(n) and its 2's complement encoded bits. Suppose the value of x(n)+v(n) is d(n), and the value of the corresponding encoded bits is w(n). It can be seen from the table
d(n)=2×w(n)+1, EQ. (2)
i.e.,
w(n)=2−1×[d(n)−1], EQ. (3)
EQ. (3) can be viewed as the encoding equation, which can be easily implemented with one shifter and one adder as shown in
Consider an N-th order FIR filter with output
where g(k) is the tap coefficient, and d(n−k) is the input data with time index n−k without encoding, i.e., the actual value of x(n)+v(n). Substitute EQ. (2) into EQ. (4), we get
The first sum on the right side of the equation is the exact filter output with 8-bit encoded data as inputs, and the second term is the sum of all the coefficients of the filter, which can be pre-computed. Hence, the desired output of the original FIR filter can be easily obtained from EQ. (6). As an example, the overall architecture for a 3-tap FIR filter implementation with wordlength reduction technique is shown in
It is easy to extend this design to the application in 10GBase-T, where one echo and three NEXT cancellers are needed for each receiver.
However, the problem with having x(k)+v(k) as input to the cancellers in
To solve this problem, consider
y=c
t
far
+g
t
near
+n EQ. (7)
where represents convolution. Here c denotes channel impulse response and g denotes echo channel impulse response. Writing EQ. (7) in Z domain, we get
Y(z)=C(z)Tfar(z)+G(z)Tnear(z)+N(z). EQ. (8)
Similarly, we represent the output of the corresponding echo cancellers as
U(z)=Ĝ′(z)Dnear(z) EQ. (9)
where Dnear(z)=Xnear(z)+Vnear(z), and Ĝ′(z) is the Z-transform of echo canceller impulse response. Then ideal cancellation is achieved when
Ĝ′(z)Dnear(z)=G(z)Tnear(z) EQ. (10)
From EQ. (1), EQ. (10) can be written as
From EQ. (12), we see that the canceller to be designed implicitly contains an IIR filter. In a staightforward way, an FIR filter is chosen to approximate this IIR filter due to its good stability and ease of implementation. However, the length of FIR chosen could be very long when the poles of the IIR filter are close to unit circle in the z-plane. In this case, the benefit from wordlength reduction technique will be countered due to the increased taps of the FIR filter and the convergence speed during the adaptation will also be slow. On the other hand, system performance may degrade since FIR filter is only an approximation of the IIR filter. Another approach is to use an adaptive IIR filter, which is more complex to analyze and suffers from stability problem. We solve this problem by removing the implicit IIR filter from the cancellation path resulting in a solution which does not suffer from stability problem or slow convergence speed.
Consider
where Ĝ(z) is the Z-transform of the echo & NEXT channel estimate. As we can see the implicit IIR filter in the echo canceller is balanced by inserting 1/H(z); thus echo noise can be cancelled better. Since the echo cancellation is in adaptive sense, introducing an IIR filter in the adaptation loop will greatly affect the convergence speed although this might bring better performance than the design in
Observing the second term at right side in EQ. (15), the IIR filter shown in EQ. (11) disappears here and echo noise cancellers based on FIR structure can be used to approximate the echo channel in a natural way. EQ. (15) also shows that the received signal has a multiplication distortion by H(z); however, this can be countered by multiplying 1/H(z) after echo cancellation. Assuming perfect cancellation, the residual signal can be written as
Q(z)=C(z)Tfar(z)H(z)+N(z)H(z) EQ. (16)
and the input of the FFE is
which is the same as the input of the FFE for original cancellers with input from the output of the precoder in
The distribution of the compensation signal v(n) was not only found to be uniform but also symmetric, as shown in
In this improved design, we propose to separate v(n) from the sum signal x(n)+v(n), and then the filter with input of x(n)+v(n) can be implemented as two filters, one with input x(n) and another with input v(n), as shown in
Consider a 2-nd order MK filter with input v(n) in
Since the v(n) is symmetric, we only consider the positive numbers, which is represented in unsigned 3 bits. A symbol detection circuit is designed to generate the control signals for the set {0, 1}. In
As an example,
A method to design low complexity and low power echo and NEXT cancellers based on wordlength reduction techniques is presented. The resulting new echo and NEXT cancellers can be used for high-speed communication applications, such as 10 Gigabit Ethernet over copper.
While various embodiments of the present invention have been described above, it should be understood that they have been presented by way of example only, and not limitation. It will be understood by those skilled in the art that various changes in form and details can be made therein without departing from the spirit and scope of the invention as defined in the appended claims. Thus, the breadth and scope of the present invention should not be limited by any of the above-described exemplary embodiments, but should be defined only in accordance with the following claims and their equivalents.
This application claims the benefit of U.S. Provisional Application No. 60/704,318, filed on Aug. 1, 2005, the entire content of which is incorporated herein by reference in its entirety.
This invention was made with Government support under the SBIR Grant No. DMI-0441632, awarded by the National Science Foundation (NSF). The Government has certain rights in this invention.
Number | Date | Country | |
---|---|---|---|
60704318 | Aug 2005 | US |