The present application relates to U.S. patent application Ser. No. 17/656,751, filed 2022 Mar. 28 by inventors Yu Liao and Junqing Sun, titled “Reduced-Complexity Maximum Likelihood Sequence Detector Suitable for M-ary Signaling”, and hereby incorporated herein by reference in its entirety.
The present disclosure relates to digital communications receivers and, more particularly, to equalizers suitable for use with high-rate signaling and larger signal constellations.
Most integrated circuit devices have become so complex that it is impractical for electronic device designers to design them from scratch. Instead, electronic device designers rely on predefined modular units of integrated circuit layout designs, arranging and joining them as needed to implement the various functions of the desired device. Each modular unit has a defined interface and behavior that has been verified by its creator. Though each modular unit may take a lot of time and investment to create, its availability for re-use and further development cuts product cycle times dramatically and enables better products. The predefined units can be organized hierarchically, with a given unit incorporating one or more lower-level units and in turn being incorporated within higher-level units. Many organizations have libraries of such predefined modular units for sale or license, including, e.g., embedded processors, memory, interfaces for different bus standards, power converters, frequency multipliers, sensor transducer interfaces, to name just a few. The predefined modular units are also known as cells, blocks, cores, and macros, terms which have different connotations and variations (“IP core”, “soft macro”) but are frequently employed interchangeably.
The modular units can be expressed in different ways, e.g., in the form of a hardware description language (HDL) file, or as a fully-routed design that could be directly printed to create a series of manufacturing process masks. Fully-routed design files are typically process-specific, meaning that additional design effort would usually be needed to migrate the modular unit to a different process or manufacturer. Modular units in HDL form require subsequent synthesis, placement, and routing steps for implementation, but are process-independent, meaning that different manufacturers can apply their preferred automated synthesis, placement, and routing processes to implement the units using a wide range of manufacturing processes. By virtue of their higher-level representation, HDL units may be more amenable to modification and the use of variable design parameters, whereas fully-routed units may offer better predictability in terms of areal requirements, reliability, and performance. While there is no fixed rule, digital module designs are more commonly specified in HDL form, while analog and mixed-signal units are more commonly specified as a lower-level, physical description.
Serializer-deserializer (SerDes) cores are a frequent need for device designs that employ modern data communications standards, which continue to evolve towards higher symbol rates and larger numbers of bits per channel symbol due to continuing demand for ever-lower latencies and ever-higher transfer rates. The channel symbols are attenuated and dispersed as they propagate, causing intersymbol interference (ISI) at the receiving end of the channel. For a given channel bandwidth, this ISI worsens at higher symbol rates and larger symbol constellations. When trying to detect the channel symbols, receivers must contend with this ISI in addition to the channel noise that contaminates the receive signal.
Due to their relatively low complexities, linear equalizers and decision feedback equalizers (DFE) are typically preferred for facilitating channel symbol detection without undue noise enhancement. Yet as data rates push ever closer to channel capacity, these equalizers may fail to provide sufficiently low error rates. The maximum likelihood sequence detector (MLSD) employs a symbol detection strategy that is optimal from an error rate perspective, but that is often prohibitive from a complexity and power consumption perspective when configured to detect multibit symbols.
Accordingly, there are disclosed herein receivers and methods using maximum likelihood sequence detection with pseudo partial response equalization. One illustrative receiver includes: a feedforward equalizer that produces an equalized receive signal by diminishing a receive signal's intersymbol interference; a decision element that derives initial symbol decisions from samples of the equalized receive signal; and a filter that applies a partial response to the equalized receive signal or to an equalization error signal to produce input for a maximum likelihood sequence detector (MLSD).
An illustrative receive method includes: diminishing a receive signal's intersymbol interference to produce an equalized receive signal; deriving initial symbol decisions from samples of the equalized receive signal; and applying a partial response to the equalized receive signal or to an equalization error signal to produce input for a MLSD.
An illustrative semiconductor intellectual property core generates circuitry for implementing a receiver and/or receiving method as described above.
Each of the foregoing receiver, method, and core implementations may be embodied individually or conjointly and may be combined with any one or more of the following optional features: 1. a filter that converts the full-response errors into partial response errors. 2. a filter that converts the equalized receive signal into a partial response signal. 3. a circuit that uses the full-response errors to determine a competing symbol decision for each initial symbol decision. 4. the decision element determines a competing symbol decision for each initial symbol decision. 5. a reduced-complexity maximum likelihood sequence detector (rMLSD) that derives final symbol decisions by evaluating state metrics only for each initial symbol decision and its competing symbol decision. 6. the rMLSD includes calculation circuitry that combines the partial response errors with initial symbol decisions and with associated competing symbol decisions to determine corresponding branch metrics. 7. the rMLSD includes calculation circuitry that combines the partial response signal with initial symbol decisions and with associated competing symbol decisions to determine corresponding branch metrics. 8. the calculation circuitry sums each branch metric with an associated state metric to obtain path metrics. 9. the rMLSD includes: comparators that determine a minimum path metric for each initial symbol decision and each competing symbol decision; and a pair of copy-shift registers that respond to outputs of the comparators to assemble a most likely symbol decision sequence ending in that symbol decision and that competing symbol decision. 10. an element that determines a sign of an equalization error between each sample of the equalized receive signal and a level of a corresponding initial symbol decision; and a circuit that uses the sign to determine a competing symbol decision for each initial symbol decision.
While specific embodiments are given in the drawings and the following description, keep in mind that they do not limit the disclosure. On the contrary, they provide the foundation for one of ordinary skill to discern the alternative forms, equivalents, and modifications that are encompassed in the scope of the appended claims.
For context,
The pluggable modules 206 may each include a retimer chip 210 and a microcontroller chip 212 that controls operation of the retimer chip 210 in accordance with firmware and parameters that may be stored in nonvolatile memory 214. The operating mode and parameters of the pluggable retimer modules 206 may be set via a two wire bus such as I2C or MDIO that connects the microcontroller chip 212 to the host device (e.g., switch 112). The microcontroller chip 212 responds to queries and commands received via the two wire bus, and responsively retrieves information from and saves information to control registers 218 of the retimer chip 210.
Retimer chip 210 includes a host-side transceiver 220 coupled to a line-side transceiver 222 by first-in first-out (FIFO) buffers 224.
yk=L(dk)+γL(dk−1)+nk, (1)
where dk∈{0,1,2,3} represents the transmitted PAM4 symbols, L(dk) is a mapping of PAM4 symbols to their corresponding signal levels, and n k represents the noise plus some other impairments.
A summer 406 combines the filtered receive signal with a feedback signal fk to produce an equalized signal sk having reduced trailing ISI and thus (in the absence of significant channel noise) having open decision eyes such as those of
Instead of a DFE, a conventional MLSD could be applied to the filtered receive signal yk. With the trailing ISI limited to one PAM4 symbol interval, the conventional MLSD would employ a four state trellis such as that shown in
Each stage of the trellis has 4 states, sjk, 0≤j<4, representing 4 possible PAM4 symbols of the kth symbol interval. There are 16 possible transitions from previous symbol dk−1 to current symbol dk; each transition has a branch metric bi,jk associated with it, where indices 0≤i<4 and 0≤j<4 represent the 4 possible PAM4 symbols of the previous and current symbol intervals, respectively. The branch metric is computed as follows:
bi,jk=(yk−L(j)−γL(i))2 (2)
Each state has an associated state metric cjk representing the accumulated branch metrics of the maximum likelihood path from start to the state of sjk. The state metrics are computed as follows:
Based on above equations, the MLSD finds the maximum likelihood path from the beginning to the end of the sequence, and the maximum likelihood decision sequence is the decision sequence associated with the branches that constitutes the maximum likelihood path.
The conventional MLSD for PAM4 with one symbol interval of trailing ISI requires sixteen branch metric computation units and four 4-way Add Compare Select (ACS) units in the critical timing path for each symbol interval. For data center applications, the power consumption of such components would be prohibitively high at data rates beyond 50 Gbaud.
To implement a reduced-complexity MLSD (rMLSD), we now introduce the concept of a “competing decision”. Where the DFE's symbol decision {circumflex over (d)}k is the most likely transmitted symbol for a given yk and {circumflex over (d)}k−1, the competing symbol decision {circumflex over (d)}′k is the second most likely transmitted symbol. Defining the equalization error signal ek as:
ek=yk−L({circumflex over (d)}k)−γL({circumflex over (d)}k−1), (4)
the competing symbol decision {circumflex over (d)}′k for PAM4 is:
We observe that under practical operating conditions, it is highly probable that the transmitted symbol dk is either equal to {circumflex over (d)}k or equal to {circumflex over (d)}′k. Thus the MLSD trellis can be simplified to consider just these potential symbol decisions with minimal performance loss.
Mathematically speaking, let the sequence dv={dv,0, dv,1, . . . , dv,n−1}, where dv,k ∈{{circumflex over (d)}k, {circumflex over (d)}′k }, 0≤k<n. The proposed low complexity MLSD finds the maximum likelihood sequence dvmax that satisfies the following:
Now, let “0” represent the state that dv,k={circumflex over (d)}k and “1” the state that dv,k={circumflex over (d)}′k. The proposed low complexity MLSD works on a 2-state trellis shown in
bi,jk=(yk−ŷk)2, (6)
where ŷk is given as:
ŷk=
In Equation (7), ī and
bi,jk=(e′k+j*(L({circumflex over (d)}k)−L({circumflex over (d)}′k))+i*γ*(L({circumflex over (d)}k−1)−L({circumflex over (d)}′k−1)))2 (8)
The state metric cjk, 0≤j<2, is computed as follows:
Each state has a path memory with predefined length to store the decisions of the branches that constitute the survivor path of the state that is the maximum likelihood path from trellis start to the current state. The decision of each branch is the same as the ending state of the branch. Compared to the conventional MLSD, where the decision of each branch has 2 bits, the decision of each branch in the proposed low complexity MLSD has only 1 bit. Unlike the conventional MLSD, which has a complexity that grows quadratically with the constellation size, the rMLSD need not have any complexity increase beyond what is needed by the DFE equalizer.
We note here that variations exist on this receiver implementation. As an alternative to having a CDD circuit 718, the slicer 408 can be modified to provide both the initial symbol decisions {circumflex over (d)}k and the competing symbol decisions {circumflex over (d)}′k. The rMLSD can be configured to operate on the filtered receive signal yk in combination with the symbol decisions, rather than using the calculated equalization error ek. These variations and others are discussed more fully in related application U.S. patent application Ser. No. 17/656,751 (“Reduced-Complexity Maximum Likelihood Sequence Detector Suitable for M-ary Signaling”).
It should be noted that the rMLSD necessarily fails when dk ∉{{circumflex over (d)}k,{circumflex over (d)}′k}. When the magnitude of the trailing ISI coefficient approaches or exceeds 1, the DFE becomes susceptible to error propagation, since the occurrence of a first symbol error causes improper calculation of the feedback signal and thereby substantially increases the likelihood of subsequent decision errors. The equalization error of equation (4) becomes large enough that the rMLSD's failure is almost assured.
To address this potential issue, an alternative receiver implementation is illustrated in
yk=L(dk)+nk (10)
Decision element 408 compares the filtered receive signal to the previously-discussed thresholds to obtain the initial symbol decisions {circumflex over (d)}k. A level circuit 822 maps the initial symbol decisions to their corresponding signal levels L(dk), which a difference element 824 then subtracts from the filtered receive signal to calculate the (full response) equalization error signal ek:
ek=yk−L({circumflex over (d)}k). (11)
A partial response filter 826 derives a pseudo-partial response error e′k given by
e′k=ek+γ·ek−1 (12)
The partial response coefficient γ can be programmable and/or dynamically adapted to minimize the error variance. The CDD circuit 718 and rMLSD 720 may be implemented as in the receiver of
In the following, we show that the full-response equalizer 804 and decision element 408 followed by a partial response filter 826 (applied to either the equalization error ek or to the filtered receive signal yk) and MLSD is theoretically equivalent to a partial-response equalizer and decision element with MLSD. For the rMLSD, however, the implementation of
Using equation (11), equation (12) can be rewritten as
e′k=yk+γ·yk−1−[L({circumflex over (d)}k)+γ·L({circumflex over (d)}k−1)] (13)
Combine the first two terms to represent a partial response filtered receive signal, and we have
y′k=yk+γ·yk−1 (14)
e′k=y′k−[L({circumflex over (d)}k)+γ·L({circumflex over (d)}k−1)] (15)
Substitute (10) into (14), we have
y′k=L(dk)+γ·L(dk−1)+nk+γ·nk−1 (16)
and combine the last two terms to represent the partial response filtered noise signal
n′k=nk+γ·nk−1 (17)
In the following we show that y′k is equivalent to the 1+γz−1 partial response equalization output and n′k is equivalent to the noise/distortion in the 1+γz−1 partial response equalization output. Let X(z) be the z-transform of the ADC output xk, D(z) the z-transform of transmitted symbols L(dk), F(z) the transfer function of the full response equalizer, and Y(z) the z-transform of the equalizer output. Then, we have the following:
Y(z)=X(z)·F(z) (18)
The z-transforms of (14) and (16) give
Y′(z)=(1+γ·z−1)·Y(z) (19)
Y′(z)=(1+γ·z−1)·D(z)+N′(z) (20)
where N′(z) is the z-transform of n′k. Combining (18)-(20) yields
Equation (21) shows that (1+γ·z−1)·F(z) is a partial response equalizer, y′k is the output of the partial response equalizer, and the noise/distortion n′ k associated with the rMLSD input e′k is the noise/distortion in ADC samples shaped by the partial response equalizer 826. Thus the rMLSD in the pseudo-partial response maximum likelihood detector sees the same partial response equalized signal y′k and the same noise/distortion n′k as in the PRML detector, and the error signal e′k in (15) is an estimation of the partial response equalization error.
As previously noted, the 2-state MLSD fails if dk ∉{{circumflex over (d)}k,{circumflex over (d)}′k}. Comparing the DFE equalization error in (4) and the full response equalization errors in (11), the notable difference is the trailing ISI term γ·L({circumflex over (d)}k−1) in (4). Without this term, the probability that dk ∉{{circumflex over (d)}k,{circumflex over (d)}′k} for the full response equalization is much lower than it is for the DFE equalization, especially when the trailing ISI coefficient γ is relatively large. For the full response equalization, the failure condition requires |nk|>2 assuming L(dk)∈{±1, ±3}, while for the DFE equalization, the failure condition for DFE equalization when {circumflex over (d)}k−1≠dk−1 is only |nk+2·γ|>2. Moreover, since there is no decision error propagations in the pseudo partial response equalizer of
Additional implementation detail for the decision element 408 is provided in
The comparator results are provided to the copy-shift register 1118 to steer the initial and competitive symbol decisions through a series of latches in a manner that assembles most-probable symbol decision sequences for each trellis state. A final multiplexer 1120 selects the symbol decision from the front of the assembled sequence determined to be most probable.
Performance simulations were done over a channel with 37 dB insertion loss at the Nyquist frequency and the results are shown in
Lab data (FFE output) was also collected from the chip debug memory and processed offline in accordance with different detector configurations. Approximately 1.6e7 PRBS31 bits were transmitted through a channel having a 36.8 dB insertion loss from bump to bump. The evaluated configurations were the (full response) FFE (BER=3.56e-4), DEF with γ=0.5 (BER=1.43e-4), rMLSD preceded by DFE with γ=0.5 (BER=1.44e-5), and rMLSD with pseudo partial response with γ=0.5 (7.31e-6). In at least this case, the pseudo-PRMLSD outperforms the DFE-rMLSD configuration.
It is contemplated that the disclosed receiver implementation designs can be incorporated into SerDes cores for use by integrated circuit designers and manufacturers creating devices for a host of applications that might benefit from cost-, complexity-, and power-efficient high-bandwidth communications. Numerous alternative forms, equivalents, and modifications will become apparent to those skilled in the art once the above disclosure is fully appreciated. For example, the disclosed principles are applicable to both PAM, QAM, and PSK modulation, and to larger signal constellations including 8-PSK, 16-PAM, etc. Though one trailing ISI interval is described in the foregoing, the disclosed principles are also applicable to longer trailing ISI intervals albeit with an increased number of trellis states. It is intended that the claims be interpreted to embrace all such alternative forms, equivalents, and modifications that are encompassed in the scope of the appended claims.
Number | Name | Date | Kind |
---|---|---|---|
5745315 | Aoyama | Apr 1998 | A |
6385239 | Okamoto | May 2002 | B1 |
6625235 | Coker | Sep 2003 | B1 |
6690739 | Mui | Feb 2004 | B1 |
20010005405 | Shimoda | Jun 2001 | A1 |
20040076245 | Okamoto | Apr 2004 | A1 |
20180167143 | Yao | Jun 2018 | A1 |
20210044462 | Lu | Feb 2021 | A1 |
20210306009 | Riani | Sep 2021 | A1 |
Number | Date | Country |
---|---|---|
WO-2021249650 | Dec 2021 | WO |
Entry |
---|
U.S. Appl. No. 17/656,751 as filed in U.S. Patent and Trademark Office on Mar. 28, 2022 by Inventors Yu Liao and Junqing Sun, titled: “Reduced-Complexity Maximum Likelihood Sequence Detector Suitable for M-ary Signaling”. |
Cioffi, J. M. Chapter 9: Decoding Algorithms. Hitachi Professor Emeritus of Engineering, Marconi Society Fellow, Department of Electrical Engineering, Stanford University. Published no later than Jan. 2021. Retrieved Aug. 25, 2022, from https://cioffi-group.stanford.edu/doc/book/chap9.pdf. |