The present invention relates to forward error correction, and more particularly to Reed-Solomon decoders.
As data storage densities and data transmission rates increase, the ability of hardware devices to correctly recognize binary data diminishes. To ensure data integrity, an error correcting Reed-Solomon (RS) code is often used. Many communications systems perform forward error correction (FEC) to improve data transmission accuracy and to ensure data integrity. FEC helps reduce bit error rates (BER) in applications such as data storage, digital video broadcasts, and wireless communications. Reed-Solomon (RS) error-correcting codes are commonly used for FEC.
Referring now to
Referring now to
In step 28, the RS decoder finds error locations. For example, Chien's search algorithm, which will be described below, can be used. In step 30, error values are found. For example, Forney's algorithm, which will be described below, is often used to find the error values. Steps 28 and 30 may be performed in parallel in hardware implementations.
Referring now to
The error location search can be performed using Chien's search, which is disclosed in R. T. Chien, “Cyclic Decoding Procedure for the Bose-Chandhuri-Hocquenghem Codes”, I.E.E.E. Transactions on Information Theory, Vol. IT-10, pp. 357–363, October 1964, which is hereby incorporated by reference in its entirety. The error value computation can be performed using Forney's algorithm disclosed in G. D. Forney, “On Decoding BCH Codes”, I.E.E.E Transactions on Information Theory, Vol. IT-11, pp. 549–557, October 1965, which is hereby incorporated by reference in its entirety.
While a data storage system will be used below to illustrate the operation of a Reed-Solomon decoder, skilled artisans will appreciate that there are may other uses for Reed-Solomon decoders. In the data storage system illustrated in
The syndrome computation for an nth codeword must be completed by the end of the nth codeword. The error location search and error value computations for the nth codeword must be completed by the end of the (n+1)th codeword. The syndrome computation operates under the symbol clock fsymbol. The error location search and error value computations operate under a system clock fsystem.
The Reed-Solomon decoder 32 operates based on finite fields GF(2m) where each symbol is an m-bit binary symbol εGF(2m). A Reed-Solomon code with correcting power t has a generator polynomial G(x) over GF(2m) with 2t roots:
G(x)=(x+αb) . . . (x+αb+2t−2)(x+αb+2t−1)
G(x)=x2t+g2t-1x2t−1+ . . . +g1x1+g0x0.
where giεGF(2m). A Reed-Solomon code word can be represented using polynomial C(x):
C(x)=cn-1xn−1+ . . . +c1x1+c0x0
C(x)=x2t(dk-1xk+1+ . . . +d0x0)+(r2t-1x2t−1+ . . . +r0x0)
C(x)=x2tD(x)+R(x).
where D(x) represents the data symbols to be encoded and R(x) represents the Reed-Solomon parity symbols. The received code word can be represented using polynomial V(x):
V(x)=vn-1xn−1+ . . . +v1x1+v0x0
V(X)=(cn-1xn−1+ . . . +c0x0)+(en-1xn−1+ . . . +e0x0)
V(x)=C(x)+E(x).
where C(x) is the polynomial representing the Reed-Solomon codeword and E(x) is a polynomial representing the error vector.
During the decoding of a Reed-Solomon code, a set of 2t syndromes S0, S1, . . . , S2t-1 is first computed as shown in step 22 of
Next, in step 24 of
Referring now to
Δ=Σt=0LΛtSr-1
Next, ^(x), B(x), L and ΔB are updated according to rules in box 62. If Δ# 0 as determined in step 64, ^(x) is set equal to ΔB^(x)+ΔxB(x) in step 66. Otherwise, (x) is set equal to ^(x) in step 68. If Δ≠0 and 2L≦r as determined in step 72, B(x) is set equal to ^(x), L is set equal to r+1−L, and ΔB is set equal to Δ in step 74. Otherwise, B(x) is set equal to xB(x) in step 76. In step 80, r is incremented. Control loops from step 80 to step 57.
To find the error locations, a search algorithm such as Chien's search is used to evaluate ^(x) at 1, α−1, α−2, . . . , α−(n−1), where ^(α−i)=0 implies vi is a symbol with error. For error value computation, Forney's algorithm is typically used to compute error values at α−i, where ^(a−i)=0. An error evaluator polynomial is defined by Ω(x)=^(x)S(x)modx2t, where S(x) is the syndrome polynomial. Then, the error value at location α−1 is given by:
wherein Λ′(x) is a formal derivative of Λ(x).
A parallel architecture can be used for the iBMA algorithm implementation. Referring now to
The discrepancy value Δ is computed by multiplying syndromes S0, S1, . . . , S5 with the error locator polynomial coefficients ^0, ^1, ^2, and ^3, respectively, using multipliers Sm0, Sm1, Sm2, and Sm3, respectively. Outputs of multipliers Sm0 and Sm2 are summed by adder 110 to provide an even portion of Δeven discrepancy Δ. Outputs of multipliers Sm1 and Sm3 are summed by adder 112 to provide an odd portion of Δodd discrepancy Δ. (Such a separation of Δeven and Δodd is because of the hardware reuse of iBMA and error evaluation circuit. Δodd is used in the evaluation of xΛ′(x) in the Feng or Forney method.) Δeven and Δodd are summed by adder 114 to provide discrepancy Δ, which is fed back to multipliers Bm0, Bm1, Bm2, and Bm3.
Discrepancy Δ is also fed back through delay device 116 to conditionally update the value of the previous discrepancy ΔB. ΔB is fed to the inputs of multipliers ^m0, ^m1, ^m2, and ^m3. Multipliers Bm0, . . . , Bm3 and ^m0, . . . , ^m3 are used in the computation of the updated value of ^(x) according to
A Reed-Solomon decoder includes an inversionless Berlekamp-Massey algorithm (iBMA) circuit with an error locator polynomial generator that generates an error locator polynomial. A scratch polynomial generator generates a scratch polynomial. A discrepancy generator generates discrepancy values based on the error locator polynomial and the scratch polynomial. Multipliers used by the error locator polynomial generator are also used by the discrepancy generator of the iBMA circuit to reduce circuit area.
In other features, a syndrome calculating circuit generates syndromes that are output to the iBMA circuit. An error location finder circuit communicates with the iBMA circuit and that identifies error locations. The error location finder circuit can implement Chien's search.
In other features, an error value computation circuit communicates with the iBMA circuit and that generates error values. The error value computation circuit implements one of Forney's and Feng's algorithm. The iBMA circuit completes one iteration every two clock cycles.
In still other features, a second delay circuit communicates with an output of the scratch polynomial generator. The discrepancy generator calculates discrepancy values over two clock cycles.
In still other features, a first delay circuit delays the discrepancy values. A feedback loop feeds back the delayed discrepancy values to the error locator polynomial generator and the scratch polynomial generator. The iBMA circuit is integrated with at least one of the error location finder circuit and the error value computation circuit.
Further areas of applicability of the present invention will become apparent from the detailed description provided hereinafter. It should be understood that the detailed description and specific examples, while indicating the preferred embodiment of the invention, are intended for purposes of illustration only and are not intended to limit the scope of the invention.
The present invention will become more fully understood from the detailed description and the accompanying drawings, wherein:
The following description of the preferred embodiment(s) is merely exemplary in nature and is in no way intended to limit the invention, its application, or uses. For purposes of clarity, the same reference numbers will be used in the drawings to identify similar elements.
The present invention provides an efficient and high-speed Reed-Solomon decoder. Higher speeds are achieved by pipelining the feedback loop in the iBMA circuit one or more times. Circuit area is reduced by hardware sharing between the iBMA pipelines and hardware sharing between the iBMA, the error location finder and the error value computation circuit.
Referring now to
In the exemplary implementation, a register 152 is positioned at the output of the adder 114. The register 152 outputs Δd, which is a delayed discrepancy value. Instead of completing one iBMA iteration per clock cycle, two clock cycles are required for each iBMA iteration.
Referring now to
Additional increases in clock speeds can be achieved by further pipelining the feedback loop. For example, the discrepancy Δ computation can be broken into two steps and three clock cycles can be used to complete each iBMA iteration.
After pipelining the feedback loop as taught in
After the 2t iterations of the iBMA, the ^(x) coefficients values are ready to be used in Chien's search to find error locations. At the same time, error values can be computed using Forney's algorithm. The error values can also be computed using “Error Evaluator for Inversionless Berlekamp-Massey Algorithm In Reed-Solomon Decoders”, Feng et al. Ser. No. 10/304,511, filed Nov. 26, 2002 which was based on U.S. Provisional Application No. 60/371,898, filed Apr. 11, 2002 and which are both hereby incorporated by reference in their entirety. When there is no erasure, Feng's evaluation can be used to simplify the computation. In other words, the error evaluator polynomial generator 38 in
Both equations are evaluated at α−1 for the error value, where ^′(x) is the formal derivative of ^(x), and B(x) is the scratch polynomial used in the iBMA algorithm.
To reduce circuit area, the pipelined iBMA algorithm, Chien's search and the error value computation can share the same circuitry. Referring to
After the 2t iBMA iterations, the ^(x) coefficients values are stored in registers ^0, ^1, ^2, and ^3. To evaluate ^(x) at 1, α−1, α−2, . . . , α−(n−1), the multipliers Sm0, . . . , Sm3 and adders A0, . . . , A3 in
Alternately, if Forney's algorithm is used, the Ω(x) coefficients can be computed by Ω(x)=^(x)S(x)modx2t by using the multipliers ^m0, ^m1, ^m2, and ^m3 and the adders 110, 112 and 114. The B(x) registers are used to store the Ω(x) coefficients values. The iBMA and error value calculation algorithms share the hardware, as shown in
Those skilled in the art can now appreciate from the foregoing description that the broad teachings of the present invention can be implemented in a variety of forms. Therefore, while this invention has been described in connection with particular examples thereof, the true scope of the invention should not be so limited since other modifications will become apparent to the skilled practitioner upon a study of the drawings, the specification and the following claims.
This application claims the benefit of U.S. Provisional Application No. 60/370,644, filed Apr. 8, 2002, which is hereby incorporated by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
6092233 | Yang | Jul 2000 | A |
6119262 | Chang et al. | Sep 2000 | A |
6122766 | Fukuoka et al. | Sep 2000 | A |
6209115 | Truong et al. | Mar 2001 | B1 |
6256763 | Oh et al. | Jul 2001 | B1 |
6286123 | Kim | Sep 2001 | B1 |
6317858 | Cameron | Nov 2001 | B1 |
6347389 | Boyer | Feb 2002 | B1 |
6374384 | Ohta et al. | Apr 2002 | B1 |
6378104 | Okita | Apr 2002 | B1 |
Number | Date | Country |
---|---|---|
WO9909694 | Feb 1999 | WO |
Number | Date | Country | |
---|---|---|---|
60370644 | Apr 2002 | US |