Not Applicable
Not Applicable
1. Technical Field
This invention relates in general to error correction coding and, more particularly, to an improved Reed-Solomon decoder.
2. Description of the Related Art
Modern digital communication systems typically make use of error correction coding (ECC) schemes to improve system performance under imperfect conditions. For digital magnetic storage devices, for example, signals on the magnetic medium may degrade over time, or imperfections in the magnetic medium may cause recording errors. For optical storage devices, such as compact disks (CDs) and digital video devices (DVDs), physical wear and tear may result in misreading data. In both these cases, some errors can be correct through the use of a proper ECC scheme.
Perhaps more importantly, in digital communication systems, such as satellite communications, mobile phones, smart phones, mobile processing systems, and high-speed modems, errors can be introduced through the transmission medium. For clear, error-free transmissions, it is important to have a robust ECC scheme.
Reed-Solomon error correcting codes are used throughout the electronics industry to provide error correction, largely because of their relative simplicity of implementation and their ability to detect and correct multiple errors, especially for channels subject to burst errors. Because, Reed-Solomon error correction is word-based, a single error encompasses any or all errors within a single word, as opposed to other correction techniques in which each bit error is an independent error event.
Nonetheless, the increasing speeds of data transmission require increase speeds for error correction. Reed-Solomon error correction involves complicated matrix operations that must be performed in real time in most cases. In many devices, it is desirable to perform the Reed-Solomon error correction within a programmable processing device, such as a DSP (digital signal processor). Accordingly, techniques for improving the speed in which the matrix calculations can be performed are extremely valuable.
Therefore, a need has arisen for providing an improved Reed-Solomon decoder.
In the present invention, for Reed-Solomon decoding, vector of υ syndromes Ei and υ error locations lj are determined from a received codeword, and error magnitudes el
where a is a primitive of the codeword. Error magnitudes are determined by triangularizing a υ×υ Vandermonde matrix of the elements ail
For a more complete understanding of the present invention, and the advantages thereof, reference is now made to the following descriptions taken in conjunction with the accompanying drawings, in which:
a illustrates a flow chart describing a preferred embodiment of triangularizing a root matrix;
b illustrates a table showing the results of the operation of
a illustrates a flow chart describing a preferred embodiment of adjusting a syndrome vector pursuant to triangularizing the root matrix;
b and 4c illustrates a table showing the results of the operation of
The present invention is best understood in relation to
Reed-Solomon codes are generated using a polynomial of the form:
g(x)=(x−a0)(x−a1)(x−a2) . . . (x−a2t−1), where a is referred to as a “primitive”, and the codeword is generated as:
c(x)=g(x) i(x), where i(x) is the data block.
The encoded data, c(x), is transferred through, or to, a medium 14, such as a communication channel or a storage medium. At this point, noise or other errors, e(x), may enter the encoded data stream. The data stream received at the Reed-Solomon decoder 16 is thus r(x)=c(x)+e(x). The Reed-Solomon decoder 16 decodes the data and corrects errors.
The syndrome calculator 20 calculates the 2t known coefficients of the Fourier transform E(x) from the received message. These are known as the “syndrome coefficients” and denoted St.
The next block is the error locator polynomial 22. The error locator polynomial has roots that give the error locations. The two most popular methods of finding the error locater polynomial are Euclid's Algorithm and the Berlekamp-Massey Algorithm.
The next block is the error locations block 24. In this block, using the error location polynomial from block 22, the roots of the error location polynomial are determined, typically using a Chien search.
The next block is the error magnitudes block 26. At this point, the location of the errors is known, but not the errors themselves. The syndromes and the error polynomial roots are used to derive the error values. In the prior art, this is typically accomplished using the Forney method. The present invention presents a different method of computing the error magnitudes that improves performance.
The final block is the error correction block 28. After the error magnitudes block 26, the error polynomial e(x) is known completely and can be subtracted from the received polynomial r(x) to yield the codeword c(x).
It should be noted that Reed Solomon structures use Galois arithmetic. Some DSPs (digital signal processors) perform Galois operations directly, while others can be programmed to implement Galois arithmetic.
The error magnitude block solves the equation:
where υ is the number of errors, lj are the error locations, el
The equation set forth above is the commonly used matrix equation for solving for e(x). The method set forth below provides an efficient way of solving the matrix equation, which can be particularly useful for software implementations using a DSP.
For simplification, the notation Ax is used in place of al
This υ×υ matrix is a Vandermonde matrix and can be efficiently triangularized by the methods discussed below. By triangularizing this matrix, the determination of the error magnitudes can be simplified. In this case, an upper triangular matrix is shown, but a lower triangular matrix can be used as well. The triangularized matrix for υ=4 is:
The triangularization of the error polynomial coefficient matrix requires 1+2+3+ . . . +(v−1) (Galois) multiplications and 1+2+3+ . . . +(v−1) (Galois) additions. To preserve the equation, the same operation are performed on the other side of equation (1); hence the syndrome vector is rewritten as:
These computations can also be done using 1+2+3+ . . . +(υ−1) multiplications and the same number of additions.
Once triangularized matrixes are formed, the equations can be solved with relative ease. Solving at the last row (first row for a lower triangular matrix) of the error polynomial coefficient matrix yields an equation with one unknown, namely (for the 4×4 matrix example):
The equation is easily solved for el
The next row of the matrix equation can be solved for el
A more generalized implementation of the triangularization techniques are now described. In the following discussion, R(Ax)z indicates a replicate function, found on some DSPs as a native command that replicates a scalar across a vector z times. Hence R(A1)3 would yield [A1 A1 A1]. Further, the vector multiplication and addition is element-by-element; hence if V1=[A1 A2 A3] and V2=[A3 A2 A1], then V1*V2=[A1A3 A2A2 A3A1]. Likewise V1+V2=[A1+A3 A2+A2 A3+A1].
The υ×υ error polynomial coefficient matrix can be triangularized using the formula Vn=(V(n−1)+R(A(n−1))υ−n+1)*V(n−1) with V1 being the vector of the top row of the matrix and Vv being the vector of the bottom row of the matrix.
V1={A1 A2 A3 . . . Av}
V2=(V1+R(A1))*V1
V3=(V1+R(A2))*V2
V4=(V1+R(A3))*V3
. . .
Vv=(V1+R(Av)υ−n+1)*V(v−1)
a illustrates a flowchart showing the computations for the triangularized error polynomial coefficient matrix. In step 30, the top row of the error polynomial coefficient matrix is defined as {A1 A2 A3 . . . Av}. The index, n, is set to “1”. In block 32, the index n is incremented and in block 34, the next row of the error polynomial coefficient matrix is computed as Vn=(V(n−1)+R(A(n−1))υ−n+1)*V(n−1). Steps 32 and 34 are repeated until the final row (n=υ) is computed in decision block 36. As can be seen, the computation of the triangular matrix is structured such that the previous row (V(n−1)) is multiplied (element-by-element) by V1+R(A(n−1). Accordingly, a minimum of calculations are performed to compute each successive row.
On a DSP, such as the TEXAS INSTRUMENTS C6XHP family of processors, computing each successive vector would take two instruction cycles. Triangularizing an 8×8 matrix, therefore, would take about 16 cycles.
The operations performed on the syndrome vector can be described as follows:
where T(n−1)<<1 is the previous value of T, left-shifted and right-filled with a “0”. In the addition of R(A(n))n*T(n−1)+T(n−1)<<1, the value of R(A(n))n*T(n−1) is left-filled with a “0” so that the vectors are the same size. Once again, additions and multiplications between vectors are element-by-element. Sum(U(n)) is the sum of all elements of U(n).
a illustrates a flow chart describing the recursive calculation of the syndrome vector. In step 40, the index n is set to “0” and an initial vector T0 is defined. In step 42, the index is incremented and in step 44, T(n) is defined as T(n)=R(A(n))n*T(n−1)+T(n−1)<<1. For T1 (i.e., n=1), this calculation would be equivalent to {A1}*{1}+{1 0}. Therefore, T1={1 A1}. In step 46, U(n) is calculated as U(n)=T(n−1)*{E(n) E(n−1) . . . E1}. For n=1, U(n)={1}{E1}={E1}. In step 48, W(n) is calculated as W(n)=sum(U(n)). For n=1, U(n)=E1. The modified syndrome vector is set to:
Decision block 50 repeats steps 42 through 48, until all elements of the modified syndrome vector are calculated. As can be seen, each element is calculated from a previous T vectors in steps 44 and 46.
For n=2, the new value of T is calculated as the previous value of T−{1 A1}−multiplied element-by-element with {A2 A2} to render {A2 A1A2}. This is added to the left-shifted T(n−1)−{1 A1 0} to render {1 A1+A2 A1A2}. U2 is calculated as {1 A1} multiplied by {E2 E1} to render {E2 A1E1}. Hence, W2=E2+A1E1.
On a modern DSP, such as the C6XHP family of processors, computing each successive T and U vector takes two instruction cycles (two shifts, two XORs, four GF(256) multiplies, and two replicates). Computing W[2] would take a shift and an add, W[3] and W[4] would take two shifts and two adds each, and W[5-8] would take two shifts and three adds each. Thus for a 1×8 matrix, about 8 instruction cycles would be required to compute the W vector from the U values. Including the time to compute the T and U vectors, about 24 instruction cycles would be required to compute the W vector.
In block 64, the error magnitudes are calculated by substitution. The kernel execution time for back-substitution would be about six instruction cycles per iteration. The operations would be:
Another iteration of step 6 cannot be started until step 11 is complete; however, the loading of a reciprocal and packing of a column into a vector can be started prior to step 11 to conserve time. The execution time of back-substitution would be about forty-eight instruction cycles for an 8×8 error polynomial coefficient matrix.
The worst-case execution time of finding the error magnitudes on a C6XHP or similar processor would be about 88 (16+24+48) instruction cycles. This could probably be reduced by specific coding for vectors that mostly contain zeroes.
The present invention provides an alternative method for the Forney method of calculating error magnitudes. On modern DSPs, the present invention can provide increased speed in the calculation. Whereas the Forney method operates on scalars, the present invention operates on vectors to take better advantage of hardware that computes several Galois field at a time. As processors are able to operate on larger Galois field vectors, the savings in execution time using the present invention increases.
Although the Detailed Description of the invention has been directed to certain exemplary embodiments, various modifications of these embodiments, as well as alternative embodiments, will be suggested to those skilled in the art. The invention encompasses any modifications or alternative embodiments that fall within the scope of the Claims.
Number | Name | Date | Kind |
---|---|---|---|
4665523 | Citron et al. | May 1987 | A |
6012159 | Fischer et al. | Jan 2000 | A |
6131178 | Fujita et al. | Oct 2000 | A |
Number | Date | Country |
---|---|---|
341004 | Aug 1992 | DE |
Number | Date | Country | |
---|---|---|---|
20030126542 A1 | Jul 2003 | US |