Decoder for error correcting block codes

Information

  • Patent Grant
  • 6637002
  • Patent Number
    6,637,002
  • Date Filed
    Wednesday, October 21, 1998
    26 years ago
  • Date Issued
    Tuesday, October 21, 2003
    21 years ago
Abstract
A decoder for decoding block error correction codes is described. The decoder includes a first search circuit to find roots of an error location polynomial corresponding to an error location and a second search circuit to find roots of an error location polynomial corresponding to an error location. A multiplexer is fed by the first search circuit and the second search circuit to produce an error location from the error location polynomial.
Description




BACKGROUND




This invention relates to error correcting code techniques.




Error correcting code techniques are used in digital systems to correct and detect errors in digital data. Such techniques are particularly employed with magnetic disks which typically have relatively high rates of error due to manufacturing defects and so forth. Several types of error correcting techniques are known. One type of error correcting technique uses so-called “block codes.” Two examples of block codes are the so called “Reed Solomon” (RS) code and “Bose-Chaudhuri-Hocquenghem” (BCH) code.




Block codes such as “Reed Solomon” and “Bose-Chaudhuri-Hocquenghem”, operate on data symbols. Data symbols are vectors of binary bits of data. Symbols are generated by grouping bits at a particular bit position for a group of words into a vector. When block codes are used in a disk drive, before a string of data symbols are recorded on a disk, the string is mathematically encoded producing error correcting code symbols that are appended to the data symbols to form code words. These code words i.e., data symbols and error correcting code symbols are stored on the disk. When the stored data is retrieved from the disk, the code words are mathematically decoded to form decoded code words.




In order to correct errors, the locations and magnitudes or values of the errors are needed. A conventional approach for determining the location of such errors uses a decoder including a syndrome generator to form error syndromes. The error syndromes are used to generate an error location polynomial and the error location polynomial is used to determine an error location. The conventional approach uses a single Chien search circuit to search for roots of the error location polynomial. The Chien search circuit evaluates the error location polynomial for all possible error locators by iteratively evaluating the error location polynomial “n” times, where n is the code length. This is a time consuming process when n is large and the code rate is high.




The roots of the error location polynomial (i.e., when the error location polynomial is zero) correspond to the location of the errors in the code word. An error evaluation polynomial is evaluated to determine error values based on the determined error locations. Once the error locations and the corresponding error values are known, the data can be corrected.




SUMMARY




According to an aspect of the invention, a decoder for decoding block error correction codes includes a first search circuit to find roots of an error location polynomial corresponding to an error location and a second search circuit to find roots of an error location polynomial corresponding to an error location. A multiplexer is fed by the first search circuit and the second search circuit to produce an error location from the error location polynomial.




According to a further aspect of the invention, a decoder for decoding block codes includes a parallel error location polynomial search circuit. The parallel error location search circuit includes a first search circuit that finds roots of an error location polynomial that correspond to an error location. The circuit includes a first modulo adder circuit that adds odd terms of the error location polynomial to produce a derivative of the error location polynomial. The circuit also includes a second search circuit that finds roots of an error location polynomial that correspond to an error location. A second modulo adder circuit adds odd terms of the error location polynomial to produce a derivative of the error location polynomial. The circuit also includes a first multiplexer fed by error location outputs from the first search circuit and the second search circuit, a second multiplexer fed by derivatives of error location outputs from the first and the second modulo adders. The circuit also includes an error evaluation circuit that produces the error evaluator polynomial and a logic circuit including a divider circuit to divide the error evaluator polynomial by the output from the second multiplexer.




According to a still further aspect of the invention, a method of decoding an error correcting block code includes evaluating an error location polynomial produced from an error location polynomial generation circuit by applying the error location polynomial to a parallel root search circuit that searches for roots of the error location polynomial in a parallel manner.




One or more advantages are provided by the above aspects. The two-way parallel root search circuit searches for roots of the error location polynomial in a parallel manner. This saves substantial computation time to facilitate real time decoding of high rate Reed Solomon and BCH codes. For example, a two-way parallel root search circuit can use two duplicate root search circuits to reduce the search time to determine the roots of an error-locator polynomial by half. An additional advantage of the arrangement is that the parallel root search circuit can be modified to provide a term that can be used with an error evaluator without the need to compute a derivative of the error location polynomial.











BRIEF DESCRIPTION OF THE DRAWINGS





FIG. 1

is a block diagram of a computer including a storage device.





FIG. 2

is a block diagram of an error correction apparatus for the storage device of FIG.


1


.





FIG. 3

is a block diagram of an error correction detection circuit.





FIGS. 4 and 5

are block diagrams of parallel error search circuits.





FIGS. 6 and 7

are block diagrams of multi-way error search circuits.





FIGS. 8 and 9

are block diagrams of parallel search circuits and error evaluator circuits.





FIG. 10

is a detailed diagram of the error search and evaluator circuit of FIG.


8


.





FIG. 11

is a block diagram of a generalized U-way error search and evaluator circuit.





FIG. 12

is a block diagram of an initializer circuit.





FIG. 13

is a block diagram of an error evaluator circuit.











DETAILED DESCRIPTION




Referring now to

FIG. 1

, a computer system


10


includes a processor


12


, a main memory


14


and a magnetic storage subsystem


15


, all coupled via a computer bus


16


. The magnetic storage system


15


includes a bus adapter


18


, a storage bus


19


and a storage device


20


. A preferred example of the storage device


20


is a magnetic disk. Other examples include a magnetic tape drive and a solid state disk.




The storage device


20


includes an error correcting code symbol generator


22


and an error correction circuit


24


, as well as, a disk drive


26


.




Referring now to

FIG. 2

, a portion of the storage device


20


is shown including the error correcting code generator


22


. The error correcting code generator


22


is fed input data from the bus adapter


18


and generates error correcting code symbols. During a write to disk operation, the error correcting code symbols are concatenated with data symbols and fed to a disk interface


23


for storage on the disk drive


26


(FIG.


1


). During a read from disk operation, the data from the disk


26


are fed to the disk interface


23


and to an error correction circuit (EC)


24


. The EC circuit


24


uses a parallel decoding technique in order to correct the errors. The EC circuit


24


corrects such errors providing they do not exceed the capability of the EC circuit


24


.




Referring now to

FIG. 3

, details of the error correction circuit


24


are shown. A syndrome generator


30


produces error syndromes from results of “exclusive ORing” retrieved error correcting code symbols in a conventional manner. The syndromes are fed to an error location polynomial circuit


32


to produce an error locator polynomial σ. The roots of the error location polynomial are determined in a two-way, parallel root search circuit


34


. The roots of the error location polynomial, if any, correspond to the error locations, if any, in the codewords. The evaluator circuit


36


evaluates the errors at the locations identified by the parallel root search circuit


34


. As will be described below, optionally, the error evaluator and root search circuits can share circuits.




The two-way parallel root search circuit


34


searches for roots of the error location polynomial σ in a parallel manner. This saves substantial computation time to facilitate real time decoding of high rate Reed Solomon and BCH codes. The two-way parallel root search circuit


34


implements a variation of the so-called “Chien” search algorithm.




A Chien search algorithm evaluates an error location polynomial using Galois field (GF) arithmetic. Considering an m bit symbol of a Reed Solomon code of length 2


m


−1, such a symbol can be considered to be an element “α” defined in the Galois field GF(2


m


). If α is a primitive element of the Galois field GF(2


m


) then the possible error locations in the multi-bit symbol are in the set of {1, α, . . . α


n−1


}. The value α is a primitive element if α can generate all of the values of the field except 0. If σ(x) is an error location polynomial that contains k error locations, namely, α


t1


, α


t2


, . . . α


tk−1


and α


tk


then






σ(χ)=(α


t1


χ−1)(α


t2


χ−1) . . . (α


tk−1


χ−1)(α


tk


χ−1)=σ


k


χ


k





k−1


χ


k−1


+ . . . +σ


1


χ+σ


0








The error locations α


αj


are found from the given coefficients σ


i


, i=0, . . . , k. The simplest way to find an error-location from σ(χ) is by trial and error, i.e., by evaluating σ(χ) for all possible values i.e., χε{1, α


−1


, . . . , α


−(n−1)


}, where σ(α


−1)=


0, α


−1


corresponds to an error location. The Chien search is an implementation of this trial and error method in an orderly manner.




Consider the evaluation of σ(χ) with α


−j


for j≧0. Every possible value of α


−j


is evaluated as:






σ(α


−j


)=σ


k


α


−kj





k−1


α


−(k−1)j


+ . . . +σ


1


α


−j





0








The order of trials can be either in ascending order or in descending order of j. For an ascending order search, the expression σ(α


−j


) is evaluated to find error locations by iterating from 0 to n−1. Given a bank of registers reg


k


(j) to reg


0


(j), and denoting reg


i


(j)=σ


i


α


−ij


provides






σ(α


−j


)=reg


k


(


j


)+reg


k−1


(


j


)+ . . . +reg


1


(


j


)+reg


0


(


j


)






where reg


i


(j) can be considered as the i-th register value in registers reg


k


(j) to reg


0


(j) at iteration cycle j. Reg


i


(j) can be obtained iteratively from the value of the same register in the previous iteration cycle, i.e.,






reg


i


(


j


)=α


−i


reg


i


(


j


−1) for


j


≧1






The initial values of these registers are:






reg


i


(


0


)=σ


i


for


i


=0


, . . . k








A Chien search can also be performed in a descending order. For a descending order search the expression a is iteratively evaluated from n−1 to 0.




The error location polynomial evaluated from n−1 to 0 can be expressed as:






σ(α


−(n−1−j)


)=σ


k


α


−k(n−1−j)





k−1


α


−(k−1)(n−1−j)


+ . . . +σ


1


α


−n−1−j)





0








By defining






reg


i


(


j


)=σ


i


α


−i(n−1−j)


=(α


i(1−n)


σ


i





ij








we have






σ(α


−(n−1−j)


)=reg


k


(


j


)+reg


k−1


(


j


)+ . . . +reg


1


(


j


)+reg


0


(


j


)






where the initial value of the register reg


i


is






reg


i


(


0


)=α


i(1−n)


σ


i








and the iterative relation is






reg


i


(


j


)=α


i


reg


i


(


j


−1) for


j


≧1






An “n” number of iterations are performed to evaluate the error location polynomial, where n is the code length. When n is large, the number of the iterations may be too large for real-time hardware decoding.




Referring now to

FIG. 4

, the two-way parallel root search circuit


34


is shown. The two-way parallel search circuit


34


includes a first search circuit


34




a


comprised of a series of multipliers


40


and a first bank of registers


42


that feed results on an iterative basis to a bank of “exclusive OR” gates


44


. The banks of exclusive OR gates


44


and


45


are fed by register


0


REG (


0


)


47


. The bank of exclusive or gates


44


“exclusive OR's” (i.e., adds modulo


2


) the outputs of register


47


and the register bank


42


.




The two-way parallel root search circuit


34


includes a second search circuit


34




b


comprised of multipliers


41


and a second bank register


43


that also evaluate the error polynomial on an iterative basis. The outputs of the second bank of registers


43


are coupled to a second bank of exclusive OR gates


45


to evaluate the results from the second bank of registers


43


.




The output of exclusive OR gate banks


44


and


45


are fed to a multiplexer


48


that is controlled by a select signal (not shown) that alternately selects one of the search circuits


34




a


,


34




b


, as an output each cycle of the decoder


34


. When the multiplexer


48


outputs a logic zero, this corresponds to a root of the error location polynomial.




The two-way parallel root search circuit


34


reduces the number of search iterations and, as a consequence, the amount of search time, needed to find the roots of the error location polynomial. The two-way parallel search circuit uses a pair of chains each chain having different initial values and multiplier coefficients. The initial values for each chain are derived as follows:




Let v be any integer such that 0≦v≦n−1, then








v


=2


j+l








with l=0 or 1 and 0≦j≦(n−1)/2.




For a two-way parallel root search from 0 to n−1, σ(α


−v


) the v


th


root of the error location polynomial is given by:










σ


(

α

-
v


)


=







σ
k



α

-

k


(


2

j

+
1

)





+


σ

k
-
1




α


-

(

k
-
1

)




(


2

j

+
1

)




+

+


σ
1



α


-

(


2

j

+
1

)


+

σ
0











=







(


α

-
lk




σ
k


)



α


-
2


k





j



+


(


α

-

l


(

k
-
1

)






σ

k
-
1



)



α


-
2



(

k
-
1

)


j



+

+














(


α

-
1




σ
1


)



α


-
2


j



+

σ
0








=








i
=
0

k








(


α

-
il




σ
i


)



α


-
2






ij

















by defining






reg


i




(0)


(


j


)=σ


i


α


−(2i)j


and reg


i




(1)


(


j


)=(α


−i


σ


i





−(2i)j








σ can be expressed in two different chains and is given, as in Equations 1 and 2:






σ(α


−(2j)


)=reg


k




(0)


(


j


)+reg


k−1




(0)


(


j


)+ . . . +reg


1




(0)


(


j


)+reg


0




(0)


(


j


)  (Equation 1)






and






σ(α


−(2j+1)


)=reg


k




(1)


(


j


)+reg


k−1




(1)


(


j


)+ . . . +reg


1




(1)


(


j


)+reg


0




(1)


(


j


)  (Equation 2)






for 0≦j≦(n−1)/2. Moreover, the values of the registers


42


(reg


i




(0)


)


42


and registers


43


(reg


i




(1)


) have the same iterative relations for 0≦j≦(n−1)/2, as given in equations 3 and 4.






reg


i




(0)


(


j


)=α


−2i


reg


i




(0)


(j−1)  (Equation 3)








reg


i




(1)


(


j


)=α


−2i


reg


i




(1)


(j−1)  (Equation 4)






The initial values of these two banks


42


,


43


of registers are:




register


42


reg


i




(0)


(


0


)=σ


i


and




register


43


reg


i




(1)


(


0


)=α


−i


σ


i






The values of reg


i




(0)


(j) and reg


i




(1)


(j) can be computed independently. The two-way parallel root search circuit


34


uses two duplicate root search circuits


34




a


,


34




b


which have different initial values in registers


42


,


43


. This parallel circuit


34


reduces the search time to determine the roots of the error-locator polynomial by half. The initial values of reg


i




(0)


registers


42


are provided by the error-locator polynomial coefficients σ


i


, whereas an initializer


50


converts the initial values of reg


i




(0)


registers


43


to those of reg


i




(1)


. The initializer


50


is described in conjunction with FIG.


12


. The initial values for the registers are given in TABLE 1:


















TABLE 1









i




k




k − 1




. . .




2




1




0











reg


i




(0)


[0]




σ


k






σ


k−1







σ


2






σ


1






σ


0








reg


i




(1)


[0]




α


−k


σ


k






α


−(k−1)


σ


k−1







α


−2


σ


2






α


−1


σ


1






σ


0
















Referring now to

FIG. 5

, a parallel two-way root search circuit


34


′ that searches from n−1 to 0 is illustrated. The circuit


34


′ has a similar structure to circuit


34


. Circuit


34


′ therefore includes a first search circuit


34




a


′ including multipliers


40


′, registers


42


′, an “exclusive OR” bank


44


, and a second search circuit


34




b


′ including multipliers


41


′, registers


43


′ and an “exclusive OR” bank


45


′. The search circuit


34


′ includes a register (


0


)


47


′ and a multiplexer


48


′. Multiplexer


48


′ is controlled by a select signal (not shown) that alternately selects one of the search circuits


34




a


,


34




b


, as an output for each cycle of the decoder


34


′. Circuit


34


′ differs from circuit


34


(

FIG. 4

) in that the coefficients used to multiply the contents of registers are different and the initial values in registers


42


′,


43


′ are also different, as illustrated in the accompanying TABLE 2.


















TABLE 2









i




k




k − 1




. . .




2




1




0











reg


i




(0)


[0]




α


k(1−n)


σ


k






α


(k−1)(1−n)


σ


k−1







α


2(−n)


σ


2






α


1−n


σ


1






σ


0








reg


i




(1)


[0]




α


k(2−n)


σ


k






α


(k−1)(2−n)


σ


k−1







α


2(1−n)


σ


2






α


2−n


σ


1






σ


0
















The two-way parallel root search


34


′ from n−1 to 0 can be explained, as follows. For circuit


34


′, the initial values of the two sets of registers


42


′,


43


′ reg


i




(0)


and reg


i




(1)


are:






reg


i




(0)


(


0


)=α


(1−n)i


σ


i


and reg


i




(1)


(


0


)=α


(2−n)i


σ


i








and the iteration relations are:






reg


i




(l)


(


j


)=α


2i








 reg


i




(l)


(


j


−1)




for 0≦j≦(n−1)/2. The error location polynomial σ is given by equation 5:






σ(α


−(n−2j−l)


)=reg


k




(l)


(


j


)+reg


k−1




(l)


(


j


)+ . . . +reg


1




(l)


(


j


)+reg


0




(l)


(


j


)  (Equation 5)






for 0≦j≦(n−1)/2, where l=0 and 1.




Referring now to

FIGS. 6 and 7

, generalized U-way parallel root search circuits


60


and


60


′ are shown for search directions


0


to n−1


60


(

FIG. 6

) and for n−1 to


0




60


′ (FIG.


7


), respectively. As shown in

FIG. 6

, the U-way parallel search circuit


60


includes a plurality of (U number of) root search circuits


34




0


-


34




u−1


, each evaluating one of the iterations for n and each comprising a plurality of registers/multiplier combinations. Each one of the search circuits


34




a


-


34




u


includes a bank of “exclusive OR” gates, as generally shown for bank


34




a


or


34




a


′, respectively. The outputs of the root search circuits


34




0


-


34




u−1


are fed to a multiplexer


49


that is controlled by a select signal (not shown) that alternately selects one of the inputs as an output each cycle of the decoder. When the multiplexer


49


outputs a logic zero, this corresponds to a root of the error location polynomial.




The generalized U-way search circuit can be explained as follows: If u is a positive integer, then for any integer v where 0≦v≦n−1,








v=uj+l








with l ε{0,1, . . . , u−1} and 0≦j≦(n−1)/u. The error location polynomial is given by:










σ


(

α

-
v


)


=







σ
k



α

-

k


(

uj
+
l

)





+


α

k
-
1




α


-

(

k
-
1

)




(

uj
+
l

)




+

+


σ
1



α

-

(

uj
+
l

)




+

σ
0








=








i
=
0

k








α

-
il




σ
i



α

-
iuj

















A U-way parallel root search from 0 to n−1 also can be provided. For branches l=0, . . . , u−1, registers reg


i




(l)


can be defined with the initial values, as given in equation 6.






reg


i




(l)


(


0


)=α


−il


σ


i


  (Equation 6)






The iterative relation for the registers is given by equation 7 as:






reg


i




(l)


(


j


)=α


−iu


reg


i




(l)


(


j


−1), 0


≦j≦(n−


1)/u  (Equation 7)






Thus, for branches l=0,1, . . . , u−1, the error location polynomial is given by equation 8 as:






σ(α


−(uj+l)


)=reg


k




(l)


(


j


)+reg


k−1




(l)


(


j


)+ . . . +reg


1




(l)


(


j


)+reg


0




(l)


(


j


)  (Equation 8)






with 0≦j≦(n−1)/u.




Referring now to

FIG. 7

, a u-way parallel Chien search from n−1 down to 0 can be provided based on the teachings set out above.




For the branches l=0, . . . , u−1, registers reg


i




(l)


can be defined with the initial values, as given in equation 9.






reg


i




(l)


(


0


)=α


(l+1−n)i


σ


i


  (Equation 9)






The iterative relation is thus given by equation 10 as:






reg


i




(l)


(


j


)=α


iu


reg


i




(l)


(


j


−1), 0


≦j≦(n−


1)/u.  (Equation 10)






Then, for branches l=0,1, . . . , u−1, the error location polynomial is given by equation 11 as:






σ(α


−(n−uj−l)


)=reg


k




(l)


(


j


)+reg


k−1




(l)


(


j


)+ . . . +reg


1




(l)


(


j


)+reg


0




(l)


(


j


)  (Equation 11)






with 0≦j≦(n−1)/2.




The u registers values reg


i




(0)


(j), reg


i




(l)


(j), . . . , reg


i




(u−1)


(j) can be updated in parallel. Therefore, at the same clock cycle, u results






σ(α


−uj)


, σ(α


−(uj+1)


), . . . , σ(α


−(uj+u−1)


)






can be obtained. Based on equation 11, a u-way parallel root search circuit


60


can have u number of root search circuits


34




0


-


34




u−1


having u different sets of initial values. This circuit


60


can search from 0 to n−1 (

FIG. 6

) and n−1 to 0 (

FIG. 7

) depending on the initial values and coefficients. As mentioned above, the initializer


20


is provided to convert the initial values σ


i


of reg


i




(0)


to those of reg


i




(l)


for l≦1.




Referring now to

FIG. 8

, a portion of a two-way parallel shared search and evaluator circuit


70


is shown. The two-way parallel search and evaluator circuit


70


includes a first root search circuit


74




a


SCF(


0


) and a second root search circuit


74




b


SCF(


1


) with associated multiplexers


75




a


-


75




b


. From the outputs of the multiplexers


75




a


-


75




b


are provided α


−jL


σ(α


−j


) and α


−j(L+1)


σ′(α


−j


). The term α


−j(L+1)


σ′(α


−j


) is one of the terms used to implement the Forney formula for error evaluation, as is described below. The value “L” is the leading exponent of consecutive roots of the Read-Solomon code.




The error location circuits


74




a


and


74




b


include registers and multipliers and three banks


76




a


-


76




c


of “exclusive or” gates. The derivative of the error evaluation polynomial is provided from bank


76




c


whereas, the error location polynomial is provided from banks


76




b


and


76




a


of “exclusive or” gates, as shown.




Each root search circuit


74




a


,


74




b


is a shared circuit that provides an error polynomial root search result and the derivative of the error-location polynomial. The derivative of the error location polynomial is provided by considering that an error-correction m-bit symbol RS code of length n can be generated by:








g


(χ)=(χ−α


L


)(χ−α


L+1


) . . . (χ−α


L+2t−1


)






If the redundancy symbols of the code are located in positions


0


,


1


, . . .


2


t−1, then when a corrupt word r=r


0


, . . . , r


n−1


is received, the decoder


70


will first compute the syndromes by








S
i

=





j
=
0


n
-
1









r
j



α



(

L
+
1

)


j

,







i


=
0


,





,


2

t

-
1











From these syndromes, an error-locator polynomial σ(χ) can be obtained by using a decoding algorithm, such as Berlekamp-Massey algorithm or Euclidean algorithm. The error evaluator polynomial ω(χ) can be computed by:






ω(χ)=σ(χ)


S


(χ)modχ


2t








where







S


(
χ
)


=




i
=
0



2

t

-
1









S
i




χ
i

.













After obtaining the error location j from the error-locator polynomial σ(χ) by the root search, the associated error values can be computed by the Forney formula










e
j

=


ω


(

α

-
j


)




α

-

j


(

L
+
1

)







σ




(

α

-
j


)








Equation





12













where σ′(χ) is the derivative of σ(χ). Additional detail concerning the above discussion can be found in “Theory and Practice of Error Control Codes” by Blahert, Addison-Wesley Publishing Company, 1983.




To implement the Forney formula, two values are needed ω(α


−j


) and α


−j(L+1)


σ′(α


−j


). The value α


−j(L+1)


σ′(α


−j


) can be produced from the root search circuits


74




a


,


74




b.






Since








σ




(
χ
)


=




i
=
0

k







i






σ
i



χ

i
-
1














and the code is over GF(2


m


), we have











α

-

j


(

L
+
1

)







σ




(

α

-
j


)



=






α

(

-

j


(

L
+
1

)



)







i





is





odd










σ
i



(

α

-
j


)



i
-
1










=








i





is





odd









σ
i



α


-

(

L
+
i

)



j

















Recalling that the standard root (i.e., Chien) search technique is given by






reg


i


(


j


)=σ


i


α


−(L+i)j








for i=0,1, . . . , k and j=0,1, . . . , n−1.




The Chien search circuit can also be used to determine the derivative of the error location polynomial for use in an error evaluation circuit. The outputs from the root search circuit are given by Equation 13:











α

-
jL




σ


(

α

-
j


)



=




i
=
0

k








reg
i



(
j
)







Equation





13













and equation 14 gives the outputs of the root search circuit that provide the term for the error evaluator











α

-

j


(

L
+
1

)







σ




(

α

-
j


)



=




i





is





odd





reg
i



(
j
)







Equation





14













for 0≦j≦n−1. By initializing the register reg


i


to reg


i


(


0


)=σ


i


, the register reg


i


has the following iterative relation






reg


i


(


j


)=α


−(L+i)


reg


i


(


j


−1)






for 1≦j≦n−1. Since α


−jL


σ(α


−j


)=0 if and only if σ(α


−j


)=0, χ


L


σ(χ) can also be used to find error-locations. Therefore, according to the above equations 13 and 14, by slightly modifying the root search circuits, the function of finding error-locations and calculation of α


−jL+1


σ′(α


−j


) can share the same circuit. The modification of the root search circuits involve generating a bank of exclusive OR gates to form the modulo sum terms corresponding to odd elements, to provide the derivative of the error location polynomial. Also, by initializing






reg


i


(


0


)=α


(L+i)(1−n)


σ


i








and defining the iteration relationship






reg


i


(


j


)=α


(L+i)


reg


i


(


j


−1)






provides








α


-

(

n
-
1
-
j

)



L




σ


(

α

-

(

n
-
l
-
j




)



=




i
=
0

k








reg
i



(
j
)













and








α


-

(

n
-
l
-
j

)




(

L
+
1

)






σ




(

α

-

(

n
-
1
-
j




)



=




i





is





odd









reg
i



(
j
)













for 0≦j≦n−1. In addition, as shown in

FIG. 9

, a shared, parallel root search circuit


70


′ can be provided in the descending order from n−1 to 0.




Referring now to

FIG. 9

, a portion of a two-way parallel shared search and evaluator circuit


70


′ is shown. The two-way parallel search and evaluator circuit


70


′ includes a first root search circuit


74




a


′ SCF(


0


) and a second root search circuit


74




b


′ SCF(


1


) with associated multiplexers


75




a


′-


75




b


′. From the outputs of the multiplexers


75




a


-


75




b


are provided α


−L(n−1−j)


σ(α


−(n−j)


) and α


−(L+1)(n−1−j)


σ′(α


−(n−j)


).




The error location circuits


74




a


′ and


74




b


′ include registers and multipliers and three banks


76




a


-


76




c


of “exclusive or” gates and are similar to those described above for

FIG. 8

except for different initial values. The derivative of the error evaluation polynomial is provided from bank


76




c


′ whereas, the error location polynomial is provided from banks


76




b


′ and


76




a


′ of “exclusive or” gates, as shown.




A U-way parallel root search that also computes α


−(n−1−j)(L+1)


σ′(α


−i


) can be provided by the U-way parallel root search circuit and modifying the root circuit to have the root search circuit calculate the derivative of the error-locator polynomial in a similar manner, as shown above for the two way circuits


70


,


70


′ in

FIGS. 8 and 9

.




Let v=uj+1. Then




1) Define reg


i




(l)


(


0


)=α


−(L+i)l


σ


i


and the iterative relationship






reg


i




(l)


(


j


)=α


−(L+i)u


reg


i


(


j


−1)






for l=0, . . . , u−1, we have








α


-

(

uj
+
l

)



L




σ


(

α

-

(

uj
+
l




)



=





i
=
0


k









reg
i
l



(
j
)







and








α


-

(
uj
)




(

L
+
1

)






σ




(

α

-

(

uj
+
l




)



=




i





is





odd









reg
i
l



(
j
)













for 0≦j≦(n−1)/u. This U-way shared parallel circuit searches locations from 0 to n−1.




The U-way parallel, shared circuit can search locations from n−1 to 0. By defining reg


i




(l)


(


0


)=α


(L+i)(l+1−n)


σ


i


and the iterative relationship






reg


i




(l)


(


j


)=α


(L+i)u


reg


i


(


j


−1)






for l=0, . . . , u−1, provides








α


-

(

n
-
uj
-
l

)



L




σ


(

α

-

(

n
-
uj
-
l




)



=





i
=
0


k









reg
i
l



(
j
)







and








α


-

(

n
-
uj
-
l

)




(

L
+
1

)






σ




(

α

-

(

n
-
uj
-
l




)



=




i





is





odd









reg
i
l



(
j
)













for 0≦j≦(n−1)/u.




Therefore, one of advantage of the arrangement of the shared circuits


70


,


70


′ for the two way case or for the U-way case is that the denominator of equation (12) is readily available from the root search circuit without the need to compute the derivative σ′(α


−j) of the error location polynomial σ(α




−j).






Referring to

FIG. 10

, the parallel combined root search and error evaluator circuit


70


includes two-way parallel root search circuits


74




a


,


74




b


that share circuitry with an error evaluator circuit


80


. The circuit


70


includes a decoder


82


that uses the Berlekamp-Massey decoding algorithm or Euclidean algorithm to provide at decoder outputs both error-locator polynomial that is fed to the root search circuits


74




a


,


74




b


.







σ


(
χ
)


=




l
=
0

k








σ
i



χ
i













and error-value polynomial that is fed to the error evaluator circuit


80


.







ω


(
χ
)


=




l
=
0

k








ω
i



χ
i













The error-locator polynomial σ(χ) fed to the two-way parallel combined root search circuit


74




a


,


74




b


and provides the roots of the error location polynomial and the derivative calculation of σ(χ) which is carried out by the circuits


74




a


,


74




b


as shown. Since the initial values in the registers of sCF(


1


) are






σ


0


, σ


1


α, . . . , σ


k


α


k








the coefficients of σ(χ) are provided from an initializer


50


(as will be described in FIG.


12


). The initializer


50


accepts the coefficients of σ(χ) as input and outputs initial values σ


0


, σ


1


, . . . , σ


k


for the first root search circuit at the first cycle time, and outputs initial values σ


0


, σ


1


α, . . . , σ


k


α


k


for the second root search circuit at the second cycle time. These two sets of outputs are used as initial values in the registers (

FIG. 8

) of the parallel shared search/evaluator circuits


74




a


and


74




b


. The output of circuits


74




a


,


74




b


will be σ(α


−(2j+l)


)σ′(α


−(2j+l)


) and location 2j+l, respectively for l=0 and 1.




While the processors


74




a


,


74




b


are running, the ω(χ) evaluator circuit


80


(

FIG. 13

) is also running. This evaluator


80


accepts the coefficients of ω(χ) from a Berlakamp-Massey decoder


82


as input, and outputs to an initializer


50


(

FIG. 12

) w


i


α


−2ij


for i=0,1, . . . , k at every cycle time j. It is known that if 2j is an error location, then sCF(


0


) will output σ(α


−2j


)=0, α


−2j


σ′(α


−2j


) and the location 2j. Thus, by directly dividing the output via a divider


84


α


−2j


σ′(α


−2j


) of sCF(


0


) by the summation of the output of the evaluator


80


, the error value l


2j


is obtained. If 2j+1 is an error-location, the values output from the evaluator


80


cannot be used directly. The initializer


50


is used to produce values ω


i


α


−(2j+1)i


, i=0,1, . . . , k from the evaluator


80


that can be used to evaluate the error evaluation polynomial ω(χ). Thus, the output of the evaluator


80


is sent to initializer


50


. Initializer


50


outputs ω


i


α


−2j


, i=0,1, . . . , k at the first cycle time and ω


i


α


−(2j+1)


, i=0,1, . . . , k at the second cycle time. Each of the sets of these two outputs are added together, respectively, by the “exclusive Or” gates


86


. The output of the “exclusive Or” gates are ω(α


−2j


) and ω(α


−2j−1


), respectively.




The processors


74




a


and


74




b


are each comprised of gates that have two inputs a and b and output c such that c=b if a=0, and c=0 otherwise. Thus, when 2j+1 is an error location, the inputs to the divider


84


are






α


−(L+1)(2j+l)


σ′(α


−(2j+l)


) and ω(α


−(2j+l)


),






where f=0 or 1. The divider


84


outputs the error values











e


2


j

+
lL

=



ω


(

α

-

(


2

j

+
l

)



)




α


-

(

L
+
1

)




(


2

j

+
l

)






σ




(

α

-

(


2

j

+
l

)



)




.











The other output of the decoder circuit


84


is the error location.




Referring now to

FIG. 11

, a general circuit


100


for a U-way parallel combined Chien search and Forney formula using a similar approach, as described in

FIG. 10

, is shown. The circuit


100


includes circuits


74




a


-


74




u


coupled to data selectors


89




a


,


89




b


. Multiplexer


89




a


provides the error location and has as inputs the outputs of the circuits


74




a


-


74




u


. Multiplexer


89




b


produces the error value and has as inputs the output of dividers


84




a


-


84




u


. Logic circuits


87




a


-


87




u


are replicated for each input to the multiplexer


89




b


. The other elements are generally as in

FIG. 10






It should be noted that the circuit for evaluating ω(χ) could be omitted to save gate counts. However, without the circuit for evaluating ω(χ) in parallel, all of the error locations would need to be stored and at the end of root search, additional cycles would be needed to compute the error values.




Therefore, a circuit for evaluating ω(χ) in parallel to the root search reduces the decoding delay while maintaining approximately the same gate counts. Also each value of ω(χ) can be provided instead of the initializer circuit


50


. This may prove to be a more costly implementation.




Referring now to

FIGS. 12 and 13

initializer circuit


50


and initializer circuit


20


are shown. These circuits use register


112


, multiplier


114


and a multiplexer


116


to output constants for the multipliers in the search circuits described above.

FIG. 12

shows a generalized initializer


50


for producing initial values for circuits of

FIGS. 4 and 5

whereas,

FIG. 13

shows a generalized circuit for producing initial values for the evaluator circuit


82


in

FIGS. 10 and 11

.




Other Embodiments




It is to be understood that while the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.



Claims
  • 1. A decoder for decoding block error correction codes comprises:a first search circuit that evaluates an error location polynomial to find a root corresponding to an error location; a second search circuit that evaluates the error location polynomial to find a root corresponding to an error location; and a multiplexer fed by the first search circuit and the second search circuit to produce an error location from a said root.
  • 2. The decoder of claim 1 wherein the first and second search circuits each comprise respectively a storage device for holding a first initial value and a multiplier device for multiplying the contents of the storage device by a coefficient.
  • 3. The decoder of claim 2 wherein the initial values of storage devices of the first search circuit further comprise an initializer circuit to produce initial values for the storage devices of the first search circuit and the second search circuit.
  • 4. The decoder of claim 3 wherein the initializer circuit produces the initial values for the second search circuit from the initial values of the first search circuit.
  • 5. The decoder of claim 1 wherein said first search circuit and said second search circuit iteratively find roots of an error location polynomial starting from an initial iteration of the polynomial to the last iteration of the polynomial.
  • 6. The decoder of claim 5 wherein the first search circuit and the second search circuit and multiplexer implement the following equations:σ(α−(2j))=regk(0)(j)+regk−1(0)(j)+ . . . +reg1(0)(j)+reg0(0)(j) for the second search circuit andσ(α−(2j+1))=regk(1)(j)+regk−1(1)(j)+ . . . +reg1(1)(j)+reg0(1)(j) for the second search circuit.
  • 7. The decoder of claim 1 wherein said first search circuit and said second search circuit iteratively find roots of an error location polynomial starting from the last iteration of the polynomial to the first iteration of the polynomial.
  • 8. The decoder of claim 5 wherein the first search circuit and the second search circuit and multiplexer implement the following equation:σ(α−(n−2j−l))=regk(l)(j)+regk−1(l)(j)+ . . . +reg1(l)(j)+reg0(l)(j) for 0≦j≦(n−1)/2, where l=0 and 1.
  • 9. The decoder of claim 1 further comprising a plurality of additional search circuits, each search circuit initialized with corresponding initial values, to find roots of the error location polynomial that correspond to an error location; andwherein the multiplexer is fed by each of the plurality of additional search circuits to produce an error location from the error location polynomial.
  • 10. The decoder of claim 9 wherein the first and second search circuits and each of the plurality of additional search circuits comprise a storage device to hold initial values assigned to each of the respective search circuits and a multiplier device for multiplying the contents of the storage device by a coefficient.
  • 11. The decoder of claim 10 wherein the initial values of the storage devices of the first search circuit are provided from the error location polynomial and wherein the decoder further comprises:an initializer circuit for initializing the storage devices of the second search circuit and the plurality of additional search circuits.
  • 12. The decoder of claim 11 wherein the initializer circuit produces initial values for the second search circuit and the plurality of additional search circuits by iteratively multiplying values of a previous one of the iterations of the search circuits by a constant.
  • 13. The decoder of claim 9 wherein said first search circuit and said second search circuit iteratively find roots of an error location polynomial starting from an initial iteration of the polynomial to the last iteration of the polynomial.
  • 14. The decoder of claim 13 wherein the first search circuit, the second search circuit and plurality of additional search circuits and multiplexer implement the following equation:σ(α−(uj+l))=regk(l)(j)+regk−1(l)(j)+ . . . +reg1(l)(j)+reg0(l)(j) with 0≦j≦(n−1)/u.
  • 15. The decoder of claim 9 wherein said first search circuit and said second search circuit iteratively find roots of an error location polynomial starting from the last iteration of the polynomial to the first iteration of the polynomial.
  • 16. The decoder of claim 15 wherein the first search circuit, the second search circuit and plurality of additional search circuits and multiplexer implement the following equation:σ(α−(n−uj−l))=regk(l)(j)+regk−1(l)(j)+ . . . +reg1(l)(j)+reg0(l)(j) with 0≦j≦(n−1)/u.
  • 17. A decoder for decoding block codes comprises:a parallel error location polynomial search circuit comprising: a first search circuit that evaluates an error location polynomial to find a root that corresponds to an error location, said first search circuit further comprising: a first modulo adder circuit that adds odd terms of the error location polynomial to produce a derivative of the error location polynomial; a second search circuit that evaluates the error location polynomial to find a root that corresponds to an error location, said second search circuit further comprising: a second modulo adder circuit that adds odd terms of the error location polynomial to produce a derivative of the error location polynomial; a first multiplexer fed by the first search circuit and the second search circuit to produce an error location from a said root; a second multiplexer fed by the error locator polynomial derivatives from the first and second modulo adders; an error evaluation circuit that produces an error evaluator polynomial; and a logic circuit including a divider circuit to divide the error evaluator polynomial by the output from the second multiplexer.
  • 18. The decoder of claim 17 further comprising an initializer circuit that receives the error evaluator polynomial from the error evaluator circuit to produce error values for error locations identified by the error location polynomial circuit.
  • 19. The decoder of claim 18 wherein the initializer circuit produces initial values for error locations identified by the second search circuit.
  • 20. The decoder circuit of claim 17 further comprising a plurality of additional root search circuits.
  • 21. The decoder of claim 17 further comprising an initializer circuit that receives,the error evaluator polynomial from the error evaluator circuit to produce error values for error locations identified each of the root search circuits.
  • 22. A method of decoding an error correcting block code comprises:evaluating an error location polynomial produced from an error location polynomial generation circuit by applying the error location polynomial to a parallel root search circuit that searches for roots of the error location polynomial in a parallel manner.
  • 23. The method of claim 22 wherein applying further comprises applying the error location polynomial to a pair of root search circuits.
  • 24. The method of claim 22 wherein applying applies the error location polynomial to a plurality of search circuits.
  • 25. The method of claim 23 further comprising alternatively selecting from the first and the second search circuits and the plurality of search circuits to provide error locations for the error location polynomial.
  • 26. A decoder to decode block error correction codes, comprising:parallel search circuits that evaluate a single error location polynomial to find roots that correspond to error locations of data from a storage medium; wherein a first root σ(α−(2j)) of the single error location polynomial comprises: σ(α−(2j))=regk(0)(j)+regk−1(0)(j)+ . . . +reg1(0)(j)+reg0(0)(j); and wherein a second root σ(α−(2j+1)) of the single error location polynomial comprises: σ(α−(2j+1))=regk(1)(j)+regk−1(1)(j)+ . . . +reg1(1)(j)+reg0(1)(j) with values for reg comprising data register values.
  • 27. The decoder of claim 26, further comprising:circuitry to obtain an error location from the roots.
  • 28. The decoder of claim 27, wherein the parallel search circuits and the circuitry implement the following equation:σ(α−(n−2j−l))=regk(l)(j)+regk−1(l)(j)+ . . . +reg1(l)(j)+reg0(l)(j) for 0≦j≦(n−1)/2, where l=0 and 1.
  • 29. The decoder of claim 26, wherein the parallel search circuits comprise first and second search circuits.
  • 30. A decoder comprising:a generator circuit that generates an error location polynomial from error correcting code symbols that correspond to data on a storage medium; parallel search circuits that search for roots of the error location polynomial; and an evaluator circuit that evaluates errors in the error correcting code symbols at locations that correspond to the roots.
  • 31. The decoder of claim 30, wherein each of the parallel search circuits includes a storage device for holding a first initial value and a multiplier device for multiplying contents of the storage device by a coefficient.
  • 32. The decoder of claim 31, wherein the parallel search circuits iteratively find roots of the error location polynomial starting from an initial iteration of the error location polynomial using the initial value to a last iteration of the error location polynomial.
  • 33. The decoder of claim 32, wherein a first root σ(α−(2j)) of the single error location polynomial comprises:σ(α−(2j))=regk(0)(j)+regk−1(0)(j)+ . . . +reg1(0)(j)+reg0(0)(j); and wherein a second root σ(α−(2j+1)) of the single error location polynomial comprises: σ(α−(2j))=regk(0)(j)+regk−1(0)(j)+ . . . +reg1(0)(j)+reg0(0)(j); and with values for reg comprising data register values.
  • 34. The decoder of claim 32, wherein the parallel search circuits and other circuitry implement the following equation:σ(α−(n−2j−l))=regk(l)(j)+regk−1(l)(j)+ . . . +reg1(l)(j)+reg0(l)(j) for 0≦j≦(n−1)/2, where l=0 and 1.
  • 35. The decoder of claim 30, wherein the parallel search circuits iteratively find roots of the error location polynomial starting from a last iteration of the error location polynomial to a first iteration of the error location polynomial.
  • 36. A method comprising:generating an error location polynomial from error correcting code symbols that correspond to data on a storage medium; searching for roots of the error location polynomial via parallel search circuits; and evaluating errors in the error correcting code symbols at locations that correspond to the roots.
  • 37. The method of claim 36, further comprising:storing, in each of the parallel search circuits, a first initial value; and multiplying contents of the storage device by a coefficient.
  • 38. The method of claim 37, wherein the parallel search circuits iteratively find roots of the error location polynomial starting from an initial iteration of the error location polynomial using the initial value to a last iteration of the error location polynomial.
  • 39. The method of claim 38, wherein a first root σ(α−(2j)) of the single error location polynomial comprises: σ(α−(2j))=regk(0)(j)+regk−1(0)(j)+ . . . +reg1(0)(j)+reg0(0)(j); andwherein a second root σ(α−(2j+1)) of the single error location polynomial comprises: σ(α−(2j+1))=regk(l)(j)+regk−1(l)(j)+ . . . +reg1(l)(j)+reg0(l)(j), with values for reg comprising data register values.
  • 40. The method of claim 38, wherein the parallel search circuits and other circuitry implement the following equation:σ(α−(n−2j−l))=regk(l)(j)+regk−1(l)(j)+ . . . +reg1(l)(j)+reg0(l)(j) for 0≦j≦(n−1)/2, where l=0 and 1.
  • 41. The method of claim 36, wherein the parallel search circuits iteratively find roots of the error location polynomial starting from a last iteration of the error location polynomial to a first iteration of the error location polynomial.
  • 42. A method comprising:searching for roots of a single error location polynomial using parallel search circuits.
  • 43. The method of claim 42, wherein:a first root σ(α−(2j)) of the single error location polynomial comprises: σ(α−(2j))=regk(0)(j)+regk−1(0)(j)+ . . . +reg1(0)(j)+reg0(0)(j); and a second root σ(α−(2j+1)) of the single error location polynomial comprises: σ(α−(2j+1))=regk(1)(j)+regk−1(1)(j)+ . . . +reg1(1)(j)+reg0(1)(j), with values for reg comprising data register values.
  • 44. The method of claim 42, further comprising obtaining an error location from the roots.
  • 45. The method of claim 42, wherein searching is performed using a Chien search algorithm.
  • 46. The method of claim 42, wherein the parallel search circuits and additional circuitry implement the following equation:σ(α−(n−2j−l))=regk(l)(j)+regk−1(l)(j)+ . . . +reg1(l)(j)+reg0(l)(j) for 0≦j≦(n−1)/2, where l=0 and 1.
  • 47. The method of claim 42, further comprising:storing, in each of the parallel search circuits, a first initial value; and multiplying contents of the storage device by a coefficient.
  • 48. The method of claim 47, wherein the parallel search circuits iteratively find roots of the error location polynomial starting from an initial iteration of the error location polynomial using the initial value to a last iteration of the error location polynomial.
  • 49. The method of claim 42, further comprising:selecting an output of one of the parallel search circuits for different cycles of a decoder.
  • 50. The method of claim 42, wherein the parallel search circuits comprise U-way parallel search circuits.
  • 51. A decoder for decoding block error correction codes, comprising:means for evaluating a single error location polynomial to find a first root corresponding to an error location; means for evaluating the single error location polynomial to find a second root corresponding to an error location; and means for producing an error location of the root based on the first and second roots.
  • 52. The decoder of claim 51, wherein each of the means for evaluating comprise, respectively, a storage device for holding a first initial value and a multiplier device for multiplying the contents of the storage device by a coefficient.
  • 53. The decoder of claim 52, wherein the means for evaluating comprise a first search circuit and a second search circuit.
  • 54. The decoder of claim 53, further comprising an initializer circuit to produce initial values for the storage devices of the first search circuit and the second search circuit.
  • 55. The decoder of claim 54, wherein the initializer circuit produces the initial values for the second search circuit from the initial values of the first search circuit.
  • 56. The decoder of claim 51, wherein the means for evaluating iteratively find roots of an error location polynomial starting from an initial iteration of the polynomial to the last iteration of the polynomial.
  • 57. The decoder of claim 51, wherein the first root σ(α−(2j)) of the error location polynomial comprises:σ(α−(2j))=regk(0)(j)+regk−1(0)(j)+ . . . +reg1(0)(j)+reg0(0)(j); and wherein the second root σ(α−(2j+1)) of the error location polynomial comprises: σ(α−(2j+1))=regk(1)(j)+regk−1(1)(j)+ . . . +reg1(1)(j)+reg0(1)(j), with values for reg comprising data register values.
  • 58. The decoder of claim 57, wherein the means for evaluating and means for producing implement the following equation:σ(α−(n−2j−l))=regk(l)(j)+regk−1(l)(j)+ . . . +reg1(l)(j)+reg0(l)(j) for 0≦j≦(n−1)/2, where l=0 and 1.
  • 59. A decoder comprising:means for generating a single error location polynomial from error correcting code symbols that correspond to data on a storage medium; means for searching for roots of the single error location polynomial via parallel search circuits; and means for evaluating errors in the error correcting code symbols at locations that correspond to the roots.
  • 60. The decoder of claim 59, further comprising:means for storing, in each of the parallel search circuits, a first initial value; and means for multiplying contents of the storage device by a coefficient; wherein the parallel search circuits iteratively find roots of the error location polynomial starting from an initial iteration of the error location polynomial using the initial value to a last iteration of the error location polynomials.
US Referenced Citations (12)
Number Name Date Kind
4833678 Cohen May 1989 A
4866716 Weng Sep 1989 A
5309449 Gandini et al. May 1994 A
5321704 Erickson et al. Jun 1994 A
5666369 Nakamura Sep 1997 A
5771244 Reed et al. Jun 1998 A
5974582 Ly Oct 1999 A
5983389 Shimizu Nov 1999 A
6031875 Im Feb 2000 A
6061826 Thirumoorthy et al. May 2000 A
6192497 Yang et al. Feb 2001 B1
6279137 Poeppelman et al. Aug 2001 B1
Foreign Referenced Citations (1)
Number Date Country
403121627 May 1991 JP
Non-Patent Literature Citations (3)
Entry
Ganapathy et al. << Designing a scalable processor array for recurrent computations; IEEE Transactions on Parallel and Distributed Systems, , vol. 8, Issue: 8, p. 840-856, Aug. 1997.*
Richard E. Blahut “A Theory and Practice of Error Control Codes”, Addison-Wesley Publishing Company, Reading MA, table of contents, chapter 4 pp. 65-92, chapter 5, pp. 93-129, and chapter 8 pp. 207-247, 1983.
Peterson et al., “A Error-Correcting Codes, Second Edition”, MIT Press, Cambridge, MA, London, England, table of contents, chapter 4 76-115, chapter 6 pp. 144-169, chapter 7 pp. 170-205, and chapter 8 pp. 206-265, 1972.