Compact chien-search based decoding apparatus and method

Description

FIELD OF THE INVENTION

The present invention relates to a compact Chien based decoding apparatus and method.

BACKGROUND OF THE INVENTION

The term “Chien search” is used herein to refer to any typically iterative method or apparatus for determining roots of polynomials defined over a finite field. The term is also used herein to refer to any method or apparatus used for finding the roots of error-locator polynomials encountered in decoding, e.g., Reed-Solomon codes and BCH codes in various applications including but not limited to flash memory and other data storage applications, and data communications applications.

The error locator polynomial (denoted Λ) has the following format:

Λ(x)=Λ₀+Λ₁*x+Λ₂*x²+ . . . +Λ_t*x^t (Equation 1)

The Chien search includes evaluating the error locator polynomial for multiple elements of a Galois field GF(2^m) over which the error locator polynomial is defined. The elements are powers of the primitive element in the field, alpha (α).

Accordingly, the Chien search includes evaluating the error locator polynomial for various powers of alpha, by setting powers of alphas in equation 1 the following sets of equations are obtained:

Λ(α)=Λ₀+Λ₁*α+Λ₂*α²+ . . . +Λ_t*α^t
Λ(α²)=Λ₀+Λ₁*α²+Λ₂*α⁴+ . . . +Λ_t*α^2t
Λ(α^m)=Λ₀+Λ₁*α^m+Λ₂*α^2m+ . . . +Λ_t*α^mt

The different powers of α are all elements in a finite field (such as a Galois field) over which the error locator polynomial is defined. Any power of alpha for which the above error locator polynomial is zero, is termed a root. These roots provide an indication about the location of the error in the received or read data. In other words, if αⁿis a root of the error locator polynomial then if binary BCH code is being used, an error has occurred in bit n of the data being read or received. In BCH, each error is a flipped bit. In Reed-Solomon, each error is a symbol in which at least one bit is wrong.

The evaluation of the error locator polynomial can be implemented in an iterative manner by a hardware circuit 10 that is illustrated in FIG. 1. Hardware circuit 10 includes: (i) a group of registers 12(1)-12(t) that are initially fed with the coefficients (Λ₁, Λ₂. . . Λ_t) of the error locator polynomial, (ii) a group of Galois multipliers 14(1)-14(t) that multiply a previous content of registers 12(1)-12(t) by various powers of alpha (α, α², . . . α^t) to provide preliminary results that are written to the registers and are also provided to an adder, (iii) a Galois adder 16 that adds the preliminary results to provide a Chien search result. During each iteration a previous content of the k'th register is multiplied by α^k. A content of the k'th register is denoted λ_k, the m'th bit of that register is denoted λ_k,m. If the Chien search result equals to minus one (or plus one for a binary field) then a root is found. (It is noted that if the Chien search result equals to zero than a root is found, when considering Λ₀which always equals to 1.

The evaluation of the error locator polynomial can also be evaluated in parallel by a hardware circuit 20 that is illustrated in FIG. 2A. Hardware circuit 20 includes: (i) a group of registers 12(1)-12(t) that are initially fed with the coefficients (Λ₁, Λ₂. . . ,Λ_t), (ii) multiple groups of Galois multipliers 14(1,1) . . . 14(1,t) . . . 14(p,1) . . . 14(p,t) that multiply a previous content of registers 12(1)-12(t) by various powers of alpha (α, α₂, . . . α^t)to provide preliminary results that are provided to Galois adders, wherein Galois multipliers of different groups of Galois multipliers can receive different powers of alpha; wherein the preliminary results of one group of Galois multipliers are written to registers 12(1)-12(t), (iii) a group of Galois adders 16(1)-16(p)—each group of Galois multipliers is connected to a dedicated Galois adder that provides a Chien search result. Accordingly, hardware circuit 20 provides p Chien search results per iteration. The parallel hardware that is described in FIG. 2A can be also implemented in a variant way, as described in FIG. 2B. In this parallel architecture all the multipliers 14(1,1) . . . 14(p,1) are all connected to the same register 12(1). In the same way all the multipliers 14(1,t) . . . 14(p,t) are all connected to the same register 12(t).

It is noted that elements of a Galois field GF(pⁿ) can be represented as polynomials of degree strictly less than n over GF(p). Operations are then performed modulo R where R is an irreducible polynomial of degree n over GF(p), for instance using polynomial long division.

The constant multipliers 14(1,1) . . . 14(p,1) includes a modulo R operation (R is an irreducible polynomial of degree n over GF(p)).

Referring back to the examples set forth in FIG. 1, FIG. 2A and FIG. 2B, the Galois multipliers and Galois adders include many logic gates. The number of gates in Galois multipliers and Galois adders can be responsive to the number of bits n in the variables that are being added to each other or multiplied with each other. The number of gates in Galois multipliers, and specifically in constant multipliers (multipliers that one of the multiplicand is a constant) can be responsive to the irreducible polynomial. In addition, the number of gates in Galois constant multipliers can be responsive to the number of set bits (‘1’) in the powers of a as well as their location.

For example, an adder that adds two n-bit numbers in the Galois field is about 2-bit XOR gates. Even more gates are required to implement Galois adder 16 that adds J n-bit numbers. Another example is that constant multiplier which its constant multiplicand is 101010101010101 (15 bits) consume much more gates than a constant multiplier which its constant multiplicand is 000000000001111 (15 bits). The second constant multiplicand has less set bits (1), and the sets bits are located in the LSB (Least Significant Bit).

Yet for another example, FIG. 3 illustrates an area consumed by sixty six groups of four Galois constant multipliers each, wherein each Galois constant multiplier performs a multiplication between two n-bits number in the Galois field. Graph 20 illustrates the number of set bits in coefficients (α, α², . . . , α^t), the x-axis represents the power of alphas, and graph 30 illustrates the area consumed by the Galois multipliers. It is apparent that there is a correlation between the number of set bits in the coefficients (α, α², . . . , α^t) and the area consumed by the respective Galois multiplier.

There is a growing need to provide a compact Chien search based decoding apparatus and method.

SUMMARY OF EMBODIMENTS OF THE INVENTION

BCH and RS (Reed-Solomon) are among the most widely used cyclic error correcting codes. They are used in various practical fields such as storage and communication. When these coding schemes are used in mobile applications, power consumption is a major design constraint which sometimes even affects the actual viability of the applicability of the schemes to the mobile applications.

At least the decoding functionality of the above codes may typically employ a Chien search. An objective of certain embodiments of the present invention is to provide low power and low area Chien search apparatus with no impact on its performance (throughput or latency). This apparatus may be useful in a variety of applications, including, for example, mobile applications, memory applications including flash memory applications, and other suitable applications.

An apparatus according to embodiments of the present invention is provided having Chien search capabilities and including a first hardware circuit and a second hardware circuit. The first hardware circuit evaluates an error locator polynomial for a first element of a finite field over which the error locator polynomial is defined to provide a first set of intermediate results and a first Chien search result and provides the first set of intermediate results to the second hardware circuit. The second hardware circuit evaluates the error locator polynomial for a second element of the finite field to provide a second Chien search result in response to the first set of intermediate results. The first hardware circuit may be different from the second hardware circuit. For example, the first hardware circuit may be substantially larger (consume more area) than the second hardware circuit. The first and second hardware circuits can be tailored to evaluate different elements—the first element may differ from the second element.

The first hardware circuit may include a mask and add unit to sum unmasked bits representative of preliminary results obtained during an evaluation of the error locator polynomial thereby to provide the first set of intermediate results.

The first hardware circuit may include a shift and add unit to shift the first set of intermediate results by different shift factors thereby to provide shifted results and to add the shifted results to provide a first shifted sum.

The first hardware circuit may include a modulo circuit that may perform modulo operation on the first shifter sum thereby to provide the first Chien search result.

The second hardware circuit may include a squaring circuit to square the first set of intermediate results thereby to provide a second set of intermediate results.

The second hardware circuit may include a shift and add unit to shift the second set of intermediate results by different shift factors thereby to provide shifted results and to add the shifted results thereby to provide a second shifted sum.

The second hardware circuit may include a modulo circuit to perform a modulo operation on the second shifted sum thereby to provide the second Chien search result.

The apparatus according to embodiments of the present invention may include more than two hardware circuits. For example, the apparatus may include a third hardware circuit to evaluate the error locator polynomial for a third element of the finite field thereby to provide a third Chien search result in response to a second set of intermediate results generated by the second hardware circuit. It will be recognized that in some embodiments of the invention, the first hardware circuit may be different from the third hardware circuit. For example, the first hardware circuit may be substantially larger than the third hardware circuit; and wherein the third element differs from the second element and from the first element.

It will be recognized that each error locator polynomial evaluates the error locator polynomial for a different element of the finite field, an apparatus according to embodiments of the invention may include multiple hardware circuits, wherein each of the multiple hardware circuits performs a modulo operation only at a modulo circuit that provides a Chien search result. In some embodiments of the invention, each of these hardware circuits may include a mask and add unit to sum unmasked bits representative of preliminary results obtained during an evaluation of the error location polynomial.

The apparatus according to embodiments of the invention may include a recovery circuit to recover errors in response to Chien search results.

The apparatus according to embodiments of the invention may include a flash memory that stores data encoded in accordance with a Reed-Solomon decoding algorithm and wherein the stored data is Reed-Solomon decoded by a decoder that comprises at least the first and second hardware circuits.

The apparatus according to embodiments of the invention may include a flash memory to store data encoded in accordance with a BCH encoding algorithm and a BCH decoder.

A method according to embodiments of the present invention for Chien search is provided. According to some embodiments of the invention, the method may include evaluating, by a first hardware circuit an error locator polynomial for a first element of a finite field over which the error locator polynomial is defined to provide a first set of intermediate results and a first Chien search result; providing the first set of intermediate results to a second hardware circuit; and evaluating, by the second hardware circuit, the error locator polynomial for a second element of the finite field to provide a second Chien search result in response to the first set of intermediate results, wherein the first hardware circuit may be substantially larger than the second hardware circuit and wherein the first element differs from the second element.

The method according to embodiments of the invention may include masking bits representative of preliminary results obtained during an evaluation of the error location polynomial; and summing unmasked bits representative of the preliminary results to provide the first set of intermediate results.

The method according to embodiments of the invention may include shifting the first set of intermediate results by different shift factors to provide shifted results; and adding the shifted results to provide a first shifted sum.

The method according to embodiments of the invention may include performing a modulo operation on the first shifted sum to provide the first Chien search result.

The method according to embodiments of the invention may include squaring the first set of intermediate results to provide a second set of intermediate results.

The method according to embodiments of the invention may include shifting the second set of intermediate results by different shift factors to provide shifted results; and adding the shifted results to provide a second shifted sum.

The method according to embodiments of the invention may include performing a modulo operation on the second shifted sum to provide the second Chien search result.

The method according to embodiments of the invention may include evaluating, by a third hardware circuit, the error locator polynomial for a third element of the finite field to provide a third Chien search result in response to a second set of intermediate results that is generated by the second hardware circuit; wherein the first hardware circuit is substantially larger than the third hardware circuit; and wherein the third element differs from the second element and from the first element.

The method according to embodiments of the invention may include evaluating the error locator polynomial for different elements of the finite field; wherein each evaluation comprises applying a modulo operation only at a last stage of the evaluating.

The method according to embodiments of the invention may include recovering errors in response to Chien search results.

The method according to embodiments of the invention may include retrieving data stored in a flash memory and performing Reed-Solomon decoding.

The method according to embodiments of the invention may comprising retrieving data stored in a flash memory and performing BCH decoding

BRIEF DESCRIPTION OF THE DRAWINGS

Certain embodiments of the present invention are illustrated in the following drawings:

FIG. 1 is a functional block diagram illustration of an “in series” prior art circuit;

FIG. 2A and FIG. 2B are functional block diagram illustrations of “in parallel” prior art circuits;

FIG. 3 illustrates area consumed by prior art Galois multipliers;

FIG. 4 is a simplified functional block diagram of a system using a compact Chien search, the system being constructed and operative in accordance with certain embodiments of the present invention;

FIG. 5 is a simplified functional block diagram of a decoder of FIG. 4, which uses a compact Chien search, which is constructed and operative in accordance with certain embodiments of the present invention;

FIG. 6A is a simplified functional block diagram of flash memory apparatus that includes, e.g. in an internal microcontroller, the encoding/decoding system of FIG. 4 and particularly the decoder of FIG. 5, all operative in accordance with certain embodiments of the present invention;

FIG. 6B illustrates a portion of an error location polynomial and a compact Chien searcher according to an embodiment of the invention;

FIG. 7. is a simplified functional block diagram of a compact Chien searcher according to an embodiment of the invention;

FIG. 8. is a simplified functional block diagram of hardware circuits of the compact Chien searcher of FIG. 7 according to an embodiment of the invention;

FIG. 9. is a simplified functional block diagram of hardware circuits of the compact Chien searcher of FIG. 7 according to an embodiment of the invention;

FIG. 10. is a simplified functional block diagram of hardware circuits of the compact Chien searcher of FIG. 7 according to an embodiment of the invention; and

FIG. 11. is a flow chart of a method for compact Chien search according to an embodiment of the invention.

DETAILED DESCRIPTION OF THE DRAWINGS

In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the invention. However, it will be understood by those skilled in the art that the present invention may be practiced without these specific details. In other instances, well-known methods, procedures, and components have not been described in detail so as not to obscure the present invention.

Reference is now made to FIG. 4 which is a simplified functional block diagram of an encoding/decoding system that includes a compact Chien searcher in accordance with certain embodiments of the present invention.

In FIG. 4, message source 115 provides a message m(x) which it may be desired to transmit or to store, e.g. in flash memory, to Error Correction Coding (ECC) encoder 110. ECC encoder 110 may include BCH or Reed-Solomon cyclic error correction coding apparatus and is typically operative for computing and for adding, to the message m(x), redundancy bits, thereby to generate a codeword c(x) of a known codebook such as BCH or Reed-Solomon with known parameters. Channel 120, which may include any medium through which the message is conveyed from ECC encoder 110 to ECC decoder 130. Channel 120 adds errors e(x) to the codeword c(x). ECC encoder 110 can be included in a transmitter while ECC decoder 130 is included in a receiver.

The errors may stem from various physical processes such as thermal noise, deterioration of storage medium over time and, especially after many read/write operations, inaccuracies in the transmitter or receiver hardware. Each error occurs at a particular location within the message, which is assumed to comprise a sequence of bits or of symbols. In the former case, binary BCH code is typically used for encoding and decoding, whereas in the latter case, non-binary BCH code, or RS code is used. In the first, binary, instance, n is used in the foregoing discussion to indicate a bit of the data being read or received in which an error has occurred. In the second, non-binary, instance, n is used in the foregoing discussion to indicate a symbol of the data being read or received in which an error has occurred.

The received data r(x) equals the following: r(x)=c(x)+e(x). Received data r(x) is typically received by an error correcting decoder 130, also termed herein the “receiver”. ECC decoder 130, using the redundancy that was added to the message and the known codebook, is operative to substantially reconstruct the original message m(x) and convey it to the intended target, message sink 140. According to certain embodiments of the present invention, the ECC decoder 130 includes a compact Chien searcher.

Reference is now made to FIG. 5 which is a simplified functional block diagram of ECC decoder 130 of FIG. 4. As shown, the ECC decoder 130 includes a compact Chien searcher 220 and is constructed and operative in accordance with certain embodiments of the present invention.

The ECC encoder 110 can be described in terms of a generation matrix G, thus the encoding process performed by ECC encoder 110 includes a matrix multiplication c=mG. As described above, c is the transmitted codeword and m is the message to be transmitted or, for data storage applications, the data to be stored. The ECC decoder 130 of FIG. 4 is operative to perform syndrome computation (functionality 200 in FIG. 5), such that there exists a parity check matrix H which has the following property: GH^T=0. It follows that cH^T=mGH^T=0 (formula IV). As described above, the received vector r comprises the transmitted codeword c and the errors added in the channel 120 i.e. r=c+e. The ECC decoder (which in flash memory applications, may be implemented within microcontroller 244 of FIG. 2) computes the syndrome vector s using the parity check matrix. Specifically (formula V):

s=rH^T=cH^T+eH^T=mGH^T+eH^T=0+eH^T=eH^T, or in short s=eH^T.

ECC 130 can generate an Error Locator Polynomial (functionality 210 in FIG. 5). Due to the special form of the BCH and RS codes and of the parity check matrix H the set of equations s=eH^Tmay be solved directly by exhaustive search in the decoder 130 to find the error vector e and correctly decode the received message r(x), however, the exhaustive search is computationally unattractive. Therefore, typically an Error Locator Polynomial (ELP) is introduced, the roots of which correspond to a one to one mapping of the error locations as described above and as is known in the art.

Once the error locator polynomial has been generated by functionality 210, compact Chien searcher 220 that has Error Locator Polynomial evaluation functionality evaluates the Error Locator Polynomial for all the elements of the field over which the Error Locator Polynomial is defined. The elements in the field that zero the error locator polynomial are the error locations. Computations are typically performed in the GF(q^m) field which is a finite field. The evaluation of the Error Locator Polynomial includes searching the roots of the Error Locator Polynomial.

Error correction unit 230 corrects errors in response to the roots of the error locator polynomial that were found by compact Chien searcher 220.

FIG. 6A is a simplified functional block diagram of a flash memory apparatus comprising, e.g. in an internal microcontroller 244, the encoding/decoding system of FIG. 4 and particularly the decoder of FIG. 5, all operative in accordance with certain embodiments of the present invention. As shown, the flash memory apparatus of FIG. 6A typically interacts with a host 240 and typically includes the microcontroller 244 as well as one or more erase sectors 246 each comprising one or more pages 248 each including cells 249. The microcontroller 244 effects erasing of, writing on and reading from the erase sector/s 246, by suitably controlling erasing circuitry 250, writing circuitry 252 and reading circuitry 254, respectively. According to certain embodiments of the present invention, microcontroller 244 includes an error correction code decoder operative to receive data from the reading circuitry 254, to decode the data, including performing a compact Chien search for error locations, and to provide the data thus decoded to the host 240 which therefore constitutes both source 100 and sink 140 of FIG. 4, in memory applications.

In flash memory applications, the channel 120 generally represents the deterioration in the data stored in memory over time and due to repeated cycling, and the encoding and decoding (functionalities 110 and 130 in FIG. 4) are performed within one or more suitable controllers e.g. the microcontroller 244 of FIG. 6 which is external to the flash memory device 245 or an external controller operatively associated with the host 240 and external to device 245.

Microcontroller 244 can include (or otherwise has the functionality of) compact Chien searcher 220. Compact Chien searcher 220 can be characterized by at least one of the following characteristics or a combination thereof: (i) utilizing dependencies between intermediate results generated during different evaluations of the error locator polynomial—generating sets of intermediate results by hardware circuits and utilizing these intermediate results by smaller hardware circuits; (ii) performing modulo operations at the end of the Chien Search; (iii) replacing addition and/or multiplication operation by masking operations and shifting operations.

FIG. 7. is a simplified functional block diagram of a compact Chien searcher 220 according to an embodiment of the invention.

Compact Chien searcher 220 is illustrated for a case in which p=8 (eight Chien searches are provided per cycle) t=66 and the Galois field is GF(2¹⁵).

Compact Chien searcher 220 includes a set of registers 12(1)-12(t). This set of registers includes sixty six registers, each fifteen bit long, that are initially fed with the elements of the error location polynomial (ELP) output from error location polynomial calculation unit 210. FIG. 6B illustrates registers 12(1)-12(t) that are connected to error location polynomial calculation unit 210 via switches 17(1)-17(t), each switch configured to provide to a register the output of error location polynomial calculation unit 210 or an initial value. The registers provide their output to multipliers 14(1)-14(t), that multiply the output of the registers by different powers of α⁸, thus multiplier 14(1) multiples the output of register 12(1) by α⁸and multiplier 14(t) multiples the output of register 12(t) by α^8t.

Compact Chien searcher 220 also includes eight hardware circuits 710, 720, 730, 740, 750, 760, 770 and 780—each provides one Chien search value by evaluating the error locator polynomial for a single element.

Hardware circuit 710 calculates r(1), hardware circuit 720 calculates r(2), 730 calculates r(3), 740 calculates r(4), 750 calculates r(5), 760 calculates r(6), 770 calculates r(7) and 780 calculates r(8).

Hardware circuit 710 is referred to as a first hardware circuit. It includes mask and add unit 810, shift and add unit 820 and modulo unit 830.

Hardware circuit 720 is referred to as a second hardware circuit. Each of hardware circuits 720, 740 and 780 includes squaring unit 840, shift and add unit 820 and modulo unit 830.

Hardware circuit 740 is referred to as a third hardware circuit.

Each of the hardware circuits 730, 750, 760 and 770 includes inner summing unit 850, constant multiplier unit 860, outer summation unit 870, modulo unit 830 and constant multiplier unit 880.

A set of intermediate results calculated by mask and add unit 810 of hardware circuit 710 is provided to squaring unit 840 of hardware circuit 720. A set of intermediate results calculated by squaring unit 840 of hardware circuit 720 is provided to squaring unit 840 of hardware circuit 740. A set of intermediate results calculated by squaring unit 840 of hardware circuit 740 is provided to squaring unit 840 of hardware circuit 780.

It is noted that the intermediate results calculated by mask and add unit 810 of hardware circuit 710 can be provided to hardware circuit 740 but in this case the squaring module of these hardware circuits will be required to perform more than a single squaring operation. The same applies to a provision of the set of intermediate results calculated by squaring unit 840 of hardware circuit 720 to squaring unit 840 of hardware circuit 780.

First hardware circuit 710 is bigger than second and third hardware circuits 720 and 740 as the mask and add unit 810 consumes more area than squaring unit 840.

The over all size of hardware circuits 710-780 is smaller than the size of a prior art circuit (as illustrated in FIG. 2A or 2B) in the combination of units 14(j,1) . . . 14(j,t) and 16(j) for some j) due to: (i): using intermediate results of R(1) when calculating R(2), R(4), R(8) (hardware sharing) (ii): Applying only one modulo operation on the sum of p products (instead of sum of p modulo operation of the products), (iii): The calculation is separated to an inner sum that is followed by a multiplication by a constant, and an outer sum in R(3), R(5), R(6) and R(7) calculation. Each of these hardware circuits (referring to (iii)) includes an inner summation unit 850, a constant multiplier unit 860, an outer summation module 870, modulo circuit 830 and can also include a constant multiplier 880.

The following mathematical description illustrates how the size reduction can be achieved.

Λ(αⁱ) or Λ(α^8k+i) (where k is some non-negative integer) is denoted by r(i). The compact Chien search includes evaluating the error locator polynomial for each value of i (each power of alpha) it can be re-written as follows:

$\begin{matrix} r (i) = \sum_{j = 0}^{t} λ_{j} α^{ij} \\ = 1 + \sum_{j = 0}^{t} λ_{j} α^{ij} \\ = 1 + \sum_{j = 1}^{t} (\sum_{m = 0}^{14} (λ_{j, m} X^{m} α^{ij}) \mod P (X)) \\ = 1 + \sum_{m = 0}^{14} (\sum_{j = 1}^{t} λ_{j, m} α^{ij}) X^{m} \mod P (X) \\ = 1 + \sum_{m = 0}^{14} V_{i, m} X^{m} \mod P (X) \\ = 1 + \mod P (X) [\sum_{m = 0}^{14} V_{i, m} X^{m}] \end{matrix}$

Where

$V_{i, m} = \sum_{j = 1}^{t} λ_{j, m} α^{ij}$

and λ_j,mis the m'th bit of the content λ_iof the j'th register. α^ijis a constant that is calculated ahead of time.

Different hardware circuits can be designed for different elements.

Consider the case of i=1. In this case r(1)=1+Σ_m=0¹⁴V_1,mX_mmod P(X); where

$V_{1, m} = \sum_{j = 1}^{t} λ_{j, m} α^{j}$

The calculation of r(1)—which evaluates if alpha is a root of the error locator polynomial can be divided into three stages: (i) calculation of V_1,mto provide a set of intermediate results; (ii) calculating

$\sum_{m = 0}^{14} V_{i, m} X^{m}$

and (iii) performing a modulo operation.

The calculation of V_1,mcan be performed by masking and summation operation, as λ_j,mis one bit long. If λ_j,mis zero (‘0’) α^jis masked and if λ_j,mis set (‘1’) α^jis not masked and can be added to other unmasked powers of α. Accordingly the masking does not require gate count at all, and the summation requires an adder that include XOR gates depending on the number of set bits in α^j.

The calculation of V_1,mcan be calculated by mask and add unit 810 of FIG. 8. Mask and add unit 810 sums unmasked bits representative of preliminary results obtained during an evaluation of the error locator polynomial to provide the first set of intermediate results. The preliminary results are stored in a group of registers.

Mask and add unit 810 includes fifteen masking units and adding circuits denoted 810(1)-810(15). Each masking unit (also referred to as multiplier) receives α, α², . . . , α¹⁵and a set of masking bits. The m'th masking unit (810(m)) receives α, α², . . . , α¹⁵, multiplies the i'th power of alpha (i ranges between 1 and 15) by the m'th bit of the i'th registers, and add the results of these multiplications. The multiplication by λ_i,mis equivalent to a masking operation.

For example, masking unit 810(1) calculates

$V_{1, 0} = \sum_{j = 0}^{14} λ_{j, 0} * α^{j}$

—by multiplying the different powers of alpha by the least significant bits of different registers and then adding the unmasked bits.

Yet for another example, masking unit 810(14) calculates

$V_{1, 14} = \sum_{j = 0}^{14} λ_{j, 14} * α^{j}$

—by multiplying the different powers of alpha by the most significant bits of different registers and then adding the unmasked bits.

The calculation of V_i,m*X^mcan be performed by performing shift operations—and especially by performing m shifts of V_i,m. Calculating

$\sum_{m = 0}^{14} V_{i, m} X^{m}$

requires a sequence of shift operations (by different shift factors) and a summation. The shift operation does not require gate count at all. The summation requires adders that include XOR gates depending on the overlapping between V_i,m*X^m.

The calculation of ΣV_i,mX^mcan be performed by shift and add unit 820 of FIG. 8. Shift and add unit 820 shifts the first set of intermediate results by different shift factors (the shift factor m has values that range between zero and fourteen) to provide shifted results and adds the shifted results to provide a first shifted sum. Shift and add unit 820 includes fifteen shifters 820(1)-820(15)—each shifts V_i,mby a shift factor and also includes an adder 821 that adds the shifted results of shifters 820(1)-820(15).

The modulo operation can be executed by any prior art modulo operation circuit. For example, a 29 bit number can be concerted by a 29 bits number by applying a modulo operation that involves performing XOR operations between constant vectors xⁱmodulo p(x), depending on whether in the original value the bit corresponding to xⁱwas 1 or 0.

The evaluation of the error locator polynomial for elements that equal α^qwhere q is bigger than one and is a power of two (q=2^k) can utilize intermediate results calculated by a hardware circuit that calculates the error locator polynomial for an element that equals 2^k−1. In other words—a hardware circuit that calculates r(2^k) can utilize intermediate results generated by another hardware circuit that calculates r(2^k−1). This is also true for the case of r(f×2^k) and r(f×2^k−1).

This is illustrated by the following example:

$\begin{matrix} V_{2, m} = \sum_{j - 1}^{t} λ_{j, m} α^{2 j} \\ = \sum_{j - 1}^{t} {(λ_{j, m} α^{j})}^{2} \\ = {(\sum_{j - 1}^{t} (λ_{j, m} α^{j}))}^{2} \\ = V_{1, m^{2}} \end{matrix}$

Thus: V_2,m=V_1,m²; V_4,m=V_2,m²and V_8,m=V_4,m²

Therefore, r(2) can be calculated by:

$r (2) = 1 + \sum_{m = 0}^{14} V_{2, m} X^{m} \mod P (X) = 1 + \sum_{m = 0}^{14} {V_{1, m}}^{2} X^{m} \mod P (X)$

The intermediate results can be squared by a squaring module. Squaring modules are known in the art and are quite simple and require relatively small number of gates—for example only 7 XOR gates in GF(2¹⁵) where the field is defined by the polynomial P(X)=X^15+X+1.

FIG. 9 illustrates a second hardware circuit 720 according to an embodiment of the invention.

Second hardware circuit 720 includes squaring unit 840, shift and add unit 820 and modulo unit 830.

Squaring unit 840 includes fifteen squaring circuits 840(1)-840(15), each squares a single intermediate result provided by a corresponding masking unit and adding circuit of mask and add unit 810.

According to yet another embodiment of the invention the evaluation of an error locator polynomial for elements that differ from a power of two can be executed by a compact hardware unit that includes an inner summing unit 850, constant multiplier unit 860, outer summation unit 870, modulo unit 830 and constant multiplier 880.

This can be explained by re-writing the error locator polynomial as follows:

$r (i) = \sum_{j = 0}^{t} λ_{j} α^{ij} = \sum_{r = 0}^{⌈ \frac{t}{s} ⌉} (\sum_{j = 0}^{s - 1} λ_{j + sr} * α^{ij}) α^{irs}$

$r (i) - 1 + \sum_{j = 1}^{t} λ_{j} α^{ij} - 1 + \sum_{k = 0}^{t - 1} λ_{k + 1} α^{i (k + 1)} - 1 + \sum_{r = 0}^{⌈ \frac{t - 1}{s} ⌉} (\sum_{j = 0}^{s - 1} λ_{(j + 1) + sr} * α^{ij}) α^{i (rs + 1)}$

$r (i) = 1 + (\sum_{r = 0}^{⌈ \frac{t - 1}{s} ⌉} (\sum_{j = 0}^{s - 1} λ_{(j + 1) + sr} * α^{ij}) α^{irs}) * α^{i}$

Inner summing unit 880 may operate by using the same technique used to calculate r(1) but being responsive to only s elements of λ. It calculates the following expression:

$\sum_{j = 0}^{s - 1} λ_{j + 1 + sr} * α^{ij} .$

This configuration performs a majority of calculations with constants that have smaller number of ones (in relation to the prior art constants) and hence require less area in the implementation.

Constant multiplier unit 860 and outer summation unit 870 do not perform a modulo operation and calculate the following expression:

$\sum_{r = 0}^{⌈ \frac{t - 1}{s} ⌉} (\sum_{j = 0}^{s - 1} λ_{(j + 1) + sr} * α^{ij}) α^{irs}$

Modulo unit 830 performs modulo operation to provide an intermediate modulo result.

Constant multiplier 880 multiples the intermediate modulo result by a power of alpha (i) that is responsive to the index of the element for which the error locator polynomial is evaluated.

By implementing the re-written equation, a much simpler and compact constant multiplier can be used.

FIG. 10 illustrates hardware circuit 730 according to an embodiment of the invention.

Hardware circuit 730 includes inner summation unit 850, constant multiplier unit 860, outer summation module 870, modulo circuit 830 and can also include a constant multiplier 880.

Inner summation unit 850 includes multiple inner summation units 850(1)-850(11). The outputs of these units is fed to multiple constant multipliers 860(1)-860(11) that multiply these outputs by a constant without performing modulo operation to provide multiple results. The multiple results are fed to outer summation circuit 870 that sums the multiple results to provide another result that is fed to modulo circuit 830. The output of module circuit can be fed to constant multiplier 880 that multiplies the output of modulo unit 830 by α^r. For example, in hardware circuit 740—that calculated ELP(r=3) the constant multiplier 880 multiples the output of modulo unit 830 by α³.

FIG. 11 illustrates method 1100 for a compact Chien search according to an embodiment of the invention. The compact Chien search provides Chien search results and evaluates the Chien search results. The evaluation may involve determining which Chien Search result is indicative of a root of the error location polynomial.

Method 1100 can include stage 1110

Stage 1110 includes evaluating, by a first hardware circuit an error locator polynomial for a first element of a finite field over which the error locator polynomial is defined to provide a first set of intermediate results and a first Chien search result.

Stage 1120 includes providing the first set of intermediate results to a second hardware circuit. Stage 1120 follows the generation of the first set of intermediate results by the first hardware circuit but can be executed before stage 1110 ends by a provision of the first Chien search result.

Stage 1120 is followed by stage 1130 of evaluating, by the second hardware circuit, the error locator polynomial for a second element of the finite field to provide a second Chien search result in response to the first set of intermediate results. The first hardware circuit is substantially bigger than the second hardware circuit and wherein the first element differs from the second element.

Stage 1110 can includes either one of stages 1112, 1114, 1116 or a combination thereof.

Stage 1112 includes masking bits representative of preliminary results obtained during an evaluation of the error location polynomial and summing unmasked bits representative of the preliminary results to provide the first set of intermediate results.

Stage 1114 includes shifting the first set of intermediate results by different shift factors to provide shifted results and adding the shifted results to provide a first shifted sum.

Stage 1116 includes performing modulo operation on the first shifted sum to provide the first Chien search result.

Stage 1130 can includes either one of stages 1132, 1134, 1136 or a combination thereof.

Stage 1132 includes squaring the first set of intermediate results to provide a second set of intermediate results.

Stage 1134 includes shifting the second set of intermediate results by different shift factors to provide shifted results and adding the shifted results to provide a second shifted sum.

Stage 1136 includes performing modulo operation on the second shifted sum to provide the second Chien search result.

Method 1100 can also include stage 1150.

Stage 1150 includes providing the second set of intermediate results to a third hardware circuit. Stage 1150 follows the generation of the second set of intermediate results by the second hardware circuit but can be executed before stage 1130 ends by a provision of the second Chien search result.

Stage 1150 is followed by stage 1160 of evaluating, by a third hardware circuit, the error locator polynomial for a third element of the finite field to provide a third Chien search. The first hardware circuit is substantially bigger than the third hardware circuit and the third element differs from the second element and from the first element. Referring to the example set fourth in previous figures, a second set of intermediate results from hardware circuit 720 can be fed to hardware circuit 740.

Method 1100 can include evaluating the error locator polynomial for different elements of the finite field, wherein each evaluation comprises applying modulo operation only at a last stage of the evaluating. Referring to the example set fourth in previous figures, each hardware circuit out of 710, 720, 730, 740, 750, 760, 770 and 780 performs the modulo operation only at its last stage.

Either one of stages can be followed by stage 1180 of recovering errors in response to Chien search results.

Method 1100 can include performing the Chien Search to detect errors in encoded data stored in a flash memory, wherein the data is encoded in accordance with a Reed-Solomon.

Method 1100 can include performing the Chien Search to detect errors in encoded data stored in a flash memory; wherein the data is encoded in accordance with a BCH algorithm.

Certain operations are described herein as occurring in the microcontroller internal to a flash memory device. Such description is intended to include operations which may be performed by hardware which may be associated with the microcontroller such as peripheral hardware on a chip on which the microcontroller may reside. It is also appreciated that some or all of these operations, in any embodiment, may alternatively be performed by the external, host-flash memory device interface controller including operations which may be performed by hardware which may be associated with the interface controller such as peripheral hardware on a chip on which the interface controller may reside. Finally it is appreciated that the internal and external controllers may each physically reside on a single hardware device, or alternatively on several operatively associated hardware devices.

Any data described as being stored at a specific location in memory may alternatively be stored elsewhere, in conjunction with an indication of the location in memory with which the data is associated. For example, instead of storing page- or erase-sector-specific information within a specific page or erase sector, the same may be stored within the flash memory device's internal microcontroller or within a microcontroller interfacing between the flash memory device and the host, and an indication may be stored of the specific page or erase sector associated with the cells.

It is appreciated that the teachings of the present invention can, for example, be implemented by suitably modifying, or interfacing externally with, flash controlling apparatus. The flash controlling apparatus controls a flash memory array and may comprise either a controller external to the flash array or a microcontroller on board the flash array or otherwise incorporated therewithin. Examples of flash memory arrays include Samsung's K9XXG08UXM series, Hynix's HY27UK08BGFM Series, Micron's MT29F64G08TAAWP or other arrays such as but not limited to NOR or phase change memory. Examples of controllers which are external to the flash array they control include STMicroelectrocincs's ST7265x microcontroller family, STMicroelectrocincs's ST72681 microcontroller, and SMSC's USB97C242, Traspan Technologies' TS-4811, Chipsbank CBM2090/CBM1190. Examples of commercial IP software for Flash file systems are: Denali's Spectra™ NAND Flash File System, Aarsan's NAND Flash Controller IP Core and Arasan's NAND Flash File System. It is appreciated that the flash controller apparatus need not be NAND-type and can alternatively, for example, be NOR-type or phase change memory-type.

Flash controlling apparatus, whether external or internal to the controlled flash array, typically includes the following components: a Memory Management/File system, a NAND interface (or other flash memory array interface), a Host Interface (USB, SD or other), error correction circuitry (ECC) typically comprising an Encoder and matching decoder, and a control system managing all of the above.

The present invention may for example interface with or modify, as per any of the embodiments described herein, one, some or all of the above components and particularly with the ECC component.

It is appreciated that software components of the present invention including programs and data may, if desired, be implemented in ROM (read only memory) form including CD-ROMs, EPROMs and EEPROMs, or may be stored in any other suitable computer-readable medium such as but not limited to disks of various kinds, cards of various kinds and RAMs. Components described herein as software may, alternatively, be implemented wholly or partly in hardware, if desired, using conventional techniques.

Included in the scope of the present invention, inter alia, are electromagnetic signals carrying computer-readable instructions for performing any or all of the steps of any of the methods shown and described herein, in any suitable order; machine-readable instructions for performing any or all of the steps of any of the methods shown and described herein, in any suitable order; program storage devices readable by machine, tangibly embodying a program of instructions executable by the machine to perform any or all of the steps of any of the methods shown and described herein, in any suitable order; a computer program product comprising a computer useable medium having computer readable program code having embodied therein, and/or including computer readable program code for performing, any or all of the steps of any of the methods shown and described herein, in any suitable order; any technical effects brought about by any or all of the steps of any of the methods shown and described herein, when performed in any suitable order; any suitable apparatus or device or combination of such, programmed to perform, alone or in combination, any or all of the steps of any of the methods shown and described herein, in any suitable order; information storage devices or physical records, such as disks or hard drives, causing a computer or other device to be configured so as to carry out any or all of the steps of any of the methods shown and described herein, in any suitable order; a program pre-stored e.g. in memory or on an information network such as the Internet, before or after being downloaded, which embodies any or all of the steps of any of the methods shown and described herein, in any suitable order, and the method of uploading or downloading such, and a system including server/s and/or client/s for using such; and hardware which performs any or all of the steps of any of the methods shown and described herein, in any suitable order, either alone or in conjunction with software.

Features of the present invention which are described in the context of separate embodiments may also be provided in combination in a single embodiment. Conversely, features of the invention, including method steps, which are described for brevity in the context of a single embodiment or in a certain order may be provided separately or in any suitable subcombination or in a different order. “e.g.” is used herein in the sense of a specific example which is not intended to be limiting.

Claims

1. An apparatus that has Chien search capabilities, the apparatus comprising: a first hardware circuit to evaluate an error locator polynomial for a first element of a finite field over which the error locator polynomial is defined, and to provide a first set of intermediate results and a first Chien search result; and a second hardware circuit, wherein the first hardware circuit is to provide the first set of intermediate results to the second hardware circuit, and wherein the second hardware circuit is to evaluate the error locator polynomial for a second element of the finite field to provide a second Chien search result in response to the first set of intermediate results; wherein the first hardware circuit comprises a mask and add unit to sum unmasked bits representative of preliminary results obtained during an evaluation of the error locator polynomial to provide the first set of intermediate results.
2. An apparatus that has Chien search capabilities, the apparatus comprising: a first hardware circuit to evaluate an error locator polynomial for a first element of a finite field over which the error locator polynomial is defined, and to provide a first set of intermediate results and a first Chien search result; and a second hardware circuit, wherein the first hardware circuit is to provide the first set of intermediate results to the second hardware circuit, and wherein the second hardware circuit is to evaluate the error locator polynomial for a second element of the finite field to provide a second Chien search result in response to the first set of intermediate results; wherein the first hardware circuit comprises a shift and add unit to shift the first set of intermediate results by different shift factors to provide shifted results and adds the shifted results to provide a first shifted sum.
3. The apparatus according to claim 2, wherein the first hardware circuit comprises a modulo circuit to perform a modulo operation on the first shifter sum to provide the first Chien search result.
4. The apparatus according to claim 2, wherein the first hardware circuit further comprises a mask and add unit to sum unmasked bits representative of preliminary results obtained during an evaluation of the error locator polynomial to provide the first set of intermediate results.
5. An apparatus that has Chien search capabilities, the apparatus comprising: a first hardware circuit to evaluate an error locator polynomial for a first element of a finite field over which the error locator polynomial is defined, and to provide a first set of intermediate results and a first Chien search result; and a second hardware circuit, wherein the first hardware circuit is to provide the first set of intermediate results to the second hardware circuit, and wherein the second hardware circuit is to evaluate the error locator polynomial for a second element of the finite field to provide a second Chien search result in response to the first set of intermediate results; wherein the second hardware circuit comprises a squaring circuit to square the first set of intermediate results to provide a second set of intermediate results.
6. The apparatus according to claim 5, wherein the second hardware circuit comprises a shift and add unit to shift the second set of intermediate results by different shift factors to provide shifted results and adds the shifted results to provide a second shifted sum.
7. The apparatus according to claim 6, wherein the second hardware circuit comprises a modulo circuit to perform a modulo operation on the second shifted sum to provide the second Chien search result.
8. An apparatus that has Chien search capabilities, the apparatus comprising: a first hardware circuit to evaluate an error locator polynomial for a first element of a finite field over which the error locator polynomial is defined, and to provide a first set of intermediate results and a first Chien search result; and a second hardware circuit, wherein the first hardware circuit is to provide the first set of intermediate results to the second hardware circuit, and wherein the second hardware circuit is to evaluate the error locator polynomial for a second element of the finite field to provide a second Chien search result in response to the first set of intermediate results; comprising a third hardware circuit to evaluate the error locator polynomial for a third element of the finite field to provide a third Chien search result in response to a second set of intermediate results generated by the second hardware circuit.
9. An apparatus that has Chien search capabilities, the apparatus comprising multiple hardware circuits, the multiple hardware circuits comprise: a first hardware circuit to evaluate an error locator polynomial for a first element of a finite field over which the error locator polynomial is defined, and to provide a first set of intermediate results and a first Chien search result; and a second hardware circuit, wherein the first hardware circuit is to provide the first set of intermediate results to the second hardware circuit, and wherein the second hardware circuit is to evaluate the error locator polynomial for a second element of the finite field to provide a second Chien search result in response to the first set of intermediate results; wherein each hardware circuit of the multiple hardware circuits evaluates the error locator polynomial for a different element of the finite field, wherein each of the multiple hardware circuits is to perform a modulo operation only at a modulo circuit that provides a Chien search result.
10. The apparatus according to claim 9, wherein each hardware circuit comprises a mask and add unit to sum unmasked bits representative of preliminary results obtained during an evaluation of the error location polynomial.
11. A method for providing Chien search results comprising: evaluating, by a first hardware circuit, an error locator polynomial for a first element of a finite field over which the error locator polynomial is defined to provide a first set of intermediate results and a first Chien search result; providing the first set of intermediate results to a second hardware circuit; evaluating, by the second hardware circuit, the error locator polynomial for a second element of the finite field to provide a second Chien search result in response to the first set of intermediate results; masking bits representative of preliminary results obtained during an evaluation of the error location polynomial; and summing unmasked bits representative of the preliminary results to provide the first set of intermediate results.
12. A method for providing Chien search results comprising: evaluating, by a first hardware circuit, an error locator polynomial for a first element of a finite field over which the error locator polynomial is defined to provide a first set of intermediate results and a first Chien search result; providing the first set of intermediate results to a second hardware circuit; evaluating, by the second hardware circuit, the error locator polynomial for a second element of the finite field to provide a second Chien search result in response to the first set of intermediate results; shifting the first set of intermediate results by different shift factors to provide shifted results; and adding the shifted results to provide a first shifted sum.
13. The method according to claim 12, comprising performing a modulo operation on the first shifted sum to provide the first Chien search result.
14. The method according to claim 12, comprising: masking bits representative of preliminary results obtained during an evaluation of the error location polynomial; and summing unmasked bits representative of the preliminary results to provide the first set of intermediate results.
15. A method for providing Chien search results comprising: evaluating, by a first hardware circuit an error locator polynomial for a first element of a finite field over which the error locator polynomial is defined to provide a first set of intermediate results and a first Chien search result; providing the first set of intermediate results to a second hardware circuit; evaluating, by the second hardware circuit, the error locator polynomial for a second element of the finite field to provide a second Chien search result in response to the first set of intermediate results; and squaring the first set of intermediate results to provide a second set of intermediate results.
16. The method according to claim 15, comprising: shifting the second set of intermediate results by different shift factors to provide shifted results; and adding the shifted results to provide a second shifted sum.
17. The method according to claim 16, comprising performing a modulo operation on the second shifted sum to provide the second Chien search result.
18. A method for providing Chien search results comprising: evaluating, by a first hardware circuit, an error locator polynomial for a first element of a finite field over which the error locator polynomial is defined to provide a first set of intermediate results and a first Chien search result; providing the first set of intermediate results to a second hardware circuit; evaluating, by the second hardware circuit, the error locator polynomial for a second element of the finite field to provide a second Chien search result in response to the first set of intermediate results; and evaluating, by a third hardware circuit, the error locator polynomial for a third element of the finite field to provide a third Chien search result in response to a second set of intermediate results generated by the second hardware circuit.
19. A method for providing Chien search results comprising: evaluating, by a first hardware circuit, an error locator polynomial for a first element of a finite field over which the error locator polynomial is defined to provide a first set of intermediate results and a first Chien search result; providing the first set of intermediate results to a second hardware circuit; evaluating, by the second hardware circuit, the error locator polynomial for a second element of the finite field to provide a second Chien search result in response to the first set of intermediate results; and evaluating the error locator polynomial for different elements of the finite field, wherein each evaluation comprises applying modulo operation only at a last stage of the evaluating.
20. The method according to claim 19, comprising: masking bits representative of preliminary results obtained during an evaluation of the error location polynomial; and summing unmasked bits representative of the preliminary results to provide the first set of intermediate results.

REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Patent Application No. 61/166,834, filed Apr. 6, 2009, the entire contents of which are incorporated herein by reference.

US Referenced Citations (240)

Number	Name	Date	Kind
4463375	Macovski	Jul 1984	A
4584686	Fritze	Apr 1986	A
4589084	Fling et al.	May 1986	A
4866716	Weng	Sep 1989	A
5077737	Leger et al.	Dec 1991	A
5297153	Baggen et al.	Mar 1994	A
5657332	Auclair et al.	Aug 1997	A
5729490	Calligaro et al.	Mar 1998	A
5793774	Usui et al.	Aug 1998	A
5926409	Engh et al.	Jul 1999	A
5942005	Hassner et al.	Aug 1999	A
5956268	Lee	Sep 1999	A
5974582	Ly	Oct 1999	A
5982659	Irrinki et al.	Nov 1999	A
6038634	Ji et al.	Mar 2000	A
6094465	Stein et al.	Jul 2000	A
6119245	Hiratsuka	Sep 2000	A
6182261	Haller et al.	Jan 2001	B1
6192497	Yang et al.	Feb 2001	B1
6195287	Hirano	Feb 2001	B1
6199188	Shen et al.	Mar 2001	B1
6209114	Wolf et al.	Mar 2001	B1
6259627	Wong	Jul 2001	B1
6278633	Wong et al.	Aug 2001	B1
6279133	Vafai et al.	Aug 2001	B1
6301151	Engh et al.	Oct 2001	B1
6370061	Yachareni et al.	Apr 2002	B1
6374383	Weng	Apr 2002	B1
6504891	Chevallier	Jan 2003	B1
6532169	Mann et al.	Mar 2003	B1
6532556	Wong et al.	Mar 2003	B1
6553533	Demura et al.	Apr 2003	B2
6560747	Weng	May 2003	B1
6581180	Weng	Jun 2003	B1
6637002	Weng et al.	Oct 2003	B1
6639865	Kwon	Oct 2003	B2
6674665	Mann et al.	Jan 2004	B1
6704902	Shinbashi et al.	Mar 2004	B1
6751766	Guterman et al.	Jun 2004	B2
6772274	Estakhri	Aug 2004	B1
6781910	Smith	Aug 2004	B2
6792569	Cox et al.	Sep 2004	B2
6873543	Smith et al.	Mar 2005	B2
6891768	Smith et al.	May 2005	B2
6914809	Hilton et al.	Jul 2005	B2
6915477	Gollamudi et al.	Jul 2005	B2
6952365	Gonzalez et al.	Oct 2005	B2
6961890	Smith	Nov 2005	B2
6990012	Smith et al.	Jan 2006	B2
6996004	Fastow et al.	Feb 2006	B1
6999854	Roth	Feb 2006	B2
7010739	Feng et al.	Mar 2006	B1
7012835	Gonzalez et al.	Mar 2006	B2
7038950	Hamilton et al.	May 2006	B1
7068539	Guterman et al.	Jun 2006	B2
7079436	Perner et al.	Jul 2006	B2
7149950	Spencer et al.	Dec 2006	B2
7177977	Chen et al.	Feb 2007	B2
7191379	Adelmann et al.	Mar 2007	B2
7196946	Chen et al.	Mar 2007	B2
7203874	Roohparvar	Apr 2007	B2
7290203	Emma et al.	Oct 2007	B2
7292365	Knox	Nov 2007	B2
7301928	Nakabayashi et al.	Nov 2007	B2
7441067	Gorobets et al.	Oct 2008	B2
7447982	Thurston	Nov 2008	B1
7466575	Shalvi et al.	Dec 2008	B2
7533328	Alrod et al.	May 2009	B2
7558109	Brandman et al.	Jul 2009	B2
7593263	Sokolov et al.	Sep 2009	B2
7697326	Sommer et al.	Apr 2010	B2
7706182	Shalvi et al.	Apr 2010	B2
7804718	Kim	Sep 2010	B2
7805663	Brandman et al.	Sep 2010	B2
7805664	Yang et al.	Sep 2010	B1
7844877	Litsyn et al.	Nov 2010	B2
7961797	Yang et al.	Jun 2011	B1
8020073	Emma et al.	Sep 2011	B2
8122328	Liu et al.	Feb 2012	B2
20010037483	Sankaran et al.	Nov 2001	A1
20010052103	Hirofuji et al.	Dec 2001	A1
20020063774	Hillis et al.	May 2002	A1
20020085419	Kwon et al.	Jul 2002	A1
20020154769	Petersen et al.	Oct 2002	A1
20030065876	Lasser	Apr 2003	A1
20030101404	Zhao et al.	May 2003	A1
20030105620	Bowen	Jun 2003	A1
20030140303	Litwin et al.	Jul 2003	A1
20030192007	Miller et al.	Oct 2003	A1
20040015771	Lasser et al.	Jan 2004	A1
20040030971	Tanaka et al.	Feb 2004	A1
20040153722	Lee	Aug 2004	A1
20040153817	Norman et al.	Aug 2004	A1
20040181735	Xin	Sep 2004	A1
20050013165	Ban	Jan 2005	A1
20050018482	Cemea et al.	Jan 2005	A1
20050083735	Chen et al.	Apr 2005	A1
20050117401	Chen et al.	Jun 2005	A1
20050120265	Pline et al.	Jun 2005	A1
20050128811	Kato et al.	Jun 2005	A1
20050138533	Le-Bars et al.	Jun 2005	A1
20050144213	Simkins et al.	Jun 2005	A1
20050144368	Chung et al.	Jun 2005	A1
20050169057	Shibata et al.	Aug 2005	A1
20050172179	Brandenberger et al.	Aug 2005	A1
20050172208	Yoon	Aug 2005	A1
20050213393	Lasser	Sep 2005	A1
20060059406	Micheloni et al.	Mar 2006	A1
20060059409	Lee	Mar 2006	A1
20060064537	Oshima et al.	Mar 2006	A1
20060101193	Murin	May 2006	A1
20060203587	Li et al.	Sep 2006	A1
20060221692	Chen	Oct 2006	A1
20060248434	Radke et al.	Nov 2006	A1
20060268608	Noguchi et al.	Nov 2006	A1
20060294312	Walmsley	Dec 2006	A1
20070025157	Wan et al.	Feb 2007	A1
20070063180	Asano et al.	Mar 2007	A1
20070103992	Sakui et al.	May 2007	A1
20070104004	So et al.	May 2007	A1
20070109858	Conley et al.	May 2007	A1
20070124652	Litsyn et al.	May 2007	A1
20070143561	Gorobets	Jun 2007	A1
20070150694	Chang et al.	Jun 2007	A1
20070168625	Cornwell et al.	Jul 2007	A1
20070171714	Wu et al.	Jul 2007	A1
20070171730	Ramamoorthy et al.	Jul 2007	A1
20070180346	Murin	Aug 2007	A1
20070223277	Tanaka et al.	Sep 2007	A1
20070226582	Tang et al.	Sep 2007	A1
20070226592	Radke	Sep 2007	A1
20070228449	Takano et al.	Oct 2007	A1
20070253249	Kang et al.	Nov 2007	A1
20070253250	Shibata et al.	Nov 2007	A1
20070263439	Cornwell et al.	Nov 2007	A1
20070266291	Toda et al.	Nov 2007	A1
20070271494	Gorobets	Nov 2007	A1
20080010581	Alrod et al.	Jan 2008	A1
20080028014	Hilt et al.	Jan 2008	A1
20080055989	Lee et al.	Mar 2008	A1
20080082897	Brandman et al.	Apr 2008	A1
20080092026	Brandman et al.	Apr 2008	A1
20080104309	Cheon et al.	May 2008	A1
20080116509	Harari et al.	May 2008	A1
20080126686	Sokolov et al.	May 2008	A1
20080127104	Li et al.	May 2008	A1
20080128790	Jung	Jun 2008	A1
20080130341	Shalvi et al.	Jun 2008	A1
20080137413	Kong et al.	Jun 2008	A1
20080148115	Sokolov et al.	Jun 2008	A1
20080158958	Sokolov et al.	Jul 2008	A1
20080159059	Moyer	Jul 2008	A1
20080162079	Astigarraga et al.	Jul 2008	A1
20080168216	Lee	Jul 2008	A1
20080168320	Cassuto et al.	Jul 2008	A1
20080168335	Mead	Jul 2008	A1
20080181001	Shalvi	Jul 2008	A1
20080198650	Shalvi et al.	Aug 2008	A1
20080198652	Shalvi et al.	Aug 2008	A1
20080219050	Shalvi et al.	Sep 2008	A1
20080225599	Chae	Sep 2008	A1
20080263262	Sokolov et al.	Oct 2008	A1
20080282106	Shalvi et al.	Nov 2008	A1
20080285351	Shlick et al.	Nov 2008	A1
20080301532	Uchikawa et al.	Dec 2008	A1
20080320371	Hsu	Dec 2008	A1
20090024905	Shalvi et al.	Jan 2009	A1
20090043951	Shalvi et al.	Feb 2009	A1
20090072303	Prall et al.	Mar 2009	A9
20090091979	Shalvi	Apr 2009	A1
20090103358	Sommer et al.	Apr 2009	A1
20090106485	Anholt	Apr 2009	A1
20090113275	Chen et al.	Apr 2009	A1
20090125671	Flynn et al.	May 2009	A1
20090144600	Perlmutter et al.	Jun 2009	A1
20090150748	Egner et al.	Jun 2009	A1
20090157964	Kasorla et al.	Jun 2009	A1
20090158126	Perlmutter et al.	Jun 2009	A1
20090168524	Golov et al.	Jul 2009	A1
20090187803	Anholt et al.	Jul 2009	A1
20090199074	Sommer	Aug 2009	A1
20090213653	Perlmutter et al.	Aug 2009	A1
20090213654	Perlmutter et al.	Aug 2009	A1
20090228761	Perlmutter et al.	Sep 2009	A1
20090240872	Perlmutter et al.	Sep 2009	A1
20090292976	Kikuchi et al.	Nov 2009	A1
20100005270	Jiang	Jan 2010	A1
20100058146	Weingarten et al.	Mar 2010	A1
20100064096	Weingarten et al.	Mar 2010	A1
20100088557	Weingarten et al.	Apr 2010	A1
20100091535	Sommer et al.	Apr 2010	A1
20100095186	Weingarten	Apr 2010	A1
20100110787	Shalvi et al.	May 2010	A1
20100115376	Shalvi et al.	May 2010	A1
20100122113	Weingarten et al.	May 2010	A1
20100124088	Shalvi et al.	May 2010	A1
20100131580	Kanter et al.	May 2010	A1
20100131806	Weingarten et al.	May 2010	A1
20100131809	Katz	May 2010	A1
20100131826	Shalvi et al.	May 2010	A1
20100131827	Sokolov et al.	May 2010	A1
20100131831	Weingarten et al.	May 2010	A1
20100146191	Katz	Jun 2010	A1
20100146192	Weingarten et al.	Jun 2010	A1
20100149881	Lee et al.	Jun 2010	A1
20100180073	Weingarten et al.	Jul 2010	A1
20100199149	Weingarten et al.	Aug 2010	A1
20100199154	Wu et al.	Aug 2010	A1
20100211724	Weingarten	Aug 2010	A1
20100211833	Weingarten	Aug 2010	A1
20100211856	Weingarten	Aug 2010	A1
20100251066	Radke	Sep 2010	A1
20100253555	Weingarten et al.	Oct 2010	A1
20100257309	Barsky et al.	Oct 2010	A1
20100293321	Weingarten	Nov 2010	A1
20110047441	Yamaga	Feb 2011	A1
20110051521	Levy et al.	Mar 2011	A1
20110055461	Steiner et al.	Mar 2011	A1
20110096612	Steiner et al.	Apr 2011	A1
20110119562	Steiner et al.	May 2011	A1
20110153919	Sabbag	Jun 2011	A1
20110161775	Weingarten	Jun 2011	A1
20110214029	Steiner et al.	Sep 2011	A1
20110214039	Steiner et al.	Sep 2011	A1
20110246792	Weingarten	Oct 2011	A1
20110246852	Sabbag	Oct 2011	A1
20110252187	Segal et al.	Oct 2011	A1
20110252188	Weingarten	Oct 2011	A1
20110271043	Segal et al.	Nov 2011	A1
20110302428	Weingarten	Dec 2011	A1
20120001778	Steiner et al.	Jan 2012	A1
20120005554	Steiner et al.	Jan 2012	A1
20120005558	Steiner et al.	Jan 2012	A1
20120005560	Steiner et al.	Jan 2012	A1
20120008401	Katz et al.	Jan 2012	A1
20120008414	Katz et al.	Jan 2012	A1
20120051144	Weingarten et al.	Mar 2012	A1
20120063227	Weingarten et al.	Mar 2012	A1
20120066441	Weingarten	Mar 2012	A1
20120110250	Sabbag et al.	May 2012	A1

Non-Patent Literature Citations (38)

Entry
Fedorenko, S.V.; Trifonov, P.V.; , “Finding roots of polynomials over finite fields,” Communications, IEEE Transactions on , vol. 50, No. 11, pp. 1709-1711, Nov. 2002, doi: 10.1109/TCOMM.2002.805269.
Search Report of PCT Patent Application WO 2009/118720 A3.
Search Report of PCT Patent Application WO 2009/095902 A3.
Search Report of PCT Patent Application WO 2009/078006 A3.
Search Report of PCT Patent Application WO 2009/074979 A3.
Search Report of PCT Patent Application WO 2009/074978 A3.
Search Report of PCT Patent Application WO 2009/072105 A3.
Search Report of PCT Patent Application WO 2009/072104 A3.
Search Report of PCT Patent Application WO 2009/072103 A3.
Search Report of PCT Patent Application WO 2009/072102 A3.
Search Report of PCT Patent Application WO 2009/072101 A3.
Search Report of PCT Patent Application WO 2009/072100 A3.
Search Report of PCT Patent Application WO 2009/053963 A3.
Search Report of PCT Patent Application WO 2009/053962 A3.
Search Report of PCT Patent Application WO 2009/053961 A3.
Search Report of PCT Patent Application WO 2009/037697 A3.
Yani Chen, Kcshab K. Parhi, “Small Area Parallel Chien Search Architectures for Long BCH Codes”, Ieee Transactions on Very Large Scale Integration(VLSI) Systems, vol. 12, No. 5, May 2004.
Yuejian Wu, “Low Power Decoding of BCH Codes”, Nortel Networks, Ottawa, Ont., Canada, in Circuits and systems, 2004. ISCAS '04. Proceeding of the 2004 International Symposium on Circuits and Systems, published May 23-26, 2004, vol. 2, pp. II-369-II-372 vol. 2.
Michael Purser, “Introduction to Error Correcting Codes”, Artech House Inc., 1995.
Ron M. Roth, “Introduction to Coding Theory”, Cambridge University Press, 2006.
Akash Kumar, Serge! Sawitzki, “High-Throughput and Low Power Architectures for Reed Solomon Decoder”, (a.kumar at tue.nl, Eindhoven University of Technology and sergei.sawitzki at philips.com).
Todd K.Moon, “Error Correction Coding Mathematical Methods and Algorithms”, A John Wiley & Sons, Inc., 2005.
Richard E. Blahut, “Algebraic Codes for Data Transmission”, Cambridge University Press, 2003.
David Esseni, Bruno Ricco, “Trading-Off Programming Speed and Current Absorption in Flash Memories with the Ramped-Gate Programming Technique”, Ieee Transactions on Electron Devices, vol. 47, No. 4, Apr. 2000.
Giovanni Campardo, Rino Micheloni, David Novosel, “VLSI-Design of Non-Volatile Memories”, Springer Berlin Heidelberg New York, 2005.
John G. Proakis, “Digital Communications”, 3rd ed., New York: McGraw-Hill, 1995.
J.M. Portal, H. Aziza, D. Nee, “EEPROM Memory: Threshold Voltage Built in Self Diagnosis”, ITC International Test Conference, Paper 2.1.
J.M. Portal, H. Aziza, D. Nee, “EEPROM Diagnosis Based on Threshold Voltage Embedded Measurement”, Journal of Electronic Testing: Theory and Applications 21, 33-42, 2005.
G. Tao, A Scarpa, J. Dijkstra, W. Stidl, F. Kuper, “Data retention prediction for modern floating gate nonvolatile memories”, Microelectronics Reliability 40 (2000), 1561-1566.
T. Hirncno, N. Matsukawa, H. Hazama, K. Sakui, M. Oshikiri, K. Masuda, K. Kanda, Y. Itoh, J. Miyamoto, “A New Technique for Measuring Threshold Voltage Distribution in Flash EEPROM Devices”, Proc. IEEE 1995 Int. Conference on Microelectronics Test Structures, vol. 8, Mar. 1995.
Boaz Eitan, Guy Cohen, Assaf Shappir, Eli Lusky, Amichai Givant, Meir Janai, Ilan Bloom, Yan Polansky, Oleg Dadashev, Avi Lavan, Ran Sahar, Eduardo Maayan, “4-bit per Cell NROM Reliability”, Appears on the website of Saifun.com.
Paulo Cappelletti, Clara Golla, Piero Olivo, Enrico Zanoni, “Flash Memories”, Kluwer Academic Publishers, 1999.
JEDEC Standard, “Stress-Test-Driven Qualification of Integrated Circuits”, JEDEC Solid State Technology Association. JEDEC Standard No. 47F pp. 1-26.
Dempster, et al., “Maximum Likelihood from Incomplete Data via the EM Algorithm”, Journal of the Royal Statistical Society. Series B (Methodological), vol. 39, No. 1 (1997), pp. 1-38.
Mielke, et al., “Flash EEPROM Threshold Instabilities due to Charge Trapping During Program/Erase Cycling”, IEEE Transactions on Device and Materials Reliability, vol. 4, No. 3, Sep. 2004, pp. 335-344.
Daneshbeh, “Bit Serial Systolic Architectures for Multiplicative Inversion and Division over GF (2)”, A thesis presented to the University of Waterloo, Ontario, Canada, 2005, pp. 1-118.
Chen, Formulas for the solutions of Quadratic Equations over GF (2), IEEE Trans. Inform. Theory, vol. IT-28, No. 5, Sep. 1982, pp. 792-794.
Berlekamp et al., “On the Solution of Algebraic Equations over Finite Fields”, Inform. Cont. 10, Oct. 1967, pp. 553-564.

Related Publications (1)

	Number	Date	Country
	20100257433 A1	Oct 2010	US

Provisional Applications (1)

	Number	Date	Country
	61166834	Apr 2009	US

Compact chien-search based decoding apparatus and method

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

US Classifications

Field of Search

US

International Classifications

Term Extension