Embodiments described herein relate generally to a memory controller, a semiconductor storage device, and a memory control method.
The Chien search is the algorithm to find the roots of an error locator polynomial over a field by substituting every element of the field. A serial Chien search circuit searches the roots by substituting a single element per clock cycle. In order to speed up the searching time, a parallel structure of the Chien search is required. However, its parallelism increases the area of the circuit.
In order to make our embodiment clear, we divide a Chien search circuit into two function blocks: root shift block and substitution block. Root shift block of p-parallel Chien search transforms the current polynomial which roots are, for instance, {β1, β2} into another one which roots are {αp×β1, αp×β2}. Substitution block of p-parallel Chien search substitutes α0, α1, . . . , and αp-1 into a shifted polynomial which is an output from root shift block. When the parallel factor p increases, substitution block needs an additional circuit to substitute another field element, while root shift block does not need an additional circuit but requires only replacement of the shift factor. Therefore, as the parallel factor p increases, so the area of substitution block increases, while increase of the area of root shift block is much less than that of substitution block.
In general, according to one embodiment, a memory controller including a syndrome calculation unit which calculates syndrome based on code word which have the ability to correct t bits, an error locator polynomial calculation unit, and a Chien search unit. The Chien search unit includes a root shift block which shifts all roots, a division block which divides the output from the root shift block by a predetermined polynomial, of which the order is smaller than t, and substitution block which substitutes elements into the remainder polynomial to examine if they are the roots of the remainder, and wherein the predetermined polynomial has at least one root which value is the same as one of the substituted elements.
Hereinafter, a memory controller, a semiconductor storage device, and a memory control method according to the embodiments will be described in detail with reference to the attached drawings. In addition, the present invention is not limited to the embodiments.
The semiconductor memory unit 3 is a non-volatile memory which stores data in a non-volatile manner, for example, a NAND memory. In addition, herein, although an example where a NAND memory is used as the semiconductor memory unit 3 is described, any storage unit other than the NAND memory may be used as the semiconductor memory unit 3. In the NAND memory, a write unit (minimum write) generally referred to as a page exists, and writing is performed in units of a page.
The memory controller 2 controls writing of the semiconductor memory unit 3 according to a write command (request) from the host 4 and controls reading from the NAND memory 3 according to a read command (request) from the host 4. The memory controller 2 includes a Host I/F 21, a memory I/F 22, a control unit 23, and an ECC (Error correction code) unit 24. The ECC unit 24 includes an encoding unit 25 and a decoding unit 26. The Host I/F 21, the memory I/F 22, the control unit 23, the encoding unit 25, and the decoding unit 26 are connected to one another via an internal bus 20.
The Host I/F 21 performs a process according to an interface standard with respect to the host 4 to output instructions, user data, and the like received from the host 4 to the internal bus 20. In addition, the Host I/F 21 transmits user data read from the semiconductor memory unit 3, responses from the control unit 23, and the like to the host 4.
The memory I/F 22 controls a process of writing write data in the semiconductor memory unit 3 and a process of reading write data from the semiconductor memory unit 3 based on instructions of the control unit 23.
The control unit 23 collectively controls components of the semiconductor storage device 1. In the case where an instruction is received from the host 4 through the Host I/F 21, the control unit 23 performs control according to the instruction. For example, the control unit 23 instructs the memory I/F 22 to write a code word (user data and parity) in the semiconductor memory unit 3 and to read the code word from the semiconductor memory unit 3 according to an instruction from the host 4.
The encoding unit 25 encodes the user data transmitted to the internal bus 20 to generate the code word which consists of the user data and the parity. Although any code may be used as ECC, for example, a BCH code, or the like may be used. Hereinafter, an example where the BCH code is used will be described.
The decoding unit 26 decodes the word read from the semiconductor memory unit 3. Since the received data almost always contains error, it is sent to the decoding unit 26 from the semiconductor memory unit 3 through the memory I/F 22 to identify the error.
First, if the decoding process is started, the decoding control unit 264 instructs the syndrome calculation unit 261 to compute the value of syndrome (Step S1). Next, the decoding control unit 264 instructs the error locator polynomial calculation unit 262 to derive error locator polynomial from syndrome (Step S2).
Next, the decoding control unit 264 instructs the Chien search unit 263 to perform Chien search by using error locator polynomial (Step S3) to identify error positions, and the process is ended. The decoding control unit 264 performs error correction by flipping bits which are identified as erroneous ones by the Chien search.
Hereinbefore, in the Chien search of the decoding process described, the roots of an error locator polynomial are searched by sequentially substituting candidates for the root into the polynomial. A parallel Chien search circuit is required to realize high-throughput.
In
The root shift block 5 includes a selector 51-i, a register 52-i, and a multiplier 53-i for each term of the error locator polynomial, wherein i=1, . . . , t. At the initial step, the selector 51-i selects σi of the input error locator polynomial and stores the coefficient in the register 52-i.
In the first cycle, the current value σi stored in the register 52-i is output into each of the substitution circuits 6-1 to 6-p in substitution block. In addition, the selector 51-i selects an output of the multiplier 53-i in order to store the root shifted coefficient which is the current value σi multiplied by αip. Hereafter, the same process is performed in each cycle until all roots are found.
Herein, the substitution circuit 6-j (j=1, 2, . . . , p) is configured so as to substitute αp-j into the input polynomial, which is the output of root shift block 5 and its maximum degree is t. For example, the substitution circuit 6-1 includes t multipliers 61-1 to 61-t and an addition circuit 62 which performs addition (XOR) of all multiplication results and the constant term of input polynomial. Notice that this constant term is not explicitly illustrated in
In generalization of the substitution circuits 6-1 to 6-p, the substitution circuit 6-j includes t multipliers 6j-i (i=1, . . . , t) which multiply the coefficience of xi of input polynomial and α(p−1)i together and an addition circuit 62 which performs addition (XOR) on multiplication results output from the multipliers 6j-1 to 6j-t and the constant term of input polynomial. In the substitution circuit 6-p, since αp-p=α0 is substituted, the multipliers 6p-1 to 6p-t are unnecessary.
Next, the Chien search circuit according to the embodiment will be described.
The division operation according to the embodiment will be described. Divisor, quotient and remainder are denoted by m(x), q(x) and r(x), respectively. Dividend denoted by σ(x) is an output of the root shift block 5. In general, these polynomial obey the following relation.
σ(x)=m(x)·q(x)+r(x) (1)
In the case where the dividend σ(x) and the divisor m(x) have a common root αc, σ(x) and m(x) can be factored into the following expressions (2) and (3), respectively.
σ(x)=(x+αc)σ/αc(x) (2)
m(x)=(x+αc)m/αc(x) (3)
In these expressions, σ/αc(x) and m/αc(x) denote the remaining factors of σ(x) and m(x) after the factorization of the common root αc, respectively.
By substituting Equations (2) and (3) into Equation (1), Equation (1) turns into the following Equation (4).
r(x)=(x+αc){/αc(x)−m/αc(x)·q(x)} (4)
It can be understood from Equation (4) that the remainder polynomial r(x) also has the common root αc. Therefore, instead of examining if an element is a root of σ(x), we can examine if the element is a root of r(x) when divisor has the element as a root. Since degree of remainder is smaller than that of dividend, the area of substitution circuit for remainder is smaller than that of dividend.
The division operation according to the embodiment may be configured in an arbitrary manner. With respect to the method of selecting the divisor, the number of steps per cycle of long division, and the like, various examples can be considered. For example, the following Equation (5) may be used as a divisor.
m(x)=(x+α0)(x+α1)(x+α2) . . . (x+αp-1) (5)
The set of all roots of this divisor is equivalent to the set of all field elements substituted by substitution circuits 7-1 to 7-p. A divisor may not necessarily include all the roots. In the case, as described in the embodiment described below, substitution block consists of two types of substitution circuits. Input polynomial of some substitution circuits is dividend, that is, the output of root shift block 5, whereas that of the other substitution circuits is the remainder of the dividend.
Next, the number of cycle to finish the division will be described. If a circuit to perform all steps of long division in 1 cycle is implemented, the number of cycle to finish the division is 1. If a circuit to perform all steps of long division in 2 cycles is implemented, the number is 2. The circuit area of the division block 8 is inversely proportional to the number of cycle to finish the division.
For example, when the divisor is Equation (5), since the degree of the divisor is p, p field multiplication operations are needed for one step of division. When the degree of dividend is t, the number of all steps is t−p+1. In the case where the number of cycle to finish the division is 1, the number of multiplication needed for the division operation is p×(t−p+1), where p is the number of multiplication per step.
On the other hand, the number of multiplication in substitution circuit 7-1 is t′=p−1, because the input polynomial into substitution circuit in
However, if the number of cycle to finish the division is 2, the area of division block 8 is reduced to half, because the circuit performs half of all steps in every cycle. In this case, the net difference of multiplication between
In Step S13, in the case where all steps of long division are finished (Yes in Step S13), the remainder is input into the substitution circuits 7-1 to 7-p, and substitution process is performed by each of the substitution circuits 7-1 to 7-p (Step S15). The substitution circuit 7-j (j=1, 2, . . . , p) determines whether the root of the input polynomial (the remainder) is α(p−j) (Step S16). In the case where the input polynomial has the root (Yes in Step S16), 1 is added to a value e which indicates the number of the identified roots of the error locator polynomial (Step S17). Otherwise (No in Step S16), e does not change.
Next, the decoding control unit 264 (or Chien search control unit) determines whether the value e (number of identified roots) is equal to the error number obtained through the error locator polynomial calculation unit 262 (Step S18). In the case where the value e is equal to the error number (Yes in Step S18), the process is ended. Otherwise (No in Step S18), the root shift block 5 turns a current polynomial, which is stored in registers 52-1 to 52-t, into the shifted polynomial all of which roots are the current polynomial roots multiplied by α−p (Step S19), and the process turns to Step S11. After all roots of the error locator polynomial are identified, the error correction is performed.
In the case where the error number is 31 or less, the number of cycle to finish the division is 1. Hereinafter, since the number of elements checked in every one cycle is equal to the parallel factor, the throughput of the case is denoted by throughput 1.
In the case where the error number is 32 or more and 47 or less, this 16-parallel Chien search circuit checks 16 elements in every second cycle, and hence the throughput is ½.
In the case where the error number is 48 or more and 63 or less, this 16-parallel Chien search circuit checks 16 elements every third cycle, and hence the throughput is ⅓.
As described hereinbefore, since the throughput of parallel Chien search can be decreased according to the error number obtained through error locator polynomial calculation unit 262, the circuit area of the division block 8 is reduced, and hence that of parallel Chien search is reduced. The condition for the throughput is reasonable with respect to power consumption. Power consumption of parallel Chien search circuit is proportional to the error number, while the power consumption is limited in order to satisfy the requirement of power consumption of decoding unit 26. In the examples described with reference to
Next, a parallel Chien search method according to a second embodiment will be described. The configuration of the semiconductor storage device according to the embodiment is the same as that of the first embodiment except that divisor used in the division block 8 in the Chien search unit 263 is replaced. Hereinafter, difference from the first embodiment will be described.
In the first embodiment, the circuit area of the division block 8 is too large to reduce the circuit area of parallel Chien search if the number of cycle to finish the division is 1. Hence, in the first embodiment, the circuit area of the division block 8 is suppressed by decreasing the throughput according to the error number. In the case where the decreasing is not desired, the number of the multiplication operations can be reduced by using a different type of divisor from the previous one. For example, the divisor is a product of minimum polynomials.
In the case where the divisor is defined by using minimum polynomials, there is a case where roots corresponding to all the substitution circuits may not be included in the divisor. Specific examples will be described. In this example, error correction capability t is set to 80, parallel factor p is set to 16, and a code length is set to 214−1=16383 bit or less. For example, the following Equation (6) is used as a divisor.
m(x)=(x+α0)M1(x)M3(x)M5(x) (6)
M1(x) is a minimum polynomial including the root α and is expressed by the following Equation (7). M1(x) has all conjugates of α as roots, and hence M1(x) includes the roots α2, α4, and α8, which are the same as some of substitution elements in substitution block.
M1(x)=x14+x10+x6+x+1 (7)
M3(x) is a minimum polynomial including the root α3 and is expressed by the following Equation (8). M3(x) includes conjugate roots α6 and α12 which are the same as some of substitution elements which are from α0 to α15.
M3(x)=x14+x8+x6+x5+x2+x+1 (8)
M5(x) is a minimum polynomial including the root α5 and is expressed by the following Equation (9). M5(x) includes conjugate root α10.
M5(x)=x14+x10+x9+x7+x6+x4+x3+x+1 (9)
In the case where the divisor is Equation (6), some of its roots, which are α0, α, α2, α3, α4, α5, α6, α8, α10, and α12 (10 elements), are the same as some of substitution elements. Hence, an input of substitution circuits of these common elements (10 elements) is a remainder of division blocks, whereas an input of substitution circuits of the other elements is an output of root shift block 5. The divisor defined by using a minimum polynomial is not limited to the above examples, but the divisor may consist of another set of minimum polynomials.
In the case where the divisor is Equation 6), since the degree of the divisor is 43, the degree of the remainder polynomial is 42 or less, that is, substitution circuits 7 include 42 multiplication circuits. Therefore, the area of substitution circuits 7 is less than that of substitution circuits 6 which include 81 multiplication circuits.
Since coefficient of minimum polynomials is limited to 0 or 1, if the divisor consists of minimum polynomials, coefficient of the divisor is also limited to 0 or 1. Therefore, if the number of cycle to finish the division is 1, the circuit area of this 16-parallel Chien search with the divisor can be smaller than that of a general 16-parallel Chien search. It goes without saying that the number of cycle to finish the division can be more than 1 in this embodiment.
For example, in the case where an operating frequency is 200 MHz, the Chien search circuit including the division block 8a is implemented under the same condition as that of
While certain embodiments have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of the inventions. Indeed, the novel embodiments described herein may be embodied in a variety of other forms; furthermore, various omissions, substitutions and changes in the form of the embodiments described herein may be made without departing from the spirit of the inventions. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the inventions.
This application is based upon and claims the benefit of priority from Provisional Patent Application No. 61/695,158, filed on Aug. 30, 2012; the entire contents of which are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
6131178 | Fujita et al. | Oct 2000 | A |
6571368 | Chen | May 2003 | B1 |
7020826 | Litwin et al. | Mar 2006 | B2 |
7249310 | Feng et al. | Jul 2007 | B1 |
7581155 | Lee et al. | Aug 2009 | B2 |
7890846 | Lee et al. | Feb 2011 | B2 |
8055977 | Ito | Nov 2011 | B2 |
8667367 | Khellah et al. | Mar 2014 | B2 |
Number | Date | Country |
---|---|---|
8-125547 | May 1996 | JP |
2001-44853 | Feb 2001 | JP |
2001-86005 | Mar 2001 | JP |
Number | Date | Country | |
---|---|---|---|
20140068392 A1 | Mar 2014 | US |
Number | Date | Country | |
---|---|---|---|
61695158 | Aug 2012 | US |