This application is based upon and claims the benefit of priority from the prior Japanese Patent Application No. 2010-273311, filed on Dec. 8, 2010, the entire contents of which are incorporated herein by reference.
The embodiment relates to a memory system and memory controller.
As the memory capable of storing mass data for use, variable resistance memories (ReRAM) and so forth have received attention because they can be easily formed in 3 dimensions. Even though any types of memory cells are used, however, a trump card, which requires no development of new process technologies for mass storage, is how to put many bits in a single memory cell by multi-leveling the state of the memory cell, of which examples can be found in NAND flash memories. If the memory cell is multi-leveled, though, the instability of setting the state causes failed read/write easily. This has a tradeoff with the increase in the number of physical quantity levels of the memory cell. In the case of the NAND flash memory, the number of physical quantity levels per cell is equal to 8 levels or 16 levels at most and a complicated association is caused between the data bit information and the physical quantity level.
A memory system according to the embodiment comprises a memory device including plural memory cells capable of storing d bits of data (d is an integer of 2 or more) in accordance with plural physical quantity levels and operative to read/write data at every page composed of specific bits in certain ones of the plural memory cells; and a memory controller operative to control the memory device. The memory controller includes a page buffer operative to hold page data to be read from/written in a page of the memory device and send/receive the page data to/from the memory device, a data processing unit operative to detect and correct an error in the page data by processing target data in a finite field Zp modulo p generated based on the page data (p is a prime that satisfies 2<p<2d), and a mapping unit operative to execute mapping of the target data from the data processing unit as page data within the page buffer.
Memory systems and memory controllers for use therein according to the embodiments will now be described below with reference to the drawings.
A memory system realizes a higher-density storage capacity by finely fabricating a cell array, forming the cell array and so forth in 3-dimensional structures, and thinking physical phenomena for use in memory cells. If the memory system is further required to have a higher density after completion of stable process steps, multi-leveling of memory cells is effective means.
In particular, as for NAND flash memories, multi-leveling proceeds promptly and 3-bit cells capable of storing 8 levels per cell become practical. Much higher multi-leveling of memory cells associates with a sharp deterioration of reliability, however, and a single-bit increase per cell suffers a 2 or more exponent increase of error rate at least. Thus, from the problem on the reliability and production yield, the progression of multi-leveling of memory cells has a difficult problem even though it is expected as means for realizing high-capacity file memories.
If the problem on multi-leveling of memory cells can be overcome to achieve effective use, NAND flash memories can be attained to have a higher-density storage capacity while utilizing stable process steps on NAND flash memories for longer terms.
Therefore, under the following gist, memory controllers using a Lee metric code and memory systems utilizing such the memory controllers are proposed. Hereinafter, the multi-level memory cell may also be referred to as a “MLC” (Multi Level Cell). In addition, the NAND flash memory utilizing the MLCs may also be referred to as a “MLC NAND flash memory”.
(1) An error in a MLC, in many cases, may be caused by writing to an adjacent physical quantity level (hereinafter simply referred to as a “level”) in the MLC and reading from an adjacent level in the MLC. Even in such the case, the assignment of binary bits to the conventional MLC may cause a larger error quantity. Therefore, the memory controller according to the embodiment described below simplifies the assignment of values to the MLC levels to improve the reliability. At the same time, it is made possible to utilize the presently existing MLC NAND flash memory as it is, thereby improving the productivity additionally.
(2) The ECC (Error Correcting Code) utilized in the conventional MLC NAND flash memory can correspond to every error. Therefore, it is possible to correct failed MLC-level identification that occurs at an extremely lower possibility, regarding it as an error in a bit contained in a page. This is to say, in other words, that the ECC excessively corresponds to the types of error occurrences. Therefore, it is intended to generally correspond to the error caused most between adjacent levels, thereby suppressing the excessive increase in check bit and constructing an efficient ECC.
(3) Under the above gist, and for the purpose of representing the MLC levels, it is intended to propose mapping of plural pages of binary data belonging to the same memory cell assigned to the p-adic elements in Zp.
First, an example of the MLC NAND flash memory including 16-level MLCs is used to describe the relation between multi-levels and binary data.
The use of 16-level MLCs allows storage of 4 bits of data per cell. The 4 bits of data may correspond to the MLC levels in various patterns as can be considered. The utilization of Gray code is considered here. Such the utilization of Gray code allows the MLC levels to correspond to the binary data such that the variation between adjacent levels just contains a variation in only 1 bit of 4 bits. The 4 bits of binary data can form different respective pages <0>-<3> as sets of data. If the MLC levels are numbered 0 to 15 in order from the lowermost level, the bits in each page corresponding to the MLC levels are as shown in
A bit p0 in page <0>, a bit p1 in page <1>, a bit p2 in page <2>, and a bit p3 in page <3> can be determined by respective binary functions P0, P1, P2, P3 of digits B0, B1, B2, B3 when the value L of the MLC level is represented by a binary number as shown in Expression 1.
L=B
020±B121+B222+B323
p0=P0(B0,B1,B2,B3)
p1=P1(B0,B1,B2,B3)
p2=P2(B0,B1,B2,B3)
p3=P3(B0,B1,B2,B3) [Expression 1]
In particular, Gary code is so coded that any one of p0, p1, p2 and p3 varies in response to the variation in L by 1. In general, any variation in L can be represented by the variation in a bit p0-p3 in each page. Therefore, if 1-bit error detection/correction is possible in each of 4 pages <0>-<3>, with regard to 1 cell, it is possible to deal errors on all level variations in the MLC. Further, if the variation in the MLC level is just 1, the use of Gray code makes it possible to correct the error by a correction of 1 bit to any one page. Therefore, if 1 bit can be corrected independently in each page, it is possible to correct errors in 4 independent MLCs.
Thus, the method of associating the MLC levels with the binary data using pages and then correcting a bit error at every page is extremely strong and is possible to deal any errors in the MLC.
In consideration of the fact that most of the errors arise between adjacent levels in the MLC, the error detecting and correcting method as described above excessively corresponds to errors and, by the amount, deals a larger amount of information that is not required usually in error detection/correction.
Next, the usually-not-required information is described.
This example corresponds to 16-level MLCs and therefore provides 512-byte MLCs with 4 series of page buffers. These 4 series of page buffers comprise 512-byte memory cells in total.
A consideration is given, for such the configuration, to a system that can be easily extended from the conventional ECC system and mounted on the memory system.
As 512 bytes include 212 bits, the ECC system using binary data utilizes a Galois field GF(213). In this case, check bits required for 1-bit error correction include 13 bits. The number of bits per page buffer required for t-bit error correction (t is an integer of 1 or more) is equal to the sum of 512 bytes of data and 13t bytes of check bits. Therefore, if 512 bytes of data can be dealt, the easily extendable number, t, of correctable bits is required to satisfy an inequality, 212+13t≦213. With such the configuration, and on the assumption that only the error between adjacent levels arises in the MLC, the use of Gray code makes it possible to correct 4t pieces of MLCs among (512 bytes+13t) pieces. The example shown in
The following description is given to the system efficiency on construction of a Lee metric code-utilizing ECC system using 16-level MLCs in comparison with that on construction of the convent ional binary-utilizing ECC system.
L: The number of MLC levels required for the ECC system. When a Lee metric code is utilized, the number of elements in the Lee metric code, that is, 13 is selected for use from 16 MLC levels as shown on the columns a-d in
p: A prime for use in a Lee metric code. When the Lee metric code is utilized, the prime comes to p=13 as shown on the columns a-d in
h: The number of bits required for representing L. When a Lee metric code is utilized, the number comes to h=4 because it is sufficient to represent just 16 levels of 0 to 15 as shown on the columns a-d in
M: The quantity of information data dealt in batch in ECC processing, which is represented by the number of bits. M varies in accordance with the later-described error-correctable quantity ε. When a Lee metric code is utilized, as the error-correctable quantity ε increases 1 by 1, for example, M decreases from 36 bits to 24 bits 4-bit by 4-bit as shown on the columns a-d in
ε: The error-correctable quantity. When a Lee metric code is utilized, it is possible to correct errors caused in 12 MLCs for use in code storage if they have the total sum of Lee metrics equal to ε or below. In the case of the error in the MLC between adjacent levels, that is, in the case of the error when the Lee metric is equal to 1, ε indicates the number of correctable cells of 12 MLCs. On the other hand, in the case of a binary system, 315 is selected as the maximum t that satisfies the inequality, 212+13t≦213, as shown on the column e, and the number of cells 4 times larger than that is made correctable among 4×512 bytes.
δ: The number of digits when M is regarded as sets of h bits. It is the number of digits when M is represented by a 2h-adic number. In the case of a binary system, it is equal to the number of cells for use in storage of information data contained in a sector as shown on the column e.
n: The number of cells required for storage of information data after it is coded. When a Lee metric code is utilized, the number comes to n=12 as shown on the columns a-d in
M/n: The number of information bits storable per MLC, that is, the quantity indicative of the number of effective bits in the code stored in the MLC. An increased error-correctable quantity ε makes the code redundant and accordingly decreases M/n. Note that, in the case of a binary system, a 4-bit MLC is used, as shown on the column e, while the need for providing check bits just allows storage of only 2 bits of information data.
log2L: The number of bits storable when the most of the utilized MLC levels are used. The brackets on the columns a-d in
code/M: The redundancy of the code contained in the ECC. It is a ratio between the number of bits in the code (=nh) and the number of bits in the quantity M of information data contained therein, which is the quantity indicative of how many times the number of bits in the code is larger than the number of bits in the quantity M of information data. Note that when a Lee metric code is utilized, code/M increases little by little as ε increases as shown on the columns a-d in
ε/n: A proportion between the number of cells required for storage of 1 code and the maximum number of error-correctable cells among those. The ε/n provides an index indicative of how many error-caused MLCs of the code storing MLCs are allowed to exist to restore the code. In the case of the binary system, the efficiency is poorer than that when a Lee metric code is utilized as shown on the column e in
(M/n)/log2L: The utilization efficiency of the MLC levels. It shows a proportion of those utilized as multi-levels when an ECC is incorporated. The brackets on the columns a-d in
The binary system is possible to correct any errors caused in 1 cell in the case of the use of 16-level MLCs and the ECC of 1-bit per page, and thus possible to correct errors in 4 cells at the maximum over the memory cells of the page length. A consideration is given to an ECC system capable of correcting errors in 3 cells of 12 cells corresponding to the use of a Lee metric code of ε=3. In this case, it is required to correct almost 512 bytes when the page length includes 2 k bytes, which correspond to 128 bytes of 512 bytes of MLCs such that BCH in GF(213) shown on the column e in
A specific example of the memory controller and memory system according to the present embodiment is described mainly assuming the ECC system shown on the column b in
The following description is given to the system efficiency on construction of a Lee metric code-utilizing ECC system using 8-level MLCs in comparison with that on construction of the conventional binary code-utilizing ECC system.
In the case of the 8-level MLC, the system utilizing the Lee metric code decreases its advantage compared to the 16-level MLC over the binary system though it can achieve the system efficiency more than that of the binary system if the error tolerance is required 15% or more.
L: When a Lee metric code is utilized, the number of elements in the Lee metric code, that is, 7 is selected for use from 8 MLC levels as shown on the columns a-c in
p: When the Lee metric code is utilized, the prime comes to p=7 as shown on the columns a-c in
h: When a Lee metric code is utilized, the number comes to h=3 because it is sufficient to represent just 8 levels of 0 to 7 as shown on the columns a-c in
M: When a Lee metric code is utilized, as the error-correctable quantity c increases 1 by 1, for example, M decreases from 9 bits to 3 bits 3-bit by 3-bit as shown on the columns a-c in
ε: When a Lee metric code is utilized, it is possible to correct errors caused in 6 MLCs for use in code storage if they have the total sum of Lee metrics equal to ε or below. In the case of the error in the MLC between adjacent levels, ε indicates the number of correctable cells of 6 MLCs. On the other hand, in the case of a binary system, 315 is selected as the maximum t that satisfies the inequality, 212+13t≦213, as shown on the column d in
δ: In the case of a binary system, it is equal to the number of cells for use in storage of information data contained in a sector as shown on the column d in
n: When a Lee metric code is utilized, the number comes to n=6 as shown on the columns a-c in
M/n: Note that, in the case of a binary system, a 3-bit MLC is used, as shown on the column d in
log2L: The brackets on the columns a-c in
code/M: Note that when a Lee metric code is utilized, code/M increases little by little as c increases as shown on the columns a-c in
ε/n: In the case of the binary system, the efficiency is poorer than that when a Lee metric code is utilized as shown on the column d in
(M/n)/log2L: The brackets on the columns a-c in
The binary system is possible to correct any errors caused in 1 cell in the case of the use of 8-level MLCs and the ECC of 1-bit per page, and thus possible to correct errors in 3 cells at the maximum over the memory cells of the page length. A consideration is given to an ECC system capable of correcting an error in 1 cell of 6 cells corresponding to the use of a Lee metric code of ε=1. In this case, it is required to correct almost 341 bytes when the page length includes 2 k bytes, which correspond to 85 bytes of 512 bytes of MLCs such that BCH in GF(213) shown on the column d in
As described with reference to
The memory system according to the present embodiment comprises a memory device 100, and a memory controller 200 operative to control the memory device 100.
The use of the memory controller 200 makes it possible to construct a memory system including a Lee metric code-utilizing ECC system and having a higher error correction efficiency, using the existing memory device 100. The following example uses a MLC NAND flash memory as the existing memory device 100.
Binary data input to the memory controller 200 is converted to the quantity in a finite field Zp modulo a prime p, which is the data processing system in the memory controller 200. The binary data is processed at every M bits in the memory controller 200. The binary data input to the memory controller 200 is first fed to a “binary to p-adic decode” circuit block 210. The “binary top-adic decode” circuit block 210 converts binary data to p-adic data.
The p-adic data is fed to a “Lee metric code ECC system” circuit block 220 (data processing unit). The “Lee metric code ECC system” circuit block 220 uses the numerals in Zp on digits of the p-adic data to generate a Lee metric code (target data).
The Lee metric code is fed to a “p-adic < > MLC binary mapping” circuit block 230 (mapping unit). The “p-adic < > MLC binary mapping” circuit block 230 associates the numerals in Zp on digits of the Lee metric code with the MLC levels for conversion so as to establish the correspondence between the Lee metrics in Zp and the distances between MLC levels. In accordance with the “p-adic < > MLC binary mapping” circuit block 230, the numerals in Zp are converted to the cell levels having bit components composed of page data in the MLC, and held in page buffers 240.
When 16-level MLCs are used, the page buffers 240 are required in 4 series. The page buffers 240 are required to reflect the relation between the MLC levels and the pages in the memory device 100. When these page buffers 240 are filled with pieces of data, these pieces of data are transferred from the controller 200 to the memory device 100. This data transfer makes it possible to store the numerals in Zp as the MLC levels in the memory device 100.
The memory device 100 programs the data transferred from the controller 200 in turn at every page. Such the operation of the memory device 100 is executed based on a command <2> supplied from a “control” circuit block 250 in the memory controller 200. The command <2> is generated at the “control” circuit block 250 in accordance with a command <1> fed to the controller 200.
On reading data from the memory system, the command <2> from the “control” circuit block 250 is applied first to the memory device 100 to readout therefrom a series of data capable of representing the levels of 1 MLC. The series of data are transferred to and held in the page buffer 240 in the memory controller 200 in turn from the memory device 100.
Subsequently, the data held in the page buffer 240 is processed in the “p-adic < > MLC binary mapping” circuit block 230. The “p-adic < > MLC binary mapping” circuit block 230 converts the data held in the page buffer 240 to the numeral in Zp, that is, the quantity representative of the MLC level.
Subsequently, the data represented by the numeral in Zp is fed to the “Lee metric code ECC system” circuit block 220. The “Lee metric code ECC system” circuit block 220 applies error correction to the data to restore the correct Lee metric code and then converts the Lee metric code to p-adic data.
Finally, the p-adic data is fed to a “p-adic to binary decode” circuit block 260. The “p-adic to binary decode” circuit block 260 restores the p-adic data to M bits of binary data and then provides them to the outside of the memory controller 260.
The following description is given to the block configuration seen from the flow of data processing at the data transfer system in the memory controller 200.
Hereinafter, the environment for processing data in binary such as the memory device 100 is referred to as a “binary world”. In contrast, the environment for processing data in numerals in Zp such as the memory controller 200 is referred to as a “p-adic Zp world”. In this case, the “binary world” may also be referred to as a “p-adic Z2 world”.
The memory controller 200 comprises a 13-adic converter unit 210′, an encoder unit 221, a “p-adic < > MLC binary mapping” circuit block 230, page buffers 240, a syndrome generator unit 222, a solution searching polynomial generator unit 223, a Euclidean iteration processing unit 224, a first Hasse differential polynomial generator unit 225, a second Hasse differential polynomial generator unit 226, a code restorer unit 227, a decoder unit 228, and a 24-adic converter unit 260′.
Among those, the 13-adic converter unit 210′ is contained in the “binary to p-adic decode” circuit block 210. The encoder unit 221, the syndrome generator unit 222, the solution searching polynomial generator unit 223, the Euclidean iteration processing unit 224, the first Hasse differential polynomial generator unit 225, the second Hasse differential polynomial generator unit 226, the code restorer unit 227 and the decoder unit 228 are contained in the “Lee metric code ECC system” circuit block 220. The 24-adic converter unit 260′ is contained in the “p-adic to binary decode” circuit block 260.
Binary data D input, as sets of 32 bits (M=32), to the memory controller 200 is first fed to the 13-adic converter unit 210′.
The 13-adic converter unit 210′ converts the input binary data D to a 13-adic number, thereby generating data A having digits represented by the elements in Zp. The data A is sent to the encoder unit 221.
The encoder unit 221 applies a generator matrix G to the input data A, thereby generating a Lee metric code C from the data A. This code C is sent to the “p-adic < > MLC binary mapping” circuit block 230.
The “p-adic < > MLC binary mapping” circuit block 230 uses the input Lee metric code C to generate data to be transferred to the memory device 100, which is then fed to the page buffer 240.
Thereafter, the data held in the page buffer 240 is transferred to the memory device 100 and programmed in MLCs.
Subsequently, the pieces of data programmed in the memory device 100 are read out as a series of page data to the page buffers 240. The data is turned at the “p-adic < > MLC binary mapping” circuit block 230 to a code Y that contains an error due to failed MLC-level identification. Subsequent processing includes restoration of an errorless Lee metric code C from the code Y.
The code Y output from the “NAND < > MLC binary mapping” circuit block 230 is provided to the syndrome generator unit 222.
The syndrome generator unit 222 executes operational processing, S=YHt, to the input code Y to generate a syndrome S. If S0 obtained through the operation has a Lee metric of 3 or more (|S0|≧3), the sum of individual error code components always becomes equal to 3 or more. In a word, a key to solution with ε=2 may not be possible to correct an error in the code Y. Therefore, NG signal is provided to the outside of the memory controller 200, and the code Y before error correction is directly fed to the decoder unit 228. If S=0, no error occurs in the code Y, and accordingly the code Y is fed to the decoder unit 228 not through error correction. In the case of other syndromes, the generated syndrome S is sent to the solution searching polynomial generator unit 223.
The solution searching polynomial generator unit 223 uses the input syndrome S=(S0, S1, S2) to generate a polynomial Ψ(x). The polynomial Ψ(x) has coefficients φ0, φ1 and φ2, which are as shown in Expression 2. The polynomial Ψ(x) is given to the Euclidean iteration processing unit 224.
ψ0=1
ψ1=−S1
ψ2=−(ψ1S1+ψ0S2)/2 [Expression 2]
The Euclidean iteration processing unit 224 uses the input polynomial Ψ(x) and x3 to try the generation of polynomials λ(x) and ν(x) through the Euclidean iterative method. Unless the polynomials λ(x) and ν(x) can be generated, NG signal is provided to the outside of the memory controller 200, and the code Y before error correction is directly fed to the decoder unit 228. On the other hand, if the polynomials λ(x) and ν(x) can be generated, the flow shifts processing to the first Hasse differential polynomial generator unit 225 and the second Hasse differential polynomial generator unit 226.
The first Hasse differential polynomial generator unit 225 generates a Hasse differential polynomial from the polynomial λ(x) and substitutes r=1-12 into the Hasse differential polynomial to extract a root r that satisfies [λ(r)][0]=0. Subsequently, it uses the extracted root r to derive the multiplicity n that satisfies [λ(r)][n−1]=0 and [λ(r)][n]≠0. Then, the position t=r−1 and the error et=n based on these root r and multiplicity n are provided to the decoder unit 227.
Similarly, the second Hasse differential polynomial generator unit 226 generates a Hasse differential polynomial from the polynomial ν(x) and substitutes r=1-12 into the Hasse differential polynomial to extract a root r that satisfies [ν(r)][0]=0. Subsequently, it uses the extracted root r to derive the multiplicity n that satisfies [ν(r)][n−1]=0 and [ν(r)][n]≠0. Then, the position t=r−1 and the error et=13-n based on these root r and multiplicity n are provided to the code restorer unit 227.
The code restorer unit 227 uses the input position t and error et to restore the error-corrected Lee metric code C=(c1, c2, . . . , c3) from ct=yt−et. The Lee metric code C is fed to the decoder unit 228.
The decoder unit 228 applies an inverse conversion of the generator matrix G to the input Lee metric code C, thereby generating p-adic data A having digits represented by the elements in Zp. The p-adic data A is fed to the 24-adic converter unit 260′.
The 24-adic converter unit 260′ converts the input p-adic data A to binary data, that is, 24-adic binary data D. The binary data D is provided to the outside of the memory controller 200.
The page buffer 240 is used only in data communications with the memory device 100 in
The mapping unit, that is, the “p-adic < > MLC binary mapping” circuit block 230 is described on the function thereof in detail in the case of, for example, p=13 and the number of MLC levels equal to 16.
Mapping is executed by determining the correspondence between 16 levels of the MLC and a finite field Zp modulo p=13 first. The following description is given to 2 cases.
The case 1 relates to a method of assigning the elements in Zp to the MLC levels in turn from the lower level. In the case of
The case 2 relates to an assigning method of setting larger level intervals to higher levels of the MLC levels. In the case of
Thus, standing on the characteristics of the 16-level NAND memory, it is preferred to determine the above-described assigning method in the case 1 or the case 2 by limiting the purpose, for example, to that for suppressing error occurrences or reducing the programming time. It is also desirable for the memory controller 100 to set any assigning method at any time in accordance with the situation of the memory device 200. In the cases 1 and 2 the elements in Zp are assigned to the MLC levels in turn from the lower level while the elements in Zp can be assigned to the MLC levels in turn from the upper level as well.
Subsequently, the correspondence between the MLC levels and the bits in a page is described with reference to
Once the correspondence between the elements in Zp and the MLC levels is determined in this way, the elements in Zp are decoded based on 4 bits in the pages <0>-<3> corresponding to the MLC levels and then held in the page buffers 240. Therefore, the page buffers 240 are arranged in 4 series in parallel such that the 4 bits of data decoded through the above-described mapping can be held in respective series of the page buffers 240 in turn.
One series of page buffers 240 in the memory controller 200 corresponds to the page buffer contained in the memory device 100. When the page buffers 240 in the memory controller 200 are completely filled with data, the data is transferred series by series in turn to the page buffer in the memory device 100. The data in 4 series of page buffers 240 is programmed continuously in the MLCs at the memory device 100. Through such the data write procedure, the elements in Zp obtained at the memory controller 200 can be stored as the MLC levels in the memory device 100.
Reading data from the MLC in the memory device 100 is executed through the opposite procedure to the above-described data write procedure. In a word, 4 pages of data in the same MLC are read out in turn and these pages of data are transferred to the 4 series of page buffers 240 in the memory controller 200. The data transferred to the 4 series of page buffers 240 is decoded to obtain the numerals in Zp indicative of the MLC levels.
The multiple use of the page buffers 240 during the process of data processing is described next with reference to
The memory controller 200 described herein is determined to have an I/O data transfer rate of 1 byte/cycle. 1 sector data is transferred to the page buffers 240 in 512 cycles. Processing at every certain-bit data is referred to as a process and, to distinguish it from the general process, it is represented by a “PROCESS”.
A first description is given to the case where ε=4, that is, errors in 4 cells of 12 cells are correctable at the maximum.
In this case, as shown in
Data processing in the case of ε=4 is executed to deal M=24 bits at every PROCESS as shown on the column d in
If 24 bits of data are processed in 1 PROCESS, processing of data in 1 sector (512 bytes) requires 171 PROCESS. Though, 171-PROCESS data includes 171×24=4104 bits (513 bytes) more than the data in 1 sector, that is, 4096 bits (512 bytes). Therefore, it is required to add 8 dummy bits. The dummy bits may be fixed data generated in a circuit.
The external input binary data D in the case of ε=4 is held in the registers <1:3> of REGISTER6. The binary data D held in the registers <1:3> is processed at the following steps S1-S4.
First, at step S1, the binary data D is processed in 171 PROCESS and converted to 13-adic data A. Through this conversion, 24 bits of 1-PROCESS data (δ=6) in the “binary data” are converted to 28 bits of data (δ=7) in the “p-adic Zp world”, containing additional 4 bits corresponding to 1 digit of p-adic number. Therefore, the 13-adic data A generated through 171 PROCESS includes 171 PROCESS×28 bits=4788 bits. The 4788 bits of data correspond to the capacity of byte registers on 3.5 columns.
Subsequently, at step S2, the 13-adic data A, having a data quantity corresponding to the byte registers on 3.5 columns, is held in the registers <1:4> of REGISTER6. Therefore, the area in REGISTER6 used to hold the external input binary data D is overwritten with the 13-adic data A.
Subsequently, at step S3, the 13-adic data A is processed in 171 PROCESS and converted to a Lee metric code C. Through this conversion, 28 bits of 1-PROCESS, 13-adic data A are used to generate a 48-bit code in Zp composed of p−1=12 components. Therefore, the Lee metric code C generated through 171 PROCESS includes 171 PROCESS×48 bits=8208 bits (1026 bytes). The 1026 bytes of data correspond to the capacity of byte registers on 6 columns.
Subsequently, at step S4, the Lee metric code C is held in REGISTER6. Therefore, the area in REGISTER6 used to hold the 13-adic data A is overwritten. All the registers <1:6> are used at step S4.
Thus, the binary data D input from the “binary world” in the case of ε=4 is held in the registers <1:3> of REGISTER6. The remaining registers <4:6> are used and overwritten until the stage of holding the final data processing result, that is, the Lee metric code C, together with the register <1:3>.
Data processing at steps S1-S4 includes converting the binary data D to the 13-adic converted data A (steps S1 and S2), then converting the 13-adic data A to the Lee metric code C (steps S3 and S4), and scanning this, as shown in
In the case of data processing shown in
The following description is given to the case where ε=3, that is, errors in 3 cells of 12 cells are correctable at the maximum.
In this case, as shown in
Data processing in the case of ε=3 is executed to deal M=28 bits at every PROCESS as shown on the column c in
If 28 bits of data are processed in 1 PROCESS, processing of data in 1 sector (512 bytes) requires 147 PROCESS. Though, 147-PROCESS data includes 147×28=4116 bits more than the data in 1 sector, that is, 4096 bits (512 bytes). Therefore, it is required to add 20 dummy bits. The dummy bits may be fixed data generated in a circuit.
The external input binary data D in the case of ε=3 is held in the registers <1:7> of the registers <1:12>. The binary data D held in the registers <1:7> is processed at the following steps S1-S4.
First, at step S1, the binary data D is processed in 147 PROCESS and converted to 13-adic data A. Through this conversion, 28 bits of 1-PROCESS data (δ=6) in the “binary world” are converted to 32 bits of data (δ=7) in the “p-adic Zp world”, containing additional 4 bits corresponding to 1 digit of p-adic number. Therefore, the 13-adic data A generated through 147 PROCESS includes 147 PROCESS×32 bits=4704 bits. The 4704 bits of data correspond to the capacity of byte registers on 8 columns.
Subsequently, at step S2, the 13-adic data A is held in the registers <1:8> of REGISTER12. Therefore, the area in REGISTER12 used to hold the external input binary data D is overwritten with the 13-adic data A.
Subsequently, at step S3, the 13-adic data A is processed in 147 PROCESS and converted to a Lee metric code C. Through this conversion, (32×2) bits of 2-PROCESS, 13-adic data A are used to generate a (48×2)-bit code in Zp composed of p−1=12 components. Therefore, the Lee metric code C generated through 147 PROCESS includes 147 PROCESS×48 bits=7056 bits (882 bytes). The 882 bytes of data correspond to the capacity of byte registers on 12 columns.
Subsequently, at step S4, the Lee metric code C is held in the registers <1:12>. Therefore, the area in REGISTER12 used to hold the 13-adic data A is overwritten. All the registers <1:12> are used at step S4.
Thus, the binary data D input from the “binary world” in the case of ε=3 is held in the registers <1:7> of REGISTER12. The remaining registers <8:12> are used and overwritten until the stage of holding the final data processing result, that is, the Lee metric code C, together with the registers <1:7>.
Data processing at steps S1-S4 includes converting the binary data D to the 13-adic converted data A (steps S1 and S2), then converting the 13-adic data A to the Lee metric code C (steps S3 and S4), and scanning this, as shown in
In the case of data processing shown in
The following description is given to the case where ε=2, that is, errors in 2 cells of 12 cells are correctable at the maximum.
In this case, as shown in
Data processing in the case of ε=2 is executed to deal M=32 bits at every PROCESS as shown on the column b in
If 32 bits of data are processed in 1 PROCESS, processing of data in 1 sector (512 bytes) requires 128 PROCESS. Though, 128-PROCESS data includes 128×32=4096 bits (512 bytes) just the same as the data in 1 sector, that is, 4096 bits (512 bytes). The addition of dummy bits as in the case of ε=4, 3 is not required.
The external input binary data D in the case of ε=2 is held in the registers <1:4> of REGISTER6. The binary data D held in the registers <1:4> is processed at the following steps S1-S4.
First, at step S1, the binary data D is processed in 128 PROCESS and converted to 13-adic data A. Through this conversion, 32 bits of 1-PROCESS data (δ=6) in the “binary data” are converted to 36 bits of data (δ=7) in the “p-adic Zp world”, containing additional 4 bits corresponding to 1 digit. Therefore, the 13-adic data A generated through 128 PROCESS includes 128 PROCESS×36 bits=4608 bits. The 4608 bits of data correspond to the capacity of byte registers on 4.5 columns.
Subsequently, at step S2, the 13-adic data A, having a data quantity corresponding to the byte registers on 4.5 columns, is held in the registers <1:5> of REGISTER6. Therefore, the area in REGISTER6 used to hold the external input binary data D is overwritten with the 13-adic data A.
Subsequently, at step S3, the 13-adic data A is processed in 128 PROCESS and converted to a Lee metric code C. Through this conversion, 36 bits of 1-PROCESS, 13-adic data A are used to generate a 48-bit code in Zp composed of p−1=12 components. Therefore, the Lee metric code C generated through 128 PROCESS includes 128 PROCESS×48 bits=6144 bits (768 bytes). The 768 bytes of data correspond to the capacity of byte registers on 6 columns.
Subsequently, at step S4, the Lee metric code C is held in the registers <1:6>. Therefore, the area in REGISTER6 used to hold the 13-adic data A is overwritten. All the registers <1:6> are used at step S4.
Thus, the binary data D input from the “binary world” in the case of ε=2 is held in the registers <1:4> of REGISTER6. The remaining registers <5:6> are used and overwritten until the stage of holding the final data processing result, that is, the Lee metric code C, together with the register <1:4>.
Data processing at steps S1-S4 includes converting the binary data D to the 13-adic converted data A (steps S1 and S2), then converting the 13-adic data A to the Lee metric code C (steps S3 and S4), and scanning this, as shown in
In the case of data processing shown in
Next, in the case of ε=2, an example is described on the relation between REGISTER6 and the page buffers with reference to
REGISTER6 is a group of 8-row, 6-column registers. When this is viewed as the page buffers 240 in the memory controller 200, it turns to a group of 12-row, 4-column registers as shown in
With regard to a certain i-th PROCESS, the input binary data D is held in 8 MLCs <i+0> to <i+7> of 12 MLCs <i+0> to <i+11> first as shown in
As described above, the memory controller 200 according to the present embodiment uses the page buffers 240 in multiple during the process of data processing.
Next, in the case of ε=2, the process of processing sector data in REGISTER6 at the time of data writing is described with reference to
The sector register, or REGISTER6, is configured to include 1024 registers aligned in 6 columns and has a receiving portion of 1024 rows for 768-byte data transfer. Of 6 columns, No. 1-4 columns are used in data transfer and the remaining No. 5-6 columns are used in holding the target data after data transfer.
At S1 in
Thus, the sector register stores 128-PROCESS, 32-bit binary data corresponding to h=4, δ=8 of a 10-digit, 24-adic number.
At S2 in
At S3 in
The Lee metric code is data having components that correspond to (p−1) (=12) elements in Zp. A Lee metric code by 1 PROCESS is 48-bit data resulted from 36-bit, p-adic data multiplied by a G matrix. The 1-PROCESS Lee metric code is overwritten and stored in one 8-row, 6-column register block. In this case, 1 sector register is entirely filled with the Lee metric code in 128 PROCESS. Thus, the sector register can hold the data to be written in 192-byte pieces of MLCs.
Thus, processing in 3 stages shown at S1-S3 in
In the memory controller 200, the sector register is overwritten with data twice: at the time of conversion from 512-byte binary data to p-adic data; and at the time of conversion from the p-adic data to a Lee metric code, which is written in 192-byte pieces of MLCs.
For example, if the memory device 100, that is, the MLC NAND flash memory has a page configuration of 8 sectors each composed of (512+16) bytes, 4 pages contain data for ((512++16) bytes×8) pieces of MLCs. Accordingly, the levels of 192-byte pieces of MLCs correspond to 22 sector registers herein described. Therefore, the page buffers 240 in the memory controller 200 comprise 22 sector registers and the data in the page buffers 240 are transferred as 4 pages of data to the MLC NAND flash memory.
If the memory device 100, that is, the MLC NAND flash memory has a page configuration of 4 sectors each composed of (512+16) bytes, 4 pages contain data for ((512+16) bytes×4) pieces of MLCs. Accordingly, the levels of 192-byte pieces of MLCs correspond to 11 sector registers herein described. Therefore, the page buffers 240 in the memory controller 200 comprise 11 sector registers and the data in the page buffers 240 are transferred as 4 pages of data to the MLC NAND flash memory.
Next, the process of data processing in the sector register at the time of data read is described with reference to
At S1 in
The 128-PROCESS, 768-byte data stored in the sector register is subject to computation processing by ECC, at every 1-PROCESS data stored in an 8-row, 6-column register block, beginning from the sector register end, then error-corrected if enable, and left as it is if unable. Then, 128 PROCESS can be applied to complete ECC processing over all data in 1 sector to restore the Lee metric code.
At S2 in
At S3 in
Thereafter, the binary data is read out at every 8-row, 5-column register group in column order at a rate of 1 byte/cycle and provided as 1-sector data.
Thus, in the memory controller 200, the sector register is overwritten with data twice: at the time of conversion from the Lee metric code to p-adic data executed after ECC processing; and at the time of conversion from the p-adic data to binary data.
A specific example is described next on the important part of the specification of the memory controller, that is, the method of using the page buffers 240, and data transfer processing. The characteristics are grouped here in the case where the number of levels of MLCs used in the memory device 100 is equal to 16 and 8.
The number of MLC levels determines the prime p for use, thereby determining the number of cells required for coding accordingly. The use of 16-level MLCs requires p=13 and accordingly 12 (=p−1) MLCs are used to hold coded data. Therefore, in the case of ε=2, it is possible to correct errors in 2 MLCs of 12 MLCs at the maximum.
On the other hand, the use of 8-level MLCs requires p=7 and accordingly 6 (=p−1) MLCs are used to hold coded data. Therefore, in the case of ε=1, it is possible to correct an error in 1 MLC of 6 MLCs at the maximum.
Even if either 16-level or 8-level MLCs are used, it is possible to remedy 2 or 1 mLC at every small number of MLCs. Accordingly, the memory controller 200 has higher remedy efficiency over errors in MLCs as can be said.
If it is intended to attain the error remedy efficiency equivalent to that in the memory controller 200 through the conventional ECC method, for example, with the use of a binary BCH code, a wider memory redundant area and an enormous circuit area are required.
Subsequently, the configuration of the memory controller 200 is shown below.
In the case of p=13, ε=2: A data interface is configured at every×8 1-byte. The data interface operates in sync with clocks. Data transfer is executed at every sector of 512 bytes.
As shown on the column b in
The NAND flash memory has a page size of 4 sectors or 8 sectors while the data transfer from the memory controller 200 to the memory device 100 is executed continuously over 4 pages. Therefore, the memory controller 200 forms a page with a larger number of sectors than that in the memory device 100. For example, the previous example forms a page with 11 sectors or 22 sectors while the number of sectors contained in a page is not specified here because it varies in accordance with the application field of the memory system.
The sectors contained in a page are accessed at random.
Access from the memory controller 200 to 16-level MLCs in the memory device 100 is executed at every 4 pages.
In the case of p=7, ε=1: A data interface is configured at every×8 1-byte. The data interface operates in sync with clocks. Data transfer is executed at every sector of 512 bytes.
As shown on the column a in
The NAND flash memory has a page size of 4 sectors or 8 sectors while the data transfer from the memory controller 200 to the memory device 100 is executed continuously over 3 pages. Therefore, the memory controller 200 forms a page with a larger number of sectors than that in the memory device 100. The number of sectors contained in a page is not specified in the specification skeleton because it varies in accordance with the application field of the memory system.
The sectors contained in a page are accessed at random.
Access from the memory controller 200 to 16-level MLCs in the memory device 100 is executed at every 3 pages.
<13-adic Converter Unit>
A specific example is described below on a circuit that applies data processing at the memory controller 200.
The following description is given to a binary/p-adic (13-adic) converter circuit required at the entrance and the exit of the memory controller 200.
A square shown with “4 res” in
At the start, at the 0-th step (S0 in
Subsequently, 5-bit binary data, of which No. 4-1 bits are the outputs from the previous “4 res” circuit block (the residue represented by 4 bits) and No. 0 bit (the least significant bit) is the most significant bit D63 (=D27) of d6, is input to the next “4 res” circuit block, which generates the residue and a quotient or carry C127 derived from the input binary data divided by 13.
Thereafter, until No. 0 bit of 5-bit binary data input to the “4 res” circuit block reaches the least significant bit D00 (=D0) of d0, 29 “4 res” circuit blocks are used to generate carries C10-C128. Binary data expressed by these generated carries C10-C128 indicates the number of 13s contained in the data D.
The output from the “4 res” circuit block having the input of d0 has a binary representation of a0 in a 9-digit, 13-adic number D (a0, a1, . . . , a8).
Subsequently, at the 1st step (S1 in
At the 1st step, numeration of 13 is executed to the carries C10-C128 from C128 on the rightmost side. At the time of the numeration of 13 executed to the carries C125-C128, the flow directly generates the residue and a carry C225 by substituting 0 for the most significant bit of 5-bit input binary data and dividing the input binary data by 13.
Subsequently, 5-bit binary data, of which No. 4-1 bits are the outputs from the previous “4 res” circuit block (the residue represented by 4 bits) and No. 0 bit (the least significant bit) is C124, is input to the next “4 res” circuit block, which generates the residue and a quotient or carry C224 derived from the input binary data divided by 13.
Thereafter, until No. 0 bit of 5-bit binary data input to the “4 res” circuit block reaches the least significant bit C10 of C1i, 26 “4 res” circuit blocks are used to generate carries C20-C225. Binary data expressed by these generated carries C20-C225 indicates the number of primes 132s contained in the data D.
The output from the “4 res” circuit block having the input of C10 has a binary of a1 in a 9-digit, 13-adic number D (a0, a1, . . . , a8).
At the subsequent 2nd step (S2 in
Thereafter, the flow advances up to the 8th step (S8 in
The carries C90-C94 at the 8th step are not used in computation.
A consideration is given next to the configuration of the 13-adic converter unit 210′ using “5 bit mod 13” circuit blocks. The “5 bit mod 13” circuit block is a circuit that compares an input A or 5-bit binary data with a prime 13, then provides PF0=‘1’ if A is equal to or higher than 13 and provides the residue Q of A by the prime 13. The details are described later.
Here, the j-th digit is denoted with dj when data is subject to an 8-digit, 24-adic expression. In this case, dj can be indicated in 4-bit binary though the coefficient D of the indication is expressed in common with the coefficient D of other d. For that purpose, sub-indexes are used as shown in Expression 3.
d
j
=D
j
0
+D
j
12+Dj222+Dj323
d
j(24)j=D4j24j+D4j+124j+1+D4j+224j+2+D4(j+1)−124(j+1)−1 [Expression 3]
The input to the operation at the k-th step, that is, the carry at the previous step (the (k−1)-th step) is Ck0−Ck4(8−k)+k−1, which is the coefficient as a binary including the sub-index as an exponential of 2, and the numeral expressed by this binary indicates the number of 13ks contained in the data.
At the k-th step, the input includes 4(8−k)+k pieces of binaries (carries Ck0-Ck4(8−k)+k−1) as shown in
The 1st “5 bit mod 13” circuit block <1> receives Ck4(8−k)+k−4 to Ck4(8−k)+k−1 and 0 on the input binary A0-A3, A4, and provides R4(8−(k+1))+k0 to R4(8−(k+1))+k3 and Ck+14(8−(k+1))+k from the outputs Q0-Q3 and the carry PF0, respectively.
The 2nd “5 bit mod 13” circuit block <2>, not shown, receives a carry Ck4(8−(k+1))+k−1 and the outputs R4(8−(k+1))+k0 to R4((8−(k+1))+k3 from the 1st “5 bit mod 13” circuit block <1> on the input binary A0, A1-A4, and provides R4(8−(k+1))+k−10 to R4(8−(k+1))+k−13 and Ck+14(8−(k+1))+k−1 from the outputs Q0-Q3 and the carry PF0, respectively.
Thereafter, as shown in
Thus, the conversion of a binary to a 13-adic number is executed sequentially from the most significant bit of the carry C as in the schematic diagram shown in
A circuit including 37 “5 bit mod 13” circuit blocks thus configured is shown in
As shown in
Next, the “5 res” circuit block shown in
The “5 bit mod 13” circuit block provides the residue Q of the input binary A by the prime 13, and provides ‘1’ from PF0 if the input binary A is equal to or more than 13 and ‘0’ from PF0 if it is lower than 13.
In the case of h=4, p=13, relations shown in Expression 4 can establish among the binary A, the binary Q and the prime p.
a=A
0
+A
12+A222+A323+A424
Q≡a(mod p)(a=Q+PF0×p)
Q=Q
0
+Q
12+Q222+Q323 [Expression 4]
The “5 bit mod 13” circuit block comprises a PF0 generator unit U1, 2 half adders HA1-HA3, and a full adder FA1.
The PF0 generator unit U1 includes serially connected PMOS transistors QP1-QP3 and NMOS transistors QN1-QN3 between the Vcc terminal supplied with a certain voltage and the Vss terminal supplied with the ground voltage. These transistors QP1, QP2, QP3, QN1, QN2 and QN3 are controlled by A4, A1, A0, A0, A2 and A3, respectively.
The PF0 generator unit U1 also includes 2 PMOS transistors QP4, QP5, 2 NMOS transistors QN4, QN5, and an inverter IV1.
The transistors QP4, QP5 are connected between the source of the transistor QP2 and the drain of the transistor QP3 in parallel. The transistor QN4 is connected between the source and drain of the transistor QN1 in parallel. The transistor QN5 is connected between the source of the transistor QN1 and the drain of the transistor QN3 (Vss terminal) in parallel. These transistors QP4, QP5, QN4 and QN5 are controlled by A2, A3, A1 and A4, respectively.
The inverter IV1 has an input connected to the sources of transistors QN1, QN4 and QN5. The output from the inverter IV1 provides the carry PF0.
The half adder HA1 has inputs A0 and PF0, an output Q0, and a carry output C0. The full adder FA1 has inputs C0 and A1, a carry input PF0, an output Q1, and a carry output C1. The half adder HA2 has inputs C1 and A2, an output Q2, and a carry output C2. The half adder HA3 has inputs C2 and A3, an output Q3.
In accordance with the above configuration, the PF0 generator unit U1 decides if the binary A input to the “5 bit mod 13” circuit block is equal to or more than 13, and provides the result from PF0. If the binary A is equal to or more than 13, the half adders HA1-HA3 and the full adder FA1 are used to add 3, a complement of the 5-bit binary 13, to the binary A in order to subtract 13 from the binary A.
The following description is given to the core part of the 13-adic converter unit 210′, that is, the p-adic” circuit block.
The “p-adic” circuit block receives B0-B9, I0-I39, and provides r0-r35 as shown in
Specifically, the inputs I0-I3, I4-I7, . . . , I35-I39 are fed via the control switches SW1 to the “X to p” circuit block as C00-C03, C04-C07, . . . , C035-C039, respectively. These control switches SW1 are controlled by the inputs B1-B9, respectively.
One control switch SW1 includes a transfer transistor TQ operative to connect the input IN with the output OUT, and an NMOS transistor QN operative to pull down the output OUT to the ground voltage. The transfer transistor TQ turns on when the control signal is CNT=‘0’ while the transistor QN turns on when the control signal is CNT=‘1’.
In the case of the control switches SW1, the control signals CNT include /B1-/B9. Therefore, I0-I39 are provided directly as C00-C039 if B=‘1’, and the output turns to ‘0’ independent of the input if B=‘0’. This is effective to prevent the input to the “X to p” circuit block from becoming infinite even if the inputs I0-I39 to the “p-adic” circuit block are infinite.
The “X to p” circuit block on receipt of C00-C039 provides R00-R323, C10-C136 as described before.
The outputs C10-C136 from the “X to p” circuit block pass through the control switches SW2 and turn to r4-r40, that is, the outputs from the “p-adic” circuit block. These control switches SW2 are controlled by the inputs /B1-/B9. Therefore, these control switches SW2 directly pass C10-C136 as r4-r40 if B=‘1’.
The outputs R00-R323 from the “X to p” circuit block pass through the control switches SW3 and turn to r0-r35, that is, the outputs from the “p-adic” circuit block. These control switches SW3 are controlled by B0□/B1 to B8□/B9, respectively. Therefore, the control switch SW3 located between R00 and r0, for example, directly provides R00 as r0 only if B0=‘0’ or B1=‘1’.
B0-B9 for use in control of the control switches SW are timing signals, which are signals that rise sequentially. In sync therewith, the paths for the inputs I open at every 4 bits from the lower bit side and the paths for the outputs r switch to the paths for the outputs R.
In order to provide the result at the present step until the flow enters the computation process at the next step, R corresponding to a coefficient A on each digit of a 13-adic number is provided to the later-described external “D-r” register via the control switches SW3, which are on/off-controlled by the signals resulted from the logical operation with the adjacent timing signals B.
The following description is given to the 13-adic converter unit 210′ including the above-described circuits grouped together.
The “D-r” register is a register controlled by the timing signal B and the clock clk as shown in
The “D-r” register includes a flip-flop FF composed of 2 inverters at every bit. The flip-flop FF receives Dj (j=0-39) via the control switch SW1 and receives rj via the control switch SW2. On the other hand, the flip-flop FF is connected at the output side thereof to an inverter IV1 via the control switch SW3. The output from the inverter IV1 provides Ij.
The control switches SW1-SW3 are controlled by the timing signal B0 and the clock clk. Specifically, the control switch SW1 turns on if /clk □/B0=‘1’, the control switch SW2 if /clk □ B0=‘1’, and the control switch SW3 if clk=‘1’, respectively.
D32-D39 not contained in the data input to the “D-r” register are held at ‘0’.
In the initial state of the “D-r” register, a binary D0-D31 is set, and the rest is filled with ‘0’. Thereafter, when B0 rises, data rj is taken in sync with the fall of clk, and the taken rj is provided as Ij in sync with the rise of clk.
The “D-r” register couples with the “p-adic” circuit block to advance the computation step at every timing signal Bj. The state of variations in each clock is shown in
At each computation step, each digit Aj of a 13-adic number is obtained as the output r from the lower side, and this is held at the same timing as that for taking Ij in the second half of the timing signal Bj.
After completion of all the computation steps, the “D-r” register holds the coefficients Ajm on the digits when the coefficients a on the digits of 13-adic data D are converted to binaries.
In the case of p=13, the number of computation steps is equal to 10, and the number of “5 bit mod 13” circuit blocks contained in the “p-adic” circuit block is equal to 37.
<24-adic Converter Unit>
A square shown with “4 add 13” in
At the start, at the 0-th step (S0 in
Subsequently, at the 1st step (S1 in
Thereafter, the same steps as the 0-th step and the 1st step are repeated and, at the 8th step (S8 in
A consideration is given next to the configuration of the 24-adic converter unit 260′.
It is possible to express a coefficient aj on the digit at the j-th degree of a 9-digit, 13-adic representation, in 5-bit binary. For the purpose of bringing the coefficient A in this binary representation into a representation in common with coefficients a on other digits, sub-indexes are used as shown in Expression 5.
a
j
=A
j
0
+A
j
12+Aj222+Aj323
a
j(24)j=A4j24j+A4j+124j+1+A4j+224j+2+A4(j+1)−124(j+1)−1 [Expression 5]
The input to the operation at the k-th step, that is, the carry at the previous step (the (k−1)-th step) includes Ck0-Ck4(k+1)−1, of which sub-index is a coefficient in the form of a binary of an exponent of 2. The number expressed by this binary indicates the number of 138−ks contained in the data.
At the k-th step, 4 (k+1) pieces of “5 bit add 13” circuit blocks are used in processing as shown in
The 1st “5 bit add 13” circuit block <1> receives Ck0, Q−10-Q−13 at the carry and the inputs B0-B3, and provides Ck+l0, Q00-Q03 from Q0, Q1-Q4, respectively.
The 2nd “5 bit add 13” circuit block <2>, not shown, receives Ck1, Q00-Q03 at the carry and the inputs B0-B3, and provides Ck+11, Q10-Q13 from Q0, Q1-Q4, respectively.
Thereafter, the “5 bit add 13” circuit blocks having the similar inputs and outputs are aligned 4 (k+1) pieces in total as shown in
Thus, the conversion of a 13-adic number to a binary is executed sequentially from the least significant bit of a carry C as in a schematic diagram shown in
The circuit including 4 “5 bit add 13” circuit blocks thus configured is referred to as an “a to X” circuit block.
As shown in
The following specific description is given to the “4 add 17” circuit block shown in
In the case of h=4, p=13, the relation shown in Expression 6 can establish between the binary B and the binary Q.
b=B
0
+B
12+B222+B323
Q=b+carry×p
Q=Q
0
+Q
12+Q222+Q323+Q424 [Expression 6]
The “5 bit add 13” circuit block includes 2 half adders HA1, HA2, and 2 full adders FA1, FA2.
The half adder HA1 has inputs B0 and carry, an output Q0, and a carry output C0. The half adder HA2 has inputs C0 and B1, an output Q1, and a carry output C1. The full adder FA1 has inputs B2 and carry, a carry input C1, an output C2, and a carry output Q2. The full adder FA2 has inputs B3 and carry, a carry input C2, an output Q3, and a carry output Q4.
In accordance with the above configuration, the “5 bit add 13” circuit block adds a prime 13 to the input binary B if carry=‘1’.
Next, the “a to X” circuit blocks described before are used to configure a circuit for 1 step operative to lower the degree of the 13-adic number by 1. Hereinafter, the circuit is referred to as a “p to X” circuit block. The “p to X” circuit block can be used in all the computation steps in common.
The “p to X” circuit block comprises 8 “a to X” circuit blocks.
The 1st “a to X” circuit block <1> receives part of the inputs to the “p to X” circuit block, that is, Q−0-Q−13, and C70-C73, and provides Q′30-Q′33 and part of the outputs from the “p to X” circuit block, that is, C80-C83.
The 2nd “a to X” circuit block <2> receives Q30-Q33 and part of the inputs to the “p to X” circuit block, that is, C74-C77, and provides Q′70-Q′73 and part of the outputs from the “p to X” circuit block, that is, C85-C87. Among the inputs, Q30-Q33 are signals fed via the control switches SW1, through which the outputs Q′30-Q′33 from the 1st “a to X” circuit block <1> are controlled by the timing signal B7.
The 3rd “a to X” circuit block <3> receives Q70-Q73 and part of the inputs to the “p to X” circuit block, that is, C78-C711, and provides Q′110-Q′113 and part of the outputs from the “p to X” circuit block, that is, C88-C811. Among the inputs, Q70-Q73 are signals fed via the control switches SW2, through which the outputs Q′70-Q′73 from the 2nd “a to X” circuit block are controlled by the timing signal B6.
Thereafter, similar connections will be made up to the 8th “a to X” circuit block <8>.
The inputs and outputs of the “a to X” circuit block are connected via the control switches SW in this way, because the connection of the input is switched between the external input and the internal input at every computation step, and for the purpose of preventing the output from the internal circuit from interfering in the case of the external input.
In the case of the circuitry of
Subsequently, when the timing signal B1 also turns to ‘1’, the 2nd “a to X” circuit block <7> from the right in
Thereafter, at every sequential rise of the timing signals B2-B7, the “a to X” circuit block required at each step is activated.
The following description is given to the core part of the 24-adic converter unit 260′, that is, the “binary” circuit block.
As shown in
Specifically, the inputs I4-I35 are fed as C70-C731 to the “p to X” circuit block via the control switches SW1. These control switches SW1 are controlled by the timing signals B7-B0. Therefore, the control switches SW1 pass I4-I35 directly as C70-C731 if B=‘1’ and keep the outputs at ‘0’ independent of the inputs if B=‘0’.
In addition, the inputs I0-I35 are fed as Q−10-Q313 to the “p to X” circuit block via the control switches SW2, respectively. These control switches SW2 are controlled in accordance with /B9ΛB8 to /B1ΛB0, respectively. Therefore, the control switch SW2 located between I0 and Q−10 passes I0 directly as Q−10 only if B9=‘0’, B8=‘1’.
The outputs C80-C835 from the “p to X” circuit block turn to r0-r35, that is, the outputs from the “binary” circuit block via the control switches SW3. These control switches SW3 are controlled by the timing signals B7-B0. Therefore, the control switches SW3 pass C80-C835 directly as r0-r35 if B=‘1’.
In accordance with the above circuitry, the “p to X” circuit block responds to each computation step while increasing the bit widths of the input and output 4-bit by 4-bit sequentially. While taking the numerals A on the digits of the 13-adic number from the upper digit sequentially at each computation step, and when all the computation steps are completed, a binary representation of data can be obtained.
As described earlier, the timing signals B0-B9 are the signals that rise in order. In accordance therewith, the paths to the inputs I and outputs r conduct 4-bit by 4-bit from the upper bit side.
The numeral A on each digit of the 13-adic number is initially set in the later-described external “A-r” register, and fed to the “A-r” register via the control switch SW3 on/off-controlled by the adjacent timing signal B such that the path switches selectively until the flow enters the next computation step.
The following description is given to the 24-adic converter unit 260′ including the above-described circuits grouped together.
The “A-r” register is a register controlled by the timing signal B0 and the clock clk and having the inputs r0-r35, A0-A35 and the outputs I0-I35 as shown in
The “A-r” register includes, at every bit, a flip-flop FF composed of 2 inverters. The flip-flop FF receives Aj (j=0-35) via the control switch SW1 and receives rj via the control switch SW2. On the other hand, the flip-flop FF is connected at the output thereof to an inverter IV1 via the control switch SW3. The output from the inverter IV1 provides Ij.
The control switches SW1-SW3 are controlled by the timing signal B0 and the clock clk. Specifically, the control switch SW1 turns on if /clk □/B0=‘1’, the control switch SW2 if /clk □ B0=‘1’, and the control switch SW3 if clk=‘1’, respectively.
The initial state of the “A-r” register includes the digits A0-A35 of the 13-adic number.
Thereafter, when the timing signal B0 rises, rj taken in sync with the fall of the clock clk is provided as Ij in sync with the rise of the clock clk.
The “A-r” register couples with the “binary” circuit block to advance the computation step at every timing signal Bj. The state of variations in each clock is similar to
After completion of all the computation steps, the “A-r” register holds a binary representation Dj of the input, that is, the 13-adic number A.
In accordance with the 24-adic converter unit 260′ thus configured, the number of computation steps is equal to 10, and the number of “5 bit add 13” circuit blocks contained in the “binary” circuit block is equal to 32.
The memory controller 200 of the present embodiment contains the “Lee metric code ECC system” circuit block 220, which serves as the ECC system for ensuring the reliability of data stored in the existing memory device 100 (such as the MLC NAND flash memory 100).
The “Lee metric code ECC system” circuit block 220 uses a Lee metric code for making simpler the computation required for error detection and correction. Therefore, the Lee metric code is described briefly here.
Symbols c representative of a code are integers shown in Expression 7.
c
j
εGF(p)=Zp,0≦cj<p [Expression 7]
When the metrics of these integers are represented by |cj| as Lee metrics and all Lee metrics |cj| are represented by integers of p/2 or below, the Lee metrics |cj| can be defined as in Expression 8.
0≦cj<p/2:|cj|=cj
p/2<c<p:|cj|=p−cj [Expression 8]
As a code C can be considered a row of n (=p−1) symbols cj, it can be represented by C=(c1, c2, . . . , cn), and a metric w(C) of the code C can be defined as the sum of Lee metrics |cj| of the symbols cj as in Expression 9.
w(C)=|c1|+|c2|+ . . . +|cn| [Expression 9]
The distance between codes can be defined by the sum of Lee metrics of the differences between the symbols corresponding to the codes. Here, a difference between 2 symbols candy (Lee distance), dL (c, y), is given as Expression 10.
d
L(c,y)=w(c−y) [Expression 10]
Further, the minimum Lee distance of the code C can be defined by the minimum metric of the metrics w(C) of the code C as shown in Expression 11.
d
L(C)=minw(C) [Expression 11]
In this case, the Lee metric code is such a code that has the minimum distance of 2γ between codes having a generator matrix G and a syndrome matrix H shown in Expression 12 and that can correct (γ−1) or lower Lee metric errors.
Here, when the word length of the code C is denoted with n and the word length of data is denoted with k, then γ=n−k where γ represents the redundant word length contained in the code C.
For the purpose of generating the Lee metric code thus configured, the input-converted data is represented by a k-digit, p-adic number. The numerals on the digits of the p-adic number correspond to the elements in Zp and therefore can be used as a data word X of the Lee metric code to obtain a code expression through an operation C=XG with the generator matrix G. The obtained code word is stored in the memory. Information about the error caused in the numeral in Zp stored is used as a data word Y of the Lee metric code read out of the memory to obtain a syndrome through an operation S=YHt (Ht is a transpose of H) so that the position and quantity of the error can be computed to correct the error.
Next, as the premise of the description of the ECC system using the Euclidean iterative method with p=13, γ=ε+1=3, ε=2 dealt in the present embodiment, the general relational equations on the Euclidean method are described collectively.
The binary data of the code C recorded in the memory device 100 includes a collection of 12 code word symbols each composed of 4 bits, which may cause variations symbol by symbol when suffer various disturbances. Thus, the operation required to restore the code C from the code Y is decoding. Prior to decoding, syndromes are sought at the start.
Syndromes S can be sought as elements S0, S1, S2 shown in Expression 13 through an operation S=YHt using a syndrome matrix H.
Here, Ht is a transpose of H. The generator matrix G and the syndrome matrix H are configured to satisfy GHt=0 (mod 13). Accordingly, a substitution of Y=C+E yields S=EHt. With E=(e1, e2, . . . , e12), a syndrome S0 corresponds to the total sum of errors in the code word symbols as can be found. These syndromes S provide the only information on errors and, on the basis of these syndromes S, the correct code C is restored as follows.
Subsequently, the principle of decoding is described. Here, n=p−1=12 error symbols are classified into 2 sets J+ and J− as in Expression 14.
J
+
={jε(1,2, . . . , 13); ej<13/2}
J
−
={jε(1,2, . . . , 13); ej>13/2} [Expression 14]
Namely, they are classified into J+ that is an arrangement of the positions j of the code symbols cj in the case of the symbol error quantity ej<13/2, and J− that is an arrangement of the positions j of the code symbols cj in the case of the symbol error quantity ej>13/2. Polynomials Λ(x), V(x) in the Galois field GF(13) are configured on the basis of these sets as in Expression 15.
Thus, the polynomial Λ(x) is a polynomial that has a reciprocal number of the position j of an error code word symbol in J+ as a root and that has a Lee metric ej of that code word symbol as the multiplicity of the root. On the other hand, the polynomial V (x) is a polynomial that has a reciprocal number of the position j of an error code word symbol in J− as a root and that has a Lee metric, 13-ej, of that code word symbol as the multiplicity of the root. Decoding is a process of finally configuring these polynomials only from the information on the syndrome S1 and solving them, thereby obtaining information on errors. In a word, it is required to seek a relation between these polynomials Λ(x), V (x) and the syndrome S1.
Subsequently, as shown in Expression 16, when it is configured with a series polynomial having each syndrome S1 on a coefficient of the corresponding degree, it is represented by a rational polynomial having the error quantity ej, the position j and the value of the symbol.
From Expression 16, a relational equation shown in Expression 17 can establish among the polynomials Λ(x), V (x) and the syndrome S(x).
Subsequently, the relational equation shown in Expression 17 is utilized to seek the polynomials Λ(x), V(x) from the syndrome S(x).
The syndrome S(x) is used to find a polynomial Ψ(x) at a degree of (γ−1) or lower shown in Expression 18.
In an expansive expression of the polynomial Ψ(x), a coefficient φj can be sought, from a comparison between the coefficients at the homogeneous degree on both sides of the expression shown in Expression 18, through an iterative method, using the syndrome Si and an already-determined coefficient φj−1. The coefficients φ0-φ2 of the polynomial Ψ(x) are sought from the syndromes S1, S2. The results are shown in Expression 19.
ψ0=1
ψ1=−ψ0S1=−S1
ψ2=−(ψ1S1+ψ0S2)/2 [Expression 19]
The polynomial Ψ(x) is a polynomial equivalent to Λ(x)/V(x) while the polynomials Λ(x), V(x) are given key conditions shown in Expression 20. Therefore, they can be sought through a Euclid iterative method applied to x3 and the polynomial Ψ(x) to eliminate constant multiples.
V(x)Ψ(x)≡Λ(x)mod x3)
degΛ(x)+degV(x)<3
Λ(x) and V(x)are coprimes
degΛ(x)−degV(x)≡S0(mod 13) [Expression 20]
Therefore, if the polynomial Ψ(x) can be configured from the syndromes S1, S2, the syndrome S0 can be used as the stop condition on the iterative method to seek the polynomials Λ(x), V(x). Namely, the polynomials Λ(x), V(x) sought from the set of (S0, Ψ(x)) through the iterative method can represent the position j and the error quantity ej of an error code word symbol.
The Euclidean iterative method is detailed later.
Next, the procedures of data processing described with reference to
Arranged below, with respect to an ECC operation in the “p-adic Zp world”, are the procedure of encoding data input as Zp data to the memory system, and the procedure of decoding the code read out of the memory system to finally obtain the data A.
The procedure of encoding is as follows.
At the start, the input data D is converted at the 13-adic converter unit 210′ to a 13-adic-represented 9-digit data word D(h) shown in Expression 21 (S1 in
D(h)=(a0,a1, . . . , a8) [Expression 21]
Subsequently, the data D is multiplied by the generator matrix G to yield 12 code word components c1-c12 of the code C (S2 in
Finally, the code word component cj is stored in memory cells (S3 in
The Lee metric code C is stored in the memory device 100 and turned to an error-containing code Y.
At the time of data read, the code Y read out of the memory device 100 is held in the page buffers 240 in the memory controller 100 (S4 in
Y=(y1,y2, . . . , y12)
Y=C+E,E=(e1,e2, . . . , e12) [Expression 23]
The memory controller 200 applies ECC error correction to the code Y held in the page buffers 240 to generate binary data and provides it to external. The procedure of data processing at the time of data read is as follows.
(Procedure 1) The code Y held in the page buffers 240 is used to compute the syndromes S=(S0, S1, S2) based on a computational equation shown in Expression 24 (S5 in
As S0=Σej, so if |S0|>2, the total sum of Lee metrics of errors is equal to or more than 3. Accordingly, the flow determines the code Y error-uncorrectable and terminates error searching as no solution (S6, S7 in
(Procedure 2) With respect to ε=2, that is, the upper limit of the number of correctable code word components, the coefficients φj of the solution searching polynomial Ψ(x) are computed first in turn from the syndromes S0-S2 obtained at Procedure 1 based on computational equations shown in Expression 25 (S8 in
Subsequently, the Euclidean method is applied to x3 and the polynomial Ψ(x) to derive polynomials λ(x) and ν(x) that can provide an ECC solution of the Lee metric code C (S9 in
(Procedure 3) The polynomial λ(x) or ν(x) obtained in Procedure 2 is used as φ(x) to derive coefficients of a Hasse differential polynomial required for obtaining the multiplicity of the root of φ(x) (S12 in
Specifically, they can be represented as in Expression 27.
[φ(x)][0]=φ(x)=1+φ1x+φ2x2
[φ(x)][1]=φ1+2φ2x
[φ(x)][2]=φ2 [Expression 27]
(Procedure 4) The Hasse differential polynomial obtained in Procedure 3 is given a substitution of the elements 1-12 in Zp to seek an element r that makes zero the 0-th degree differential polynomial (=φ(x)). Subsequently, as shown in Expression 28, such the degree n is sought that makes an (n−1)-th degree differential polynomial zero and an n-th degree differential polynomial non-zero, for each element r (S13 in
[φ(r)][0]=φ(r)=0,
[φ(r)][n−1]=0,[φ(r)][n]≠0 [Expression 28]
The obtained r is an inverse element of the position number t of the error-caused code component, and the corresponding n indicates the quantity converted from the error quantity et caused. This process is executed over all the elements r.
(Procedure 5) In this procedure, the error quantity is obtained through conversion from the multiplicity n of the solution (S14 in
Subsequently, the code yt and the error quantity et are used to restore the correct Lee metric code C=(c1, c2, . . . , c12) based on ct=yt−et (S15 in
(Procedure 6) A pluralistic system of linear equations of a relation AG=C among the codes C and A and the generator matrix G is applied to obtain 9 elements a0-a8 in Zp. The obtained elements a0-a8 are used to create a 9-digit, 13-adic number A (=a0, a1, . . . , a8) (S16 in
Thus, data processing in the “p-adic Zp world” in the memory controller 200 is finished. Subsequently, a conversion is made to restore the data A to binary data at the exit of the “p-adic Zp world”. The data A is converted from the 9-digit, 13-adic representation to an 8-digit, 24-adic representation such that a numeral on each digit is represented in binary as 8 digits. This turns to the binary data D input to the memory system.
Thus, the data is restored completely.
From now on, examples are described on specific circuits for realizing the above procedures.
First, a computing circuit operative to seek the product of Zp is described. Hereinafter, the computing circuit is referred to as an “X Zp” circuit block.
The “X Zp” circuit block is roughly divided into a circuit for processing a computation step group in the first half, and a circuit for processing a computation step group in the second half.
The circuit for processing the first-half computation step group includes an AND gate G1, and 3 “4 bit AD mod 13” circuit blocks <1>-<3>.
The AND gate G1 yields the logical product of the i-th bit (i=0-3) of a multiplicand a and the j-th bit (j=0-3) of a multiplier b, and provides it as Mij. The numerals a and b are represented as in Expression 29.
a=A
0
+A
12+A222+A323
b=B
0
+B
12+B222+B323 [Expression 29]
The “4 bit AD mod 13” circuit block is a circuit operative to seek the sum of 2 numerals in Zp modulo 13. The “4 bit AD mod 13” circuit block has the inputs A0-A3 and B0-B3 and the outputs Q0-Q3. The specific configuration of the “4 bit AD mod 13” circuit block is described later.
The 1st “4 bit AD mod 13” circuit block <1> receives M10-M30, ‘0’, M01-M31 at A0-A2, A3 and B0-B3, and provides Q00-Q03 from Q0-Q3, respectively.
The 2nd “4 bit AD mod 13” circuit block <2> receives Q01-Q03, that is, the output from the “4 bit AD mod 13” circuit block <1>, ‘0’, and M02-M32 at A0-A2, A3 and B0-B3, and provides Q10-Q13 from Q0-Q4, respectively.
The 3rd “4 bit AD mod 13” circuit block <3> receives Q11-Q13, that is, the output from the “4 bit AD mod 13” circuit block <2>, 0, and M03-M33 at A0-A2, A3 and B0-B3, and provides Q20-Q23 from Q0-Q4, respectively.
As described above, the circuit for processing the first-half computation step group includes the “4 bit AD mod 13” circuit blocks <1>-<3>, of which inputs and outputs are connected in order.
The circuit for processing the second-half computation step group includes 3 “5 bit mod 13” circuit blocks <1>-<3>. The “5 bit mod 13” circuit blocks <1>-<3> are the circuits shown in
The 1st “5 bit mod 13” circuit block <1> receives Q10 and Q20-Q23 at A0 and A1-A4, and provides Q30-Q33 from Q0-Q3, respectively.
The 2nd “5 bit mod 13” circuit block <2> receives Q00, and Q30-Q33, that is, the outputs from the “5 bit mod 13” circuit block <1> at A0 and A1-A4, and provides Q40-Q43 from Q0-Q3, respectively.
The 3rd “5 bit mod 13” circuit block <3> receives M00, and Q40-Q43, that is, the outputs from the “5 bit mod 13” circuit block <2> at A0 and A1-A4, and provides Q60-Q63 (Q0-Q3) from Q0-Q3, respectively.
As described above, the circuit for processing the second-half computation step group includes the “5 bit mod 13” circuit blocks <1>-<3>, of which inputs and outputs are connected in order.
All the circuits operate not in sync with clocks and decide the output Q when the input Mij is given.
Thus, the number of circuit blocks contained in the “X Zp” circuit block is equal to 6.
The following description is given to the “4 bit AD mod 13” circuit block shown in
The “4 bit AD mod 13” circuit block seeks the sum of numerals a and b input from A and B, and provides the residue of the resultant sum by a prime 13 as the output from Q. The sum of inputs may not be dealt as the numeral in Zp but as a 4-bit binary number, as required. Accordingly,
In the case of h=4, p=13, the numerals a, b and the binary representation Q of the residue establish relations therebetween shown in Expression 30.
a=A
0
+A
12+A222+A323
b=B
0
+B
12+B222+B323
a+b≡Q(mod 13)
Q=Q
0
+Q
12+Q222+Q323 [Expression 30]
The “4 bit AD mod 13” circuit block and the “4 bit AD (16) mod 13” circuit block comprise a PF0 generator unit U1, 4 half adders HA1-HA4, and 4 full adders FA1-FA4.
The PF0 generator unit U1 includes serially connected PMOS transistors QP0, QP1, QP2 and NMOS transistors QN1-QN3 between the Vcc terminal supplied with a certain voltage and the Vss terminal. These transistors QP0, QP1, QP2, QN1, QN2 and QN3 are controlled by C3 or Vss, S1, S0, S0, S2 and S3, respectively.
The PF0 generator unit U1 additionally includes 2 PMOS transistors QP3, QP4, 2 NMOS transistors QN4,QN5, and an inverter IV1.
The transistors QP3, QP4 are connected between the source of the transistor QP1 and the drain of the transistor QP2 in parallel. The transistor QN4 is connected between the source and drain of the transistor QN1 in parallel. The transistor QN5 is connected between the source of the transistor QN1 and the drain of the transistor QN3 in parallel. These transistors QP3, QP4, QN4 and QN5 are controlled by S2, S3, S1, and C3 or Vss, respectively. The inverter IV1 has an input connected to the sources of the transistors QN1, QN4 and QN5. The output from the inverter IV1 provides a carry PF0. Thus, the input of Vss instead of C3 realizes the “4 bit AD(16) mod 13” circuit block.
The half adder HA1 has inputs A0 and B0, an output S0, and a carry output C0′. The full adder FA1 has inputs A1 and B1, a carry input C0′, an output S1, and a carry output C1′. The full adder FA2 has inputs A2 and B2, a carry input C1′, an output S2, and a carry output C2′. The full adder FA3 has inputs A3 and B3, a carry input C2′, an output S3, and a carry output C3′. The half adder HA2 has inputs S0 and PF0, and an output Q0, and a carry output C0. The full adder FA4 has inputs S1 and PF0, a carry input C0, an output Q1, and a carry output C1. The half adder HA3 has inputs C1 and S2, an output Q2, and a carry output C2. The half adder HA4 has inputs C2 and S3, and an output Q3.
In accordance with the above configuration, the PF0 generator unit U1 determines if the sum of binaries A and B input to the “4 bit AD mod 13” circuit block is equal to or higher than 13 and if the sum of the numerals input, not as the numerals in Zp but as the 4-bit binary numbers, to the “4 bit AD(16) mod 13” circuit block is equal to or higher than 13. If the sum of binaries A and B is equal to or higher than 13, then the half adders HA1-HA3 and the full adders FA1-FA3 are used to add 3, a complement of 13 in 4-bit binary, to the sum of A and B in order to subtract 13 from the sum of A and B.
An example is described next on the circuitry of the encoder unit 221.
As shown in
Among these clocks ck and cl, ck is applied to control a “Counter (1 to 12)” circuit block and a “Ri (1-12)” register unit, and cl to control a “Ro (0-8)” register unit, an “X k-times” circuit block and a “Rgstr” register.
The “Counter (1 to 12)” circuit block is a circuit having an initial state of 0. It starts counting the number of clocks again at every rise of the clock ck and provides the number. Namely, of ckj (j=1-12), it provides j to the “X k-times” circuit block at the rise of ckj.
The “Ri (1-12)” register unit includes registers operative to store components cj of the code word C and capable of storing 12 numerals in total. The “Ri (1-12)” register unit stores numerals in the individual registers sequentially in sync with the timing of the rise of ck. Namely, data, that is, the element cj is taken into the register at the timing of the rise of ckj+1. At the rise of ck13, 12 elements cj are taken in the registers. In a word, the code C can be thus stored.
The “X k-times” circuit block is a circuit that multiplies the output by the input at every rise of the clock cl. The “X k-times” circuit block multiplies the output by the input j at every rise of cl, 9 times in total. Namely, the rises of cli (i=0-8) bring the output from the “X k-times” circuit block to (j)i+1. This output is fed to the “X Zp” circuit block.
The “Ro (0-8)” register unit includes registers capable of storing 9 numerals, and stores 9 components a0-a8 of the code A in the initial state. The “Ro (0-8)” register unit receives the clock cl, and provides the components a0-a8 of the code A in order at every rise of the clock cl. Namely, on receipt of cli (i=0-8), it provides ai.
The “X Zp” circuit block is a circuit operative to execute a multiplication of the input in Zp. The “X Zp” circuit block receives the output (j)i+1 from the “X k-times” circuit block and the output ai from the “Ro (0-8)” register unit at every clock cli, and provides (j)i+1ai. The output numeral (j)i+1ai is summed in a combination of the “4 bit AD mod 13” circuit block and the “Rgstr” register.
The “4 bit AD mod 13” circuit block is a circuit that seeks the sum of 2 input numerals modulo 13. On the other hand, the “Rgstr” register is a register having an initial state of 0. It blocks any input from the “4 bit AD mod 13” circuit block at every input of the clock cl and feeds the self-holding contents to the “4 bit AD mod 13” circuit block. The connection of the “4 bit AD mod 13” circuit block with the “Rgstr” register as shown in
The “X k-times” circuit block shown in
The “X k-times” circuit block is a circuit operative to compute a power (X)j of the input X, which is controlled by the clock clj (j=1-12) as shown in
The “X k-times” circuit block includes an “X Zp” circuit block, and a “Rgstr” register <1> and a “Rgstr” register <2> operable in sync with the clock cl as shown in
The “Rgstr” register <1> has an input connected to X, and an output connected to one output of the “X Zp” circuit block. The “Rgstr” register <2> has an input connected to the output of the “X k-times” circuit block, and an output connected to one input of the “k-times” circuit block. The “Rgstr” register <2> holds ‘1’ as the initial state.
In accordance with this circuitry, the “X k-times” circuit block takes its own output with a 1-cycle delay to obtain the product of the input X and the output (X)j.
The output (X)j is multiplied cumulatively by the input X at every input of the clock cl. The data X is set in the “Rgstr” register <1> before the clock clj rises, and the “Rgstr” register <2> having an initial state of ‘1’ is brought into sync therewith, thereby obtaining (X)j at the j-th clj.
The circuitry of the syndrome generator unit 222 is described below.
A circuit block is described first, which is frequently used in subsequent operational processing, including the syndrome generator unit 222, for computing the m-th power of an element j in Zp over all the elements j in batch. Hereinafter, this circuit block is referred to as a “(j)i (j=1 to 12)” circuit block.
The “(j)i (j=1 to 12)” circuit block is controlled by the clocks cki (i=0-11) and clj (j=1-12), and provides (j)i and (j)i+1 in sync with the rise of the clock clj.
The “(j)i (j=1 to 12)” circuit block is a circuit that computes the 0-th to 11-th powers of all the elements 1-12 in Zp other than zero in order, and holds them in registers.
The “(j)i (j=1 to 12)” circuit block includes an “X Zp” circuit block, a “Counter (1 to 12)” circuit block, and a “R (1-12)” register unit as shown in
The “Counter (1 to 12)” circuit block is connected to one input of the “X Zp” circuit block. It uses cki as the start signal and counts up at the timing of the rise of clj within a range of 1-12.
The “R (1-12)” register unit includes 12 registers. It stores the inputs i1-i12 in No. 1-12 registers in order at the rise of the clock /clj at in, and provides the contents i1-i12 of No. 1-12 registers in order at the rise of the clock clj at out.
As shown in
Next, an example is described on the circuitry of the syndrome generator unit 222, that is, an operating circuit operative to seek component elements of the syndrome S in batch. Hereinafter, this operating circuit is referred to as a “syndrome component element generating circuit”.
The syndrome component element generating circuit is configured based on the equations shown in Expression 31.
The syndrome component element generating circuit includes, as shown in
The code Y readout of the memory device 100 is converted at the “p-adic < > MLC binary mapping” circuit block 230 and stored in the “Ro (1-12)” register as the initial setting. The “(j)i (j=1 to 12)” circuit block generates (j)i. The “Ro (1-12)” register and the “(j)i (j=1 to 12)” circuit block are synchronized with each other by the clock cli and controlled so that (j)i is provided at the same time as the output of the component yj of Y. The “X Zp” circuit block yields the product thereof, (j)iyj, which is summed, in sync with the clock clj (j=1 to 12), through the loop including the “4 bit AD mod 13” circuit block and the “Rgstr” register, to create a syndrome component Si. The resultant Si is stored in the i-th register in the “Ri (0-2)” register at the clock cki+1. This process is treated at the clock cki for i=0-2 to obtain all syndrome components, which are then stored in the “Ri (0-2)” register unit.
The decision on the success/fail of error searching for the code Y requires derivation of Lee metrics |S0| of syndrome vector components S0.
Therefore, the following description is given to an operational circuit element operative to compute a Lee metric of the element in Zp. Hereinafter, this operational circuit element is referred to as a “4 bit LM 13” circuit block.
As for an element a in Zp represented in 4-bit binary, the associated Lee metric Q=|a| can be expressed by Q=/PF0×a+PF0×(13−a). Here, PRO becomes ‘1’ if a ≧7, and ‘0’ if a <7. Therefore, a derivation of the Lee metric of a in the case of a ≧7 just requires a subtraction of a from 13, that is, an addition of the complement of a to 13.
A relation between A and Q in the case of h=4, p=13 is as in Expression 32.
a=A
0
+A
12+A222+A323
Q=|a|(Q=/PF0×a+PF0×(13−a))
Q=Q
0
+Q
12+Q222+Q323 [Expression 32]
The “4 bit LM 13” circuit block receives a 4-bit binary A0-A3, and provides a 4-bit binary Q0-Q3.
The “4 bit LM 13” circuit block comprises a PF0 generator unit U1, an XOR gate G1, a half adder HA1, and 3 full adders FA1-FA3.
The PF0 generator unit U1 includes, between the Vcc terminal and the Vss terminal, serially connected PMOS transistors QP1, QP2 and NMOS transistors QN1-QN3. These transistors QP1, QP2, QN1, QN2 and QN3 are controlled by A3, A0, A0, A1, and A2, respectively.
The PF0 generator unit U1 also includes 2 PMOS transistors QP3, QP4, an NMOS transistor QN4, and an inverter IV1.
The transistors QP3, QP4 are connected between the source and drain of the transistor QP2. The transistor QN4 is connected between the source of the transistor QN1 and the drain of the transistor QN3 (Vss terminal). These transistors QP3, QP4 and QN4 are controlled by A1, A2 and A3, respectively. The inverter IV1 has an input connected to the sources of the transistors QN1 and QN4. The output from the inverter IV1 provides a carry PF0.
The XOR gate G1 receives A (j=0-3) and PF0, and provides Bj.
The full adder FA1 has inputs B0 and PF0, a carry input PF0, an output Q0, and a carry output C0. The half adder HA1 has inputs C0 and B1, an output Q1, and a carry output C1. The full adder FA2 has inputs B2 and PF0, a carry input C1, an output Q2 and a carry output C2. The full adder FA3 has inputs B3 and PF0, a carry input C2, an output Q3.
If the input reaches 7 or above, the complement of a is added to 13. The complement of a in the case of PF0=1 is created at the XOR gate G1 by inverting Aj, that is, each bit representation of a, to yield B1, and adding 1 thereto.
As p=13 is 13=(1101)2, this is represented by PF0, and further PF0 is used as 1, then these are added to Bj as the sum of binaries.
The “4 bit LM 13” circuit block operates not in sync with the clocks and, on receipt of inputs, provides the computed Lee metric. If the Lee metric is equal to or larger than 3, solution searching is halted.
The configuration of the solution searching polynomial generator unit 223 is described below.
First, an operating circuit for seeking a solution searching polynomial Ψ(x) is described. This operating circuit is referred to as a “solution searching polynomial generating circuit”.
A processing equation is shown in Expression 33, which is required in operational processing for deriving a coefficient φj at each degree j of x in the solution searching polynomial Ψ(x).
The solution searching polynomial generating circuit includes a first part unit U1 operative to derive the 1st order coefficient φ1, and a second part unit U2 operative to derive the 2nd order coefficient φ2.
The first part unit U1 includes an inverter IV1 operative to receive S1, and a “4 bit AD(16) mod 13” circuit block <1> operative to receive the output from the inverter IV1 and 14 (=p+1).
As the 1st order coefficient is φ1=−S1, it is sufficient to derive a complement of S1 in a 4-bit representation. The derivation of the complement in 4-bit binary makes it possible to represent a negative numeral in Zp. Therefore, the first part unit U1 uses the inverter IV1 to invert S1, to which 14 is added at the “4 bit AD(16) mod 13” circuit block <1>, so that the output from the “4 bit AD(16) mod 13” circuit block <1> can provide φ1.
The first part unit U1 is configured based on a computational equation for the input A shown in Expression 33.
−A≡p−A=p+A*(mod 2h)=p+1+/A(mod 2h)≡p+1+/A(mod 2h) [Expression 33]
where A* indicates a complement of A in h bits, /A indicates an inverted number of A in h bits, and ≡ indicates the sum mod p.
The first part unit U1 embodies p+1+/A (mod 2h) in Expression 33. Therefore, for computation of the sum of 4-bit numbers mod 13 at the “4 bit AD(16) mod 13” circuit block <1>, it receives the output from the inverter IV1 corresponding to /A, and 14 corresponding to p+1.
The second part unit U2 comprises an “X Zp” circuit block <1> operative to receive S1 and φ1; a “4 bit AD mod 13” circuit block <2> operative to receive the output from the “X Zp” circuit block <1> and S2; an “X Zp” circuit block <2> operative to receive the inverse element of 2, that is, 7 (=2−1) and the output from the “4 bit AD mod 13” circuit block <2>, that is, φ1S1+S2; an inverter IV2 operative to receive the output from the “X Zp” circuit block <2>; and a “4 bit AD(16) mod 13” circuit block <3> operative to receive the output from the inverter IV2 and 14 (=p+1). The output from the “4 bit AD(16) mod 13” circuit block <3> provides φ2.
The second part unit U2 realizes φ2=−(φ1S1+S2)/2 faithfully using the “X Zp” circuit blocks <1>, <2>, the “4 bit AD mod 13” circuit block <2>, and the “4 bit AD(16) mod 13” circuit block <3>. The inverse element of 2 is equal to 7 and thus a division by 2 turns to a multiplication by 7.
All the circuits contained in the solution searching polynomial generating circuit are asynchronous circuits in simple circuitry using no clocks.
The Euclidean iteration process ing unit 224 is described below.
First, as the premise of the use of a Euclidean method for deriving an error searching polynomial from Ψ(x) and x3, the Euclidean method is described.
Below described is a method for deriving polynomials λ(x) and ν(x) that satisfy a congruence equation, ν(x)Ψ(x)≡λ(x) (mod x3), using the Euclidean iterative method, in accordance with the stop condition, deg λ(x)−deg ν(x)≡S0 (mod 13). Among the polynomials λ(x) and ν(x) derived, one that satisfies deg λ(x)+deg ν(x)<3 is the error searching polynomial.
The Euclidean iterative method is a method for deriving functions f0, f1, . . . , fn in turn using divisions of polynomials. These quantities have relations therebetween shown in Expression 34.
In this case, increases in n inevitably find fn having a smaller degree than the previous degrees at a certain n as shown in Expression 34. This fact leads to the condition indicative of the presence of the possibility of deriving a solution that satisfies the stop condition on the Euclidean iterative method.
In particular, with sequential introductions of pn and qn from the quotient polynomials kn obtained in the process of divisions, these polynomials can satisfy simple relations. Therefore, fn can be represented by f0, f1, pn−1 and qn−1 as fn=(−1)n (qn−1f0−pn−1f1) such that as the degree of fn decreases, the degrees of pn−1 and qn−1 increase. The degree of pn−1 increases as the degree of fn decreases in this way, and accordingly the difference between degrees decreases inevitably to satisfy the stop condition.
Generally, in the Euclidean method, fn=knfn+1+fn+2 utilizes the sequential decreases in degree in relation to divisions while kn assumes the 1st or 0-th degree equation here and accordingly the degree cannot decrease always. Rather, the degree may increase but cannot exceed the degree of f0. In any way, if the relation about the factorization of the above polynomial Ψ(x) establishes, the relation shown in Expression 34 can establish. Therefore, it is possible to finally lower the degree to satisfy the stop condition on the iteration.
Therefore, when substitutions of f0=x3, f1=Ψ(x) are made to create a congruence equation at x3, the stop condition on the iteration comes to deg fn−deg pn−1≡S0 (mod 13). With respect to n that satisfies this stop condition, if deg fn+deg pn−1<3, then substitutions of λ(x)=fn, ν(x)=pn−1 are possible.
On the other hand, if it is not possible to achieve such the degree relation, it means the occurrence of an error that cannot satisfy the error-correctable condition.
An example is described next on the circuitry of the Euclidean iteration processing unit 224.
Described first is a circuit operative to derive kn in computation of fn=knfn+1+fn+2. Hereinafter, this circuit is referred to as a “kn” circuit block.
The “kn” circuit block is configured based on the relational equations shown in Expression 35.
f
n
=k
n
f
n+1
+f
n+2
f
n
=a
(n)
0
+a
(n)
1
x+a
(n)
2
x
2
+a
(n)
3
x
3
k
n
=b
(n)
0
+b
(n)
1
x
b
(n)
1
=a
(n)
m
/b
(n+1)
m−1
,b
(n)
0=(a(n)m−1−b(n)1a(n+1)m−2)/a(n+1)m−1 [Expression 35]
The “kn” circuit block computes the relation between coefficients, b(n)1=a(n)m/b(n+1)m−1, b(n)0=(a(n)m−1−b(n)1a(n+1)m-2)/a(n+1)m−1.
The “kn” circuit block receives a(n)m−1 and a(n)m, and provides a(n+1)m−1, a(n+1)m−2, b(n)1, and b(n)0 as shown in
The “kn” circuit block includes 3 “X Zp” circuit blocks <1>, <2>, <3>, a “4 bit AD(16) mod 13” circuit block <1>, a “4 bit AD mod 13” circuit block <2>, a “j−1 dec” circuit block, an XNOR gate G1, and an OR gate G2.
The “X Zp” circuit block <1> computes the product of a(n+1)m−2 and b(n)1. The result b(n)1a(n+1)m−2 is fed to the “4 bit AD(16) mod 13” circuit block <1> via the inverter IV1.
The “4 bit AD(16) mod 13” circuit block <1> receives the output from the inverter IV1 and 14 (=p+1) to obtain a complement of b(n)1a(n+1)m−2.
The “4 bit AD mod 13” circuit block <2> receives the output from the “4 bit AD (16) mod 13” circuit block <1> and a(n)m−1, and provides the result, a(n)m−1−b(n)1a(n+1)m−2, to the “X Zp” circuit block <2>.
The “j−1 dec” circuit block receives a(n+1)m−1, and provides the inverse element thereof, (a(n+1)m−1)−1, to the “X Zp” circuit blocks <2> and <3>.
The “X Zp” circuit block <2> computes the product of the output, a(n)m−1−b(n)1a(n+1)m−2, from the “4 bit AD mod 13” circuit block <2> and the output, (a(n+1)m−1)−1, from the “j−1 dec” circuit block to obtain the result b′(n)0.
The “X Zp” circuit block <3> computes the product of a(n)m and the output, (a(n+1)m−1)−1, from the “j−1 dec” circuit block to obtain the result b(n)1, that is, the output from the “kn” circuit block.
The XNOR gate G1 receives the output, b′(n)0, from the “X Zp” circuit block <2> and the output, b(n)1, from the “X Zp” circuit block <3>. The OR gate G2 receives the output from the XNOR gate G1 and b′(n)0. In this case, the output from the OR gate G2 turns to the output, b(n)0, from the “kn” circuit block.
The “kn” circuit block applies the relations between coefficients shown in Expression 35 to derive an unknown coefficient from the known coefficients. On the assumption that the polynomial of kn is at the 1st degree, and that the coefficient is determined 0 in the process of computation if the coefficient at the highest degree becomes 0 as well, the process advances the procedure of computation sequentially. At this time, the inverse element of 0 is determined 0. In addition, the logics of the XNOR gate G1 and the OR gate G2 are introduced to deal kn as 1 to prevent kn from coming to zero in relation to the degrees of fn and fn+1, so that the computation advances without entering the repetition loop. At this time, the degree of fn+1 may increase in the process of computation though the degree can be made lower than the previous degrees before long because the degree is lowered 1 by 1 in and after the next cycle.
The following description is given to a circuit operative to derive fn+2 in computation of fn=knfn+1+fn+2. Hereinafter, this circuit is referred to as an “fn” circuit block.
The “fn” circuit block is configured based on the relational equations shown in Expression 36.
f
n
=k
n
f
n+1
+f
n+2
f
n+2
=f
n
−k
n
f
n+1
,k
n
=b
(n)
0
+b
(n)
1
x
f
n+2
=f
n−(b(n)0fn+1+b(n)1xfn+1)
a
(n+2)
m
=a
(n)
m−(b(n)0a(n+1)m+b(n)1a(n+1)m−1) [Expression 36]
The “fn” circuit block applies the relations between coefficients shown in Expression 36 to derive an unknown coefficient from the known coefficients. It utilizes the already-obtained coefficients of the polynomial of kn. The “fn” circuit block computes a(n+2)m=a(n)m−(b(n)0a(n+1)m+b(n)1a(n+1)m−1).
The “fn” circuit block receives a(n+1)m−1, a(n+1)m, a(n)m, b(n)1 and b(n)0, and provides a(n+2)m as shown in
The “fn” circuit block includes 2 “X Zp” circuit blocks <1>, <2>, 2 “4 bit AD mod 13” circuit blocks <1>, <3>, and a “4 bit AD (16) mod 13” circuit block <2>.
The “X Zp” circuit block <1> computes the product of a(n+1)m−1 and b(n)1, and provides the result, b(n)1a(n+1)m−1, to the “4 bit AD mod 13” circuit block <1>.
The “X Zp” circuit block <2> computes the product of a(n+1)m and b(n)0, and provides the result, b(n)0a(n+1)m, to the “4 bit AD mod 13” circuit block <1>.
The “4 bit AD mod 13” circuit block <1> receives the output, b(n)1a(n+1)m−1, from the X Zp” circuit block <1> and the output, b(n)0a(n+1)m, from the X Zp” circuit block <2>. The output from the “4 bit AD mod 13” circuit block <1> is fed to the “4 bit AD (16) mod 13” circuit block <2> via the inverter IV1.
The “4 bit AD (16) mod 13” circuit block <2> receives the output from the inverter IV1 and ‘1’ to obtain a complement of (b(n)0a(n+1)m+b(n)1a(n+1)m−1).
The 4 bit AD mod 13” circuit block <3> receives a(n)m and the output from the “4 bit AD (16)mod 13” circuit block <2> to obtain a(n+2)m, that is, the output from this “fn” circuit block.
The following description is given to a circuit operative to derive pn−2 in computation of pn=kn−1pn−1+pn−2. Hereinafter, this circuit is referred to as a “pn” circuit block.
The “pn” circuit block is configured based on the relational equations shown in Expression 37.
p
n
=k
n−1
p
n−1
+p
n−2
p
n
=a
(n)
0
+a
(n)
1
x+a
(n)
2
x
2
k
n
=b
(n)
0
+b
(n)
1
x
p
n+1=(b(n)0pn+b(n)1xpn)+pn−1
a
(n+1)
m=(b(n)0a(n)m+b(n)1a(n)m−1)+a(n−1)m [Expression 37]
The “pn” circuit block applies the relations between coefficients shown in Expression 37 to derive an unknown coefficient from the known coefficients. It utilizes the already-obtained coefficients of the polynomial of kn. The “pn” circuit block computes the relation between coefficients, a(n+1)m=(b(n)0a(n)m+b(n)1a(n)m−1)+a(n−1)m.
The “pn” circuit block receives a(n)m−1, a(n)m and a(n−1)m, and provides a(n+1), as shown in
The “pn” circuit block includes 2 “X Zp” circuit blocks <1>, <2>, and 2 “4 bit AD mod 13” circuit blocks <1> and <2> as shown in
The “X Zp” circuit block <1> computes the product of a(n)m−1 and b(n)1, and provides the result, b(n)1a(n)m−1, to the “4 bit AD mod 13” circuit block <1>.
The “X Zp” circuit block <2> computes the product of a(n)m and b(n)0, and provides the result, b(n)0a(n)m, to the “4 bit AD mod 13” circuit block <1>.
The “4 bit AD mod 13” circuit block <1> receives the output, b(n)1a(n)m−1, from the X Zp” circuit block <1> and the output, b(n)0a(n)m, from the X Zp” circuit block <2>, and provides b(n)0a(n)m+b(n)1a(n)m−1 to the “4 bit AD mod 13” circuit block <1>.
The “4 bit AD mod 13” circuit block <2> receives a(n−1)m and the output, b(n)0a(n)m+b(n)1a(n)m−1, from the “4 bit AD mod 13” circuit block <1> to obtain a(n+1)m, that is, the output from the “pn” circuit block.
Subsequently, a circuit for computing fn=knfn+1+fn+2 is described. Hereinafter, this circuit is referred to as an “fn=knfn+1+fn+2” circuit block.
The “fn=knfn+1+fn+2” circuit block receives the coefficients a0-a3 of fn and the coefficients b0-b2 of fn+1, and provides the coefficients c0-c3 of fn+2 and the coefficients β0-β1 of kn as shown in
The “kn” circuit block <1> associates ‘0’, a0, b0 and a1 with a(n+1)m−2, a(n)m−1, a(n+1)m−1 and a(n)m, and associates β(0)0 and β(0)1 with b(n)0 and b(n)1, respectively.
The “kn” circuit block <2> associates b0, a1, b1 and a2 with a(n+1)m−2, a(n)m−1, a(n+1)m−1, and a(n)m, and associates β(1)0 and β(1)1 with b(n)0 and b(n)1, respectively.
The “kn” circuit block <3> associates b1, a2, b2 and a3 with a(n+1)m−2, a(n)m−1, a(n+1)m−1, and a(n)m, and associates β(2)0 and β(2)1 with b(n)0 and b(n)1, respectively.
The “kn” circuit block <4> associates b2, a3, b3 and ‘0’ with a(n+1)m−2, a(n)m−1, a(n+1)m−1 and a(n)m, and associates β(3)0 and β(3)1 with b(n)0 and b(n)1, respectively.
The outputs, β(m)0 and β(m)1 (m=0-3), from the “kn” circuit blocks <1>-<4> are provided as β0 and β1 from the “fn=knfn+1+fn+2” circuit block via the control switches SW controlled by deg (b1, b2, b3)=m, respectively. The β0 and β1 can be utilized also as the inputs to the “fn” circuit blocks <1>-<4>. Here, deg( ) represents the output from the degree decision circuit associated with fn+1.
The “fn” circuit block <1> receives ‘0’, b0, a0, β0 and β1 as a(n+1)m−1, a(n+1)m, a(n)m, b(n)0 and b(n)1, and provides c0 as a(n+2)m.
The “fn” circuit block <2> receives b0, b1, a1, β0 and β1 as a(n+1)m−1, a(n+l)m, a(n)m, b(n)0 and b(n)1, and provides c1 as a(n+2)m.
The “fn” circuit block <3> receives b1, b2, a2, β0 and β1 as a(n+1)m−1, a(n+1)m, a(n)m, b(n)0 and b(n)1, and provides c2 as a(n+2)m.
The “fn” circuit block <4> receives b2, b3, a3, β0 and β1 as a(n+1)m−1, a(n+1)m, a(n)m, b(n)0 and b(n)1, and provides c3 as a(n+2)m.
The outputs from the “fn” circuit blocks <1>-<4> turn to c0-c3, that is, the output from the “fn=knfn+1+fn+2” circuit block.
If one of more of the inputs to the “kn” circuit blocks <1>-<4> and the “fn” circuit blocks <1>-<4> have no corresponding value, ‘0’ is given instead.
The computation results from the “kn” circuit blocks <1>-<4> are switched in accordance with the degree of the polynomial fn+1 representative of the coefficient of the input bn, and utilized as β inside the “fn=knfn+1+fn+2” circuit block and provided. Namely, β output from the “kn” circuit block related to the highest degree of fn+1 is utilized.
Subsequently, a circuit for computing pn+1=knpn+pn−1 is described. Hereinafter, this circuit is referred to as a “pn+1=knpn+pn−1” circuit block.
The “pn+1=knpn+pn−1” circuit block receives the coefficients β0, β1 of kn, the coefficients a0-a2 of pn−1, and the coefficients b0-b2 of pn, and provides the coefficients c0-c2 of pn+1 as shown in
As computations are started with substitutions of p−1=0, p0=1, and for the purpose of utilizing pn+1 mod x3, the maximum degree of pn+1 is 2. Therefore, the “pn+1=knpn+pn−1” circuit block includes 3 “kn” circuit blocks <1>-<3>.
The “pn” circuit block <1> receives ‘0’, b0, a0, β0 and β1 as a(n)m−1, a(n)m, a(n−1)m, b(n)0 and b(n)1, and provides c0 as a(n+1)m, respectively.
The “pn” circuit block <2> receives b0, b1, a1, β0 and β1 as a(n)m−1, a(n)m, a(n−1)m, b(n)0 and b(n)1, and provides c1 as a(n+1)m, respectively.
The “pn” circuit block <3> receives b1, b2, a2, β0 and β1 as a(n)m−1, a(n)m, a(n−1)m, b(n)0 and b(n)1, and provides c2 as a(n+1)m, respectively.
The outputs from the “pn” circuit blocks <1>-<3> are provided as c0-c2, that is, the output from the “pn+1=knpn+pn−1” circuit block.
Subsequently, computing circuits are described for deriving fn and pn−1 through iterative computations from 0 to n.
The computing circuit operative to derive fn includes 3 “16 register” units <1>-<3>, as shown in
The computing circuit operative to derive pn+1 includes “12 register” units <1>-<3>, as shown in
The computations of fn and pn−1 advance at every cycle of the clock cl and the number of cycles of the clock cl counted from 0 corresponds to n of the Euclidean iteration. Finally, the “16 register” unit <1> holds fn, and the “12 register” unit <1> holds pn−1.
In the first clock cycle cl0, the “16 register” units <1>-<3> and the “12 register” units <1>-<3> are set to have such initial values as shown in
Namely, in the computing circuit operative to derive fn, the coefficient of f0=x3 is set in the “16 register” unit <1>, and the coefficient of f1=Ψ(x) in the “16 register” unit <3>. In the computing circuit operative to derive pn−1, p−1=‘0’ is set in the “12 register” unit <1>, and p0=‘1’ in the “12 register” unit <3>. Thereafter, as the clock cycle advances, fn and pn−1 are set in the corresponding register units at the start of the cln cycle.
The “16 register” unit <1> and the “12 register” unit <1> are prevented from receiving the inputs by a signal /ISTOP. Accordingly, data in the “16 register” unit <1> and the “12 register” unit <1> can be determined at that moment. The signal /ISTOP is a signal that is activated if the later-described stop condition on the Euclidean iterative method is satisfied.
Thus, the “16 register” unit <1> and the “12 register” unit <1> can obtain the coefficients of the polynomials as desired.
These coefficients can be fed to the first Hasse differential polynomial generator unit 225 and the second Hasse differential polynomial generator unit 226 to seek the root and the multiplicity of the Hasse differential polynomial.
For the purpose of seeking the signal /ISTOP shown in
The “DEG” circuit block receives a1-a3, and provides b0, b1, that is, a binary representation of the degree as shown in
The “DEG” circuit block includes 4-bit registers <1>-<3>, OR gates G0<1>-<3>, G1<1>-<3>, and an “A to deg DEC.” circuit block as shown in
The 4-bit registers <1>-<3> each hold 3-bit binary data indicative of the coefficients, a1-a3, of the 3rd degree polynomial. The 4-bit binary data indicative of the coefficients a1-a3 are fed to the OR gates G0<1>-<3>, respectively.
The OR gate G0<n> (n=1-3) receives 3-bit binary data indicative of an. The OR gate G0<1> provides ‘0’ if an=0 and ‘1’ if an≠0. In a word, the OR gate G0<n> determines whether an is equal to 0 or other than 0.
The OR gate G1<m> (m=1, 2) receives the outputs from the OR gates G0<m>-<3>. The outputs from the OR gates G1<1>,<2> turn to A1, A2, respectively. In addition, the output from the OR gate G0<3> directly turns to A3. These A1-A3 are fed to the “A to deg DEC.” circuit block.
The “A to deg DEC.” circuit block is a decoder circuit capable of realizing the associated relations between A1-A3 and binary representations of the degrees as shown with T1 in
The “A to deg DEC.” derives b0, b1, that is, the output from the “DEC” circuit block, fromA1-A3 based on the association table shown with T1 in
The final description is given to a circuit operative to decide if the key condition on the Lee metric code is satisfied to stop the iterations, and if the error correctable condition is satisfied, based on fn and pn−1 obtained through the Euclidean iterative method.
This circuit includes 2 “DEG” circuit blocks <1>, <2>, a “4 bit AD(16) mod 13” circuit block <1>, a “4 bit AD mod 13” circuit block <2>, and a part circuit U1 composed of plural logic gates.
The “DEG” circuit block <1> receives fn as a1-a3, and provides deg fn, that is, a binary representation of fn, as b0, b1. This deg fn is fed to the “4 bit AD mod 13” circuit block <2>.
The “DEG” circuit block <2> receives pn−1 as a1-a3, and provides deg pn−1, that is, a binary representation of pn−1, as b0, b1. This deg is fed to the “4 bit AD(16) mod 13” circuit block <1> via the inverter IV1.
The “4 bit AD(16) mod 13” circuit block <1> derives a complement of deg pn−1 from the input, deg pn−1, fed via the inverter IV1 and ‘1’. The complement of deg pn−1 is fed to the “4 bit AD mod 13” circuit block <2>.
The stop condition on the Euclidean iterative method is deg fn-deg pn−1≡S0 (mod 13). Therefore, the “4 bit AD mod 13” circuit block <2> receives deg fn and a complement of deg pn−1 to obtain deg fn-deg pn−1 as a numeral in Zp. This result is fed to the part circuit U1 for comparison with S0 at every bit of the binary representation.
For the purpose of comparing deg fn−deg pn−1 with S0 at every bit, the part circuit U1 computes the exclusive logical sum at every bit. If all bits (deg fn−deg pn−1)j (j=0-3) match (S0)j, the part circuit U1 provides ‘0’ as the signal /ISTOP. In a word, the signal /ISTOP is (deg fn−deg pn−1)0(+) (S0)0□ . . . □ (deg fn−deg pn−1)m(+) (S0)m□ . . . □ (deg fn−deg pn−1)h−1(+) (S0)h−1. Thus, the part circuit U1 can realize if deg fn−deg pn−1 matches S0 or not.
This circuit includes a NAND gate G1, a “4 bit AD mod 13” circuit block, a “4 bit 3≦” circuit block later-described, and an NOR gate G2.
The NAND gate G1 is arranged to provide for the case where the Euclidean iterative method fails to satisfy the stop condition to stop. On the condition that the degree of fn is 1, the degree of fn+1 is 0, and the signal /ISTOP is 1, that is, (deg fn=1) □ (deg fn+1=0) □ (/ISTOPn=1), the NAND gate G1 provides ‘1’ as a signal Ifail.
It is determined based on deg fn+deg pn−1<3 if the error is correctable when the Euclidean iterative method stops.
Therefore, the “4 bit AD mod 13” circuit block is given deg fn and deg pn−1 to derive the sum thereof, deg fn+deg pn−1. This sum, deg fn+deg pn−1, is fed to the “4 bit 3≦” circuit block.
The “4 bit 3≦” circuit block is a circuit operative to determine if the input is equal to or higher than 3 or not. The “4 bit 3≦” circuit block provides ‘1’ as a signal gtn3 if the input, deg fn+deg pn−1, supplied from the “4 bit AD mod 13” circuit block is equal to or higher than 3.
The details of the “4 bit 3≦” circuit block are omitted from the following description though they can be configured similar to the PF0 generator unit U1 in the “4 bit AD mod 13” circuit block shown in
The OR gate G2 receives the output, Ifail, from the NAND gate G1 and the output, gtn3, from the “4 bit 3≦” circuit block and, if Ifail □ gtn3, it provides NG signal indicative of the impossibility of error correction through the Euclidean iterative method.
Thus, the circuit blocks shown in
These circuit blocks utilize the inverse elements of the elements in Zp.
An example is described next on the configuration of a specific decoder.
In particular, a decoder capable of creating the inverse element of an element in Zp is highly required. Accordingly, herein described is a decoder operative to derive the inverse element j−1 of an element j in Zp.
Hereinafter, this decoder is referred to as a “j−1 dec” circuit block.
The inverse element j−1 can be derived from the element j by seeking a relation of j x j−1≡1 (mod 13), then configuring j and j−1 in 4-bit binary representations as shown in Expression 38, thereby forming a conversion decoder.
Input:Ii=(j)i
j=(j)0+(j)12+(j)222+(j)323
Output:Oi=(j−1)i
jj−1≡1(mod p)
j
−1=(j−1)0+(j−1)12+(j−1)222+(j−1)323 [Expression 38]
The “j−1 dec” circuit block can be configured by a general circuit operative to decode 4-bit binary data I to other 4-bit binary data O, setting I=j, O=j−1 as shown in Expression 38 in this case.
The “j−1 dec” circuit block includes 2 NAND gates G1, G2, and 2 NOR gates G3, G4.
First, the NAND gates G1 and G2 are used to pre-decode 4 bits I0-I3 of the input to create signals /A0-/A3, /B0-/B3. Subsequently, the NOR gate G3 is used to create OIm, that is, the input Zp represented by ‘H’ based on a combination of ‘L’ in /Am, /Bm (m=0-3). Finally, the NOR gate G4 is used to collect the inverse elements exhibiting 1 on the m bit from OIm and execute the logical sum at every m. Thus, it is possible to obtain the inverse element of the input. If the input is j=0, then j−1=0 is set.
The following description is given to the Hasse differential polynomial generator units 225 and 226.
The degree of the error searching polynomial obtained through the Euclidean iterative method is the 2nd degree at the maximum. Therefore, the coefficients of a Hasse differential polynomial for obtaining a root of the error searching polynomial and the multiplicity of the root can be derived through a simple operation.
The relation between the representation and the coefficient of a Hasse differential polynomial of φ(x) is as in Expression 39.
When the relational equations shown in Expression 39 are applied to the error searching polynomials λ(x), ν(x) obtained through the Euclidean iterative method, as φ(x)=λ(x), φ(x)=ν(x), they can be represented as in Expression 40 and Expression 41.
[λ(x)][0]=λ(x)=1+λ1x+λ2x2φ[0]0=1φ[0]1=λ1φ[0]2=λ2
[λ(x)][1]=λ1+λ2xφ[1]0=λ1φ[1]1=2λ2
[λ(x)][2]=λ2φ[2]0=λ2 [Expression 40]
[ν(x)][0]=ν(x)=1+ν1x+ν2x2φ[0]0=1φ[0]1=ν1φ[0]2=ν2
[ν(x)][1]=ν1+ν2xφ[1]0=ν1φ[1]1=2ν2
[ν(x)][2]=ν2φ[2]0=ν2 [Expression 41]
As can be found from Expression 40 and Expression 41, the coefficients of the Hasse differential polynomial can be obtained by the permutation of the coefficients obtained through the Euclidean iterative method and the multiplication by 2. These coefficients are stored in the registers.
The following description is given to an operating circuit operative to compute a root and the associated multiplicity of the solution searching polynomial φ(x). Here, φ(x) turns to the error searching polynomial λ(x) or ν(x). Hereinafter, this operating circuit is referred to as a “solution searching polynomial root/multiplicity operating circuit”.
The solution searching polynomial root/multiplicity operating circuit regards the 0-th order differential of the Hasse differential polynomial as φ(x). Unless the Hasse differential polynomial is zero for each element in Zp, the circuit shifts the computation to the next element and increases the number of differential orders while the Hasse differential polynomial keeps zero. When registers are used to hold the number of the first Hasse differential orders while the Hasse differential polynomial is not zero, for each element in Zp, the element when the number of orders is not zero indicates the root, and the number of the remaining orders indicates the multiplicity of the root. In a word, when one with non-‘0’ content is selected from the registers, the value held therein indicates the multiplicity of the root.
Specifically, when the root of the solution searching polynomial φ(x) is denoted with α, and a Hasse differential polynomial of φ(x) at the i-th order with [φ(x)][i], a relational equation is given as in Expression 42 where the multiplicity of α is denoted with n.
The solution searching polynomial root/multiplicity operating circuit seeks n that satisfies Expression 42 for each element α in Zp regardless of whether it is the root of φ(x) or not. In the case of n=0, it means that α is not a root.
The solution searching polynomial root/multiplicity operating circuit scans the elements 1-12 in Zp at the clock ck, increases the number of the Hasse differential orders at the clock cl, and seeks the value of the Hasse differential polynomial in that number of orders at the clock clk. The clock ck is generated when the computed value of the Hasse differential polynomial becomes non-zero so that the process enters the cycle for the next element in Zp.
The solution searching polynomial root/multiplicity operating circuit includes, as shown in
The “Counter (1 to 12)” circuit block is operative on receipt of the clock ck α times to generate an element α in Zp and count up from 1 to 12. The “Counter (1 to 12)” circuit block returns to the initial value of 1 always after it counts up to 12 in cyclic count.
The “Rgstr” register has an initial value of 1 and uses the output from the “X Zp” circuit block <1> to update the holding value on receipt of the clock clkj (j=0-2).
The “X Zp” circuit block <1> multiplies the output a from the “Counter (1 to 12)” circuit block by the value held in the “Rgstr” register <1> to yield αj.
The “Ro i(0-2)/j(0-2)” register unit is a register operative to provide a coefficient φ[i]j of the Hasse differential polynomial on receipt of the clocks cli and clkj.
The “X Zp” circuit block <2> multiplies the output αj from the “X Zp” circuit block <1> by the output φ[i]j from the “Ro i(0-2)/j(0-2)” register unit to yield αjφ[i]j.
Thus, when the clock clk is applied 3 times to the circuit comprising the “Counter (1 to 12)” circuit block, the “Rgstr” register <1>, the “X Zp” circuit block <1>, the “Ro i(0-2)/j(0-2)” register unit, and the “X Zp” circuit block <2>, the value of the Hasse differential polynomial can be obtained. For the purpose of simplifying the control of the clocks ck, cl and clk, the sum of the terms of the value 0 not present in this computation is computed as well. Therefore, the total sum of the clocks clk originally required is almost halved.
The value [φ(x)][i] of the differential polynomial is taken in the “Rgstr” register <1> as ‘0’ in the case of zero and ‘1’ in other cases.
The value held in the “Rgstr” register <1> is output at the timing of the clock clk0 as the clock ckα (α=1-12). The clock ckα is kept until it is reset by the clock clk2.
The “Rgstr” register <1> has an initial value of ‘1’ and accordingly the clock ck1 rises at the first clk0. The clock ck2 and so on rise at clk0 in any one cycle of clk in accordance with the computation result.
The “clock cl gen.” circuit block generates the clock cli in sync with the clock clk0 and resets it to the clock cl0 at every rise of the clock ckα.
The “Counter (0 to 2)” circuit block is reset to 0 by the clock ckα, and counts up at every input of the clock cl and outputs the order of the clock cl minus 1. This output is stored in the “Li (1-12)” register unit.
The “Li (1-12)” register unit has the inputs switchable by the clock ckα, and thus the α-th register can store the associated multiplicity.
After the “Counter (1 to 12)” circuit block completes one flow of counting up, the “Li (1-12)” register unit stores the multiplicities associated with α of the solution searching polynomial φ(x) and, if these are finite, the root a and the corresponding multiplicity can be found.
The decoder unit 228 is described below.
After error searching is finished and a Lee metric code C corrected with the error code E is obtained, it is required to restore it to the data code A in Zp. This operation corresponds to an inverse operation of C=AG with the generator matrix G though an inversion of a matrix requires a large-scale operation. Therefore, the elements of A are sequentially derived from the elements of C. The computation process is shown in Expression 43.
As shown in Expression 43, the relation cj=Σ(j)i+1ai is deformed in turn to sequentially derive a1 from a0, subsequently a2 from a1, . . . , then am. This is the principle of computation.
Both sides are multiplied by the inverse element of the power of j to yield the terms of j on all elements in Zp, which are then summed. At this time, the deformation is executed based on the fact that the sum of all elements in Zp becomes zero.
Expression 44 shows a relational equation of the components of C and the components of A.
The following description is given to a specific operating circuit realized using the relational equation shown in Expression 44.
When seeking am after am−1 is obtained, the parallel use of clocks is executed by computing the sum on j at am in order while seeking c(m−1)j.
Next, a circuit for converting the component of the Lee metric code C every time is described based on c(m−1)j=c(m−2)j−(j)mam−1 shown in Expression 44. Hereinafter, this circuit is referred to as a “c(m−1)j” circuit block.
The “c(m−1)j” circuit block is a circuit operable in sync with the clocks clj, /clj. It receives (j)m, am−1 and the code components c1-c16, and provides c(m−1)j.
The “c(m−1)j” circuit block includes an “X Zp” circuit block, a “4 bit AD(16) mod 13” circuit block <1>, a “4 bit AD mod 13” circuit block <2>, and a “R (1-12)” register unit.
The “X Zp” circuit block yields the product of (j)m and am−1, and provides it to the “4 bit AD(16) mod 13” circuit block <1> via the inverter IV1.
The “4 bit AD(16) mod 13” circuit block <1> seeks a complement of (j)mam−1, that is, the output fed via the inverter IV1 to yield −jmam−1, and provides it to the “4 bit AD mod 13” circuit block <2>.
The “4 bit AD mod 13” circuit block <2> yields the sum of −jmam−1 output from the “4 bit AD (16) mod 13” circuit block <1> and cj (=c(m−2)j) output from the “R (1-12)” register unit in sync with the clock cl. This sum turns to c(m−1)j, that is, the output from the “c(m−1)j” circuit block. The output c(m−1)j from the “4 bit AD mod 13” circuit block <2> is recorded in the j-th register in the “R (1-12)” register unit in sync with the fall of the clock cl.
The subsequent description is given to a circuit for creating am based on am=12−1Σ(jm+1)−1c(m−1)j (j=1-12) shown in Expression 44. Hereinafter, this circuit is referred to as an “am” circuit block.
The “am” circuit block is operable in sync with the clock clj. It receives (j)m+1 and c(m−1)j, and provides am.
The “am” circuit block includes a “j−1 dec” circuit block, 2 “X Zp” circuit blocks <1>, <2>, a “4 bit AD mod 13” circuit block, and a “Rgstr” register.
The “j−1 dec” circuit block converts (j)m+1 to the inverse element thereof, (j)−m+1, and provides it to the “X Zp” circuit block <1>.
The “X Zp” circuit block <1> computes the product of (j)−(m+1) fed from the “j−1 dec” circuit block and cj (=c(m−1)j) output from the “c(m−1)j” circuit block, and provides the product to the “4 bit AD mod 13” circuit block.
The product output from the “4 bit AD mod 13” circuit block is summed through a loop including the “Rgstr” register having an initial value of ‘0’ and the “4 bit AD mod 13” circuit block. The result is fed from the “4 bit AD mod 13” circuit block to the “X Zp” circuit block <2>.
The “X Zp” circuit block <2> computes the product of the sum output from the “4 bit AD mod 13” circuit block and 12−1=12 to yield am, that is, the output from the “am” circuit block.
Next, a computing circuit is described, which includes the before-described “c(m−1)j” circuit block and “am” circuit block to obtain the data code A. Hereinafter, this computing circuit is referred to as an “inverse converting circuit”.
As shown in
The inverse converting circuit comprises a first part circuit U1 for converting the component of the Lee metric code C required for inverse conversion every time, and a second part circuit U2 for seeking the elements of the data code A one by one.
The first part circuit U1 includes a “(j)i (j=1 to 12)” circuit block, a “Rgstr” register, and a “c(m−1)j” circuit block.
The second part circuit U2 includes a “(j)i (j=1 to 12)” circuit block, an “am” circuit block, and a “Li (0-8)” register unit.
Among those, the “(j)i (j=1 to 12)” circuit block is shared with the first part circuit U1. The “(j)i (j=1 to 12)” circuit block is a circuit operative to generate the (m+1)-th and m-th powers of the element j in Zp for use in the “am” circuit block and the “c(m−1)j” circuit block. This circuit block uses m+1, the number of cycles of the clock ck, and, m, a 1-cycle lower number than this number of cycles, as indexes, and uses the number of cycles of the clock cl as j to provide (j)m+1 and (j)m.
The “(j)i (j=1 to 12)” circuit block provides the element in Zp itself to the “am” circuit block and ‘1’ to the “c(m−1)j” circuit block in the 1st cycle of ck. The ‘1’ fed to the “c(m−1)j” circuit block is originally not utilized in computation. Accordingly, it is required to put a thought in preventing ‘1’ from exerting an influence on the operation result. Therefore, for the purpose of removing the influence of ‘1’, the product operation in the first part circuit U1 turns the other input to ‘0’ in the first cycle of the clock ck such that the result of the product operation becomes zero.
The “Rgstr” register in the first part circuit U1 is given a setting of am. The setting of am is provided from the “Rgstr” register at the timing of the clock ck after the clock ck1. Namely, the output am−1 from the “Rgstr” register is provided at the timing of the clock ckm.
In the first clock cycle ck0 of the clock ck, the initial value on the output node in the “Rgstr” register is set at ‘0’ to exert no influence on the computation result as described above.
In sync with the clock ckm, the “c(m−1)j” circuit block receives am−1 and (j)m output from the “Rgstr” register and the “(j)i (j=1 to 12)” circuit block, respectively. The “c(m−1)j” circuit block derives c(m−1)j, that is, the output from the first part circuit U1, from these inputs am−1 and (j)m.
The initial value in the “c(m−1)j” circuit block is the restored code data C, and the setting in the “Rgstr” register at the clock ck0 provides c(−1)j=init C.
The “am” circuit block in the second part circuit U2 obtains am at every 12 cycles of the clock cl in accordance with the computing equation shown in Expression 44. The am created at the “am” circuit block is stored in the m-th register in the “Li (0-8)” register unit at the clock ckm+1 that indicates the start timing of the cycles of the next 12 clocks cl.
After completion of all the clock cycles, the code data A is set and stored in the “Li (0-8)” register unit.
Either when 16-level MLCs are used or when 8-level MLCs are used in NAND flash memories, it has been essential to apply stable, accurate process steps for practical use.
With this regard, the memory controller and memory system according to the present embodiment makes it possible to ensure the reliability of data even if the MLC levels are set insufficiently. In accordance therewith, it is possible to develop further leveling of memory cells.
While certain embodiments have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of the inventions. Indeed, the novel methods and systems described herein may be embodied in a variety of other forms: furthermore, various omissions, substitutions and changes in the form of the methods and systems described herein may be made without departing from the spirit of the inventions. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the inventions.
The above-described embodiment is described mainly assuming the memory controller in the case of h=4, p=13 though other h and p can be used to exert the similar effect as shown in
The use of the MLC NAND flash memory as the memory device is described though the memory device is not limited to this but rather can use multi-level storable memory cells (such as ReRAM), in a system accessible on a page basis, to which the present invention is similarly applicable.
Number | Date | Country | Kind |
---|---|---|---|
2010-273311 | Dec 2010 | JP | national |