This application is based upon and claims the benefit of priority from the prior Japanese Patent Application No. 2010-214357, filed on Sep. 24, 2010, the entire contents of which are incorporated herein by reference.
The embodiment relates to a memory system.
In recent years, mass data is held in mobile devices, for example, in handheld smart terminals and electronic books for fast access. Such technologies have become important increasingly. Further, from the viewpoint of weight-reducing and energy-saving of information devices, integrated circuit memories having no moving parts such as a motor has proceeded rapidly. The importance in this case is how to use up memory cells in performance marginal parts on memory production. In accordance with this, lower prices and larger capacities due to improvements in productivity of memories can be realized. In addition, the reliability of data stored in such the mass-capacity memories becomes important.
Even with no large-scale technical innovation, mass-capacity memories can be realized. One such method is a multi-level technology, which can increase the storage capacity per memory cell. In addition, as the technology of insuring data for marginal memory cells, ECC is known. In particular, as for NAND flash memories, multi-levels of memory cells and ECC system technologies have been matured and almost completed.
Such the NAND flash memories have been used increasingly wider, for example, in mobile terminals and SSDs. Accordingly, it is required to achieve additional technical leaps for the NAND flash memories. For that purpose, it is required to add innovative modifications to the conventional multi-leveling technologies and ECC.
The multi-leveled NAND flash memories have been developed originally based on the specifications without consideration of multi-level. In addition, the ECC is applied to the NAND flash memories based on the technologies for the system of the overall memory. Therefore, the multi-leveling technologies and ECC have not been made optimal as the technology kept with multi-levels of memory cells.
A memory system according to the embodiment comprises a cell array including word lines and plural memory cells operative to store data in accordance with plural different physical levels when selected by the word lines; a register operative to hold first data input from external; and a data converter unit operative to convert the first data held in the register into second data and overwrite the second data in the area of the register for holding the first data, and further operative to convert the second data held in the register into third data to be recorded in the memory cells and overwrite the third data in the area of the register for holding the second data.
With reference to the drawings, memory systems according to the embodiments will now be described below.
A memory system realizes a higher-density storage capacity by finely fabricating a cell array, forming the cell array and so forth in 3-dimensional structures, and thinking physical phenomena for use in memory cells. If the memory system is further required to have a higher density after completion of stable process steps, multi-leveling of memory cells is effective means.
In particular, as for NAND flash memories, multi-leveling proceeds promptly and 3-bit cells capable of storing 8 levels per cell become practical. Much higher multi-leveling of memory cells associates with a sudden deterioration of reliability, however, and suffers a 2-decimal places or more error rate per bit error rate increase at least. Thus, from the problem on the reliability and production yield, the progression of multi-leveling of memory cells has a difficult problem even though it is expected as means for realizing high-capacity file memories.
If the problem on multi-leveling of memory cells can be overcome to achieve effective use, NAND flash memories can be attained to have a higher-density storage capacity while utilizing stable process steps for NAND flash memories for longer terms.
Therefore, the first embodiment, under the following gist, proposes a memory system utilizing a Lee metric code.
(1) An error in a memory cell, in many cases, may be caused by writing an adjacent level in the memory cell and reading an adjacent level from the memory cell. Even in such the case, the assignment of binary bits to the conventional memory cell may cause a larger error quantity. Therefore, the memory system according to the embodiment described below utilizes multi-level storage cells with reliability improved by simplifying the assignment of levels to a memory cell.
(2) A multi-level of the memory cell can be grasped as a height from the base level. In a word, a multi-leveled memory cell is suitable for storing information not digitalized by a binary but digitalized by a prime. Therefore, the memory system according to the embodiment described below is configured with the use of memory cells having physical quantity levels assigned to elements in a finite field Zp of a prime p (hereinafter referred to as “p-adic cells”).
(3) An ECC (Error Correcting Code) effective in the case of the error caused by writing an adjacent level in the memory cell and reading an adjacent level from the memory cell is a Lee metric code in the finite field Zp. Therefore, the memory system according to the embodiment described below utilizes an ECC system that uses a Lee metric code thought to cause no error searching miss.
(4) The configuration of the NAND flash memory using p-adic cells is different from the configuration of the conventional NAND flash memory. Therefore, the embodiment described below proposes an essential idea of the specification about a specific memory system and describes the memory system configured based thereon.
(5) If it is desired to set a larger correctable error quantity, an error correcting method using a Euclidean iterative method may be effective if it is used as the error correcting method using the Lee metric code ECC. Therefore, the following description is given to specific circuits required to realize the memory system using the Euclidean iterative method for error correction.
First, an overview of the Lee metric code is described.
A symbol c representative of the code is an integer shown in Expression 1.
cjεGF(p)=Zp,0≦c<p [Expression 1]
When the metrics of these integers are represented by |cj| as Lee metrics and all Lee metrics |cj| are represented by integers of p/2 or below, the Lee metrics |cj| can be defined as in Expression 2.
0≦c<p/2:|cj|=cj
p/2<c<p:|cj|=p−cj [Expression 2]
As the code C can be considered a row of n (=p−1) symbols cj, it can be represented by C=(c1, c2, . . . , cn), and a metric w(C) of the code C can be defined as the sum of Lee metrics of the symbols cj as in Expression 3.
w(C)=|c1|+|c2|+ . . . +|cn| [Expression 3]
The distance between codes can be defined by the sum of Lee metrics of differences between the symbols corresponding to the code. Here, a difference between 2 symbols c and y (Lee distance) dL(c, y) is given as Expression 4.
dL(c,y)=w(c−y) [Expression 4]
Further, the minimum Lee distance of the code C can be defined by the minimum metric of the metrics w(C) of the code C as shown in Expression 5.
dL(C)=min w(C) [Expression 5]
In this case, the Lee metric code is such a code that has the minimum distance of 2γ between codes having a generator matrix G and a syndrome matrix H shown in Expression 6 and that can correct (γ−1) or lower Lee metric errors.
Here, when the word length of the code C is denoted with n and the word length of data is denoted with k, then γ=n−k where γ represents the redundant word length contained in the code C.
For the purpose of generating the Lee metric code thus configured, the input-converted data is presented by a k-digit, p-adic number. The numerals on the digits of the p-adic number are the elements in Zp and therefore can be used as a data word X of the Lee metric code to obtain a code expression through an operation C=XG based on the generator matrix G. The obtained code word is stored in the memory. Information about the error caused on the stored numeral in Zp is used as a data word Y of the Lee metric code read out of the memory to obtain a syndrome through an operation S=YHt (Ht is a transpose of H) so that the position and quantity of the error can be computed to correct the error.
<Principle of Data Processing, Syndrome Converting Method>
The following description is given to the principle of a data processing method in the NAND flash memory using p-adic cells. Hereinafter, the principle is referred to as a “syndrome converting method”.
The components of the code C held in the p-adic cells are the numerals in Zp and they cause variations on receipt of various disturbances at every component and make a change to the code Y consisting of different components as shown in Expression 7.
Y=(yi),yi=ci+ei,i=1˜p−1 [Expression 7]
The operation to restore C from Y is decoding. Prior to decoding, a syndrome is derived.
After selecting 3-component vectors <m>=<m1, m2, m3> (mi=0 to p−2) from Zp and applying a matrix operation of these vectors and H matrix as in Expression 8, a syndrome <m>S can be derived as elements S<m>+0, S<m>+1, S<m>+ε where ε=γ−1.
After setting <m>S=(S<m>+l) (l=0 to ε), and deriving S0 from <m>+1=0, if |S0|≦ε, then decoding starts.
This is because the generator matrix G and the syndrome matrix H are configured to satisfy GHt=0(mod p), for example. Therefore, the substitution of Y=C+E makes zero the part corresponding to C as shown in Expression 9 so that S can be represented by the components of E.
If <m>=<0>, l=0 and E=(e1, e2, . . . , en), the representation of S<0>+0=S0=Σej can be given so that S0 indicates the total sum of errors as can be found.
If |S0|≦γ−1, an error can be found through the following syndrome converting method. Even if this condition is not satisfied, the value of the error on the code component can be obtained according to the computation. It is not possible to determine whether the error is an error on a true code or an error on an adjacent code. Therefore, it cannot be used in error correction. This is because the requirement for enabling error correction of the Lee metric code includes the minimum value of the Lee metric code between codes being equal to 2γ, and because in this case the Lee metric code of S0 always becomes equal to ε or below, that is, the total sum of Lee metrics of error components becomes equal to ε or below.
If |S0|≦ε, the flow enters the error searching through the syndrome converting method and executes the syndrome conversion at the start. In this case, the flow seeks u=ε(S<m>+0)−1 that makes uS<m>+0≡ε over <m> that leads to S<m>+0≠0, from Expression 10, to obtain a new syndrome u<m>S=(uS<m>+0, uS<m>+1, . . . , uS<m>+ε).
Subsequently, the converted syndrome is used to decode the error. This method is described.
The code stored in the memory cell array contains the error E=(e1, e2, . . . , en) (n=p−1). Accordingly, a virtual error on the new syndrome includes {u(1)<m>e1, u(2)<m>e2, . . . , u(n)<m>ep−1}. These n (=p−1) error components are converted and classified into 2 sets J+ and J− as in Expression 11.
J+={jε(1,2, . . . , n);ej<p/2}
J−={jε(1,2, . . . , n);ej>p/2} [Expression 11]
Namely, it is classified into J+, which is an arrangement of the positions j of the symbols cj, if the error quantity of the symbol is u(j)<m>ej<p/2, and J−, which is an arrangement of the positions j of the symbols cj, if the error quantity of the symbol is u(j)<m>ej>p/2. Polynomials Λ(x), V(x) on Zp are configured on the basis of these sets as in Expression 12.
Thus, the polynomial Λ(x) is such a polynomial that has a reciprocal number of the error component position j in J+ as a root, and that has a Lee metric u(j)<m>ej of that error component as the multiplicity of the root. On the other hand, the polynomial V(x) is such a polynomial that has a reciprocal number of the error component position j in J− as a root, and that has a Lee metric p−u(j)<m>ej of that error component as the multiplicity of the root. Decoding is a process of finally configuring these polynomials only from the information on the syndromes to solve them, thereby obtaining information on errors. In a word, it is required to seek a relation between these polynomials Λ(x), V(x) and the syndromes. When it is configured with a series polynomial having each syndrome u<m>S on a coefficient of the corresponding degree, it is represented with a rational polynomial having the position of the error composition and the value of the virtual error composition in factors thereof as in Expression 13.
From Expression 13, a relational equation shown in Expression 14 can be established among the polynomials Λ(x), V(x) and the syndrome S(x).
Subsequently, the relational equation shown in Expression 14 is used to seek the polynomials Λ(x), V(x) from the syndrome S(x).
From the syndrome S(x), a polynomial Ψ(x) with a degree of γ−1 or lower shown in Expression 15 is sought.
In an expansive expression of the polynomial Ψ(x), a coefficient ψj can be sought from a comparison between coefficients of the homogeneous degree on both sides of the expression shown in Expression 15 through an iterative method using the syndrome uS<m>+j and an already-determined coefficient ψj−1. Coefficients ψ0-ψε of the polynomial Ψ(x) are sought from syndromes uS<m>+0 to uS<m>+ε. The results are shown in Expression 16.
The polynomial Ψ(x) is a polynomial equivalent to Λ(x)/V(x). In this case, the key conditions on mutual primes λ(x) and ν(x) are given as in Expression 17. Therefore, V(x)=1 can be established and Ψ(x) can be applied as Λ(x) itself.
v(x)Ψ(x)≡λ(x)(mod xε+1)
degλ(x)−deg v(x)==ε(mod p)
degλ(x)−deg v(x)≦ε [Expression 17]
The condition, deg λ(x)−deg ν(x)=ε and deg λ(x)+deg ν(x)≦ε, on the degree of the polynomial results in 0≦2 deg ν(x)≦0, which establishes deg ν(x)=0 as can be found. Namely, as V(x)=1 and Λ(x)=Ψ(x) can be established, so Ψ(x) can satisfy this condition. In this case, ε previously determined by Ψ(x) derived from the syndrome, and ε=deg Ψ(x)=deg Λ(x) should be established. If it is established, all key conditions can be satisfied and accordingly Ψ(x) can be used to obtain a solution. On the other hand, if it is not established, no error satisfies the key conditions, resulting in no solution.
This method corresponds to conversions, which collect the positions of all error code components into the set J+, applied to errors. Another viewpoint indicates the presence of the possibility of error correction if conversions are executed such that the total sum of the converted errors reaches ε.
The possibility referred above has two meanings. The first is associated with the requirement in which the positions of all error code components are collected in J+, and the degree of Ψ(x) derived from the syndrome is just equal to ε. The second is associated with the requirement in which the error quantity can be obtained according to the computation even though |S0|≦ε is not satisfied. This condition is added, though, to exclude failed corrections.
The method of solution searching comprises substituting Λ(x)=Ψ(x) and V(x)=1 to find a solution that satisfies the equation from the elements in Zp, and applying an inverse transformation to obtain a true error E from the obtained root and multiplicity.
Next, with respect to the syndrome converting method in accordance with the above principle, an effective condition on error searching is considered.
When a point in a true error code word actually indistinct is represented by E=(e1, e2, . . . , ep−1), in the syndrome converting method the process converts it to a point <m>E=(u(1)<m>e1, u(2)<m>e2, . . . , u(n)<m>ep−1) in a virtual error code word. Note that the component coordinates themselves are unchanged. In this case, the syndrome is represented as in Expression 18. In Expression 18, ji (i=1 to ξ) indicates the component coordinates of E different from zero where j1<j2< . . . <jξ.
If ξ≧2 and <m>≠<m′>, the conversions to the virtual errors are all independent. Because if there is an equal one, u(j)<m>=u′(j)<m′> is required to establish over ξ different j because ji is unchanged through the conversion. This means that Expression 19 shown below can establish over all ji and can establish only if ξ=1.
Thus, according to the syndrome converting method, scanning <m> can configure polynomials of virtual error points different at every different <m>. The case of ξ=1 corresponds to one in which only the conversion of the syndrome multiplied by u is required to ready for virtual errors.
In addition, when paying attention to one virtual error component, for example, u(j)<m>ej=(S<m>+0)−1(j)<m>ej moves through all Zp components except zero under the following condition. In other words, when any one of <m> components fluctuates from 0 to p−2, then
it moves through Zp components 1 to p−1 except 0. Accordingly, the inverse element thereof (S<m>+0)−1(j)<m> also takes all components except zero. In the end, as u(j)<m>ej takes all components except zero once at least, the sufficient condition is to scan any one of <m> components from 0 to p−2.
In the syndrome converting method, the process introduces syndromes, (p−1) diagonal matrixes, and uses 0-th to (p−2)-th powers of these individuals to cause a virtual error. Therefore, the following description is given to the error that can be covered by the conversion to such the virtual error, and to the condition on the syndrome converting method.
When a point in a true error code word is given by E=(e1, e2, . . . , ep−1), a point in a virtual error code word in accordance with the syndrome converting method is given <m>E={u(1)<m>e1, u(2)<m>e2, . . . , u(p−1)<m>ep−1}.
There are ξ error component positions, which are represented by ji (i=1−ξ, j1<j2< . . . <jξ) and replaced as in Expression 21.
In Expression 21, εi (i=1 to p−1) is considered a variable where the sum of εi to εp−1 is fixed to ε.
Therefore, the process computes the number in the case of ξ components that make the sum equal to ε.
As for (ξ−1) components, 1 to p−1 can be selected freely. The final ξ-th value can be selected for the sum of (ξ−1) such that the sum of ξ components is made congruent with ε. Accordingly, the number is equal to (p−1)ξ−1 in the case of selection of the variable.
This result, however, also contains the case where the final selection is required to take 0. Accordingly, this case is excluded. If the number is equal to n(ξ) when the sum is congruent with ε for ξ pieces of 1 to p−1, it is required to exclude the case where the sum reaches ε before the final selection. Therefore, n(ξ)=(p−1)ξ−1−n(ξ−1) establishes, and thus n(ξ) can be obtained as in Expression 22.
ξ:even n(ξ)={(p−1)ξ−1}/p
ξ:odd n(ξ)={(p−1)ξ+1}/p
n(ξ)={(p−1)ξ−(−1)}/p [Expression 22]
When ε is divided into ξ pieces, there is one having components all present between 0 and 2/p, that is, components belong only in J+. If the number of conversions to different virtual errors is higher than the number in the case of n(ξ) pieces, it is possible to make J+ virtual errors. In this case, if the resultant true error can satisfy the condition on error correction of the Lee metric code, it is possible to find solutions through the syndrome converting method without misses.
Subsequently, the flow finds the number of different virtual errors that can be caused from the true error through the syndrome converting method. In the case described above, the degree of the vector <m> is equal to 3. Therefore, <m>=<m1, m2, m3> establishes, and in order to allow the divided components of ε to move all from 1 to p−1, the components are independently scanned from 0 to p−2. In the case of the current specific example, the number of conversions in the case of <m>≠<m′> is equal to (p−1)3.
In comparison with the number n(ξ) when the sum of ξ free variables εi reaches ε,
n(ξ)={(p−1)ξ−(−1)ξ}/p≦(p−1)3 [Expression 23]
is the scan condition without misses. Thus, ξ=4 can satisfy this condition, and the method using 3 diagonal matrixes is possible to execute searching up to ξ=4 without misses.
When j moves from 1 to p−1, an additional m-th power of the matrix that includes a diagonal matrix, that is, a function that moves through all Zp components 1 to p−1 except zero allows the number of conversions to increase at every (p−1)-fold similar to the above.
When ζ matrixes are aligned, the number of conversions reaches (p−1)ζ, which makes the condition without misses always be ξ=ζ+1.
In addition to the above used j, p−j, p−j−1, examples of these functions include a in Zp but a≠1, 2, and aj, p−a3 of which a is the number other than a power of a certain number so that a sufficient number of conversions can be made. Therefore, an appropriate selection of a to make a diagonal matrix can extend the degree of <m> so that ξ can be set always larger by 1 than this degree. Therefore, even if the syndrome converting method is used over all the cases, where the correctable condition on the Lee metric code is satisfied, instead of using the Euclidean iterative method, it is possible to execute error searching without misses.
The number in the case of the conversion grows (p−1)-fold when ξ increases by 1, however, and accordingly the number of searches increases exponentially. Therefore, depending on the case, the use of the conventional Euclidean iterative method shortens the processing time than the syndrome converting method as can be considered. In this case, it is required to execute an optimal selection in accordance with the applied memory system. In particular, in the case of ε=1, 2 and so forth, the use of the syndrome converting method is more advantageous.
<Data Interface in p-adic NAND Flash Memory>
On the basis of the above, the following description is given to a specific example of the NAND flash memory that uses the syndrome converting method for error correction. Hereinafter, this NAND flash memory is referred to as the “p-adic NAND flash memory”. In addition, if no special notice is given, a “memory system” refers to the p-adic NAND flash memory.
First, a comparison/consideration is performed to determine what kind of p-adic cell should be selected.
L: The number of threshold level divisions of the p-adic cell.
p: A prime to be used.
h: A minimum number of bits required for binary representation of Zp.
ε: An overall quantity of Lee metrics of error-correctable errors.
M: A value determined by M=h(p−ε−3), which is the number of bits, that is, the quantity of binary data in a “Binary World” subject to batch processing by ECC.
The “Binary World” refers to an environment outside the memory system according to the present embodiment, for example, an IT instrument that handles data in binary.
δ: The number of digits of M as a 2h-adic number (δ=M/h).
M/(p−1): An index indicative of the number of binary bits that one p-adic cell can store.
Binary M bits are treated as a code word having (p−1) code components, which are stored in (p−1) p-adic cells.
p−1: The number of p-adic cells required for storing a code word of the Lee metric code.
log2 L: The number of bits that can be stored in a p-adic cell when the p-adic cell is made as a binary storing multi-level cell.
ε/(p−1): A proportion of the maximum number of correctable cells when c is used. Or a proportion of the maximum error level length correctable in one p-adic cell.
log2 L/{M/(p−1)}: A redundancy when a memory cell is used as a p-adic cell for storing ECC with the Lee metric code, and the memory cell is regarded as a binary multi-bit cell.
In the table dotted parts are ranges that can be considered practical from the viewpoint of the level of the p-adic cell and the redundancy to binary.
The following description is given to an example of the practical memory system in the case of L=17 and ε=4 shown in
This considers that the number of levels per cell in the practically used NAND flash memory is equal to 16 levels. When E increases by 1, the scale of processing through the syndrome converting method increases almost p-fold. Accordingly, if p=17 is used, the case of ε=2 shown in
Subsequently, a NAND flash memory system is configured for the specific example in the case of L=17 and ε=4. Initially, the specification for the NAND flash memory system is shown. The specification herein shown provides a skeleton of the specification for the use of other p and ε. In this case, the values of the skeleton can be changed to the corresponding values in accordance with the values of p and ε.
The following is the specification for the p-adic NAND flash memory system.
(1) The configuration of the data interface is a ×8 1-byte configuration.
(2) It operates in sync with clocks.
(3) A sector contains 512 bytes as a data transfer unit.
(4) The unit of conversion processing from a binary to a p-adic number includes 50 bits. Hereinafter, the conversion processing from the 50-bit binary to the p-adic number is referred to as a “process”. For the purpose of distinguishing it from other general terms, “process”, it is expressed by “PROCESS”. Therefore, a binary corresponding to 1 sector is converted to a p-adic number through 82 PROCESS.
(5) A page size is determined by 4 sectors (2048 bytes). A block size is not determined particularly though it has a size of 64 pages, for example, which depends on the configuration of the cell array.
(6) Each sector contained in a page can be accessed at random.
(7) A variety of modes are prepared as access methods in a sector. The access modes are roughly divided into two: a “fast mode” and a “random mode”.
The fast mode: A mode in which accesses are made in order of addresses fixed in a sector. In writing, after a lapse of 10 cycles since the beginning of the sector access, a PROCESS starts for processing data in turn. In reading, after completion of 1 PROCESS, a sector access is allowed. In both writing and reading, a PROCESS to a sector and an access to the sector proceed at the same time.
The random mode: An access mode in which a burst length of access to a sector, the burst mode, and the top address of burst can be set.
For example, the burst length may include 4/8/16/32/256 bits and so forth, and the burst mode can be set in an interleave mode/wrap-around mode and so forth.
After completion of PROCESS to the data to be stored in a sector, an access to the sector is allowed.
At the time of writing, PROCESS starts after completion of data storage in the sector access.
In the random mode, an access and a process start after pieces of data to the sector are grouped. Therefore, the access and the PROCESS to the same sector cannot proceed at the same time.
The following description is given to an overview of the memory system capable of realizing the above specification skeleton. This system is a system that directly treats the numerals in Zp as data.
As describe above, the environment outside the memory system, for example, the IT instrument that handles data in binary is referred to as the “binary world”. In contrast, containing a part that handles data in binary such as operational processing, an environment inside the p-adic NAND flash memory system that handles data in Zp is referred to as a “p-adic Zp world”. In this case, the “binary world” can also be referred to as a “p-adic Z2 world”.
The memory system further comprises, as interfaces between the “binary world” and the “p-adic Zp world”, a “binary to p-adic” converter unit operative to convert binary data to Zp data, and a “p-adic to binary” converter unit operative to convert Zp data to binary data.
As the outside of the memory system is the “binary world”, the “binary to p-adic” converter unit receives binary data from the “binary world” and processes it at every 50 bits. Thereafter, the data is stored in p-adic cells, and further provided to external at every 50 bits by the “p-adic to binary” converter unit.
ECC to the data stored in the “p-adic NAND cell” part can be executed easily by applying a Lee metric code to Zp. The present memory system is also possible to execute ECC processing because it comprises the “Lee metric code & ECC” part.
As the memory system comprises the “binary to p-adic” part and the “p-adic to binary” converter unit, users can handle the memory system just as the “binary world” without having consciousness about the “p-adic Zp world” inside the memory system.
Data write and read to the “p-adic NAND cell” part is handled directly with relating the threshold levels to the elements in Zp. There are two such relating methods.
The first is a method in which interfaces are used to deal data write and read to conventionally NAND-connected multi-level cells (hereinafter referred to as “NAND MLC”). It is required to once convert the Zp level (1 to p−1) stored in MLC to page information and so forth of NAND MLC. Therefore, a part, which mediates between the NAND MLC and a peripheral circuit “p-adic Zp world”, is required to have a logic part. In the case of this method, information on Zp stored in MLC spans over page information of plural NAND MLCs and accordingly data transfer needs a longer time.
The second is a method in which a Zp level is directly written as a physical level in a p-adic cell. This method can make data write/read processing faster than the first method.
The present embodiment applies the second method.
The following description is given to data transfer suitable for the above-described memory system specification, and to an interface circuit for making the match with internal p-adic cells.
In the case of the memory system according to the present embodiment, all the results on data processing show up on page registers contained in pages. The p-adic cells communicate data with the page registers. Therefore, how to configure the page registers is the key to the data transfer specification.
At the start, the data interface specification is confirmed once again.
(1) The I/O transfer rate of the memory system is 1 byte/cycle. In a word, the transfer unit is 1 byte.
(2) 50 bits of binary data are converted as a processing unit to a Lee metric code.
(3) 1 sector includes 512 bytes. Therefore, 512 cycles are required to transfer 1 sector of binary data.
(4) 82 PROCESS are required to convert 1 sector of binary data to a Lee metric code. Binary data that can be processed by 82 PROCESS includes 82×50=4100 bits. Therefore, it is required to process 1 sector plus 4 bits.
(5) When binary data is converted to p-adic data, there is an increase to 55 bits per 50 bits. Therefore, 1 sector requires registers corresponding to 82×55=4510 bits.
(6) If binary data is converted to a Lee metric code through the conversion to p-adic data, there is an increase to 80 bits per 50 bits. Therefore, 1 sector requires registers corresponding to 82×80=6560 bits (820 bytes).
In consideration of the above (2), the subsequent description is given to a register configuration that allows data processing at every 50 bits to be executed efficiently.
Data is transferred at a rate of 1 byte/cycle in accordance with the above (1). Therefore, transfer of 50-bit binary data requires 7 cycles at the minimum. When 50-bit binary data is converted to a Lee metric code, there is an increase in size to 80 bits in accordance with the above (6).
Here, 50 bits can be represented by 5×10 bits, and 80 bits by 8×10 bits. Therefore, the memory system according to the present embodiment configures (10×8) register blocks with plural byte-unit registers to execute data processing using the registers thus configured. Thus, it is possible to realize an optimal register configuration and data processing in the memory system. The (10×8) register blocks are described later.
Subsequently, a flow of data processing is described with reference to
The transfer of 50-bit binary data to 1 register block is executed in 10 cycles.
Therefore, if the address order of data transfer has been determined, processing of 1 PROCESS is made enable at every 10 cycles. In PROCESS, however, pieces of data are merged at every I/O and processed. Accordingly, if an I/O is masked and invalidated, the I/O is processed as it holds fixed data.
On the other hand, if the address order of data transfer is at random, it is not possible to determine which part of the registers contained in a sector the data is transferred. Accordingly, on receipt of a command indicative of completion of 1-sector data transfer, the flow decides that preparation of processing-target data is completed, and starts PROCESS.
The PROCESS is treated by regarding (10×8) register blocks as the later-described “D-r” register and utilizing overwrite to the “D-r” register.
At the start, processing-target data stored in the “D-r” register is regarded as h=5, δ=10 binary data (first data) and converted to a p-adic number (second data) (S1 in
Thus, binary data corresponding to 1 PROCESS stored in the registers in a (10×8) or 80-bit register block serving as the “D-r” register is converted to the Lee metric code, then overwritten in the same register block and stored therein.
The repetition of the above PROCESS sequence can treat 512 bytes of sector data in 82 PROCESS. The quantity of data that can be treated in 82 PROCESS is equal to 512 bytes plus 4 bits. Accordingly, 4 fixed dummy bits are added for processing.
The following description is given to the relation between the configuration of register blocks and the burst data transfer.
One register block includes 80 bit registers, in which a (8×5) register group shown with a in
Pieces of data transferred in No. 6-10 cycles are held in another register group shown with b in
Transfer scanning of input byte data advances wave-like as shown with the dashed line in
Pieces of byte data transferred over cycles are merged on processing. Accordingly, the number of cycles used at every bit of byte data differs from one another as can be found. In the case of
Next, the process of data processing in the sector register at the time of data write is described with reference to
The sector register is configured to include 820 registers aligned in 8 columns and has a receiving portion of 820 rows for 102.5-byte data transfer. Of 8 columns, No. 1-5 columns are used in data transfer and the remaining No. 6-8 columns are used in holding the target data after data transfer.
At S1 in
Pieces of data transferred in subsequent No. 511, 512 cycles are divided into 4 bits each and held in registers on No. 817-820 rows, No. 1-4 columns as shown in the S1 lower figure of
Thus, the sector register can store 50-bit binary data corresponding to h=5, δ=10 of a 10-digit, 25-adic number, by 82 PROCESS.
At S2 in
Thus, S1 and S2 in
At S3 in
The Lee metric code is data having components that are equal to (p−1) (=16) elements in Zp. A Lee metric code by 1 PROCESS is 80-bit data resulted from 5-bit, p-adic data multiplied by a G matrix. The Lee metric code by 1 PROCESS is overwritten and stored in one 10-row, 8-column register block. In this case, 1 sector register is entirely filled with the Lee metric code in 82 PROCESS. Thus, the sector register can hold pieces of data to be written in 164-byte pieces of p-adic cells.
Thus, processing in 3 stages shown at S1-S3 in
The data transfer to the sector register is executed in the fast mode and in the random mode as described earlier. In the case of the fast mode, pieces of transferred binary data are filled in the sector register in turn from the top. Therefore, in the fast mode, when pieces of binary data by 1 PROCESS are stored in the 10-row, 8-column register block, PROCESS can be executed sequentially in serial.
On the other hand, in the random mode, it is possible to designate the top address at every burst length to set a data storage location and also possible to select 2 addressing modes: interleave, and wrap-around. In this case, no-data stored areas exist at intervals in the register block. Therefore, in the random mode, it is required to wait a command indicative of completion of binary data storage to the sector register to exert control over starting PROCESS and so forth.
Thus, in the memory system according to the present embodiment, data is overwritten in the sector register twice at the time of conversion of binary data (first data) to p-adic data (second data) and at the time of conversion of the p-adic data (second data) to a Lee metric code (third data).
By the way, in the case of the conventional memory system, there is no processing corresponding to the conversion of binary data to p-adic data, and binary data is directly converted to a code using Reed-Solomon and so forth. Accordingly, overwrite to a register is executed once at the most.
Next, the process of data processing in the sector register at the time of data read is described with reference to
At S1 in
The pieces of data stored in the sector registers are subject to computation processing by ECC, at every data corresponding to 1 PROCESS stored in a 10-row, 8-column register block, beginning from the sector register end, then error-corrected if enable, and left as it is if unable. Then, 82 PROCESS can be applied to complete ECC processing over all data in 1 sector to restore the Lee metric code.
At S2 in
At S3 in
Thereafter, the binary data is read out at every 8-row, 5-column register group in column order at a rate of 1 byte/cycle and provided as data corresponding to 1 sector. With respect to data read from the sector register at the last 2 cycles, 4-bit, 2-column binary data is read out as byte data in 1 cycle, similar to data write, as shown in the lower figure at S3 in
In the fast access mode, data is read out from the top of the sector register. Accordingly, data in the sector register can be read out sequentially with the latency of 1-PROCESS processing. On the other hand, in the random mode, the top of burst in the sector has not been known previously. Accordingly, after completion of 82 PROCESS when binary data corresponding to 1 sector is reproduced in the sector register, a first access is allowed.
Thus, in the memory system according to the present embodiment, data is overwritten in the sector register twice at the time of conversion of a Lee metric code (third data) to p-adic data (second data) executed after ECC processing and at the time of conversion of the p-adic data (second data) to binary data (first data).
By the way, in the case of the conventional memory system, there is no processing corresponding to the conversion of the Lee metric code to p-adic data. Accordingly, overwrite to a register is executed once at the most.
<Assignment of Elements in Zp to p-adic Cell>
The following description is given to an assignment of elements in Zp to p-adic cell levels, which is optimal for storing each code cj of a Lee metric code C=(c1, c2, . . . , c16) in a p-adic cell.
In the present embodiment, as shown in
In the case of p=17, the number of levels required for a p-adic cell is equal to 17, and the number of boundaries (thresholds) between these 17-stage levels is equal to 16.
In the case of c=4, if an error arises only in 1 cell, error correction is possible on failed identification of a level within a range shown with a bold arrow in
In a word, the memory system according to the present embodiment is possible to have a strong error correcting ability to realize error correction of 4 cells among 16 cells substantially in accordance with the assignment way shown in
<Data Read/Write>
An example of a method of data write/read to the p-adic cell array is described next with reference to
Previously, the meanings of symbols shown in
In the case of p=17, the Lee metric code has 16 code components. These codes are denoted with c1-c16. Each code c can be represented by a 5-bit binary h. The bits from the least significant bit through the most significant bit of the binary h are denoted with Cj0-Cj4.
At the time of data write/read, a write voltage/read voltage supplied from a row decoder/word-line driver (Row Dec/WLDriver) serving as part of a data write unit or a data read unit changes the level on a word line WL in 17 stages. At this time, the signals indicative of the levels on the word line WL are denoted with Δ1-Δ17. In addition, the inversion of a bit X is represented by ‘/X’ with additional ‘/’.
At the start, data write to the p-adic cell is described.
The threshold of a NAND cell serving as the p-adic cell is set in accordance with the quantity of electrons injected into the floating gate while a certain voltage is applied between the word line WL and the bit line BL.
At the time of data write, the potential on the word line WL elevates in 16 stages. On the other hand, the bit line BL is connected to a certain power supply Vss. When the p-adic cell changes to have a desired threshold, it shifts to the floating state isolated from the certain voltage Vss. Thus, no voltage is applied to the floating gate such that injection of extra electrons into the floating gate can be prevented.
The present embodiment uses this method to set thresholds in batch over p-adic cells corresponding to the components contained in the code. The code components c1-c16 correspond to elements in Zp and accordingly they are expressed in binary. As for the thresholds serving as levels of the p-adic cell, the lowermost threshold is assigned with the element 0 in Zp, and then the elements in Zp are assigned in ascending order from lowest. The upper limit of a threshold division corresponding to each level L of Zp is denoted with L, the level on the word line WL for injecting electrons by the quantity corresponding to the threshold is denoted with WVL, and the signal for generating the word line level WVL is denoted with ΔL.
If the erased state of the p-adic cell is assumed to have a threshold in the lowermost state, it is not required to change the threshold of the p-adic cell targeted to write the component 0 in Zp therein though it is required to change the threshold of the p-adic cell targeted to write another component in Zp therein.
Therefore, in order to change the threshold of the p-adic cell, the present embodiment elevates the voltage on the word line WL sequentially from lowest to inject electrons into the floating gate. At that time, if the voltage on the word line WL exceeds the voltage corresponding to the threshold to be set in the p-adic cell, the bit line BL connected to that p-adic cell is brought into the floating state, as described above, to prevent extra electron injections. This control is executed in a program setting circuit shown with a in
The program setting circuit includes an NMOS transistor QN1 for connecting the bit line BL to the voltage Vss. The gate of the transistor QN1 is connected to a constant voltage Vcc via a PMOS transistor QP1 controlled by the later-described control signal /p. The gate of the transistor QN1 is also connected to the voltage Vss via 16 paths, aligned in parallel, corresponding to the elements in Zp. These 16 paths each include a circuit of 5 serially connected NMOS transistors. Each path can conduct if /Cj0□/Cj1□/Cj2□/Cj3□Δ1, /Cj0□/Cj1□/Cj2□/Cj3□Δ2, . . . , /Cj0□/Cj1□/Cj2□/Cj3□Δ16.
On data write, the word line level varies from WV0 to WV16. At that time, the program setting circuit exerts control, in accordance with the word line level, so that the bit line BL can be connected to the voltage Vss or isolated from the voltage Vss and brought into the floating state.
Namely, when the word line level is at WVL, that is, when the signal ΔL is made activate, the circuit isolates the bit line BL connected to the p-adic cell to be set at L−1 from the voltage Vss. Thus, the threshold of that p-adic cell is set in the division of L−1. Isolation of the bit line BL from the voltage Vss is controlled by the transistor QN1. The node n1 on the gate of the transistor QN1 is initially precharged via the transistor QP1 that is controlled by the signal /p. The result from decoding with bits of information Cj0-Cj3 representative of the code components and the signals Δ1-Δ16 indicative of the word line levels are used to discharge the node n1, thereby turning off the transistor QN1 to bring the bit line BL in the floating state. As for the p-adic cell targeted to write the element 16 in Zp therein, it is not required to discharge the node n1 but just required to decode the elements 0-15 in Zp using 4 bits of information Cj0-Cj3.
The program setting circuit may be provided at every bit line BL connected to p-adic cells belonging to the same word line WL and corresponding to components of different codes. In this case, the p-adic cell can be set at the threshold level corresponding to Zp of the code component by sequentially elevating the voltage on the word line WL.
For the purpose of setting an accurate threshold level to the p-adic cell, an operation of verify read may be introduced to verify the state of the threshold of the p-adic cell when the word line level varies.
Subsequently, data read from the p-adic cell is described.
At the time of data read, the level of a p-adic cell is sensed from the current Icell flowing in the p-adic cell. This operation is executed at a sense amp unit t-SA provided on the end of the bit line BL.
The sense amp unit t-SA includes a sense amp sa operative to compare the reference current Iref with the cell current Icell; a latch L1 operative to hold the latest comparison result from the sense amp sa; a latch L2 operative to hold the previous comparison result from the sense amp sa; an XOR gate G1 operative to detect a variation in comparison result from the results held in the latches L1 and L2; and an XOR gate G2 operative to detect if the p-adic cell changes to the threshold corresponding to 16 in Zp. The output from the XOR gate G2 provides the output from the sense amp sa.
The reference current Iref is set to a current slightly larger than the cell current when the upper limit of the division of the threshold level and the voltage on the word line WL corresponding to the threshold level are applied to the p-adic cell.
In the sense amp unit t-SA the sense amp sa compares the reference current Iref with the cell current Icell and outputs ‘1’ to the latch L1 if the cell current Icell is larger. The voltage on the word line WL is sequentially varied within a range of WV1-WV16 and, only if the word line level is at ‘0’ on WVL and at ‘1’ on WVL+1, the threshold level of the p-adic cell belongs to the division L. In this case, the p-adic cell holds the element L in Zp.
The comparison result from the sense amp sa is transferred to 2 latches L1 and L2 in turn. If the contents of these latches L1 and L2 are different from each other, the output from the sense amp unit t-SA provides Z=‘1’. This makes it possible to determine the state of the threshold written in the p-adic cell from the word line level WVL.
If Z=‘1’ has been established already when the word line level is at WV1, it is required to determine if the threshold of the p-adic cell belongs to the division 0 or lower. If Z=‘0’ when the word line level is at WV16, it is required to determine if the threshold of the p-adic cell belongs to the division 16 or higher. Therefore, for the purpose of enabling these decisions, the initial states of the latches L1, L2 are set at ‘0”, and a signal Δ17 is generated after the signal Δ16. Thus, it is possible to decide that the threshold level of the p-adic cell is at L−1 from the signal ΔL when the output from the sense amp unit t-SA provides Z=‘1’.
The following description is given to the 5-bit register hanging from each bit line BL and operative to hold Cj0-Cj4.
The register has a circuit as b in
A “Zp→h dec” circuit block shown in
The register shown with b in
These registers may also be utilized to exert control on data write/read. For example, if these registers are used, at the time of data write, to store binary representations of components of the code to be set, they can be utilized for control such that the program setting circuit executes verify and brings the bit line BL in the floating state.
The subsequent description is given to setting of the write voltage/read voltage supplied from the row decoder/word-line driver to the word line WL.
With respect to all p-adic cells to be accessed at the same time, the levels of the p-adic cells are assigned with the values of codes at the same time. Accordingly, it is required to scan all the voltages WV0-WV16 over the word line WL.
In data write (Write), there is no changing method other than one in which the flow elevates the threshold of the p-adic cell in order. Accordingly, the word line voltage is changed from the lower voltage side to the higher voltage side. On the other side, in data read (Read), it is sufficient to shift all levels within the scan as described above.
In the case of
If data write is executed after completion of scan 1, it is required to return the word line voltage to WV0 again and elevate it to WV16 sequentially as shown with the dotted line in scan 2. On the other hand, in the case of data read, the word line voltage may be elevated sequentially as shown with the dotted line in scan 2 or dropped sequentially subsequent to scan 1 as shown with the solid line. As the code data is decided at the time of completion of scan 2, data read can be executed similarly to the time of completion of scan 1.
If scan 1 and scan 2 are applied to access all p-adic cells belonging to the word lines WL, it is required to select different word lines WL in these scan 1 and scan 2. If p-adic cells belonging to the same word line WL are subject to serial read, for example, if the sense amp unit t-SA is shared among plural bit lines BL, it is also possible to select the same word line WL in scan 1 and scan 2.
<Conversion from Binary Data to 17-Adic Number>
The following description is given to a binary/p-adic number (17-adic number) converter circuit required at the entrance and the exit of the memory system according to the present embodiment.
A square shown with “5 res” in
At the start, at the 0-th step (S0 in
Subsequently, 6-bit binary data, of which No. 5-1 bits are the outputs from the previous “5 res” circuit block (the residue represented by 5 bits) and No. 0 bit (the least significant bit) is the most significant bit D84 (=D39) of d8, is input to the next “5 res” circuit block, which generates the residue and a quotient or carry C144 derived from the input binary data divided by 17.
Thereafter, until No. 0 bit of 6-bit binary data input to the “5 res” circuit block reaches the least significant bit D00 (=D0) of d0, 46 “5 res” circuit blocks are used to generate carries C10-C145. Binary data expressed by these generated carries C10-C145 indicates the number of 17s contained in the data D.
The output from the “5 res” circuit block having the input of d0 has a binary representation a0 of a 11-digit, 17-adic number D (a0, a1, . . . , a10).
Subsequently, at the 1st step (S1 in
At the 1st step, numeration of 17 is executed to the carries C10-C145 from C145 on the rightmost side. At the time of the numeration of 17 executed to the carries C141-C145, the flow directly generates the residue and a carry C241 by substituting 0 for the most significant bit of 6-bit input binary data and dividing the input binary data by 17.
Subsequently, 6-bit binary data, of which No. 5-1 bits are the outputs from the previous “5 res” circuit block (the residue represented by 5 bits) and No. 0 bit (the least significant bit) is C140, is input to the next “5 res” circuit block, which generates the residue and a quotient or carry C240 derived from the input binary data divided by 17.
Thereafter, until No. 0 bit of 6-bit binary data input to the “5 res” circuit block reaches the least significant bit C10 of C1x, 41 “5 res” circuit blocks are used to generate carries C20-C241. Binary data expressed by these generated carries C20-C241 indicates the number of primes 172s contained in the data D.
The output from the “5 res” circuit block having the input of C10 has a binary a1 of a 11-digit, 17-adic number D (a0, a1, . . . , a10).
At the subsequent 2nd step (S2 in
Thereafter, the flow advances up to the 10th step (S10 in
The carries C110-C115 at the 10th step are not used in computation.
A consideration is given next to the configuration of the “binary to p-adic” converter unit using a “6 bit mod p” circuit block. The “6 bit mod p” circuit block is a circuit that compares an input A or 6-bit binary data with a prime 17, then provides PF0=‘1’ if A is equal to or higher than 17 and provides the residue Q modulo the prime 17 of A. The details are described later.
Here, the j-th digit is denoted with dj when data is subject to a 10-digit, 25-adic expression. In this case, dj can be indicated in 5-bit binary though the coefficient D of the indication is expressed in common with the coefficient D of other d. For that purpose, sub-indexes are used as shown in Expression 24.
dj=Dj0+Dj12+Dj222+Dj323+Dj424
dj(25)j=D5j25j+D5j+125j+1+D5j+225j+2+D5(j+1)−225(j+1)−2+D5(j+1)−125(j+1)−1 [Expression 24]
The input to the operation at the k-th step, that is, the carry at the previous step (the (k−1)-th step) is Ck0-Ck5(10−k)+k−1, which is the coefficient of a binary expression that has the sub-index as an exponential of 2, and the numeral expressed by this binary indicates the number of 17ks contained in the data.
At the k-th step, the input includes 5 (10−k)+k pieces of binaries (carries Ck0-Ck5(10−k)+k−1) as shown in
The 1st “6 bit mod 17” circuit block <1> receives Ck5(10−k)+k−5 to Ck5(10−k)+k−1 and 0 on the input binary A0-A4, A5, and provides R5(10−(k+1))+k to R5(10−(k+1))+k4 and Ck+15(10−(k+1))+k from the outputs Q0-Q4 and the carry PF0, respectively.
The 2nd “6 bit mod 17” circuit block <2>, not shown, receives a carry Ck5(10−(k+1))+k−1 and the outputs R5(10−(k+1))+k0 to R5(10−(k+1))+k4 from the 1st “6 bit mod 17” circuit block <1> on the input binary A0, A1-A5, and provides R5(10−(k+1))+k−10 to R5(10−(k+1))+k−14 and Ck+15(10−(k+1))+k−1 from the outputs Q0-Q4 and the carry PF0, respectively.
Thereafter, as shown in
Thus, the conversion of binary to p-adic number is executed sequentially from the most significant bit of the carry C as in the schematic diagram shown in
A circuit including 56“6 bit mod 17” circuit blocks thus configured is shown in
As shown in
Next, the “5 res” circuit block shown in
The “6 bit mod 17” circuit block provides the residue Q modulo the prime 17 of the input binary A, and provides ‘1’ from PF0 if the input binary A is equal to or more than 17 and ‘0’ from PF0 if it is lower than 17.
In the case of h=5, p=17, relations shown in Expression 25 can establish among the binary A, the binary Q and the prime p.
a=A0+A12+A222+A323+A424+A525
Q=a(mod p)(a=Q+PF0×p)
Q=Q0+Q12+Q222+Q323+Q424 [Expression 25]
The “6 bit mod 17” circuit block comprises a PF0 generator unit U1, 2 half adders HA1, HA2, and 3 full adders FA1-FA3.
The PF0 generator unit U1 includes serially connected PMOS transistors QP1-QP5 and NMOS transistors QN1-QN2 between the Vcc terminal supplied with a certain voltage and the Vss terminal supplied with the ground voltage. These transistors QP1, QP2, QP3, QP4, QP5, QN1 and QN2 are controlled by A0, A1, A2, A3, A5, A0 and A4, respectively.
The PF0 generator unit U1 also includes a PMOS transistor QP6, 4 NMOS transistors QN3-QN6, and an inverter IV1.
The transistor QP6 is connected between the source of the transistor QP1 and the drain of the transistor QP4 in parallel. The transistors QN3-N5 are connected between the source and drain of the transistor QN1 in parallel. The transistor QN6 is connected between the source of the transistor QN1 and the drain of the transistor QN2 (Vss terminal) in parallel. These transistors QP6, QN3, QN4, QN5 and QN6 are controlled by A4, A2, A3 and A5, respectively.
The inverter IV1 has an input connected to the sources of transistors QN1, QN3-QN6. The output from the inverter IV1 provides the carry PF0.
The half adder HA1 has inputs A0 and PF0, an output Q0, and a carry output C0. The full adder FA1 has inputs C0 and A1, a carry input PF0, an output Q1, and a carry output C1. The full adder FA2 has inputs C1 and A2, a carry input PF0, an output Q2, and a carry output C2. The full adder FA3 has inputs C2 and A3, a carry input PF0, an output Q3 and a carry output C3. The half adder HA2 has inputs C3 and A4, and an output Q.
In accordance with the above configuration, the PF0 generator unit U1 decides if the binary A input to the “6 bit mod 17” circuit block is equal to or more than 17, and provides the result from PF0. If the binary A is equal to or more than 17, the half adders HA1, HA2 and the full adders FA1-FA3 are used to add 15, a complement of the 6-bit binary 17, to the binary A in order to subtract 17 from the binary A.
The following description is given to the core part of the “binary to p-adic” converter unit, that is, the p-adic” circuit block.
The “p-adic” circuit block receives B0-B11, I0-I59, and provides r0-r60 as shown in
Specifically, the inputs I0-I4, I5-I9, . . . , I54-I59 are fed via the control switches SW1 to the “X to p” circuit block as C00-C04, C05-C09, . . . , C054-C059, respectively. These control switches SW1 are controlled by the inputs B1-B10, respectively.
One control switch SW1 includes a transfer transistor TQ operative to connect the input IN with the output OUT, and an NMOS transistor QN operative to pull down the output OUT to the ground voltage. The transfer transistor TQ turns on if the control signal is CNT=‘0’ while the transistor QN turns on if the control signal is CNT=‘1’.
In the case of the control switches SW1, the control signals CNT include /B1-/B10. Therefore, I0-I59 are provided directly as C00-C059 if B=‘1’, and the output turns to ‘0’ independent of the input if B=‘0’. This is effective to prevent the input to the “X to p” circuit block from becoming indefinite even if the inputs I0-I59 to the “p-adic” circuit block are indefinite.
The “X to p” circuit block on receipt of C00-C059 provides R00-R504, C10-C155 as described above.
The outputs C10-C155 from the “X to p” circuit block pass through the control switches SW2 and turn to r5-r60, that is, the outputs from the “p-adic” circuit block. These control switches SW2 are controlled by the inputs B1-B10. Therefore, these control switches SW2 directly pass C10-C155 as r5-r60 if B=‘0’.
The outputs R00-R504 from the “X to p” circuit block pass through the control switches SW3 and turn to r0-r54, that is, the outputs from the “p-adic” circuit block. These control switches SW3 are controlled by B0□/B1 to B10□/B11, respectively. Therefore, the control switches SW3 located between R00 and r0, for example, directly provide R00 as r0 only if B0=‘1’ and B1=‘0’.
B1-B11 for use in control of the control switches SW are timing signals, which are signals that rise sequentially. In sync with this, the paths for the inputs I open at every 5 bits from the lower bit side and the paths for the outputs r switch to the paths for the outputs R.
In order to provide the result at the present step until the flow enters the computation process at the next step, R corresponding to a coefficient A on each digit of a 17-adic number is provided to the later-described external “D-r” register via the control switches SW3 on/off-controlled by the signals resulted from the logical operation with the adjacent timing signals B.
The following description is given to the “binary to p-adic” converter unit including the above-described circuits grouped together.
The “D-r” register is a register controlled by the timing signal B and the clock clk as shown in
The “D-r” register includes a flip-flop FF composed of 2 inverters at every bit. The flip-flop FF receives Dj (j=0-60) via the control switch SW1 and receives rj via the control switch SW2. On the other hand, the flip-flop FF is connected to an inverter IV1 on the output side via the control switch SW3. The output from the inverter IV1 provides Ij.
The control switches SW1-SW3 are controlled by the timing signal B0 and the clock clk. Specifically, the control switch SW1 turns on if /clk□/B0=‘1’, the control switch SW2 if /clk□B0=‘1’, and the control switch SW3 if clk=‘1’, respectively.
D50-D60 not contained in the data input to the “D-r” register are held at ‘0’.
In the initial state of the “D-r” register, a binary D0-D49 is set, and the rest is filled with ‘0’. Thereafter, when B0 rises, data rj is taken in sync with the fall of clk, and the taken rj is provided as Ij in sync with the rise of clk.
The “D-r” register couples with the “p-adic” circuit block to advance the computation step at every timing signal Bj. The state of variations in each clock is shown in
At each computation step, each digit Aj of a 17-adic number is obtained as the output r from the lower side, and this is held at the same timing as that for taking I in the second half of the timing signal Bj.
After completion of all the computation steps, the “D-r” register holds the coefficients Ajm on respective digits when the coefficients a on respective digits of 17-adic data D are converted to binaries.
In the case of p=17, the number of computation steps is equal to 12, and the number of “6 bit mod 17” circuit blocks contained in the “p-adic” circuit block is equal to 56.
<Conversion of 17-Adic Number to 25-Adic Number>
A square shown with “5 add 17” in
At the start, at the 0-th step (S0 in
Subsequently, at the 1st step (S1 in
Thereafter, the same steps as the 0-th step and the 1st step are repeated and, at the 9th step (S9 in
A consideration is given next to the configuration of the “p-adic to binary” converter unit.
It is possible to express data, a coefficient a3 on the digit at the j-th place of a 11-digit, 17-adic representation, in 5-bit binary. For the purpose of bringing the coefficient A of this binary representation into a representation in common with coefficients a on other digits, sub-indexes as shown in Expression 26 are used.
aj=Aj0+Aj12+Aj222+Aj323+Aj424
aj(25)j=A5j25j+A5j+125j+1+A5j+225j+2+A5(j+1)−225(j+1)−2+A5(j+1)−125(j+1)−1 [Expression 26]
The input to the operation at the k-th step, that is, the carry at the previous step (the (k−1)-th step) includes Ck0-Ck5(k+1)−1, of which sub-index of a coefficient in the binary representation is an exponent of 2. The number expressed by this binary indicates the number of 1710−ks contained in the data.
At the k-th step, 5 (k+1) pieces of “6 bit add 17” circuit blocks are used in processing as shown in
The 1st “6 bit add 17” circuit block <1> receives Ck0, Q−10-Q−14 at the carry and the inputs B0-B4, and provides Ck+10, Q00-Q04 from Q0, Q1-Q5, respectively.
The 2nd “6 bit add 17” circuit block <2>, not shown, receives Ck1, Q00-Q04 at the carry and the inputs B0-B4, and provides Ck+11, Q10-Q14 from Q0, Q1-Q5, respectively.
Thereafter, the “6 bit add 17” circuit blocks having the similar inputs and outputs are aligned 5(k+1) pieces in total as shown in
Thus, the conversion of a 17-adic number to a binary is executed sequentially from the least significant bit of a carry C as in a schematic diagram shown in
The circuit including the 5“6 bit add 17” circuit blocks thus configured is referred to as an “a to X” circuit block.
As shown in
The following specific description is given to the “5 add 17” circuit block shown in
FIG. 25 shows a circuit symbol of the “6 bit add 17” circuit block. The “6 bit add 17” circuit block receives a 5-bit binary B0-B4 and a 1-bit carry (carry), and provides a 6-bit binary Q0-Q5. The “6 bit add 17” circuit block adds a prime 17 to the input B if the carry is ‘1’, and provides the result as Q.
In the case of h=5, p=17, the relation shown in Expression 27 can be established between the binary B and the binary Q.
b=B0+B12+B222+B323+B424
Q=b+carry×p
Q=Q0+Q12+Q222+Q323+Q424+Q525 [Expression 27]
The “6 bit add 17” circuit block includes 4 half adders HA1-HA4 and a single full adder FA1.
The half adder HA1 has inputs B0 and carry, an output Q0, and a carry output C0. The half adder HA2 has inputs C0 and B1, an output Q1, and a carry output C1. The half adder HA3 has inputs C1 and B2, an output Q2, and a carry output C2. The half adder HA4 has inputs C2 and B3, an output Q3, and a carry output C3. The full adder FA1 has inputs B4 and carry, a carry input C3, an output Q4, and a carry output Q5.
In accordance with the above configuration, the “6 bit add 17” circuit block adds a prime 17 to the input binary B if carry=‘1’.
Next, the above-described “a to X” circuit blocks are used to configure a circuit for 1 step operative to lower the degree of the 17-adic number by 1. Hereinafter, the circuit is referred to as the “p to X” circuit block. The “p to X” circuit block can be used in all the computation steps in common.
The “p to X” circuit block comprises 10 “a to X” circuit blocks.
The 1st “a to X” circuit block <1> receives part of the inputs to the “p to X” circuit block, that is, Q−10-Q−14, and C90-C94, and provides Q′40-Q′44 and part of the outputs from the “p to X” circuit block, that is, C100-C104.
The 2nd “a to X” circuit block <2> receives Q40-Q44 and part of the inputs to the “p to X” circuit block, that is, C95-C99, and provides Q′90-Q′94 and part of the outputs from the “p to X” circuit block, that is, C105-C109. Among the inputs, Q40-Q44 are signals fed via the control switches SW1, through which the outputs Q′40-Q′44 from the 1st “a to X” circuit block <1> are controlled by the timing signal B9.
The 3rd “a to X” circuit block <3> receives Q90-Q94 and part of the inputs to the “p to X” circuit block, that is, C910-C914, and provides Q′140-Q′144 and part of the outputs from the “p to X” circuit block, that is, C1010-C1014. Among the inputs, Q90-Q94 are signals fed via the control switches SW2, through which the outputs Q′90-Q′94 from the 2nd “a to X” circuit block are controlled by the timing signal B8.
Thereafter, similar connections will be made up to the 10th “a to X” circuit block <10>.
The inputs and outputs of the “a to X” circuit block are connected via the control switches SW in this way, because the connection of the input is switched between the external input and the internal input at every computation step, and for the purpose of preventing the output from the internal circuit from interfering in the case of the external input.
In the case of the circuitry of
Subsequently, when the timing signal B1 also turns to ‘1’, the 9th “a to X” circuit block <9> is activated additionally. This corresponds to the 1st step.
Thereafter, at every sequential rise of the timing signals B2-B9, the “a to X” circuit block required at each step is activated.
The following description is given to the core part of the “p-adic to binary” converter unit, that is, the “binary” circuit block.
As shown in
Specifically, the inputs I5-I54 are fed as C90-C949 to the “p to X” circuit block via the control switches SW1. These control switches SW1 are controlled by the timing signals B1-B10. Therefore, the control switches SW1 pass I5-I54 directly as C90-C949 if B=‘1’ and keep the outputs at ‘0’ independent of the inputs if B=‘0’.
In addition, the input I0-I49 are fed as Q−10-Q444 to the “p to X” circuit block via the control switches SW2, respectively. These control switches SW2 are controlled in accordance with B10^/B9 to B1^/B0, respectively. Therefore, the control switch SW2 located between I0 and Q−10 passes I0 directly as Q−10 only if B10=‘1’, B9=‘0’.
The outputs C100-C1054 from the “p to X” circuit block turn to r0-r54, that is, the outputs from the “binary” circuit block via the control switches SW3. The control switches SW3 are controlled by the timing signals B9-B0. Therefore, the control switches SW3 pass C100-C1054 directly as r0-r54 if B=‘1’.
In accordance with the above circuitry, the “p to X” circuit block responds to each computation step while increasing the bit widths of the input and output 5-bit by 5-bit sequentially. While taking the numerals A on the digits of the 17-adic number from the upper digit sequentially at each computation step, and when all the computation steps are completed, a binary representation of data can be obtained.
As describe earlier, the timing signals B0-B10 are the signals that rise in order. In accordance therewith, the paths to the inputs I and outputs r conduct 5-bit by 5-bit from the upper bit.
The numeral A on each digit of the 17-adic number is initially set in the later-described external “A-r” register, and fed to the “A-r” register via the control switch SW3 on/off-controlled by the adjacent timing signal B such that the path switches selectively until the flow enters the next computation step.
The following description is given to the “p-adic to binary” converter unit including the above-described circuits grouped together.
The “A-r” register is a register controlled by the timing signal B0 and the clock clk and having the inputs r0-r54, A0-A54 and the outputs I0-I54 as shown in
The “A-r” register includes, at every bit, a flip-flop FF composed of 2 inverters. The flip-flop FF receives Aj (j=0-54) via the control switch SW1 and receives rj via the control switch SW2. On the other hand, the output of the flip-flop FF is connected to an inverter IV1 via the control switch SW3. The output from the inverter IV1 provides Ij.
The control switches SW1-SW3 are controlled by the timing signal B0 and the clock clk. Specifically, the control switch SW1 turns on if /clk□/B0=‘1’, the control switch SW2 if /clk□B0=‘1, and the control switch SW3 if clk=‘1, respectively.
The initial state of the “A-r” register includes the digits A0-A54 of the 17-adic number.
Thereafter, after the timing signal B0 rises, rj taken in sync with the fall of the clock clk is provided as Ij in sync with the rise of the clock clk.
The “A-r” register couples with the “binary” circuit block to advance the computation step at every timing signal Bj. The state of variations in each clock is similar to
After completion of all the computation steps, the “A-r” register holds a binary representation Dj of the input, that is, the p-adic number A.
Thus, preparations for configuring the p-adic NAND flash memory have been made.
Next, the configuration of the memory system according to the present embodiment is described along the flow of data processing.
Data processing in the memory system according to the present embodiment can be executed using a Lee metric code. The methods of data processing include the above-described syndrome converting method, and a Euclidean iterative method. A first embodiment describes a memory system using the syndrome converting method for data processing.
The general principle of the syndrome converting method is as described above. The following case is described with specific numerals such as p=17, ε=4, and δ=10 substituted based on the above-described specification.
Data D, that is, binary data to the “binary world” is converted to a 17-adic number at a 17-adic converter unit 101 serving as the entrance to the “p-adic Zp world”, and converted to a code in Zp, that is, data A.
The 17-adic converter unit 101 can be composed of the circuits described with reference to
Subsequently, an encoder unit 102 brings the data A output from the 17-adic converter unit 101 into action with a generator matrix G, thereby converting the data A to a Lee metric code, that is, a code C.
Subsequently, the code C output from the encoder unit 102 is stored in a p-adic cell flash memory 103.
Subsequently, the code held in p-adic cells is read out of the p-adic cell flash memory 103. The code read out of the p-adic cell flash memory 103 is a code Y (fourth data). The code Y corresponds to the code C, which has been deformed by errors caused when the code C is written in p-adic cells for storage, or while it is stored in the p-adic cells, or when it is read out of the p-adic cells.
The read-out code Y is fed to a syndrome generator unit 104. The syndrome generator unit 104 computes a syndrome <m′S in accordance with Expression 28 using a transpose syndrome matrix Ht and the powers of 3 types of diagonal matrixes.
If <m>S=0, no error arises. Then, for the purpose of achieving the final processing step in the “p-adic Zp world”, the code Y is fed to a decoder unit 109. In the case of <m>S≠0, on the other hand, if the first component S0 of <m>S in the case of <m>=(0, 0, 0) satisfies |S0|>5, reliable error correction is impossible. Accordingly, NG signal is provided and the error-containing code Y is fed to the decoder unit 109. In other cases, the syndrome <m>S is fed to a solution searching polynomial generator unit 105.
The solution searching polynomial generator unit 105 derives a solution searching polynomial Ψ(x) from the syndrome <m>S and, if the coefficient at the 4th degree is φ4≠0, then it feeds the φ(x) to a Hasse differential polynomial generator unit 106. In the case of φ4=0, on the other hand, the step of seeking a new syndrome <m>S with changed <m> is repeated to seek a polynomial Ψ(x) again. If the coefficient at the 4th degree of Ψ(x) is still φ4=0 even after repetitions up to <m>=(15, 15, 15), error correction is impossible. Accordingly, NG signal is provided and the error-containing code Y is fed to the decoder unit 109.
The Hasse differential polynomial generator unit 106 derives a Hasse differential polynomial from the input Ψ(x), then computes a root r and the multiplicity n of the root, and feeds them as t=r−1 to an error quantity computing unit 107.
The error quantity computing unit 107 derives the positional coordinates of the error-caused code word from t=r−1 and derives the error quantity et (={u(t)<m>}−1n) from n to obtain the total quantity, Σ|et|, of Lee metrics of the error quantity. If Σ|et|≦4, the flow shifts processing to a code restorer unit 108 in order to restore the code C of the Lee metric code. If Σ|et|≧5, it is dealt as uncorrectable because there is the possibility of failed correction. In this case, the flow repeats processing on and after the syndrome generator unit 104 for the next <m>. In the case of <m>=(15, 15, 15), error correction is impossible. Accordingly, NG signal is provided and the error-containing code Y is fed to the decoder unit 109.
The code restorer unit 108 restores the code C (c1, c2, . . . , c3) in accordance with ct=yt−et. The restored code C is fed to the decoder unit 109.
The decoder unit 109 executes an inverse transformation of the generator matrix G to the code C to obtain a 17-adic code A. This code A is fed to a 25-adic converter unit 110.
The 25-adic converter unit 110 converts the code A to a 25-adic number represented in binary, which is provided as the restored binary data D.
The 25-adic converter unit 110 can be composed of the circuits described with reference to
<Generation of Lee Metric Code, Error Correction, and Restoration of Lee Metric Code>
Next, the procedures of data processing described with reference to
Hereinafter, with respect to an ECC operation in the “p-adic Zp world”, the procedure of encoding data that is input as Zp data to the memory system, and the procedures of decoding the code read out of the memory system to finally obtain the data A are arranged.
The procedure of encoding is as follows.
At the start, the input data D is converted at the “binary to p-adic” converter unit to a 17-adic-represented, 11-digit data word D(h) shown in Expression 29 (S1 in
D(h)=(a0,a1, . . . , a10) [Expression 29]
Subsequently, the data D is multiplied by the generator matrix G to obtain 16 code word components c1-c16 of the code C (S2 in
Finally, the code word component cj is stored in memory cells (S3 in
The procedure of decoding the code Y read out of the p-adic cell array is as follows.
At the start, the code Y read out of the p-adic cell array is read out (S4 in
Y=(y1,y2, . . . ,y16)
Y=C+E,E=(e1,e2, . . . ,e16) [Expression 31]
Subsequently, a syndrome is computed from the code Y (S5 in
Subsequently, as S<0>+0=S0 in the case of <m>=0, the total quantity of Lee metrics of errors cannot reach 5 or higher if |S0|≦4 and accordingly the flow starts the following Procedures 2 and 3. In the case of |S0|>4, as the total quantity of Lee metrics of errors reaches 5 or higher, the flow determines that it is uncorrectable and terminates solution searching as no solution (S6, S7 in
(Procedure 1) For <m> of which components are changed in order, a syndrome <m>S=(S<m>+0, S<m>+1, . . . , S<m>+4) is derived from Expression 32 (S8 in
(Procedure 2) For ε=4, that is, the upper limit of the number of the correctable code word components, u=4 (S<m>+0)−1 is computed first, and then a syndrome u<m>S=(uS<m>+0, uS<m>+1, . . . , uS<m>+4) is derived from the syndrome obtained in Procedure 1 (S9 in
If the degree of Ψ(x) is equal to 4, that is, φ4≠0, then the flow advances to Procedure 3 to obtain a solution (S11 in
(Procedure 3) If it is found in Procedure 2 that a solution can be obtained, processing on and after Procedure 3 is proceeded to obtain the solution.
The process seeks coefficients of the Hasse differential polynomial required for obtaining the multiplicity of the solution of the polynomial Ψ(x) sought in Procedure 2 (S15 in
(Procedure 4) The Hasse differential polynomial obtained in Procedure 3 is given a substitution of the elements 1-16 in Zp to seek an element r that makes zero the 0-th degree differential polynomial (=Ψ(x)). Subsequently, as shown in Expression 35, such the degree n is sought that makes an (n−1)-th differential polynomial zero and an n-th differential polynomial non-zero, for each element r (S16 in
The obtained r is an inverse element of the position number t of the error-caused code component, and the corresponding n is the quantity converted from the caused error quantity et. This process is executed over all the elements r.
(Procedure 5) In Procedure 5, the error quantity is obtained through conversion from the multiplicity n of the solution (S17 in
Until now, the correct code C stored in the p-adic cell array has been obtained. Accordingly, binary data input to the p-adic memory system is sought in accordance with Procedure 6.
(Procedure 6) A pluralistic system of linear equations of a relation AG=C among the codes C and A and the generator matrix G is applied to obtain 11 elements a0-a10 in Zp and A (=a0, a1, . . . , a10). Thus, 11 elements a0-a10 in Zp can be obtained. The obtained elements a0-a10 are used to create data A (=a0, a1, . . . , a10) therefrom (S22 in
Thus, data processing in the “p-adic Zp world” is finished. Subsequently, a conversion is made to restore the data A to binary data at the exit of the “p-adic Zp world”. The data A is converted from the 11-digit, 17-adic representation to a 10-digit, 25-adic representation such that a numeral on each digit of 10 digits is represented in binary. This provides the binary data D input to the p-adic memory system.
Thus, restoration of data is completed.
Next, a computing circuit operative to seek the product of Zp is described. Hereinafter, the computing circuit is referred to as the “X Zp” circuit block.
The “X Zp” circuit block is roughly divided into a circuit for processing a computation step group in the first half, and a circuit for processing a computation step group in the second half.
The circuit for processing the computation step group in the first half includes an AND gate G1, and 4 “5 bit AD mod 17” circuit blocks.
The AND gate G1 yields the logical product of the i-th bit (i=0-4) of the multiplied numeral a and the j-th bit (j=0-4) of the multiplying numeral b, and provides it as Mij.
The “5 bit AD mod 17” circuit block is a circuit operative to seek the sum of 2 numerals in Zp modulo 17. The “5 bit AD mod 17” circuit block has the inputs A0-A4 and B0-B4 and the outputs Q0-Q4. The details are described later.
The 1st “5 bit AD mod 17” circuit block <1> receives M10-M40, ‘0’, M01-M41 at A0-A3, A4 and B0-B4, and provides Q00-Q04 from Q0-Q4, respectively.
The 2nd “5 bit AD mod 17” circuit block <2> receives Q01-Q04, that is, the output from the “5 bit AD mod 17” circuit block <1>, ‘0’, and M02-M42 at A0-A3, A4 and B0-B4, and provides Q10-Q14 from Q0-Q4, respectively.
As described above, the circuit for processing the computation step group in the first half includes the “5 bit AD mod 17” circuit block <1> through the “5 bit AD mod 17” circuit block <4>, of which inputs and outputs are connected in order.
The circuit for processing the computation step group in the second half includes 4“6 bit mod 17” circuit blocks. The “6 bit mod 17” circuit block is the circuit shown in
The 1st “6 bit mod 17” circuit block <1> receives Q20 and Q30-Q34 at A0 and A1-A5, and provides Q40-Q44 from Q0-Q4, respectively.
The 2nd “6 bit mod 17” circuit block <2> receives Q10 and Q40-Q44 at A0 and A1-A5, and provides Q50-Q54 from Q0-Q4, respectively.
As described above, the circuit for processing the computation step group in the second half includes the “6 bit mod 17” circuit block <1> through the “6 bit mod 17” circuit block <4>, of which inputs and outputs are connected in order.
All the circuits operate not in sync with clocks and decide the output Q when the input Mab is given.
Mention is made here of the circuit scale of the “X Zp” circuit block.
If p=17, h=5 as in the present example, the “X Zp” circuit block can be configured with 4“5 bit AD mod 17” circuit blocks and 4“6 bit mod 17” circuit blocks.
There is a requirement for such “X Zp” of 8 circuit blocks.
Next, the “5 bit AD mod 17” circuit block shown in
The “5 bit AD mod 17” circuit block seeks the sum of numerals a and b input from A and B, and provides the residue modulo a prime p of the resultant sum, from Q.
In the case of h=5, p=17, the numerals a, b and the binary representation Q of the residue establish a relation shown in Expression 36.
a=A0+A12+A222+A323+A424
b=B0+B12+B222+B323+B424
p=17=24+1,/p=25−p=23+22+21+1=15
a+b≡Q(mod 17)(a+b=Q+PF0×17)
Q=Q0+Q12+Q222+Q323+Q424 [Expression 36]
The “5 bit AD mod 17” circuit block comprises a PF0 generator unit U1, 3 half adders HA1-HA3, and 7 full adders FA1-FA7.
The PF0 generator unit U1 includes serially connected PMOS transistors QP1-QP4 and NMOS transistors QN1 and QN2 between a Vcc terminal supplied with a certain voltage and a Vss terminal. These transistors QP1, QP2, QP3, QP4, QN1 and QN2 are controlled by S0, S1, S2, S3, S0 and S4, respectively.
The PF0 generator unit U1 additionally includes a PMOS transistor QP5, 3 NMOS transistors QN3-QN5, and an inverter IV1.
The transistor QP5 is connected between the source of the transistor QP1 and the drain of the transistor QP4 in parallel. The transistors QN3-QN5 are connected between the source and drain of the transistor QN1 in parallel. These transistors QP5, PN3, QN4 and QN5 are controlled by S4, S1, S2 and S3, respectively. The inverter IV1 has an input connected to the sources of the transistors QN1, QN3-QN5. The output from the inverter IV1 provides a carry PF0.
The half adder HA1 has inputs A0 and B0, an output S0, and a carry output C0′. The full adder FA1 has inputs A1 and B1, a carry input C0′, an output S1, and a carry output C1′. The full adder FA2 has inputs A2 and B2, a carry input C1′, an output S2, and a carry output C2′. The full adder FA3 has inputs A3 and B3, a carry input C2′, an output S3, and a carry output C3′. The full adder FA4 has inputs A4 and B4, a carry input C3′, an output S4, and a carry output C4′. The half adder HA2 has inputs S0 and PF0, and an output Q0, and a carry output C0. The full adder FA5 has inputs S1 and PF0, a carry input C0, an output Q1, and a carry output C1. The full adder FA6 has inputs S2 and PF0, a carry input C1, an output Q2, and a carry output C2. The full adder FA7 has inputs S3 and PF0, a carry input C2, an output Q3, and a carry output C3. The half adder HA3 has inputs C3 and S4, and an output Q.
In accordance with the above configuration, the PF0 generator unit U1 determines if the sum of binaries A and B input to the “5 bit mod 17” circuit block is equal to or higher than 17. If the sum of binaries A and B is equal to or higher than 17, then the half adders HA1-HA3 and the full adders FA1-FA7 are used to add 15, a complement of 17 in 5-bit binary, to the sum of A and B in order to subtract 17 from the sum of A and B.
Hereinafter, the circuitry of the blocks in the “p-adic Zp world” is described with specific examples shown.
An example of circuitry of the encoder unit 102 is described below.
As shown in
Among these clocks ck and cl, ck is applied to control a “Counter (1 to 6)” circuit block and a “Ri (1-16)” register unit, and cl to control a “Ro (0-16)” register unit and a “Rgstr” register.
The “Counter (1 to 6)” circuit block is a circuit having an initial state of 0. It starts counting the number of clocks again at every rise of the clock ck and provides the number. Namely, of ckj (j=1-16), it provides j to an “X k-times” circuit block at the rise of ckj.
The “Ri (1-16)” register unit includes registers operative to store components cj of the code word C and capable of storing 16 numerals in total. The “Ri (1-16)” register unit stores numerals in the individual registers sequentially in sync with the timing of the rise of ck. Namely, data, that is, the element cj is taken into the register at the timing of the rise of ckj+1. At the rise of ck17, 16 elements cj are taken in the registers. In a word, the code C can be thus stored.
The “X k-times” circuit block is circuit that multiplies the input by the output at every rise of the clock cl. The “X k-times” circuit block multiplies the output by the input j at every rise of cl, 11-times in total. Namely, the rises of cli (i=0-10) bring the output from the “X k-times” circuit block to (j)i+1. This output is fed to the “X Zp” circuit block.
The “Ro (0-10)” register unit includes registers, which can store 11 numerals and, in the initial state, store 11 components a0-a10 of the code A. The “Ro (0-10)” register unit receives the clocks cl, and provides the components a0-a10 of the code A in order at every rise of the clock cl. Namely, on receipt of cli (i=0-10), it provides ai.
The “X Zp” circuit block is a circuit operative to execute a multiplication of the input in Zp. The “X Zp” circuit block receives the output (j)i+1 from the “X k-times” circuit block and the output ai from the “Ro (0-10)” register unit at every clock cli, and provides (j)i+1ai. The output numerals (j)i+1ai are summed in a combination of the “5 bit AD mod 17” circuit block and the “Rgstr” register.
The “5 bit AD mod 17” circuit block is a circuit that seeks the sum of 2 input numerals modulo 17. On the other hand, the “Rgstr” register is a register having an initial state of 0. It blocks any input from the “5 bit AD mod 17” circuit block at every input of the clock cl and feeds the self-holding contents to the “5 bit AD mod 17” circuit block. The connection of the “5 bit AD mod 17” circuit block with the “Rgstr” register as shown in
The “X k-times” circuit block shown in
The “X k-times” circuit block is a circuit operative to compute a power (X)j of the input X, which is controlled by the clock clj (j=1-16) as shown in
The “X k-times” circuit block includes an “X Zp” circuit block, and a “Rgstr” register <1> and a “Rgstr” register <2> operable in sync with the clock cl as shown in
The “Rgstr” register <1> has an input connected to X, and an output connected to one output of the “X Zp” circuit block. The “Rgstr” register <2> has an input connected to the output of the “X k-times” circuit block X, and an output connected to one input of the “X k-times” circuit block. The “Rgstr” register <2> holds ‘1’ in the initial state.
In accordance with this circuitry, the “X k-times” circuit block takes its own output with a 1-cycle delay to obtain the product of the input X and the output (X)j.
The output (X)j is multiplied cumulatively by the input X at every input of the clock cl. The data X is set in the “Rgstr” register <1> before the clock cli (j=1-k) rises, and the “Rgstr” register <2> having an initial state of ‘1’ is brought into sync therewith, thereby obtaining (X)j at the j-th clj.
An example of circuitry of the syndrome generator unit 104 is described below.
First, a circuit for seeking component elements of the syndrome <m>S is described.
Scanning of <m>=(m1, m2, m3) is executed over the components from 1 to 15 each. When each <m> is given, the corresponding component of the syndrome is computed. An equation required for operational processing of the component of the syndrome is shown in Expression 37.
At the start, all the <m>-associated powers of elements in Zp required for computation are sought. In a computing method using that powers in seeking the sum of syndromes, the process seeks the power of each element in Zp and, when the element j in Zp is selected, the process decodes 17-j or 17-j−1 to select the power of the corresponding element, thereby computing the product or the sum.
Therefore, a computing circuit operative to seek the powers of elements in Zp is described.
This computing circuit comprises a selecting circuit U1, a “(j)i (j=1 to 16)” circuit block, a “R1 (0-15)” register unit, a “R2 (0-15)” register unit, and a “R3 (0-15)” register unit.
The selecting circuit U1 includes 16 AND gates G1-G16. The AND gate G1 receives B0 and /B1, and provides a signal “m=0”. The AND gate G2 receives B1 and /B2, and provides a signal “m=1”. From then on, in accordance with similar input/output, the AND gates G3-G16 provide signals “m=2” to “m=15”, respectively. These signals “m=0” to “m=15” turn via the control switches SW1-SW16 to the outputs rξ (ξ=1, 2, 3) from the selecting circuit U1. The signals “m=0” to “m=15” are decoded through the control switches SW1-SW16 at every value of the component of <m>.
As shown in
The output (j)i is latched as data in the “R1 (0-15)” register unit, the “R2 (0-15)” register unit, and the “R3 (0-15)” register unit in appropriate cycles to obtain 3 powers of the elements in Zp.
Determination of the timing of taking data in the “R1 (0-15)” register unit, the “R2 (0-15)” register unit and the “R3 (0-15)” register unit requires the generation and use of signals B0-B16 shown in
The selecting circuit U1 decodes and selects these signals m=0, m=1, . . . , m=15 at every value of the component mξ of <m>, and provides it as a signal rξ, which is effective only in the time of cki of i=mζ.
The “R1 (0-15)” register unit, the “R2 (0-15)” register unit and the “R3 (0-15)” register unit each determine the time of taking data in accordance with the signal j, that is, the logical sum of the signal rξ and the clock cli, and latch data in sync with the clock cl. As a result, at the end of the clock cycle, the mζ-th (ζ=1, 2, 3) powers of the elements in Zp are stored in the “R1 (0-15)” register unit, the “R2 (0-15)” register unit and the “R3 (0-15)” register unit.
Thus, the <m>-associated powers of the elements in Zp are generated.
The subsequent description is given to a computing circuit, applied to each element j in Zp, for computing an error component ε<m>+lj (l=1, 2, 3, 4; j=1-16) converted through the syndrome converting method. The error component is defined as ε<m>+lj=(17-j−1)m3(17-j)m2(j)ml+1yj.
The error component generator circuit comprises a decoder unit U1, and a product operating unit U2.
The decoder unit U1 includes a “Counter (1 to 16)” circuit block, a “17-j dec” circuit block, a “17-j−1 dec” circuit block, a “j2 dec” circuit block, a “j3 dec” circuit block and a “j4 dec” circuit block, and also includes the “R1 (0-15)” register unit, the “R2 (0-15)” register unit and the “R3 (0-15)” register unit used to store the mξ-th powers of the elements in Zp (ξ=1, 2, 3) shown in
The decoder unit U1, “Counter (1 to 16)” circuit block generates j in order at the timing of the clock clj (j=1-16). The j output from the “Counter (1 to 16)” circuit block is fed to the “17-j dec” circuit block, to the “17-j−1 dec” circuit block, to the “j2 dec” circuit block, to the “j3 dec” circuit block and to the “j4 dec” circuit block.
The “17-j dec” circuit block decodes the input j to provide 17-j. The “17-j−1 dec” circuit block decodes the input j to provide 17-j−1. The j2 dec” circuit block decodes the input j to provide j2. The“j3 dec” circuit block decodes the input j to provide j3. The“j4 dec” circuit block decodes the input j to provide j4.
The “R1 (0-15)” register unit provides (j)m1 based on the input j. The “R2 (0-15)” register unit provides (17-j)m2 based on 17-j output from the “17-j dec” circuit block. The “R3 (0-15)” register unit provides (17-j−1)m3 based on 17-j−1 output from the “17-j−1 dec” circuit block.
Thus, the decoder unit U1 can obtain the outputs (j)m1, (17-j)m2, (17-j−1)m3.
The product operating unit U2 comprises a “Ro (1-16)” register unit, and 7 “X Zp” circuit blocks <1>-<7>.
The “Ro (1-16)” register unit includes 16 latches. These 16 latches hold 16 components of the code Y read out of the p-adic cells, that is, the elements y1-y16 in Zp, respectively. The “Ro(1-16)” register unit provides yj, held in the latches, in sync with the clock clj (j=1-16).
The “X Zp” circuit block <1> computes the product in Zp of yj output from the Ro (1-16)” register unit and (j)m1 output from the “R1 (0-15)” register unit, and provides the product.
The “X Zp” circuit block <2> computes the product in Zp of (17-j)m2 output from the “R2 (0-15)” register unit and (17-j−1)m3 output from the “R3 (0-15)” register unit, and provides the product.
The “X Zp” circuit block <3> computes the product in Zp of the output from the “X Zp” circuit block <1> and the output from the “X Zp” circuit block <2>, and provides the product ε<m>+0j.
The “X Zp” circuit block <4> computes the product of <m>+0j output from the “X Zp” circuit block <3> and j, and provides the product ε<m>+lj.
The “X Zp” circuit block <5> computes the product of ε<m>+0j output from the “X Zp” circuit block <3> and j2 output from the “j2 dec” circuit block, and provides the product ε<m>+33.
The “X Zp” circuit block <6> computes the product of ε<m>+0j output from the “X Zp” circuit block <3> and j3 output from the “j3 dec” circuit block, and provides the product ε<m>+3j.
The “X Zp” circuit block <7> computes the product of ε<m>+0j output from the “X Zp” circuit block <3> and j4 output from the “j4 dec” circuit block, and provides the product ε<m>+4j.
With the above-configured product operating unit U2, the selected read data Y can be obtained in sync with the clock clj as ε<m>+0j, ε<m>+lj, ε<m>+2j, ε<m>+3j, ε<m>+4j corresponding to the elements j in Zp.
The subsequent description is given to a computing circuit operative to seek the sum of ε<m>+lj (l=1, 2, 3, 4) where j=1-16. Seeking the sum of ε<m>+lj (l=1, 2, 3, 4) where j=1-16 allows a syndrome S<m>+1 to be obtained. Hereinafter, this computing circuit is referred to as the “syndrome generating circuit”.
The syndrome generating circuit includes a “Rgstr” register and a “5 bit AD mod 17” circuit block.
The syndrome generating circuit yields the sum of ε<m>+lj in sync with the clock clj (j=1-16) through a loop containing the “5 bit AD mod 17” circuit block and the “Rgstr” register to generate the syndrome component S<m>+l (l=0-4).
The following description is given to a circuit block for computing the i-th power of the element j in Zp used in the circuit for seeking jm shown in
The “(j)i (j=1 to 16)” circuit block is controlled by the clocks cki (i=0-15) and clj (j=1-16), and provides (j)i and (j)i+1 in sync with the rise of the clock cl3.
The “(j)i (j=1 to 16)” circuit block is a circuit that computes the 0-th to 15-th powers of all the elements 1-16 in Zp other than zero in order, and holds them in registers.
The “(j)i (j=1 to 16)” circuit block includes an “X Zp” circuit block, a “Counter (1 to 16)” circuit block, and a “R (1-16)” register unit as shown in
The index-decidable clock is cki, and an index i can be decided by the order of the clock cki. On the other hand, the elements in Zp are designated in order from 1 by the clock cl, and the order j of the clock cl indicates the number of elements.
The “Counter (1 to 16)” circuit block is connected to one input of the “X Zp” circuit block. It uses cki as the start signal and counts up at the timing of the rise of clj within a range of 1-16.
The “R (1-16)” register unit includes 16 registers, and it stores inputs i1-i16 in No. 1-16 registers in order at the rise of the clock /clj at in, and provides the contents i1-i16 of No. 1-16 registers in order at the rise of the clock clj at out.
As shown in
As an example of circuitry of the solution searching polynomial generator unit 105, an operating circuit for seeking a solution searching polynomial Ψ(x) is described below. This operating circuit is referred to as the “solution searching polynomial generating circuit”.
A processing equation is shown in Expression 38, which is required in operational processing for deriving a coefficient φj at each degree j of x in the solution searching polynomial Ψ(x).
The solution searching polynomial generating circuit is a circuit for use in the step of error searching with the use of a syndrome u<m>S=(uS<m>+0, uS<m>+1, uS<m>+2, uS<m>+3, uS<m>+4).
The solution searching polynomial generating circuit allows a simple circulation circuit to generate the solution searching polynomial Ψ(x) without using a complicated Euclidean iterative method.
The solution searching polynomial generating circuit comprises a first part circuit U1 for deriving Σφj−lS<m>+i on the right side of the 2nd equation shown in Expression 38, and a second part circuit U2 for deriving u(j)−1 on the right side of the 2nd equation shown in Expression 38 as well.
The first part circuit U1 includes 4 serially connected “Rgstr” registers <1>-<4>, and 4 “X Zp” circuit blocks <1>-<4> connected to the outputs of the Rgstr” registers, respectively.
The “Rgstr” register <1> has an initial value of ‘1’ and other “Rgstr” registers <2>-<4> have an initial value of ‘0’.
The first part circuit U1 is controlled by the clock ck fed to the “Rgstr” registers, and provides the coefficients φj−1, φj−2, φj−3, φj−4 at the respective degrees from the nodes between the “Rgstr” registers at the rise of the j-th clock ckj. A node with no coefficient present is at ‘0’ and accordingly cannot contribute to the product operation with the syndrome components S<m>+1-S<m>+4 executed by the “X Zp” circuit blocks <1>-<4>. The outputs from the X Zp” circuit blocks <1>-<4> are summed 2 by 2 at the “5 bit AD mod 17” circuit blocks, and a ladder of these “5 bit AD mod 17” circuit blocks is applied to finally obtains Σφj−1S<m>+i.
The second part circuit U2 comprises a “Counter (1 to 4)” circuit block, “X Zp” circuit blocks <a>, <b>, and a “j−1 dec” circuit block.
The second part circuit U2 generates (jS<m>+0)−1 from j generated at the “Counter (1 to 4)” circuit block and the syndrome component S<m>+0, in accordance with the clock ckj, at the “X Zp” circuit block <a> and the “j−1 dec” circuit block. Then, the “X Zp” circuit block <b> derives u(j)−1 from the generated (jS<m>+0)−1 and the established ε=4.
An “X Zp” circuit block <c> is used to multiply Σφj−1S<m>+i generated at the first part circuit U1, by u(j)−1, generated at the second part circuit U2, to yield the coefficient φj.
The coefficient φj is a h-bit binary, which is represented in complement to represent a negative number. Therefore, the output from the “X Zp” circuit block <c> is inverted through the inverter IV1, and ‘1’ is added thereto at the “5 bit AD mod 17” circuit block, thereby obtaining a complement representation of 5-bit binary.
The “j−1 dec” circuit block shown in
The solution searching polynomial generating circuit described above is possible to provide the coefficients φ4-φ0 where j=4 on the respective nodes on receipt of 4-times inputs of the clock ck.
An example of circuitry of the Hasse differential polynomial generator unit 106 is described below.
If the degree of Ψ(x) is coincident with 4, it is required to derive the root of this Ψ(x) and the associated multiplicity. Therefore, the following description is given to an operating circuit operative to compute a coefficient of the Hasse differential polynomial required in deriving the multiplicity of the root. Hereinafter, this operating circuit is referred to as the “Hasse differential polynomial coefficient generating circuit”.
A relation between the coefficient of the Hasse differential polynomial, the coefficient, and the coefficient of Ψ(x) is as shown in Expression 39.
In a word, as can be found from Expression 39, the Hasse differential polynomial coefficient generating circuit multiplies the coefficient of Ψ(x) by the binomial coefficient, generates the coefficient at each degree at every differential stage using the clocks ck and cl, and stores all the coefficients in the registers.
The Hasse differential polynomial coefficient generating circuit includes a “i!” table having a record of the factorial i! of each element i in Zp, a “(i!)−1” table having a record of the inverse element, and a “φi” table having a record of the coefficient φi of the solution searching polynomial Ψ(x). As for the i!” table and “(i!)−1” table, they can be suppressed to sufficiently small scales in practice in the case of p=17 as in the present example.
As for the “φi” table, it is created in the Hasse differential polynomial coefficient generating circuit shown in
The Hasse differential polynomial coefficient generating circuit includes, as shown in
The number of clocks of the clock ck corresponds to the number of differential stages, and i in cki takes a value of 1-4. On the other hand, the clock cl corresponds to the degree, and a growth in number of differential stages increases undesired degrees. Therefore, it is not required to generate the clock cl every time by the same number as the clock ck though the circuit of
For the purpose of storing the coefficients of Ψ(x), that is, the coefficients corresponding to Ψ[0], previously in the registers, ck0 is provided as a matter of convenience.
The 5-bit binary representations of these numerals can be used to form a decoder. Examples of the functions, which can scan all elements in Zp without excess/shortage when j varies through all elements 1-16 in Zp, are as shown in the table, which may exclude even powers and factorials in many cases. For example, ai can scan all elements in Zp except zero when j varies if a can satisfy the condition. For example, the cases of a=3, 5, 6, 7, 10 are as shown in
These functions can be utilized to extend the searching range by virtual errors in the syndrome converting method.
As an example of the circuit that configures the decoder, a decoder operative to derive the inverse element j−1 of an element j in Zp is described. Hereinafter, this decoder is referred to as the “j−1 dec” circuit block.
The inverse element j−1 can be derived from the element j by seeking a relation of j×j−1≡1 (mod 17), then configuring j and j−1, 5-bit binary representations as shown in Expression 40 to form a converting decoder thereof.
Input:Ii=(j)i
j=(j)0+(j)12+(j)222+(j)323+(j)424
Output:Oi=(j−1)i
jj−1≡1(mod p)
j−1=(j−1)0+(j−1)12+(j−1)222+(j−1)323+(j−1)424 [Expression 40]
The “j−1 dec” circuit block can be configured by a general circuit operative to decode 5-bit binary data I to other 5-bit binary data O, setting I=j, O=j−1 as shown in Expression 40 in this case.
The “j−1 dec” circuit block includes NAND gates G1 and G2 to decode bits I0-I3 partially 2-bit by 2-bit to create signals /A0-/A3, /B0-/B3, and also includes a 3-input NOR gate G3 to create Om (m=0-4) from /Am, /Bm, I4 (or /I4). In the case of j=0, a setting of j−1=0 is given.
The following description is given to an operating circuit operative to compute the root and the associated multiplicity of the solution searching polynomial Ψ(x). Hereinafter, this operating circuit is referred to as the “solution searching polynomial root/multiplicity operating circuit”.
The solution searching polynomial root/multiplicity operating circuit regards the 0-th stage differential in the Hasse differential polynomial as Ψ(x). Unless the Hasse differential polynomial is zero for each element in Zp, the circuit shifts the computation to the next element. The circuit increases the number of differential stages while the Hasse differential polynomial keeps zero. When registers are used to hold the number of the first Hasse differential stages where the Hasse differential polynomial is not zero, for each element in Zp, the element with the number of stages being not zero indicates the root, and the number of the remaining stages indicates the multiplicity of the root. In a word, when those with non-‘0’ contents are selected from the registers, the held value indicates the multiplicity of the root.
Specifically, when the root of the solution searching polynomial Ψ(x) is denoted with α, and a Hasse differential polynomial of Ψ(x) in the i-th stage with [Ψ(x)][i], a relational equation is given as Expression 41 where the multiplicity of α is denoted with n.
The solution searching polynomial root/multiplicity operating circuit seeks n as in Expression 41 for each element a in Zp regardless of whether it is the root of Ψ(x) or not. In the case of n=0, it means that α is not a root.
The solution searching polynomial root/multiplicity operating circuit scans the elements 1-16 in Zp at the clock ck to seek the number of stages of the Hasse differential at the clock cl, and the value of the Hasse differential polynomial in that number of stages at the clock clk. The clock ck is generated when the computed value of the Hasse differential polynomial becomes non-zero such that the process enters the cycle for the next element in Zp.
The solution searching polynomial root/multiplicity operating circuit includes, as shown in
The “(j)i (j=1 to 16)” circuit block selects an element α on α-times receipt of the clock ck, and provides the j-th power of α on (j−1)-times receipt of the clock clk.
The “Ro i(0-4)/j(0-4)” register unit is a register operative to provide a coefficient φ[i]j of the Hasse differential polynomial on receipt of the clocks cli and clkj.
The “X Zp” circuit block multiplies the output a from the “(j)i (j=1 to 16)” circuit block by the output φ[i]j from the “Ro i(0-4)/j(0-4)” register unit, and provides αjφ[i]j.
Thus, when the clock clk is applied 5-times to the “(j)i (j=1 to 16)” circuit block, the “Ro i(0-4)/j(0-4)” register unit, and the “X Zp” circuit block, the value of the Hasse differential polynomial can be obtained. For the purpose of simplifying the control of the clocks ck, cl and clk, the sum of the terms of the value 0 not present in this computation is computed as well. Therefore, the total sum of the clocks clk originally required is almost halved.
The value [Ψ(x)][i] of the differential polynomial is taken in the “Rgstr” register <1> as ‘0’ in the case of zero and ‘1’ in other cases.
The value held in the “Rgstr” register <1> is output at the timing of the clock clk0 as the clock ckα (α=1-16). The clock ckα is held until it is reset by the clock clk4.
The “Rgstr” register <1> has an initial value of 0 and accordingly the clock ckl rises at the first clk0. The clock ck2 and so forth rise at clk0 in any cycle of clk in accordance with the computation result.
The “clock cl gen.” circuit block generates the clock cli in sync with the clock clk0 and resets it to the clock cl0 at every rise of the clock ckα.
The “Counter (0 to 4)” circuit block is reset to 0 by the clock ckα, and counts up at every input of the clock cl and outputs the order of the clock cl minus 1. This output is stored in the “Li (1-16)” register unit.
The “Li (1-16)” register unit has the inputs, which are switched by the clock ckα, and thus can store the associated multiplicity in the α-th register.
An example of circuitry of the error quantity computing unit 107 is described below.
The following description is given to a converting circuit operative to derive a true error from the root and multiplicity of the Hasse differential polynomial, an α and [n] pair, to execute error correction. This converting circuit is herein referred to as the “code correcting circuit”.
The solution sought through the syndrome converting method, the virtual error n, the error-caused position t, and the true error et have a relation therebetween as in Expression 42.
The numeral (j)<m>=(17-j−1)m3(17-j)m2(j)m1 is utilized in the conversion. This numeral is equal to S<m>+0 when compared with the relational equation S<m>+1 of the syndrome component shown in Expression 43 if Y=(0, 0, . . . , yj=1, . . . , 0, 0) is used as the syndrome component at l=0. This is used to give a representation of T<m>+0 to the syndrome component replaced by l for the t component of Y and 0 for others, which is used as a circuit input in a circuit shown in a block diagram of
The code correcting circuit includes a first part unit U1 operative to seek the error-caused position t, a second part unit U2 operative to seek the true error et, and a third part unit U3 operative to execute code correction in accordance with the true error et.
The first part unit U1 includes a “j−1 dec” circuit block <1≦.
The “j−1 dec” circuit block <1> receives the root α, and provides the error-caused position t.
The second part unit U2 includes 3 “X Zp” circuit blocks <1>-<3>, and a “j−1 dec” circuit block <2>.
The “X Zp” circuit block <1> receives the syndrome component S<m>+0 and 4−1 (=13), and provides u−1, that is, the inverse element of the numeral u found in the conversion.
The “j−1 dec” circuit block <2> receives T<m>+0, and provides {(t)<m>}−1, that is, the inverse element of (t)<m>.
The “X Zp” circuit block <2> receives the multiplicity n and the output u−1 from the “X Zp” circuit block <1>, and provides the product thereof nu−1.
The “X Zp” circuit block <3> receives the output nu−1 from the “X Zp” circuit block <2> and the output {(t)<m>}−1 from the “j−1 dec” circuit block <2>, and provides the true error et.
The third part unit U3 includes 2“5 bit AD mod 17” circuit blocks <1> and <2>.
The “5 bit AD mod 17” circuit block <1> receives /et and ‘1’, and provides −et, that is, the complement of et.
The “5 bit AD mod 17” circuit block <1> receives the code yt read out of the p-adic cells and the output −et from the “5 bit AD mod 17” circuit block <1>, and provides the error-corrected correct code ct.
The code correcting circuit thus configured is possible to restore the correct code ct.
The following description is given to an operating circuit for computing a Lee metric w(E)=Σ|ej| (j=1-16) of an error code word E. Hereinafter, this operating circuit is referred to as the “Lee metric operating circuit”.
The Lee metric operating circuit includes a “Ro (0-16)” register unit, a “5 bit LM mod 17” circuit block, a “5 bit AD mod 17” circuit block, and a “Rgstr” register.
The error code word E derived from the polynomial Ψ(x) is stored as the initial value in the “Ro (0-16)” register unit. From the “Ro (0-16)” register unit, the components ej of E are taken out in order by the clock ckj.
The “5 bit LM mod 17” circuit block computes the associated Lee metrics |ej| from the taken-out components ej. The “5 bit LM mod 17” circuit block provides the Lee metrics |ej| of the components, computed at every clock ckj, to the “5 bit AD mod 17” circuit block.
The “Rgstr” register and the “5 bit AD mod 17” circuit block form a loop, which is used to sum these |ej|. The output from the “5 bit AD mod 17” circuit block when the 16th clock ck rises provides w(E)=Σ|ej|.
If w(E)≦4, a series of error searching are finished, and E can be used to execute correction.
The error E sought through operational processing can be corrected after it is ensured that the associated Lee metric is equal to or below and it can be found that failed correction does not occur.
Therefore, the following description is given to an operational circuit element operative to compute a Lee metric of the element in Zp. Hereinafter, this operational circuit element is referred to as the “5 bit LM 17” circuit block.
As for an element a in Zp represented in 5-bit binary, the associated Lee metric Q=|a| can be expressed by Q=/PF0×a+PF0×(17−a). Here, PF0 becomes ‘1’ if a≧9 and ‘0’ if a<9. Therefore, if a≧9, a derivation of the Lee metric of a just requires a subtraction of a from 17, that is, an addition of the complement of a to 17.
A relation between A and Q in the case of h=5, p=17 is as in Expression 44.
a=A0+A12+A222+A323+A424
Q=|a|(Q=/PF0×a+PF0×(17−a))
Q=Q0+Q12+Q222+Q323+Q424 [Expression 44]
The “5 bit LM 17” circuit block receives a 5-bit binary A0-A4, and provides a 5-bit binary Q0-Q4.
The “5 bit LM 17” circuit block comprises a PF0 generator unit U1, an XOR gate G1, 3 half adders HA1-HA3, and 2 full adders FA1, FA2.
The PF0 generator unit U1 includes, between the Vcc terminal and the Vss terminal, serially connected PMOS transistors QP1-QP4 and NMOS transistors QN1, QN2. These transistors QP1, QP2, QP3, QP4, QN1 and QN2 are controlled by A4, A2, A1, A0, A0, and A3, respectively.
The PF0 generator unit U1 also includes a PMOS transistor QP5, 3 NMOS transistors QN3-QN5, and an inverter IV1.
The transistor QP5 is connected between the source of the transistor QP2 and the drain of the transistor QP4. The transistors QN3 and QN4 are connected between the source and drain of the transistor QN1. The transistor QN5 is connected between the source of the transistor QN1 and the drain of the transistor QN2 (Vss terminal). These transistors QP5, QN3, QN4 and QN5 are controlled by A3, A1, A2 and A4, respectively. The inverter IV1 has an input connected to the sources of the transistors QN1, QN3, QN4 and QN5. The output from the inverter IV1 provides the carry PF0.
The XOR gate G1 receives Aj (j=0-4) and PF0, and provides Bj.
The full adder FA1 has inputs B0 and PF0, a carry input PF0, an output Q0, and a carry output C0. The half adder HA1 has inputs C0 and B1, an output Q1, and a carry output C1. The half adder HA2 has inputs C1 and B2, an output Q2, and a carry output C2. The half adder HA3 has inputs C2 and B3, an output Q3, and a carry output C3. The full adder FA2 has inputs B4 and PF0, a carry input C3, and an output Q4.
In this specific example, if the input reaches 9 or above, the complement of a is added to 17. In the case of PF0=1, the complement of a is created at the XOR gate G1 by inverting Aj, that is, each bit representation of a, to yield Bj, and adding 1 thereto.
As p=17 is 17=(10001)2, this is represented by PF0, and further PF0 is used as 1, then these are added to Bj as the sum of binaries.
The “5 bit LM 17” circuit block operates not in sync with the clocks and, on receipt of inputs, provides the computed Lee metric.
An example of circuitry of the decoder unit 109 is described below.
After error searching is finished and a Lee metric code C corrected with the error code E is obtained, it is required to restore it to the data code A in Zp. This operation corresponds to an inverse operation of C=AG with a generator matrix G though an inversion of a matrix requires a large-scale operation. Therefore, the elements of A are sequentially derived from the elements of C. The computation process is shown in Expression 45.
As shown in Expression 45, the relation cj=Σ(j)i+1ai is deformed in turn to sequentially derive a1 from a0, subsequently a2 from a1, . . . , then am. This is the principle of computation.
Both sides are multiplied by the inverse element of the power of j to yield the terms on all elements in Zp, which are then summed. At this time, the deformation is executed based on the fact that the sum of all elements in Zp becomes zero.
Expression 46 shows a relational equation of the components of C and the components of A.
The following description is given to a specific operating circuit realized using the relational equation shown in Expression 46.
When seeking am after am−1 is obtained, parallel utilization of clocks is executed by computing the sum of j at am in order while seeking c(m−1)j.
At the start, a circuit for converting the component of the Lee metric code C every time is described based on c(m−1)j=c(m−2)j−(j)mam−1 shown in Expression 46. Hereinafter, this circuit is referred to as the “c(m−1)j” circuit block.
The “c(m−1)j” circuit block is a circuit operable in sync with the clocks clj, /clj. It receives (j)m, am−1 and the code components c1-c16, and provides c(m−1)j.
The “c(m−1)j” circuit block includes an “X Zp” circuit block, 2“5 bit AD mod 17” circuit blocks <1>, <2>, and a “R (1-16)” register unit.
The “X Zp” circuit block yields the product of (j)m and am−1, and provides it to the “5 bit AD mod 17” circuit block <1> via the inverter IV1.
The “5 bit AD mod 17” circuit block <1> seeks the complement of (j)mam−1 output through the inverter IV1 to yield −jmam−1, and provides it to the “5 bit AD mod 17” circuit block <2>.
The “5 bit AD mod 17” circuit block <2> yields the sum of −jmam−1 output from the “5 bit AD mod 17” circuit block <1> and cj (=c(m−2)j) output from the “R (1-16)” register unit in sync with the clock cl. This sum turns to c(m−1)j, that is, the output from the “c(m−1)j” circuit block. The output c(m−1)j from the “5 bit AD mod 17” circuit block <2> is recorded in the j-th register in the “R (1-16)” register unit in sync with the fall of the clock cl.
The subsequent description is given to a circuit for creating am based on am=16−1Σ(jm+1)−1c(m−1)j (j=1-16) shown in Expression 46. Hereinafter, this circuit is referred to as the “am” circuit block.
The “am” circuit block is operable in sync with the clock clj. It receives (j)m+1 and c(m−1)j, and provides am.
The “am” circuit block includes a “j−1 dec” circuit block, 2 “X Zp” circuit blocks <1>, <2>, a “5 bit AD mod 17” circuit block, and a “Rgstr” register.
The “j−1 dec” circuit block converts (j)m+1 to the inverse element thereof, (j)−m+1, and provides it to the “X Zp” circuit block <1>.
The “X Zp” circuit block <1> operates the product of (j)−(m+1) fed from the “j−1 dec” circuit block and cj (=c(m−1)j) output from the “c(m−1)j” circuit block, and provides the product to the “5 bit AD mod 17” circuit block.
The products output from the “5 bit AD mod 17” circuit block are summed through a loop including the “Rgstr” register having an initial value of ‘0’ and the “5 bit AD mod 17” circuit block. The result is fed from the “5 bit AD mod 17” circuit block to the “X Zp” circuit block <2>.
The “X Zp” circuit block <2> operates the product of the sum output from the “5 bit AD mod 17” circuit block and 16−1=16 to obtain am, that is, the output from the “am” circuit block.
Finally, a computing circuit is described, which includes the above-described “c(m−1)j” circuit block and “am” circuit block to obtain the data code A. Hereinafter, this computing circuit is referred to as the “inverse transforming circuit”.
As shown in
The inverse transforming circuit comprises a first part circuit U1 for converting the component of the Lee metric code C required for inverse transformation every time, and a second part circuit U2 for seeking the elements of the data code A one by one.
The first part circuit U1 includes a “(ji (j=1 to 16)” circuit block, a “Rgstr” register, and a “c(m−1)j” circuit block.
The second part circuit U2 includes a “(j)i (j=1 to 16)” circuit block, an “am” circuit block, and a “Li (0-10)” register unit.
Among those, the “(j)i (j=1 to 16)” circuit block is shared with the first part circuit U1. The “(j)i (j=1 to 16)” circuit block is a circuit operative to generate the (m+1)-th and m-th powers of the element j in Zp for use in the “am” circuit block and the “c(m−1)j” circuit block. This circuit block applies m+1, the number of cycles of the clock ck and, m, a 1-cycle lower number than this number of cycles, as indexes, and provides (j)m+1 and (j)m.
The “(j)i (j=1 to 16)” circuit block provides the element in Zp itself to the “am” circuit block and ‘1’ to the “c(m−1)j” circuit block in the 1st cycle of ck. The ‘1’ fed to the “c(m−1)j” circuit block is originally not utilized in computation. Accordingly, it is required to put a thought in preventing the ‘1’ from exerting an influence on the operation result. Therefore, for the purpose of removing the influence of the ‘1’, the product operation in the first part circuit U1 turns the other input to ‘0’ at the first cycle of the clock ck such that the result of the product operation becomes zero.
The “Rgstr” register in the first part circuit U1 is given a setting of am. The setting of am is provided from the “Rgstr” register at the timing of the clock ck after the clock ck1. Namely, the output am−1 from the “Rgstr” register is provided at the timing of the clock ckm.
In the first clock cycle ck0 of the clock ck, the initial value of the output node in the “Rgstr” register is set at ‘0’ to exert no influence on the computation result as described above.
In sync with the clock ckm, the “c(m−1)j” circuit block receives am−1 and (j)m output from the “Rgstr” register and the “(j)i (j=1 to 16)” circuit block, respectively. The “c(m−1)j” circuit block derives c(m−1)j, that is, the output from the first part circuit U1, from these inputs am−1 and (j)m.
The initial value of the “c(m−1)j” circuit block is the restored code data C, and the setting in the “Rgstr” register at the clock ck0 provides c(−1)j=init C.
The “am” circuit block in the second part circuit U2 obtains am at every 16 cycles of the clock cl in accordance with the computing equation shown in Expression 46. The am created at the “am” circuit block is stored in the m-th register in the “Li (0-10)” register unit at the clock ckm+1 that indicates the start timing of the cycles of the next 16 clocks cl.
After completion of all the clock cycles, the code data A is set and stored in the “Li (0-10)” register unit.
<Conclusion>
Thus, the first embodiment is described on the p-adic NAND flash memory using the syndrome converting method in error correction with the specific configurations exemplified.
The first embodiment makes it possible to ensure the reliability of data by a strong error correcting ability even if the level setting is insufficient in data write to p-adic cells and data read from p-adic cells.
The first embodiment describes the memory system that applies the syndrome converting method in data (code) processing in the “p-adci Zp world”. In this case, the circuitry used in the memory system can be simplified. An increase in the correctable error quantity, however, results in an increased number of searching repetitions for searching with no miss in error correction, which leads to an increase in processing time.
If the syndrome converting method is used in error correction as in the first embodiment, the number of repetitions of the computation required for error searching is, for example, in the case of p=17, equal to 16×16=256 times at the maximum if the correctable Lee metrics ε=3, and 16 times at the maximum if ε=2. On the other hand, if the correctable Lee metrics ε=4 as described in the first embodiment, it is required to repeat computation processing 16×16×16=4096 times at the maximum on error searching.
From the above point, the syndrome converting method is effective to an error correcting system mounted on a memory system with a smaller error quantity. In contrast, if the error-correctable quantity is larger (such as p=17, ε=4), the use of the conventional Euclidean iterative method rather than the syndrome converting method may reduce the processing time and the circuit scale depending on the case.
Therefore, the second embodiment describes a p-adic NAND flash memory using the Euclidean iterative method in error correction.
The p-adic memory system comprises a 17-adic converter unit 201, an encoder unit 202, a p-adic cell memory unit 203, a syndrome generator unit 204, a solution searching polynomial generator unit 205, a Euclidean iteration processing unit 206, a first Hasse differential polynomial generator unit 207, a second Hasse differential polynomial generator unit 208, a code restorer unit 209, a decoder unit 210, and a 25-adic converter unit 211.
Among those, the processes of processing in the 17-adic converter unit 201, the encoder unit 202, and the p-adic cell memory unit 203 are same as those in the 17-adic converter unit 101, the encoder unit 102, and the p-adic cell flash memory 103 in the p-adic memory system shown in
The code C, Lee-metric-coded at the encoder unit 202, is read out as the error-containing code Y from the p-adic cell flash memory 203. The code Y is fed to the syndrome generator unit 204.
The syndrome generator unit 204 uses the syndrome matrix H and each code word component of the code Y to compute the syndrome S in accordance with S=YHt. In the case of S=0, the code Y contains no error. Accordingly, the code Y is fed to the decoder unit 210 for final processing. On the other hand, if S≠0 and the first component S0 of S is |S0|>5, the error in the code Y cannot be corrected surely. Accordingly, NG signal is provided and the error-containing code Y is directly fed to the decoder unit 210 for first processing. In other cases, the syndrome S is fed to the solution searching polynomial generator unit 205.
The solution searching polynomial generator unit 205 derives a polynomial Ψ(x) from the syndrome S in accordance with the computation equation shown in Expression 47. This polynomial Ψ(x) is fed to the Euclidean iteration processing unit 206.
S=(S0,S1,S3,S4)→Ψ(x)
ψ0=1
ψ1=−S1
ψ2=−(ψ1S1+ψ0S2)/2
ψ3=−(ψ2S1+ψ1S2+ψ0S3)/3
ψ4=−(ψ3S1+ψ2S2+ψ1S3+ψ0S4)/4 [Expression 47]
The Euclidean iteration processing unit 206 tries creation of λ=fn, ν=pn−1 through the Euclidean iterative method. If succeeded to create λ and ν, it provides these polynomials λ(x) and ν(x) to the first and second Hasse differential polynomial generator units 207 and 208. On the other hand, if failed to create λ and ν, it provides NG signal, and feeds the error-containing code Y directly to the encoder unit 210 for first processing. The circuitry of the Euclidean iteration processing unit 206 is detailed later.
The first and second Hasse differential polynomial generator units 207 and 208 derive respective Hasse differential polynomials from the polynomials λ(x) and ν(x), and seek the roots r thereof and the multiplicities n of the roots. The first and second Hasse differential polynomial generator units 207 and 208 provide t=r−1 to the code restorer unit 210 in accordance with the established roots r and multiplicities n.
The code restorer unit 210 derives an error quantity et from the positional coordinates of the error-caused code word and the multiplicities n. It restores the code C=(c1, c2, . . . , c16) of the Lee metric code from ct=yt−et.
Processing on and after the decoder unit 210 and the 25-adic converter unit 211 is same as that in the decoder unit 109 and the 25-adic converter unit 110 shown in
Thus, in the case of the memory system according to the present embodiment, the iteration and complication of processing can be all pushed in the Euclidean iterative method and the entire processing flow can be simplified.
The entire system configuration is roughly divided in 2: a system configuration surrounded by the dashed line shown with a in
The system surrounded by the dashed line includes a p-adic cell flash memory mounted thereon, which can serve as one memory system by itself. Therefore, in the case of this memory system, it is required to develop the p-adic cell flash memory also described in the first embodiment. In accordance therewith, it is also required to modify the level setting method applied to p-adic cells from that in the case of the memory system using the conventional NAND MLC.
The system surrounded by the chain line serves as a memory controller that uses the conventional NAND MLC directly as the memory device. Therefore, this system can be realized only by providing I/O operative to decode the elements in Zp so as to correspond to the levels of MLC at the time of data write. Thus, the conventional NAND MLC can be utilized to construct the memory system that uses the Lee metric code. The memory system that uses the memory controller, however, requires a longer time for mapping of MLC and Zp, and thus cannot provide a faster data processing system. It is still possible, though, to execute data management with increased reliability of data.
Mapping of MLC and Zp has different pages at every group of levels of MLC. Accordingly, unless pieces of data between different pages are managed collectively, the levels of 1 memory cell cannot be continuously associated as the elements in Zp. Therefore, it is required to provide registers corresponding to all pieces of page data for mapping of the elements in Zp and binary representations, and execute a series of data transfers between all pages in the register group and individual pages in the NAND MLC memory device. Therefore, the mapping requires a plurality of logic circuits. This mapping can be regarded as a large register-and-decoder set and accordingly the details are not described. The number of levels in NAND MLC is equal to a power of 2 and the number of elements in Zp is a prime. Therefore, available primes are below p=7 in the case of an 8-level cell, and below p=13 in the case of a 16-level cell. Thus, from the viewpoint of the efficiency of memory cells, it cannot be regarded as an optimal memory system.
Market demands determine which one of the systems should be constructed. In accordance with the description in the first embodiment and a second embodiment shown below, it is possible to realize either of the systems.
<Euclidean Method>
First, as the premise of the memory system using the Euclidean iterative method, general relational equations in the case of the use of the Euclidean method are described collectively where γ=ε+1.
The binary data of the code C recorded in p-adic cells includes h-bit code word symbols, a collection of (p−1) pieces, which may cause variations bit by bit when suffer various disturbances. Thus, the operation required to restore the code C from the code Y is decoding. Prior to decoding, the syndrome is sought at the start.
The syndrome S can be sought as elements S0, S1, . . . , Sγ−1 shown in Expression 48 through an operation S=YHt using a syndrome matrix H.
Here, Ht is a transpose of H. The generator matrix G and the syndrome matrix H are configured to satisfy GHt=0 (mod p). Accordingly, a representation of Y=C+E yields S=EHt. With E=(e1, e2, . . . , ep−1), a syndrome S0 corresponds to the total sum of errors in the code word symbols as can be found. These syndromes S provide the only information on errors and, on the basis of these syndromes S, the correct code C is restored as follows.
Subsequently, the principle of decoding is described. Here, n(=p −1) error symbols are classified into 2 sets J+ and J− as in Expression 49.
J+={jε(1,2, . . . n);ej<p/2}
J−={jε(1,2, . . . n);ej>p/2} [Expression 49]
Namely, it is classified into J+ that is an arrangement of the positions j of the code symbols cj if the error quantity in the symbol is ej<p/2, and J− that is an arrangement of the positions j of the code symbols cj if the error quantity in the symbol is ej>p/2. Polynomials Λ(x), V(x) in the Galois field GF(p) are configured on the basis of these sets as in Expression 50.
Thus, the polynomial Λ(x) is a polynomial that has a reciprocal number of the position j of an error code word symbol in J+ as a root and that has a Lee metric ej of that code word symbol as the multiplicity of the root. On the other hand, the polynomial V(x) is a polynomial that has a reciprocal number of the position j of an error code word symbol in J− as a root and that has a Lee metric p−ej of that code word symbol as the multiplicity of the root. Decoding is a process of finally configuring these polynomials only from the information on the syndrome S1 to solve them, thereby obtaining information on errors. In a word, it is required to seek a relation between these polynomials Λ(x), V(x) and the syndrome S1.
Subsequently, as shown in Expression 51, when it is configured with a series polynomial having each syndrome S1 on a coefficient of the corresponding degree, it is represented with a rational polynomial having the error quantity ej of the symbol, the position j and the value thereof.
From Expression 51, a relational expression shown in Expression 52 can be established among the polynomials Λ(x), V(x) and the syndrome S(x).
Subsequently, the relational expression shown in Expression 52 is used to seek the polynomials Λ(x), V(x) from the syndrome S(x).
From the syndrome S(x), a polynomial Ψ(x) with a degree of γ−1 or lower shown in Expression 53 is sought.
In an expansive expression of the polynomial Ψ(x), a coefficient ψ3 can be sought from a comparison between the coefficients at the homogeneous degree on both sides of the expression shown in Expression 53 through an iterative method using the syndrome Si and an already-determined coefficient ψj−1. The coefficients ψ0 to ψγ−1 of the polynomial Ψ(x) are sought from the syndromes S1 to Sγ−1. The results are shown in Expression 54.
The polynomial Ψ(x) is a polynomial equivalent to Λ(x)/V(x) while the polynomials Λ(x), V(x) are given key conditions shown in Expression 55. Therefore, it can be sought through a Euclid iterative method applied to xγ and the polynomial Ψ(x) to eliminate constant multiples.
V(x)Ψ(x)≡Λ(x)(mod xγ)
degΛ(x)+deg V(x)<γ
Λ(x) and V(x) are coprimes
degΛ(x)−deg V(x)≡S0(mod p) [Expression 55]
Therefore, if the polynomial Ψ(x) can be configured from the syndromes S1 to Sγ−1, the syndrome S0 can be used as the stop condition on the iterative method to seek the polynomials Λ(x), V(x). Namely, the polynomials Λ(x), V(x) sought from the set of (S0, Ψ(x)) through the iterative method can represent the position j and the error quantity ej of an error code word symbol.
<Error Correction through Euclidean Iterative Method>
Next, the Euclidean iterative method is described. The following description is given to an iterative method of seeking a congruence equation, v(x)Ψ(x)≡λ(x)(mod xγ), with the stop condition, deg λ(x)−deg v(x)≡S0(mod p).
The Euclid iterative method is a method of sequentially seeking functions f0, f1, . . . , fn using divisions of polynomials. These quantities have relations shown in Expression 56.
Here, as shown in Expression 57, when pn or qn can be sequentially introduced from a quotient polynomial kn particularly obtained through the process of division, these polynomials can satisfy a simple relation. Accordingly, fn can be represented by f0, f1, pn−1 and qn−1. Then, when f0=xγ, f1=Ψ(x) are established to create a congruence equation at xγ, the stop condition on iteration turns to deg fn−deg pn−1≡S0 (mod p). With regard to n that satisfies the stop condition, if deg fn+deg pn−1<γ, then λ(x)=fn, V(x)=pn−1 can be established.
As γ=ε+1, a computation for error 4 requires around 4 times of iterations, for example. With this regard, the use of the Euclidean iterative method increases the scale of the operating circuit though it can reduce the number of computations particularly lower than the syndrome converting method.
The following description is given to specific circuitry on points greatly different from the syndrome converting method.
<Circuitry of Syndrome Generator Unit>
Next, as an example of circuitry of the syndrome generator unit 204, an operating circuit operative to seek component elements of the syndrome S in batch is described. Hereinafter, this operating circuit is referred to as the “syndrome component element generating circuit”.
The syndrome component element generating circuit is configured based on the equations shown in Expression 57.
The syndrome component element generating circuit includes, as shown in
Data Y read out of the p-adic cell array is stored in the “Ro (1 to p−1)” register as an initial value. The “(j)i (j=1 to p−1)” circuit block generates (j)i. These “Ro (1 to p−1)” register and “(j)i (j=1 to p−1)” circuit block are synchronized with each other by the clock cli and controlled such that (j)i is provided at the same time as the output of the component yj of Y. The “X Zp” circuit block yields the product thereof, (j)iyj, which is summed, in sync with the clock clj (j=1 to p−1), through the loop including the “h bit AD mod p” circuit block and the “Rgstr” register, to create a syndrome component Si. The resultant Si is stored in the i-th register in the Ri (0-ε)” register at the clock cki+1. This process is treated at the clock cki for i=0-ε to obtain all syndrome components, which are then stored in the “Ri (0-ε)” register unit.
An example of circuitry of the Euclidean iteration processing unit 206 is described below.
The first description is given to a circuit operative to derive kn in computation of fn=knfn+1+fn+2. Hereinafter, this circuit is referred to as the “kn” circuit block.
The “kn” circuit block is configured based on the relational equations shown in Expression 58.
fn=knfn+1+fn+2
fn=a(n)0+a(n)1x+ . . . +a(n)j−1xj−1+a(n)jxj
kn=b(n)0+b(n)1x
b(n)1=a(n)j/b(n+1)j−1,b(n)0=(a(n)j−1−b(n)1a(n+1)j−2)/a(n+1)j−1 [Expression 58]
The “kn” circuit block applies the relations between coefficients shown in Expression 58 to derive an unknown coefficient from the known coefficients. On the assumption that the polynomial of kn is at the 1st degree, and that the coefficient is turned to 0 in the process of computation if the coefficient at the highest degree becomes 0 as well, the process advances the procedure of computation sequentially. At this time, the inverse element of 0 is determined 0 to allow the relations between coefficients to establish always.
The “kn” circuit block receives a(n)m−1 and a(n)m, and provides a(n+1)m−1, a(n+1)m−2, b(n)1, and b(n)0 as shown in
The “kn” circuit block includes 3 “X Zp” circuit blocks <1>, <2>, <3>, 2 “h bit AD mod p” circuit blocks <1>, <2>, a “j−1 dec” circuit block, an XNOR gate G1, and an OR gate G2.
The “X Zp” circuit block <1> computes the product of a(n+1)m−2 and b(n)1. The result b(n)1a(n+1)m−2 is fed to the “h bit AD mod p” circuit block <1> via the inverter IV1.
The “h bit AD mod p” circuit block <1> receives the output from the inverter IV1 and ‘1’, and obtains the complement of b(n)1a(n+1)m−2.
The “h bit AD mod p” circuit block <2> receives the output from the “h bit AD mod p” circuit block <1> and a(n)m−1, and provides the result, a(n)m−1−b(n)1a(n+1)m−2, to the “X Zp” circuit block <2>.
The “j−1 dec” circuit block receives a(n+1)m−1, and provides the inverse element thereof, (a(n+1)m−1)−1, to the “X Zp” circuit blocks <2> and <3>.
The “X Zp” circuit block <2> computes the product of the output, a(n)m−1−b(n)1a(n+1)m−2, from the “h bit AD mod p” circuit block <2> and the output, (a(n+1)m−1)−1, from the “j−1 dec” circuit block, to obtain the result b′(n)0.
The “X Zp” circuit block <3> computes the product of a(n)m and the output, (a(n+1)m−1)−1, from the “j−1 dec” circuit block, to obtain the result b(n)1, that is, the output from the “kn” circuit block.
The XNOR gate G1 receives the output, b′(n)0, from the “X Zp” circuit block <2> and the output, b(n)10, from the “X Zp” circuit block <3>. The OR gate G2 receives the output from the “X Zp” circuit block <3> and b′(n)0. In this case, the output from the OR gate G2 turns to the output, b(n)0, from the “kn” circuit block.
The following description is given to a circuit operative to derive fn+2 in computation of fn=knfn+1+fn+2. Hereinafter, this circuit is referred to as the “fn” circuit block.
The “fn” circuit block is configured based on the relational equations shown in Expression 59.
fn=knfn+1+fn+2
fn+2=fn−knfn+1,kn=b(n)0+b(n)1x
fn+2=fn−(b(n)0fn+1+b(n)1xfn+1)
a(n+2)m=a(n)m−(b(n)0a(n+1)m+b(n)1a(n+1)n−1) [Expression 59]
The “fn” circuit block applies the relations between coefficients shown in Expression 59 to derive an unknown coefficient from the known coefficients. It utilizes the already-obtained coefficients of the polynomial of kn.
The “fn” circuit block receives a(n+1)m−1, a(n+1)m, a(n)m, b(n)1 and b(n)0, and provides a(n+2)m as shown in
The “fn” circuit block includes 2 “X Zp” circuit blocks <1>, <2>, and 3 “h bit AD mod p” circuit blocks <1>, <2> and <3>.
The “X Zp” circuit block <1> computes the product of a(n+1)m−1 and b(n)1, and provides the result, b(n)1a(n+1)m−1, to the “h bit AD mod p” circuit block <1>.
The “X Zp” circuit block <2> computes the product of a(n+1)m and b(n)0, and provides the result, b(n)0a(n+1)m, t0 the “h bit AD mod p” circuit block <1>.
The “h bit AD mod p” circuit block <1> receives the output, b(n)1a(n+1)m−1, from the X Zp” circuit block <1> and the output, b(n)0a(n+1)m, from the X Zp” circuit block <2>. The output from the “h bit AD mod p” circuit block <1> is fed to the “h bit AD mod p” circuit block <2> via the inverter IV1.
The “h bit AD mod p” circuit block <2> receives the output from the inverter IV1 and ‘1’, and obtains the complement of (b(n)0a(n+1)m+b(n)1a(n+1)m−1).
The “h bit AD mod p” circuit block <3> receives a(n)m and the output from the “h bit AD mod p” circuit block <2>, and obtains a(n+2)m, that is, the output from this “fn” circuit block.
The following description is given to a circuit operative to derive pn−2 in computation of pn=kn−1pn−1+pn−2. Hereinafter, this circuit is referred to as the “pn” circuit block.
The “pn” circuit block is configured based on the relational equations shown in Expression 60.
pn=kn−1pn−1+pn−2
pn=a(n)0+a(n)1x+ . . . +a(0)j−1xj−1+a(n)jxj
kn=b(n)0+b(n)1x
pn+1=(b(n)0pn+b(n)1xpn)+pn−1
a(n+1)m=(b(n)0a(n)m+b(n)1a(n)m−1)+a(n−1)m [Expression 60]
The “pn” circuit block applies the relations between coefficients shown in Expression 60 to derive an unknown coefficient from the known coefficients. It utilizes the already-obtained coefficients of the polynomial of kn.
The “pn” circuit block receives a(n)m−1, a(n)m and a(n−1)m, and provides a(n+1)m as shown in
The “pn” circuit block includes 2 “X Zp” circuit blocks <1>, <2>, and 2 “h bit AD mod p” circuit blocks <1> and <2> as shown in
The “X Zp” circuit block <1> computes the product of a(n)m−1 and b(n)1, and provides the result, b(n)1a(n)m−1, to the “h bit AD mod p” circuit block <1>.
The “X Zp” circuit block <2> computes the product of a(n)m and b(n)0, and provides the result, b(n)0a(n)m, to the “h bit AD mod p” circuit block <1>.
The “h bit AD mod p” circuit block <1> receives the output, b(n)1a(n)m−1, from the X Zp” circuit block <1> and the output, b(n)0a(n)m, from the X Zp” circuit block <2>, and provides b(n)0a(n)m+b(n)1a(n)m−1 to the “h bit AD mod p” circuit block <1>.
The “h bit AD mod p” circuit block <2> receives a(n−1)m and the output, b(n)0a(n)m+b(n)1a(n)m−1, from the “h bit AD mod p” circuit block <1>, and obtains a(n+1)m, that is, the output from the “pn” circuit block.
Subsequently, a circuit for computing fn=knfn+1+fn+2 is described. Hereinafter, this circuit is referred to as the “fn=knfn+1+fn+2” circuit block.
The “fn=knfn+1+fn+2” circuit block receives a0-aγ, the coefficients of fn; and b0-bγ, the coefficients of fn+1, and provides c0-cγ, the coefficients of fn+2; and β0-βγ, the coefficients of kn as shown in
As computations are started with substitutions of f0=xγ, f1=Ψ(x), the maximum degree of fn is γ, and the maximum degree of fn+2 is also γ because kn may be 1. Therefore, the “fn=knfn+1+fn+2” circuit block includes (γ+1) “kn” circuit blocks <1>-<γ+1>, and (γ+1) “fn” circuit blocks <1>-<γ+1> as shown in
The “kn” circuit block <1> associates ‘0’, a0, b0 and a1 with a(n+1)m−2, a(n)m−1, a(n+1)m−1 and a(n)m, and associates β(1)0 and β(1)1 with b(n)0 and b(n)1, respectively.
The “kn” circuit block <2> associates b0, a1, b1 and a2 with a(n+1)m−2, a(n)m−1, a(n+1)m−1 and a(n)m, and associates β(1)0 and β(1)1 with b(n)0 and b(n)1, respectively.
The subsequent “kn” circuit blocks <3>-<γ+1> have the similar inputs/outputs to the “kn” circuit blocks <1> and <2>. The outputs, β(m)0 and β(m)1 (m=0−γ), from the “kn” circuit blocks <1>-<γ+1> are provided as β0 and β1 from the “fn=knfn+1+fn+2” circuit block via the control switches SW controlled by deg(b1, b2, . . . , bγ)=m. The β0 and β1 can be utilized also as the inputs to the “fn” circuit blocks <1>-<γ+1>. Here, deg( ) represents the output from the degree decision circuit associated with fn+1.
The “fn” circuit block <1> receives ‘0’, b0, a0, β0 and β1 as a(n+1)m−1, a(n+1)m, a(n)m, b(n)0 and b(n)1, and provides c0 as a(n+2)m.
The “fn” circuit block <2> receives b0, b1, a1, β0 and β1 as a(n+1)m−1, a(n+1)m, a(n)m, b(n)0 and b(n)1, and provides c1 as a(n+2)m.
The subsequent “fn” circuit blocks <3>-<γ+1> have the similar inputs/outputs to the “fn” circuit blocks <1> and <2>.
The outputs from the “fn” circuit blocks <1>-<γ+1> turn to c0-cγ, that is, the output from the “fn=knfn+1+fn+2” circuit block.
Subsequently, a circuit for computing pn+1=knpn+pn−1 is described. Hereinafter, this circuit is referred to as the “pn+1=knpn+pn−1” circuit block.
The “pn+1=knpn+pn−1” circuit block receives β0, β1, the coefficients of kn; a0-aγ−1, the coefficients of pn−1; and b0-bγy−1, the coefficients of pn, and provides c0-cγy−1, the coefficients of pn+1 as shown in
As computations are started with substitutions of p−1=0, p0=1, the maximum degree of pn+1 is γ−1, for the purpose of utilizing pn+1 mod xγ. Therefore, the “pn+1=knpn+pn−1” circuit block includes γ “kn” circuit blocks <1>-<γ>.
The “pn” circuit block <1> receives ‘0’, b0, a0, β0 and β1as a(n)m−1, a(n)m, a(n−1)m, b(n)0 and b(n)1, and provides c0 as a(n+1)m, respectively.
The “pn” circuit block <2> receives b0, b1, a1, β0 and β1 as a(n)m−1, a(n)m, a(n−1)m, b(n)0 and b(n)1, and provides c1 as a(n+1)m, respectively.
The subsequent “pn” circuit blocks <3>-<γ> have the similar inputs/outputs to the “pn” circuit blocks <1> and <2>.
The outputs from the “pn” circuit blocks <1>-<γ> are provided as c0-cγ−1, that is, the output from the “pn+1=knpn+pn−1” circuit block.
Subsequently, a description is given to computing circuits operative to derive fn and pn−1 through iterative computations from 0 to n.
The computing circuit operative to derive fn includes 3 “h(γ+1)” register units <1>-<3>, as shown in
The computing circuit operative to derive pn+1 includes “hγ” register units <1>-<3>, as shown in
The computations of fn and pn−1 advance at every cycle of the clock cl and the number of cycles of the clock cl counted from 0 corresponds to n of the Euclidean iteration. Finally, the “h(γ+1)” register unit <1> holds fn, and the “hγ” register unit <1> holds pn−1.
In the first clock cycle cl0, the “h(γ+1)” register units <1>-<3 > and the “hγ” register units <1>-<3 > are given settings of initial values as shown in
In the computing circuit operative to derive fn, the coefficient of f0=xγ is set in the “h(γ+1)” register unit <1> and the coefficient of f1=Ψ(x) in the “h(γ+1)” register unit <3>. In the computing circuit operative to derive pn−1, p−1=‘0’ is set in the “hγ” register unit <1> and p−1=‘1’ in the “hγ” register unit <3 >. Thereafter, as the clock cycle advances, fn and pn−1 are set in the corresponding register units at the start of the cln cycle.
The “h(γ+1)” register unit <1> and the “hγ” register unit <1> are prevented from receiving the inputs by a signal /ISTOP. Accordingly, data in the “h(γ+1)” register unit <1> and the “hγ” register unit <1> can be determined at that moment. The signal /ISTOP is a signal that is activated if the later-described stop condition on the Euclidean iterative method is satisfied.
Thus, the “h(γ+1)” register unit <1> and the “hγ” register unit <1> can obtain the desired resultant coefficients of the polynomials therein.
These coefficients can be fed to the first Hasse differential polynomial generator unit 207 and the second Hasse differential polynomial generator unit 208 to seek the root and the multiplicity of the Hasse differential polynomial.
For the purpose of seeking the signal /ISTOP shown in
The “DEG” circuit block receives a1-aγ, and provides that is, a binary representation of the degree as shown in
The “DEG” circuit block includes “h bit reg” registers <1>-<γ>, OR gates G0<1>-<γ>, G1<1>-<γ>, and an “A to deg DEC.” circuit block as shown in
The “h bit” registers each hold h-bit binary data indicative of the coefficients, a1-aγ, of the γ-th degree polynomial. The h-bit binary data indicative of the coefficients a1-aγ are fed to the OR gates G0<1>-<γ>.
The OR gate G0<n> (n=1-γ) receives h-bit binary data indicative of an. The OR gate G0<1> provides ‘0’ if an=0 and ‘1’ if an≠0. In a word, the OR gate G0<n> determines whether an is equal to 0 or other than 0.
The OR gate G1<m> (m=1 to γ−1) receives the outputs from the OR gates G0<m>-<γ>. The outputs from the OR gates G1<1>-<γ−1> turn to A1-Aγ−1, respectively. In addition, the output from the OR gate G0<γ> directly turns to Aγ. These A1-Aγ are fed to the “A to deg DEC.” circuit block.
The “A to deg DEC.” circuit block is a decoder circuit capable of realizing the associated relations between A1-Aγ and binary representations of the degrees as shown with T1 in
The “A to deg DEC.” circuit block derives b0-bh−1, that is, the output from the “DEC” circuit block, from A1-Aγ based on the association table shown with T1 in
The final description is given to a circuit operative to decide if the key condition on the Lee metric code is satisfied to stop the iterations, and if the error correctable condition is satisfied, based on fn and pn−1 obtained through the Euclidean iterative method.
This circuit includes 2 “DEG” circuit blocks <1>, <2>, 2 “h bit AD mod p” circuit blocks <1>, <2>, and a part circuit U1 composed of plural logic gates.
The “DEG” circuit block <1> receives fn as a1-aγ, and provides deg fn, that is, a binary representation of fn as b0-bn−1. This deg fn is fed to the “h bit AD mod p” circuit block <2>.
The “DEG” circuit block <2> receives pn−1 as a1-aγ, and provides deg pn−1, that is, a binary representation of pn−1 as b0-bh−1. This deg pn−1 is fed to the “h bit AD mod p” circuit block <1> via the inverter IV1.
The “h bit AD mod p” circuit block <1> derives the complement of deg pn−1 from the input, deg pn−1, fed via the inverter IV1 and ‘1’. The complement of deg pn−1 is fed to the “h bit AD mod p” circuit block <2>.
The stop condition on the Euclidean iterative method is deg fn−deg pn−1≡S0 (mod p). Therefore, the “h bit AD mod p” circuit block <2> receives deg fn and the complement of deg pn−1 to obtain deg fn−deg pn−1 as a numeral in Zp. This result is fed to the part circuit U1 for comparison with S0 at every bit of the binary representation.
For the purpose of comparing deg fn−deg pn−1 with S0 at every bit, the part circuit U1 seeks an exclusive logical sum at every bit. If all bits (deg fn−deg pn−1)j (j=0 to h−1) match (S0)j, the part circuit U1 provides ‘0’ as the signal /ISTOP. In a word, the signal /ISTOP is (deg fn−deg pn−1)0 (+) (S0)0□ . . . □(deg fn−deg pn−1)m (+) (S0)m□ . . . □(deg fn−deg pn−1)h−1 (+) (S0)h−1. Thus, the part circuit U1 can realize if deg fn−deg pn−1 matches S0 or not.
This circuit includes a NAND gate G1, a “h bit AD mod p” circuit block, a “h bit γ≦” circuit block later-described, and an NOR gate G2.
The NAND gate G1 is arranged to provide for the case where the Euclidean iterative method fails to satisfy the stop condition to stop. If the degree of fn is equal to 1, the degree of fn+1 is equal to 0, and the signal /ISTOPn in reverse logic of the signal /ISTOP is at 1, that is, under the condition of (deg fn=1)□(deg fn+1=0)□(/ISTOPn=1), the NAND gate G1 provides ‘1’ as a signal Ifail.
It is determined based on deg fn+deg pn−1<γ if the error is correctable when the Euclidean iterative method stops.
Therefore, the “h bit AD mod p” circuit block is given deg fn and deg pn−1 to derive the sum thereof, deg fn+deg pn−1. This sum, deg fn+deg pn−1, is fed to the “h bit γ≦” circuit block.
The “h bit γ≦” circuit block is a circuit operative to determine if the input is equal to or higher than γ or not. The “h bit γ≦” circuit block provides ‘1’ as a signal gtγ if the input, deg fn+deg pn−1, supplied from the “h bit AD mod p” circuit block is equal to or higher than γ.
The details of the “h bit γ≦” circuit block are omitted from the following description though they can be configured similar to the PF0 generator unit U1 in the “h bit AD mod p” circuit block shown in
The OR gate G2 receives the output, Ifail, from the NAND gate G1 and the output, gtγ, from the “h bit γ≦” circuit block and, if Ifail□gtγ, it provides NG signal indicative of the impossibility of error correction through the Euclidean iterative method.
<Conclusion>
Thus, the second embodiment is described on the p-adic NAND flash memory using the Euclidean iterative method in error correction with the specific configurations exemplified.
The second embodiment makes it possible to ensure the reliability of data by a strong error correcting ability even if the level setting is insufficient in data write to p-adic cells and data read from p-adic cells. It is possible to realize faster error correction processing than the first embodiment.
[Others]
While certain embodiments have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of the inventions. Indeed, the novel methods and systems described herein may be embodied in a variety of other forms: furthermore, various omissions, substitutions and changes in the form of the methods and systems described herein may be made without departing from the spirit of the inventions. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the inventions.
Number | Date | Country | Kind |
---|---|---|---|
2010-214357 | Sep 2010 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
6359806 | Nozoe et al. | Mar 2002 | B1 |
7984360 | Sharon et al. | Jul 2011 | B2 |
8156398 | Sommer | Apr 2012 | B2 |
8321760 | Son et al. | Nov 2012 | B2 |
20090024903 | Bueb | Jan 2009 | A1 |
20090198881 | Toda | Aug 2009 | A1 |
20100103735 | Kim et al. | Apr 2010 | A1 |
20100107039 | Toda | Apr 2010 | A1 |
20100211856 | Weingarten | Aug 2010 | A1 |
Number | Date | Country |
---|---|---|
2000-305861 | Nov 2000 | JP |
Entry |
---|
Office Action issued Sep. 25, 2012, in Japanese Patent Application No. 2010-214357 with English translation. |
Japanese Office Action issued Dec. 11, 2012 in Patent Application No. 2010-214357 with English Translation. |
U.S. Appl. No. 13/367,655, filed Feb. 7, 2012, Toda. |
U.S. Appl. No. 13/011,318, filed Jan. 21, 2011, Haruki Toda. |
Ron M. Roth, et al., “Lee-Metric BCH Codes and their Application to Constrained and Partial-Response Channels”, IEEE Transactions on Information Theory, vol. 40, No. 4, Jul. 1994, pp. 1083-1096. |
Number | Date | Country | |
---|---|---|---|
20120075940 A1 | Mar 2012 | US |