1. Field of the Invention
The present invention relates to non-volatile memory devices and particularly to an error detection method apparatus for use in the non-volatile memory devices.
2. Description of the Prior Art
Flash or non-volatile memory has gained wide acceptance for various applications and particularly its non-volatile characteristic in retaining information or data even when power is disconnected. This makes non-volatile memory especially well suited for non-volatile portable devices that may lose power. Some devices, which include flash memory are constructed from electrically-erasable programmable read-only memory (EEPROM) cells.
Rather than use a randomly-addressable scheme such as is common with dynamic-random-access memory (DRAM), many flash memory-based devices use a block-based addressing where a command and an address are transmitted via a data bus and then a block of information is read from or written to the flash memory. Since the data bus is also used to send commands and addresses, fewer pins are needed on the flash-memory chip thereby reducing cost. Thus, flash memory is often used as a mass-storage device rather than a randomly-addressable device.
Typically, in a flash memory device, a microcontroller is employed for controlling information transfer between the flash memory and a host of some type. The microcontroller typically includes ROM with a control program that is read by the internal central processing unit (CPU) of the microcontroller when the microcontroller is booted or powered up. Once initialized with the control program, the CPU can control data transfers between the serial interface and the flash controller.
A popular bus standard is a Multi-Media Card (MMC), the specifications of which are defined and adopted by the industry. An extension of MMC is known as Secure Digital (SD). Various other flash device interfaces such as Compact Flash (CF), Memory Stick (MS), PCI-Express (PCIE), ATA/IDE, and Serial ATA (SATA), etc, which are commonly employed in today's portable multimedia, computer, or communication devices as data storage elements. A controller coupled to the bus would also operationally conform to the foregoing standards.
With the advent of the popularity of flash memory, density of flash memory devices (or chips, integrated circuits or semiconductor) is increasing thereby increasing the rate of defect spots. Even more noteworthy is the increase in the rate of defect spots in Multi-Level Cell (MLC), which is a certain type of non-volatile memory, during the flash manufacturing process. Compared with a SLC process, random error bits in MLC processes occur more often due to multi-level threshold voltages (less noise margin) needed to detect logic levels.
An effective error detection results when using Bose, Ray-Chaudhuri, Hocquenghem (BCH) code. Therefore, an apparatus and method are needed for flash operations to improve the accuracy of information.
BCH codes are multiple error correcting utilizing block cyclic Galois codes to do the same. As described earlier, the BCH algorithm is popularly applied in the industry for random error correction purposes and as compared with Reed Solomon algorithms, another error coding/decoding scheme, it is more appropriate for burst error correction. Error location/detection is known to be one of the most difficult procedures associated with using BCH.
By way of brief background, a Berlekamp-Messay and Euclidian method is first applied to find roots of the error polynomial and then, Chien's method is applied to search for error locations. However, since both implementations require complex hardware and lengthy calculations, cost of error correction coding (ECC) is increased in the foregoing traditional methods.
A second method utilizes searching in look-up table. A pre-calculated value is stored in memory, such as ROM, for error searching. This method advantageously increases speed, because time is saved in performing calculations, however, for long size codes, ROM could occupy expensive silicon real estate and therefore directly increases controller chip cost.
What is needed is flash memory coding and decoding apparatus and method having advantageously lower complexity with simple control signal handling and faster calculation speed that is not influenced by code length.
Briefly, one embodiment of the present includes a BCH decoder employed in non-volatile memory applications for determining the number of errors and locating the errors in a page of information. The decoder is disclosed to include a syndrome calculator responsive to a sector of information, the sector including data and overhead, the data being organized into a plurality of data sections and the overhead being organized into a plurality of overhead sections, the syndrome calculator operative to generate a syndrome for each of the data sections, a root finder coupled to receive the calculated syndrome and operative to generate at least two roots, and a polynomial calculator responsive to the at least two roots and operative to generate at least two error addresses, each identifying a location in the data wherein the error lies.
In one embodiment of the present invention, BCH algorithm is employed to reduce hardware costs associated with silicon real estate of a non-volatile memory device including ECC and in particular to reduce the size of the ROM look-up table employed therein. Additionally, lengthy calculations are eliminated by using Chien's searching algorithm in the ECC. One of the applications of the foregoing is in non-volatile memory systems replacing hard disk drives and transferring information organized in sectors between a host and non-volatile memory. A sector can vary in size and in some applications are 512 bytes of raw data and additional bytes of overhead.
Typically, there are six procedures involved in BCH decoding process that is outlined below in the case where information is organized in sectors:
At step (2), typically Berlekamp-Messay recursive method or Euclidian's matrix method is employed, the complexity of these two methods is dependent on code length, and independent of number of errors, so it is very inefficient from a calculation point of view.
At step (3), in finding or determining the roots of an error polynomial, Chien's search method is typically adopted and the calculation time also depends on the code length, and independent of number of errors, since an exhausted search needs to be performed, and time is a constant in this step.
However, the foregoing two methods do not advantageously utilize the characteristic of low error counts of flash memory, furthermore, sophisticated hardware and longer calculation time result.
One embodiment of the present invention takes advantage of and relies on the low error count associated with flash or non-volatile memory. More specifically, emphasis is placed on being able to correct up to two error counts per code word. The Syndrome result is used to determine an error location, which is known to be the most difficult part of implementing the BCH algorithm.
Referring now to
Sectional organization of the data area 24 and the spare area 26 causes reduction in the code length, in turn, simplifying the BCH method by parallel processing 4 sections (each section being 128 bytes) at the same time with the total error that is correctable being 8 bits per 512 bytes.
In
Z2+Z+K=0 Eq. (1)
In the interest of further clarification, an example of error count using the finder 50 is now presented. The example is intended to be used for flash or non-volatile memory error recovery and in the case where a page is 1028 bits in length with a page being divided or organized into four sections, as previously discussed. Also, in this example, two errors (or error bits) are assumed to be present.
Assuming for the sake of an example, two error counts per code format BCH code are based on GF(211), the code length is BCH(128×8+11×2, 128×8) which (128×8+11×2) bits is code length to accommodate quarter flash memory page size 128 bytes plus 3 spare bytes, which is 11×2 bits (22 bits, less than 3 bytes) as the parity bits in the spare area of a sector or page. 11 is an example of the notation ‘m’ in GF (2m). 128 bytes is the length of message (or section) unit is read out as the code from the data area of a sector or page during the decoding period of a quarter of a page of flash memory, 3 bytes (actually 22 bits used, 2 bits are left to be zeroes) is the parity bytes generated from BCH encoder, which resides in spare areas per page of flash memory.
The reason 11 is chosen as the ‘m’ value in the GF (2m) is: 211−2047; and quarter of full page 512 bytes is 128 bytes; 128×8+11×2=1046; m=11 is the least number we can choose. Let us assume r(x) is the receiving polynomial,
In this formula, X1=αi
Error location polynomial X1=αi
σ(x)=(x−X1)*(x−X2)=x2+σ1x+σ0; Eq. (4)
with two roots being X1 and X2 and σ being coefficients of polynomial;
σ1=X1+X2; σ0=X1*X2; Eq. (5)
S0=X1+X2=σ1; Eq. (6)
S1=X12+X22=σ12; Eq. (7)
S2=X13+X23=S1*σ1+S0*σ0; Eq. (8)
S3=X14+X24=S2*σ1+S1*σ0; Eq. (9)
from above equations, σ1 and σ0 can be calculated
σ1=(S1S2+S0S3)/(S12+S0S2); Eq. (10)
σ0=(S22+S1S3)/(S12+S0S2); Eq. (11)
Syndrome values Sj can be obtained by substitute α, α2, α3, α4 into r(x).
By substitute syndrome values into Sj equation can obtain σ1, σ0. Solving error location polynomial equation σ(x), the two roots of σ(x) is the error bits might occur. r(x)+e(x) can recover the original c(x).
Another way to calculate syndrome is from minimal polynomial, Since primitive function is x11+x2+1, and α2047=1;
expansion and simplification.
If no error occur, then
S0=S1=S2=S3=0; Eq. (14)
In the case where only a single bit error occurs and assuming the error location is at i1, the error bit value is Y1, which can be calculated as A, B, C value, as below:
A=S12+S0S2; Eq. (15)
B=S1S2+S0S3; Eq. (16)
C=S22+S1S3; Eq. (17)
Since
S0=X1≠0; Eq. (18)
S1=X12≠0; Eq. (19)
S2=X13≠0; Eq. (20)
S3=X14≠0; Eq. (21)
all Si≠0 Does not imply four error, but imply at least one error occurs.
But X1=S0; from above simple derivation;
A=X
1
4
+X
1
4=0<=A; Eq. (24)
So if A=B=C=0 means only one error occur in this case. And αX1=S0(α); (3)
If two errors happen in read data, and assume i1, i2 are the error locations.
S0=X11+X21=X1+X2 Eq. (25)
S1=X12+X22≠0; Eq. (26)
S2=X13+X23≠0 Eq. (27)
S3=X14+X24≠0 Eq. (28)
A=S12+S0S2=(X12+X22)2+(X1+X2) (X13+X23)≠0 Eq. (29)
since any arbitrary number that is squared and add together is necessarily greater than zero, if X1, X2 are not zero two errors occur;
Using cyclic characteristic of Galois Fields (GF), assume
x=σ1*z Eq. (32)
in order to make σ(x)=x2+σ1x+σ0 simple, it is easier to obtain
σ(z)=z2+z+K; Eq. (32A)
where K=σ0/σ12
Once the root of above equation is found, error is recovered
x=σ1*z Eq. (33)
Again.
Roots of σ(x) are error locations X1, X2, where the error bits location occur.
Most of the BCH decoding problems are finding these two roots that lead to error bit location.
In one embodiment of the present invention, requiring less hardware, the following is performed. Assume Z1, and Z2 are roots of σ(z),
Z12+Z1+K=0; Eq. (34)
Z22+Z2+K=0; Eq. (35)
Subtraction of the two equations Eq. (34) and Eq. (35) results in:
(Z12−Z22)+(z1−Z2)=0, Eq. (36)
in Galois field operation “−” is identical with “+”,
And there is obtained
(Z12+Z22)+(Z1+Z2)=0, Eq. (37)
since
2*Z1*Z2=Z1*Z2+Z1*Z2=0; Eq. (38)
and because two identical terms added together equal zero, under Galois operation, there is obtained, (Z12+Z22+2Z1*Z2)+(Z1+Z2)=0,
(Z1+Z2)2+(Z1+Z2)=0; Eq. (39)
(Z1+Z2)*(Z1+Z2+1)=0; Eq. (40)
Which results in Z1+Z20; or
Z1+Z2+1=0; Eq. (41)
However Z1=Z2 is not possible, as two error locations can not be same, thus,
Z1=Z2+1;
or
Z2=Z1+1;
or
Z1+Z2=1; Eq. (42)
Three equations exist at the same time under Galois operation.
Also the number ‘1’ in the Equations (41) and (42) in above equation means (100 0000 0000) if GF(211), we know Z1 and Z2 highest bit (bit position 0) should be opposite to each other.
Examples like Z1=011 1010 0110; Z2=111 1010 0110;
underline LSB bit toggle to each other according to above explanation.
Again Z12+Z1+K=0;
Z1*(Z1+1)+K=0; Eq. (43)
Z1*(Z1+1)=K; Eq. (44)
Assuming:
Then
βj is 1 or 0 only in above derivation, so equalities hold for βj*βj=βj, βj+βj=0.
(44), (45) These Two terms can be swapped without influence final result,
Multiply two terms together, we get
(β10*α20+β9*α18+β8*α16+β7*α14+β6*α12)
+(β10+β5)*α10+β9*α9+(β8+β4)*α8
+β7*α7+(β6+β3)*α6+β5*α5+(β4+β2)*α4+β3*α3+(β2+β1)*α2+β1*α=K; (46A)
Owing to the fact that β7*β7=β7;
Kj(j=10 . . . 0) are coefficient of 11 bit symbol value;
Substitute β10 in Eq. (47), we get
β5=K10+K0; Eq. (58)
Adding (50) and (53),
β7=K3+K6; Eq. (59)
From Eq. (51),
β8+β5=K5+K3+K6;
From Eq. (53),
β6+β3=K5;
From Eq. (49),
β9+β8=K7+K3+K6;
From Eq. (51),
β8=K5+K10+K0+K3+K6; Eq. (60)
From Eq. (48),
β4=K5+K10+K0+K3+K6+K8; Eq. (61)
From Eq. (52),
β2=K4+K5+K10+K0+K3+K6+K8; Eq. (62)
From Eq. (54),
β1=K2+K4+K5+K10+K0+K6+K8; Eq. (63)
From Eq. (55),
β6=K1+K2+K4+K5+K10+K3+K6+K8; Eq. (64)
From Eq. (53),
β3=K3+K1+K2+K4+K5+K10+K8; Eq. (65)
After all βj are found, Z1 is found, as we know from Eq. (41), Z2 can also be found by adding 1 or (100 0000 0000) to it.
X1, X2 values are recovered by using Eq. (32),
X1=σ1*Z1; Eq. (65)(a)
X2=σ1*Z2; Eq. (65)(b)
and
e(x)=X1+X2; Eq. (66)
Correct code word c(x) can be obtained from Eq. (2) by adding r(x) and e(x).
As above explained, error locations X1, X2 need only be calculated from Kj, which is in turn coming from syndrome value σ0/σ12 with very simple exclusive operations. It does not need either ROM silicon which proportional to code size, or complex operation that requires lots of hardware for implementation.
Another easier approach for BCH application is using GF(25) m=5 for theory verification, which limits total c(x) code to 31 bits, with double bits error correction capability. 10 bits is reserved for parity purpose, and 21 bits for message purpose.
Same equation applies for error location polynomial, and two σ1*Z1, and σ1*Z2 roots finding can tell exactly where the errors will be.
Assume
Z1=β1*α+β2*α2+β3*α3+β4*α4; Eq. (67)
Then
Multiply two terms together, we get
(β4*α8+β3*α6)
+(β2+β4)*α4+β3*α3+(β2+β1)*α2+β1*α=K; Eq. (69)
α8=(1011 0)=1+α2+α3;
α6=(0101 0)=α+α3;
α5=(0000 0101 000)=α2+1; since generation polynomial is X5+X2+1
α4=(0000 1);
α3=(0001 0);
α2=(0010 0);
α=(0100 0);
(β4*(1+α2+α3)+β3*(α+α3))
+(β2+β4)*α4+β3*α3+(β2+β1)*α2+β1*α=K; Eq. (70)
(β2+β4)*α4+β4*α3+(β4+β2+β1)*α2+(β3+β1)*α+β4=K; Eq. (71)
β2+β4=K4; Eq. (72)
β4+β2+β1=K2; Eq. (73)
β3+β1=K1; Eq. (74)
β4=K0; Eq. (75)
We got
β2=K4+K0; Eq. (76)
β1=K2+(K4+K0)+K0=K2+K4; Eq. (77)
β3=K1+K2+K4; Eq. (78)
From above Z1 and Z2, we can find X1 and X2 also from σ1*Z1 and σ1*Z2,
e(x)=X1+X2; Eq. (66)
adding to r(x), c(x) is recovered, which is original encoded data from flash.
Assume 100—0001, is ASCII “A” representation, as correct message stored in flash memory, 10 bit parity code are stored in spare area for 2 bit error decoding purpose.
So
g(x)=x10+x9+x8+x6+x5+x3+1; Eq. (81)
if multiply m1(x) and m3(x).
0 0000 0000 0100 0001 01 1001 0011 as correct code word c(x) shown above, which 10 bits in second column is parity bits by BCH encoder output, and 21 bits in first column that is message bits.
If this c(x) is fed into a circuit, S0(α)=r(α)=1+α+α4+α7+α8+α10α16=0;
(10000)<−1
(01000)<−α
(00001)<−α4
(00101)<−α7
(10110)<−α8
(10001)<−α10
(11011)<−α16
(00000)<−0
S1(α2)=r(α2)=1+α2+α8+α14+α16+α20+α=0;
S2(α3)=r(α3)=1+α3+α12+α21+α24+α30+α17=0;
S3(α4)=r(α4)=1+α4+α16+α28+α+α40+α2=0;
Since all syndrome values are zero, E_count=0; Error count is equal to zero;
Err_adr1=0; Err_adr2=0; There is no errors so all address indication is zero
However if single error occurs, and assume 0 0000 0000 0101 0001,01 1001 0011 is the r(x) received, bold and italic underlined 1 is error bit location, this information is fed into a circuit and there is obtained:
S0(x)=r(x) mod m1(x)=x4+x3+x2+1 Eq. (82)
S1(x)=r(x) mod m2(x)=x4+x3+x2+1 Eq. (83)
S2(x)=r(x) mod m3(x)=x4+x+1 Eq. (84)
S3(x)=r(x) mod m4(x)=x4+x3+x2+1 Eq. (85)
With mod being a modulo function S0(α)=α4+α3+α2+1=α14 from Eq. (82); or S0(α)=r(α)=α16+α14+α10+α8+α7+α4+α+1=α14; two equations end with same result even without minimal polynomial involved.
so location is 14th bit error for r(x) received.
S1(α2)=α8+α6+α4+1;
S2(α3)=α12+α3+1;
S3(α4)=α16+α12+α8+1;
A=S12+S0S2=0 Eq. (15)
B=S1*S2+S0*S3=0 Eq. (16)
C=S22+S1S3=0 Eq. (17)
Can directly derive only one error that has occurred.
E_count=1;
Err_adr1=S0(α)=Loc 14;
Err_adr2=0;
Let bit position r(x) 14 invert, can recover the original c(x).
Also assuming two errors occurred in the following locations, designated in Italic and Bold notation:
0 0000 0000 0101 0001, 00 1001 0011 is the r(x) received, after circuit analysis, there is obtained:
S0(α)=r(α)=α16+α14+α10+α7+α4+α+1=α4; (where α14 is the error term introduced from r(x) for example.)
K0=1; K1=0; K2=1; K3=1; K4=1 from above K values; Substitute these Ki values into
For Z1, β4=1, β3=0, β2=0, β1=0 β0=0; which is (00001)=α4, And Z2=(10001)=α10;
X1=α4*α4=α8; (From Eqs. (65A) and (65B), true root must multiply σ1)
X2=α4*α10=α14;
Since A, B, C is not zero, it is implied that the error count is two.
E_count (error count)=2;
Err_adr1 (error address of first error)=0E;
Err_adr2 (error address of second error)=8; which identifies an error located in parity bits and from above X1 and X2 values, their position is irrelevant from values and can be exchanged.
Inverting two error bit locations of r(x), causes recovering the c(x), this is because BCH operates on bit errors instead of on symbol errors, inverting the position of r(x) can advantageously recover the original data.
The embodiments of the invention have various applications, among with which are memory system. On such application is in the integrated circuit card disclosed in a related application, i.e. U.S. Pat. No. 6,547,130, issued on Apr. 15, 2003, entitled “Integrated circuit card with fingerprint verification capability”, the disclosure of which is incorporated herein as though set forth in full.
Exemplary implementation of the foregoing is shown relative to
After step 102, σ0 and σ1 are calculated from the syndromes S0-S3, in accordance with the foregoing equations, for the first section, at step 106 and similarly, at step 108, the σ0 and σ1 are calculated from the syndromes S0-S3, in accordance with the foregoing equations, for the three remaining sections of the page, at step 108 after step 104.
After the step 106, K is calculated, at step 110, based on the foregoing equations, for the first section and after the step 108, K is calculated, at step 112, after the step 108, for the remaining three sections.
After the step 110, Z1 and Z2 are calculated, for the first section of the page, at step 114, based on the foregoing equations, for the first section and are used to calculate X1 and X2, at step 118 after which, at step 122, X1 and X2 are given a value ‘1’ and/or added together and then XORed with R1(x) to recover the original values. X1 and X2 are error locations, and if a value of 1 is placed in the error location and ORed together as e(x), the error location polynomial, and then XORed with R1(x), the first 128 byte of data, i.e. the first data segment, is recovered.
After the step 112, Z1 and Z2 are calculated, for the remaining three sections of the page, at step 116, based on the foregoing equations, and are used to calculate X1 and X2, at step 118 after which, at step 124, X1, X2 are (same as above said) XOR to R2, 3, 4(x), and recover 2nd, 3rd and 4th 128 byte data sectors.
Next, at step 132, A, B and C values are calculated from the calculated syndrome values of step 130 and in accordance with Eqs. (15) through (17), respectively. Next, at 134, it is determined whether the syndromes S0(α), S1(α), S2(α) and S3(α) are all equal to one another and if so, no error is detected at 136. If at 134, it is that the syndromes are not equal to each other, another determination is made at 138 as to whether A, B and C are equal to each other and if it is determined that they are not, at step 140, two errors are detected. After step 140, at step 144, K is calculated as σ0/(σ1)2 from A, B and C values. Next, at step 148, Z1 and Z2 are calculated, pursuant, for example, to the circuits of
At 138, if it is determined that A, B and C values are not equal, the process continues to step 142 where a single error is detected and X1 is set equal to S0. After step 142, step 146 is performed.
The shift registers 154 and the XORs 156 and 158, shown coupled in a manner consistent with that of
S2(x)=r(x) mod m2(x); in this example, m=11, m0=m1=m3=X11+X2+1, but M2=X11+X8+X2+1; this is the reason for S2 being generated differently, as shown in
The BCH decoder 10, shown in
The host system 1000 is shown to include a host MMC controller 1200, which couples the host system 1000 to the controller 200. The host system 1000 stores or retrieves information into and from the flash memory system 1400-1600 through the controller 200. However, as previously noted, the retrieved information may include errors, thus, the BCH decoder, determines the number of errors and locates the errors with in a page of information stored in the flash memory systems, in a manner consistent with the foregoing
Although the present invention has been described in terms of specific embodiments, it is anticipated that alterations and modifications thereof will no doubt become apparent to those skilled in the art. It is therefore intended that the following claims be interpreted as covering all such alterations and modification as fall within the true spirit and scope of the invention.
This application is a continuation-in-part (CIP) of the application entitled “Electronic Data Storage Medium with Fingerprint Verification Capability”, U.S. patent application Ser. No. 09/478,720, filed on Jan. 6, 2000 now U.S. Pat. No. 7,257,714, and a continuation-in-part of the application entitled “Flash Memory Controller For Electronic Data Flash Card”, U.S. patent application Ser. No. 11/466,759, filed on Aug. 23, 2006 now U.S. Pat. No. 7,702,831, which is a CIP of “System and Method for Controlling Flash Memory”, U.S. patent application. Ser. No. 10/789,333, filed on Feb. 26, 2004 now U.S. Pat. No. 7,318,117, all of which are incorporated herein as though set forth in full.
Number | Name | Date | Kind |
---|---|---|---|
4498175 | Nagumo et al. | Feb 1985 | A |
5710782 | Weng | Jan 1998 | A |
5761102 | Weng | Jun 1998 | A |
5983389 | Shimizu | Nov 1999 | A |
6637002 | Weng et al. | Oct 2003 | B1 |
7407393 | Ni et al. | Aug 2008 | B2 |
7420803 | Hsueh et al. | Sep 2008 | B2 |
Number | Date | Country | |
---|---|---|---|
Parent | 11466759 | Aug 2006 | US |
Child | 11657243 | US | |
Parent | 10789333 | Feb 2004 | US |
Child | 11466759 | US | |
Parent | 09478720 | Jan 2000 | US |
Child | 10789333 | US |