The present application claims priority under 35 U.S.C. § 119(a) to Korean Patent Application No. 10-2018-0043797, filed on Apr. 16, 2018, and Korean Patent Application No. 10-2018-0167946, filed on Dec. 21, 2018, which are incorporated herein by reference in their entireties.
Various embodiments may generally relate to a Reed Solomon decoder and a semiconductor device including the Reed Solomon decoder.
The conventional Reed Solomon decoder includes a syndrome calculation (SC) circuit 10 for calculating a syndrome from a codeword, a key equation solver (KES) circuit 20 for solving a key equation using a syndrome, and a Chien search and error evaluation (CEEE) circuit 30 for performing a Chien search operation and an error evaluation operation using the output of the KES circuit 20 and an error correction circuit 40 for providing an error corrected output data.
The conventional KES circuit 20 outputs an error location polynomial and an error evaluation polynomial from the syndrome. To do this, the calculation is performed while looping 2t times, where 2t corresponds to the number of parity symbol included in the codeword. For example, for a block encoded using Reed Solomon coding having 255 symbols, of which 223 are data symbols and 32 are parity symbols, 2t=32.
For example, if one clock is required for the operation of the SC circuit 10, two clocks for the operation of the CSEE circuit 30, and one clock for the KES circuit 20 to loop once, the latency becomes 2t+3.
Also, the conventional Reed Solomon decoder cannot start a decoding operation for a new codeword during a decoding operation for a previously inputted codeword.
Accordingly, there is a problem that the conventional Reed Solomon decoder has a long latency and low throughput and thus cannot perform a high speed decoding.
In accordance with the present teachings, a Reed Solomon decoder may include a syndrome calculation (SC) circuit configured to calculate a syndrome from a codeword; a key equation solver (KES) circuit configured to calculate an error location polynomial and an error evaluation polynomial from the syndrome; and a Chien search and error evaluation (CSEE) circuit configured to calculate an error location and an error value from the error location polynomial and the error evaluation polynomial, wherein the KES circuit comprises a plurality of sub-KES circuits and each of the plurality of sub-KES circuits, the SC circuit and the CSEE circuit constitutes pipeline stages respectively.
In accordance with the present teachings, a semiconductor device may include an error correction encoder configured to output a codeword by encoding data; a memory cell array configured to store the codeword output from the error correction encoder; and an error correction decoder configured to output error corrected data by decoding a codeword output from the memory cell.
The accompanying figures, where like reference numerals refer to identical or functionally similar elements throughout the separate views, together with the detailed description below, are incorporated in and form part of the specification, and serve to further illustrate embodiments of concepts that include the claimed novelty, and explain various principles and advantages of those embodiments.
The following detailed description references the accompanying figures in describing illustrative embodiments consistent with this disclosure. The embodiments are provided for illustrative purposes and are not exhaustive. Additional embodiments not explicitly illustrated or described are possible. Further, modifications can be made to presented embodiments within the scope of the present teachings. The detailed description is not meant to limit this disclosure. Rather, the scope of the present disclosure is defined in accordance with the presented claims and equivalents thereof.
The Reed Solomon decoder 1 includes a syndrome calculation (SC) circuit 100, a key equation solver (KES) circuit 200, and a Chien search and error evaluation (CSEE) circuit 300.
The Reed Solomon decoder 1 may further include a register 500 for sequentially queuing input codewords and an error correction circuit 400 to output error corrected data produced using a codeword output from the register 500 and an error location and an error value output from the CSEE circuit 300.
The SC circuit 100 outputs a syndrome including syndrome elements S0 to S2t−1 produced using the codeword r(x).
Hereinafter, the codeword may be represented by a codeword polynomial, and the syndrome may be represented by a syndrome polynomial.
The KES circuit 200 outputs an error location polynomial λ(x) and an error evaluation polynomial Ω(x), each produced using the syndrome.
CSEE circuit 300 receives the error location polynomial λ(x) and the error evaluation polynomial Ω(x) and outputs an error location and an error value each produced using the received polynomials.
In this embodiment, the basic operation principle of the SC circuit 100, the KES circuit 200 and the CSEE circuit 300 is similar to that of the conventional Reed Solomon decoder of
For example, the CSEE circuit 300 may be configured to perform a Chien Search algorithm and a Forney algorithm.
Since these algorithms are well known, a detailed description thereof will be omitted.
The SC circuit 100 and the CSEE circuit 300 may have a parallel structure in order to increase the processing speed.
In an embodiment, the KES circuit 200 includes a plurality of sub-KES circuits 210-0 to 210-(t−1) connected in series.
In an embodiment, the KES circuit 200 includes t sub-KES circuits 210-0, 210-1, . . . , 210-(t−1) (where t is a natural number). In an embodiment, t is equal to half the number of parity symbols included in the codeword r(x).
The t sub-KES circuits 210-0 to 210-(t−1) replace the conventional KES circuit 20 which performs the operation by looping 2t times.
At this time, each of the sub-KES circuits 210-0 to 210-(t−1), the SC circuit 100, and the CSEE circuit 300 may respectively constitute a pipeline stage.
For example, if in an embodiment each pipeline stage requires two clock cycles to perform its respective operation, the overall latency may be t+4 clock cycles.
Also, since the Reed Solomon decoder 1 according to an embodiment of the present invention operates in a pipelined manner, a decoding operation for a new codeword can be performed while a decoding operation for a previously input codeword is performed, thereby a throughput may be increased. For example, in the embodiment wherein each pipeline stage requires two clock cycles to perform its respective operation, a new codeword can be decoded every 2 clock cycles.
The register 500 sequentially queues codewords and provides a codeword corresponding to an error location and an error value output from the CSEE circuit 300 to the error correction circuit 400. In an embodiment, the register 500 is configured to queue a number of codewords corresponding to the total number of pipe stages in the SC circuit 100, the KES circuit 200, and the CSEE circuit 300, and to operate as a first-in, first-out (FIFO) queue.
The error correction circuit 400 modifies the codeword from the register 500 according to the error location and the error value to produce and output an error corrected data.
The sub-SC circuit 110 outputs a syndrome element Si and the SC circuit 100 includes a plurality of sub-SC circuit 110 arranged in parallel to output a plurality of syndrome elements Si (i=0, . . . , 2t−1) at the same time, wherein 2t corresponds to a number of parity symbols in the codeword r(x).
Hereinafter, a codeword polynomial is represented by r(x), an error corrected data polynomial is represented by c(x), and an error polynomial is expressed by e(x).
The codeword polynomial r(x) can be expressed as:
r(x)=c(x)+e(x)
If a message polynomial is m(x) and a codeword generating polynomial is g(x), the codeword polynomial can be expressed as:
g(x)=(x−α0) . . . (x−α2t−1)
c(x)=m(x)g(x)
In the above, αi(i=0, . . . , 2t−1) is a root of the primitive polynomial constituting a Galois field.
In a syndrome polynomial, each syndrome element Si is expressed as:
Si=r(αi)=c(αi)+e(αi)=e(αi),i=0, . . . 2t−1
The rth sub-KES circuit 210 receives inputs such as θi(r), δi(r), γ(r) and k(r) from a previous sub-KES circuit (i=0, 1, . . . , 3t). For example, θi(r) are input to ith PE circuit 211-i (i=0, 1, . . . ,3t), δi+1(r) are input to ith PE circuit 211-i (i=0, 1, . . . , 3t−1) and 3tth PE circuit 211-3t receives a fixed value 0 instead of δ3t+1(r). δ0(r), γ(r) and k(r) are input to a control logic 212.
For the 0th sub-KES circuit 210-0, the inputs θi(0) and δi(0) are initialized to Si (i=0, 1, . . . , 2t−1), the inputs θi(0) and δi(0) are initialized to 0 (i=2t, 2t+1, . . . , 3t−2, 3t−1), the inputs θ3t(0) and δ3t(0) are initialized to 1, k(0) is initialized to 0 and the input γ(0) is initialized to 1.
The rth sub-KES circuit 210 provides outputs such as θi(r+1), δi(r+1), γ(r+1) and k(r+1) to a next sub-KES circuit, where θi(r+1) and θi(r+1) are output from ith PE circuit 211-i and γ(r+1) and k(r+1) are output from a control logic 212 (i=0, 1, . . . , 3t).
The (t−1)th sub-KES circuit 210-(t−1) provides coefficients λi(t) of an error location polynomial λ(x) and coefficients Ωi(t) of an error evaluation polynomial Ω(x), where λi(t)=δi+1(t) and Ωi(t)=δi(t) (i=0, 1, . . . , t−1).
Operation of the PE circuits 211-0 to 211-3t is described below.
In an embodiment of the invention, the KES circuit 200 performs a 2-stage unfolded RiBM algorithm disclosed in
This embodiment includes t sub-KES circuits 210-0 to 210-(t−1) in the KES circuit 200. Each sub-KES circuit 210-r (r=0, 1, . . . ,t−1) performs operations corresponding to lines 6 to 35 of the algorithm of
Since the sub-KES circuits 210-0 to 210-(t−1) are connected in series, the rth sub-KES circuit 210-r performs an operation corresponding to the r value iterated on line 5 of the algorithm of
The control circuit 212 controls each of the 3t+1 PE circuits 211-0 to 211-3t to perform a first operation and a second operation. The first operation corresponds to lines 7 to 20 of the algorithm of
The control circuit 212 controls each of the 3t+1 PE circuits 211-0 to 211-3t to perform the first operation and then to perform the second operation. The first operation and the second operation are sequentially performed at each of the 3t+1 PE circuits 211-0 to 211-3t.
The PE circuit 211 includes terminals for inputting and outputting signals necessary for a first operation performed at lines 7, 10 and 16 of the algorithm of
For example, the PE circuit 211 receives inputs such as δi+1(r), θi(r), δ0(r), γ(r), δ′0(r), γ′(r), MC(r) and MC′(r). The PE circuit 211 generates outputs such as δi(r+1), θi(r+1), δ0(r), γ(r), δ′0(r) and γ′(r).
The PE circuit 211 includes a first operation circuit 2111 for a first operation and a second operation circuit 2112 for a second operation. The first operation circuit 2111 includes a first operation block 21111 and a second operation block 21112. The second operation circuit 2112 includes a third operation block 21121 and a fourth operation block 21122.
The detailed block diagram disclosed in
For example, the first operation block 21111 performs an operation corresponding to a line 7 of
Since each operation block is a direct representation of corresponding operation in the algorithm, a detailed description thereof will be omitted.
The control circuit 212 performs a first control operation performed at lines 8, 11, 12, 17 and 18 of the algorithm of
The control circuit 212 includes a first control circuit 2121 for a first control operation and a second control circuit 2122 for a second control operation. The first control circuit 2121 includes a first control block 21211, a second control block 21212 and a third control block 21213. The second control circuit 2122 includes a fourth control block 21221, a fifth control block 21222 and a sixth control block 21223. D flipflops are included in the fifth control block 21222 and the sixth control block 21223 for keeping data at a corresponding pipeline stage.
The signal indicating the determination result at line 8 is denoted as MC(r), and the signal indicating the determination result at line 23 is denoted as MC′(r).
Since the circuit of
For example, the first control block 21211 generates a signal MC(r) which corresponds to a signal indicating the determination result of line 8 of
The operations of line 12 and 27 are related to 2's complement operation to represent negative value. For example, a negative value of k(r) may correspond to a bitwise inversion of k(r) plus 1. Therefore, the bitwise inversion of k(r) may be represented by the negative value of k(r) minus 1 like the line 12 of
Since each control block is a direct representation of corresponding operation in the algorithm, a detailed description thereof will be omitted.
The CSEE circuit 300 includes a Chien Search (CS) circuit 310 and an error evaluation (EE) circuit 320.
The CS circuit 310, which is a circuit that implements a Chien search algorithm, receives the error location polynomial λ(x) output from the KES circuit 200, and calculates and outputs an error location.
The EE circuit 320, which is a circuit that implements a Forney algorithm, receives the error evaluation polynomial Ω(x) output from the KES circuit 200 and the error location output from the CS circuit 310, and calculates and outputs an error value.
Various circuits that implement the Chien search algorithm or the Forney algorithm are known.
In order to improve the operation speed, it is preferable to implement a circuit in a parallel manner.
The semiconductor device 2 includes an input buffer 610 for receiving data, an error correction encoder 620 for encoding the data output from the input buffer 610 according to an error correction algorithm to output a codeword, a memory cell array 630 for storing a codeword output from the error correction encoder 620, an error correction decoder 640 for decoding a codeword output from a memory cell array 630 according to an error correction algorithm and outputting error corrected data, and an output buffer 650 for buffering and outputting data from the error correction decoder 640.
The memory cell array 630 may store data and parity separately. In this case, the memory cell array 630 may include a main cell array 631 for storing data and a parity cell array 632 for storing parity.
In this embodiment, the error correction algorithm includes a Reed Solomon algorithm, wherein the error correction decoder 640 includes the Reed Solomon decoder 1 of
The semiconductor device 2 may be implemented in various embodiments such as a semiconductor memory device, a network device, and the like.
The semiconductor device 2 prevents a bottleneck in the decoding process because the error correction decoder 640 may perform the decoding operation at higher speed.
Since the semiconductor device 2 performs the error correction encoding and decoding functions, there is no need for a separate encoding and decoding device outside of the semiconductor device 2. Thereby an area of the system including the semiconductor device 2 and the manufacturing cost thereof may be reduced.
Although various embodiments have been described for illustrative purposes, it will be apparent to those skilled in the art that various changes and modifications may be made to the described embodiments without departing from the spirit and scope of the disclosure as defined by the following claims.
Number | Date | Country | Kind |
---|---|---|---|
10-2018-0043797 | Apr 2018 | KR | national |
10-2018-0167946 | Dec 2018 | KR | national |
Number | Name | Date | Kind |
---|---|---|---|
5428628 | Hassner | Jun 1995 | A |
7870468 | Vanka | Jan 2011 | B1 |
8365054 | Gasanov et al. | Jan 2013 | B2 |
20010014960 | Ohyama | Aug 2001 | A1 |
20060174181 | Banks | Aug 2006 | A1 |
20090254796 | Hassner | Oct 2009 | A1 |
20170093433 | Ilani | Mar 2017 | A1 |
Number | Date | Country |
---|---|---|
20040045922 | Jun 2004 | KR |
Entry |
---|
M. Zhang, Z. Dai, X. Gao, Y. Huang and F. Gu, “The Design and Implementation of Reed-Solomon Decoder in Broadband Wireless Access,” 2008 4th International Conference on Wireless Communications, Networking and Mobile Computing, Dalian, 2008, pp. 1-4 , doi: 10.1109/WiCom.2008.343. (Year: 2008). |
Chang, et al., “A Reed-Solomon Product-Code (RS-PC) Decoder for DVD Applications,” IEEE International Solid-State Circuits Conference, 1998, vol. 36 (2), pp. 390-391, 470. |
DailyTech—Coalition of 20+ Tech Firms Backs MRAM as Potential DRAM, NAND Replacement, 2018, http://www.dailytech.com/Coalition+of+20+Tech+Firms+Backs+MRAM+as+Potential+DRAM+NAND+Replacement/article33826.htm, pp. 14. |
Compact Disc Interactive Full Functional Specification, Philips Consumer Electroncis B.V., 1994, Chapter 11, pp. 28-37. |
Kim, et al., “Low-latency Reed-Solomon Encoder and Decoder Using Pipelining and Parallel Technique,” Department of Electrical Engineering, Published as a poster paper in the 24th Korean Conference on Semiconductor (“WP1-201.pdf” included in a compressed file located at http://kcs.cosar.or.kr/data_download/KCS_2017.zip ). |
Moon, “Error Correction Coding: Mathematical Methods and Algorithms,” John Wiley & Sons, Inc., 2005, pp. 253-265. |
Nair, et al., “Citadel: Efficiently Protecting Stacked Memory from Large Granularity Failures,” 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014, pp. 51-62. |
Parhi, “VLSI Digital Signal Processing Systems: Design and Implementation,” John Wiley & Sons, Inc., 1999, pp. 119-124. |
Sarwate, et al., “High-speed Architectures for Reed-Solomon Decoders,” IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2001, vol. 9 (5), pp. 641-655. |
Yang, et al., “Product Code Schemes for Error Correction in MLC NAND Flash Memories,” IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2012, vol. 20 (12), pp. 2302-2314. |
Number | Date | Country | |
---|---|---|---|
20190319643 A1 | Oct 2019 | US |