Various data processing systems have been developed including storage systems, cellular telephone systems, and radio transmission systems. In such systems data is transferred from a sender to a receiver via some medium. For example, in a storage system, data is sent from a sender (i.e., a write function) to a receiver (i.e., a read function) via a storage medium. As information is stored and transmitted in the form of digital data, errors are introduced that, if not corrected, can corrupt the data and render the information unusable. The effectiveness of any transfer is impacted by any losses in data caused by various factors. Many types of error checking systems have been developed to detect and correct errors in digital data. For example, in perhaps the simplest system, a parity bit can be added to a group of data bits, ensuring that the group of data bits (including the parity bit) has either an even or odd number of ones. When using odd parity, as the data is prepared for storage or transmission, the number of data bits in the group that are set to one are counted, and if there is an even number of ones in the group, the parity bit is set to one to ensure that the group has an odd number of ones. If there is an odd number of ones in the group, the parity bit is set to zero to ensure that the group has an odd number of ones. After the data is retrieved from storage or received from transmission, the parity can again be checked, and if the group has an even parity, at least one error has been introduced in the data. At this simplistic level, some errors can be detected but not corrected.
The parity bit may also be used in error correction systems, including in Low Density Parity Check (LDPC) decoders. An LDPC code is a parity-based code that can be visually represented in a Tanner graph 100 as illustrated in
The connections between variable nodes 110-124 and check nodes 102-108 may be presented in matrix form as follows, where columns represent variable nodes, rows represent check nodes, and a non-zero element a(i,j) from the Galois Field at the intersection of a variable node column and a check node row indicates a connection between that variable node and check node and provides a permutation for messages between that variable node and check node:
By providing multiple check nodes 102-108 for the group of variable nodes 110-124, redundancy in error checking is provided, enabling errors to be corrected as well as detected. Each check node 102-108 performs a parity check on bits or symbols passed as messages from its neighboring (or connected) variable nodes. In the example LDPC code corresponding to the Tanner graph 100 of
A message from a variable node to any particular neighboring check node is computed using any of a number of algorithms based on the current value of the variable node and the last messages to the variable node from neighboring check nodes, except that the last message from that particular check node is omitted from the calculation to prevent positive feedback. Similarly, a message from a check node to any particular neighboring variable node is computed based on the current value of the check node and the last messages to the check node from neighboring variable nodes, except that the last message from that particular variable node is omitted from the calculation to prevent positive feedback. As local decoding iterations are performed in the system, messages pass back and forth between variable nodes 110-124 and check nodes 102-108, with the values in the nodes 102-124 being adjusted based on the messages that are passed, until the values converge and stop changing or until processing is halted.
Delays may be incurred when check nodes are waiting for messages from variable nodes and when variable nodes are waiting for messages from check nodes, which may be magnified by hardware pipeline bottlenecks. Such delays are multiplied in a decoding operation by the number of local iterations performed. A need therefore remains in the art for data decoders which reduce such delays.
The present inventions are related to systems and methods for decoding data, and more particularly to systems and methods for decoding data in a multi-section non-binary LDPC decoder. In some embodiments, an LDPC decoder is disclosed that includes a variable node processor operable to perform variable node updates based at least in part on check node to variable node messages and to generate variable node to check node messages, and a check node processor operable to process the variable node to check node messages in groups across each of a plurality of sections of an H matrix and to generate the check node to variable node messages. By dividing the H matrix associated with the LDPC decoder into sections, check node to variable node messages may be generated multiple times in a single local decoding iteration. In particular, in some embodiments, the H matrix rows are divided, and as the H matrix is processed column by column, check node calculations are performed at the divisions of the H matrix, for example at the start and middle of the H matrix.
The multi-section non-binary LDPC decoder may implement any LDPC decoding algorithm. In some embodiments, the multi-section non-binary LDPC decoder implements a min-sum based decoding algorithm, and in particular, an algorithm in which the lowest LLR values and the next lowest LLR values are identified in variable node to check node messages for each section of the H matrix, and check node to variable node messages are generated based on these values.
This summary provides only a general outline of some embodiments according to the present invention. Many other objects, features, advantages and other embodiments of the present invention will become more fully apparent from the following detailed description, the appended claims and the accompanying drawings.
A further understanding of the various embodiments of the present inventions may be realized by reference to the figures which are described in remaining portions of the specification. In the figures, like reference numerals may be used throughout several drawings to refer to similar components. In the figures, like reference numerals are used throughout several figures to refer to similar components. In some instances, a sub-label consisting of a lower case letter is associated with a reference numeral to denote one of multiple similar components. When reference is made to a reference numeral without specification to an existing sub-label, it is intended to refer to all such multiple similar components.
The present inventions are related to systems and methods for decoding data, and more particularly to systems and methods for decoding data in a multi-section non-binary LDPC decoder. The H matrix associated with the LDPC decoder is divided into sections enabling check node updates to be performed multiple times in a single local decoding iteration. In particular, in some embodiments, the H matrix rows are divided, and as the H matrix is processed column by column, check node calculations are performed at the divisions of the H matrix, for example at the start and middle of the H matrix. The H matrix may be divided at any location, for example in the middle of the rows or any other suitable location, and may be divided once (providing two check node updates per local iteration) or more (providing three or more check node updates per local iteration).
The multi-section non-binary LDPC decoder may implement any LDPC decoding algorithm. In some embodiments, the multi-section non-binary LDPC decoder implements a min-sum based decoding algorithm, and in particular, an algorithm in which the lowest LLR values and the next lowest LLR values are identified in messages from variable nodes to check nodes (referred to herein as V2C messages), and messages from check nodes to variable nodes (referred to herein as C2V messages) are derived from either the lowest LLR or the next lowest LLR values. The selection of the lowest LLR or the next lowest LLR value (referred to herein as the min1 and min2 values) is performed to include only extrinsic inputs, excluding prior round V2C messages from the neighboring variable node for which the C2V message is being prepared, in order to avoid positive feedback. In these embodiments, the lowest and next lowest LLR values are calculated for a portion of the H matrix, check node updates are performed based on these values, and C2V messages are performed based on the recent check node updates for that portion of the H matrix and on previously performed check node updates for other portions of the H matrix. The LDPC decoder then moves on to the next portion of the H matrix, repeating the operation for the next portion of the H matrix until all portions of the H matrix have been processed and C2V messages for the H matrix have been generated. As each group of C2V message are generated, variable node updates may be performed. Thus, check node and variable node updates may be performed multiple times per local decoding iteration, increasing message propagation during each local iteration and improving convergence speed.
Turning to
Analog to digital converter circuit 210 converts processed analog signal 206 into a corresponding series of digital samples 212. Analog to digital converter circuit 210 may be any circuit known in the art that is capable of producing digital samples corresponding to an analog input signal. Based upon the disclosure provided herein, one of ordinary skill in the art will recognize a variety of analog to digital converter circuits that may be used in relation to different embodiments of the present invention. Digital samples 212 are provided to an equalizer circuit 214. Equalizer circuit 214 applies an equalization algorithm to digital samples 212 to yield an equalized output 216. In some embodiments of the present invention, equalizer circuit 214 is a digital finite impulse response filter circuit as are known in the art. In some cases, equalizer 214 includes sufficient memory to maintain one or more codewords until a data detector circuit 220 is available for processing. It may be possible that equalized output 216 may be received directly from a storage device in, for example, a solid state storage system. In such cases, analog front end circuit 202, analog to digital converter circuit 210 and equalizer circuit 214 may be eliminated where the data is received as a digital data input.
Data detector circuit 220 is operable to apply a data detection algorithm to a received codeword or data set, and in some cases data detector circuit 220 can process two or more codewords in parallel. In some embodiments of the present invention, data detector circuit 220 is a Viterbi algorithm data detector circuit as is known in the art. In other embodiments of the present invention, data detector circuit 220 is a maximum a posteriori data detector circuit as is known in the art. Of note, the general phrases “Viterbi data detection algorithm” or “Viterbi algorithm data detector circuit” are used in their broadest sense to mean any Viterbi detection algorithm or Viterbi algorithm detector circuit or variations thereof including, but not limited to, bi-direction Viterbi detection algorithm or bi-direction Viterbi algorithm detector circuit. Also, the general phrases “maximum a posteriori data detection algorithm” or “maximum a posteriori data detector circuit” are used in their broadest sense to mean any maximum a posteriori detection algorithm or detector circuit or variations thereof including, but not limited to, simplified maximum a posteriori data detection algorithm and a max-log maximum a posteriori data detection algorithm, or corresponding detector circuits. Based upon the disclosure provided herein, one of ordinary skill in the art will recognize a variety of data detector circuits that may be used in relation to different embodiments of the present invention. Data detector circuit 220 is started based upon availability of a data set from equalizer circuit 214 or from a central memory circuit 230.
Upon completion, data detector circuit 220 provides detector output 222, which includes soft data. As used herein, the phrase “soft data” is used in its broadest sense to mean reliability data with each instance of the reliability data indicating a likelihood that a corresponding bit position or group of bit positions has been correctly detected. In some embodiments of the present invention, the soft data or reliability data is log likelihood ratio data as is known in the art. Detector output 222 is provided to a local interleaver circuit 224. Local interleaver circuit 224 is operable to shuffle sub-portions (i.e., local chunks) of the data set included as detected output 222 and provides an interleaved codeword 226 that is stored to central memory circuit 230. Interleaver circuit 224 may be any circuit known in the art that is capable of shuffling data sets to yield a re-arranged data set. Interleaved codeword 226 is stored to central memory circuit 230.
Once multi-section non-binary LDPC decoder 240 is available, a previously stored interleaved codeword 226 is accessed from central memory circuit 230 as a stored codeword 232 and globally interleaved by a global interleaver/deinterleaver circuit 234. Global interleaver/deinterleaver circuit 234 may be any circuit known in the art that is capable of globally rearranging codewords. Global interleaver/deinterleaver circuit 234 provides a decoder input 236 into multi-section non-binary LDPC decoder 240. In some embodiments of the present invention, the data decode algorithm is a non-binary non-layer min-sum based low density parity check algorithm. Based upon the disclosure provided herein, one of ordinary skill in the art will recognize other decode algorithms that may be used in relation to different embodiments of the present invention. The multi-section non-binary LDPC decoder 240 may be implemented similar to that described below in relation to
Where the multi-section non-binary LDPC decoder 240 fails to converge (i.e., fails to yield the originally written data set) and a number of local iterations through multi-section non-binary LDPC decoder 240 exceeds a threshold, the resulting decoded output is provided as a decoded output 242 back to central memory circuit 230 where it is stored awaiting another global iteration through data detector circuit 220 and multi-section non-binary LDPC decoder 240. Prior to storage of decoded output 242 to central memory circuit 230, decoded output 242 is globally deinterleaved to yield a globally deinterleaved output 244 that is stored to central memory circuit 230. The global deinterleaving reverses the global interleaving earlier applied to stored codeword 232 to yield decoder input 236. Once data detector circuit 220 is available, a previously stored deinterleaved output 244 is accessed from central memory circuit 230 and locally deinterleaved by a deinterleaver circuit 246. Deinterleaver circuit 246 rearranges decoder output 250 to reverse the shuffling originally performed by interleaver circuit 224. A resulting deinterleaved output 252 is provided to data detector circuit 220 where it is used to guide subsequent detection of a corresponding data set received as equalized output 216.
Alternatively, where the decoded output converges (i.e., yields the originally written data set) in the multi-section non-binary LDPC decoder 240, the resulting decoded output is provided as an output codeword 254 to a deinterleaver circuit 256. Deinterleaver circuit 256 rearranges the data to reverse both the global and local interleaving applied to the data to yield a deinterleaved output 260. Deinterleaved output 260 is provided to a hard decision output circuit 262. Hard decision output circuit 262 is operable to yield a hard decision output 264, for example based on the argmina of the total LLR values.
In a conventional non-binary min-sum LDPC decoder with GF(q) and with p check node rows in the parity check matrix, check node processing involves both forward and backward recursions that incur long latency since they require about q2 additions and comparisons in each of p−2 basic steps. To perform both forward and backward recursions, numerous intermediate messages are stored, requiring a large memory, and messages are sorted when combining the results of forward and backward recursions. In contrast, the min-sum based decoding of non-binary LDPC codes used in some embodiments of the multi-section non-binary LDPC decoder 240 provides low-complexity decoding that does not require forward and backward recursions, sorting or dynamic programming. By including message normalization and modification of the search space, searching over various local configurations is reduced to the simple recursive processing of a single message vector.
Check nodes in a min-sum based non-binary LDPC decoder receive incoming messages from connected or neighboring variable nodes and generate outgoing messages to each neighboring variable node to implement the parity check matrix for the LDPC code, an example of which is graphically illustrated in the Tanner graph of
Both V2C and C2V messages are vectors, each including a number of sub-messages with LLR values. Each V2C message vector from a particular variable node will contain sub-messages corresponding to each symbol in the Galois Field, with each sub-message giving the likelihood that the variable node contains that particular symbol. For example, given a Galois Field GF(q) with q elements, V2C and C2V messages will include at least q sub-messages representing the likelihood for each symbol in the field. Message normalization in the simplified min-sum decoding is performed with respect to the most likely symbol. Thus, the V2C and C2V vector format includes two parts, an identification of the most likely symbol and the LLR for the other q−1 symbols, since the most likely symbol has LLR equal to 0 after normalization.
Generally, the C2V vector message from a check node to a variable node contains the probabilities for each symbol d in the Galois Field that the destination variable node contains that symbol d, based on the prior round V2C messages from neighboring variable nodes other than the destination variable node. The inputs from neighboring variable nodes used in a check node to generate the C2V message for a particular neighboring variable node are referred to as extrinsic inputs and include the prior round V2C messages from all neighboring variable nodes except the particular neighboring variable node for which the C2V message is being prepared, in order to avoid positive feedback. The check node thus prepares a different C2V message for each neighboring variable node, using the different set of extrinsic inputs for each message based on the destination variable node.
In the simplified min-sum based decoding applied in some embodiments of the multi-section non-binary LDPC decoder 240 disclosed herein, the check nodes calculate the lowest sub-message min1(d), the index idx(d) of min1(d), and the next lowest sub-message min2(d), or minimum of all sub-messages excluding min1(d), for each nonzero symbol din the Galois Field based on all extrinsic V2C messages from neighboring variable nodes. In other words, the sub-messages for a particular symbol d are gathered from messages from all extrinsic inputs, and the min1(d), idx(d) and min2(d) is calculated based on the gathered sub-messages for that symbol d. For a Galois Field with q symbols, the check node will calculate the min1(d), idx(d) and min2(d) sub-message for each of the q−1 non-zero symbols in the field except the most likely symbol. The min1(d), idx(d) and min2(d) values are stored in a memory for use in calculating the C2V message, requiring much less memory than the traditional non-binary LDPC check node processor that stores each intermediate forward and backward message.
When applied to multi-section LDPC decoding, the min1(d), idx(d) and min2(d) calculations are performed for each portion of the H matrix, for example across the left side and right side of an H matrix divided into two sections. This is illustrated in
Turning to
Turning to
Turning to
Turning to
Turning to
The multi-section non-binary LDPC decoder 400 is provided with LLR values from an input channel 402, which may be stored in an LLR memory 404. Stored values 406 are provided to an adder/subtractor array 410, also referred to as a variable node processor or variable node unit (VNU) or as a portion of a VNU. The adder/subtractor array 406 updates the perceived value of symbols based on the value from input channel 402 and on C2V message vectors 412. The adder/subtractor array 410 yields an external LLR output 414 to a check sum calculation circuit 416, which generates a parity check output 420. For example, check sum calculation circuit 416 may include multiplexers and XOR circuits to calculate parity check equation ν·HT=0 over GF(q), where νεGF(q)N, and where ν is a codeword vector and HT is the transform of the H matrix for the LDPC decoder. The adder/subtractor array 410 also yields an external LLR output 422 to a normalization/saturation circuit 424, which generates a hard decision output 426.
The adder/subtractor array 410 performs an update function, adding C2V message vectors 412 to symbol values, and generates V2C message vectors 430 setting forth the updated likelihood or LLR value for each element in the Galois Field for each symbol in the data set. The V2C message vectors 430 are provided to a normalization/scaling/saturation circuit 432 which scales the LLR values and converts them to normalized V2C message vectors 434. The normalized V2C message vectors 434 contain a hard decision (an indication of the most likely GF element), and LLR values for the remaining GF elements for each symbol, each normalized to the hard decision. For example, in a GF(4) LDPC decoder, the normalization/scaling/saturation circuit 432 takes the four LLR data values for each symbol, identifies the highest LLR data value of the four values, and normalizes the four LLR data values to the value of the highest LLR data value. An example of this is shown using the following example symbol:
In this example, the normalization/scaling/saturation circuit 432 selects the LLR data value ‘22’ corresponding to the hard decision ‘10’. Next, the LLR data values corresponding to hard decision values ‘00’, ‘01’, ‘10’ and ‘11’ are normalized to LLR data value ‘22’ by subtracting ‘22’ from each of the LLR data values to yield the following normalized symbol:
The LLR values may also be scaled in normalization/scaling/saturation circuit 432, multiplying each of the normalized LLR data values by a scaling factor. The scaling factor may be user programmable. As an example, with a scaling factor of 0.5, the normalized V2C message vectors 434 might include the following scaled symbol based on the current example:
The V2C message vectors 434 are provided to a rearranger 436 which shuffles messages on the boundaries at message edges, randomizing noise and breaking dependencies between messages, and yielding rearranged V2C message vectors 440 and 442. The rearranged V2C message vectors 440 and 442 are provided to barrel shifters 444 and 446, respectively, which shift the symbol values in the rearranged V2C message vectors 440 and 442 to generate the next circulant sub-matrix, yielding shifted LLR values 450 and 452. In some embodiments, the code structure of the codeword provided at input channel 402 has a code structure matrix of the following form:
where each of PI,J are p×p circulants with weight 1, or permutations of the identity matrix, and the circulant size L is the row weight. The following is an example of a p×p circulant representative of PI,J:
The barrel shifters 444 and 446 are operable to shift the currently received circulant to an identity matrix. Such an identity matrix may be as follows:
Barrel shifter 444 provides shifted output 450, which contains the magnitude and sign of the hard decision HD. Barrel shifter 446 provides shifted output 452, which contains the magnitudes of the remaining LLR values, normalized to the hard decision HD. The shifted output 450 is provided to a parity/HD computation circuit 454 which calculates the accumulative sign for the hard decisions in shifted output 450, storing the resulting sign values 456 for each non-zero element of the portion of the H matrix being processed, and the hard decisions, in a parity/HD memory 460.
The shifted output 452 is provided to an LLR comparison circuit 462, which calculates the first minimum LLR value or sub-message min1(d), (i.e., the lowest LLR value), the index idx(d) of min1(d) (i.e., the location in the row corresponding to the first minimum LLR data value), and the second minimum LLR value or sub-message min2(d), (i.e., the second lowest LLR value) or minimum of all sub-messages excluding min1(d), for each nonzero symbol din the Galois Field based on all extrinsic V2C messages in the portion of the H matrix being processed. In other words, the sub-messages for a particular symbol d are gathered from messages from all extrinsic inputs for the portion of the H matrix being processed, and the min1(d), idx(d) and min2(d) is calculated based on the gathered sub-messages for that symbol d. For a Galois Field with q symbols, the check node will calculate the min1(d), idx(d) and min2(d) sub-message for each of the q−1 non-zero symbols in the field except the most likely symbol, the hard decision HD.
Again, columns in the H matrix represent variable nodes, rows represent check nodes, and non-zero values in the H matrix indicate a connection between the column and row at the non-zero intersection. In general, the multi-section min-sum based decoding algorithm identifies the lowest extrinsic input value to a check node from each connected variable node in the portion or section of the H matrix being processed, for each non-zero element of the Galois Field except the most likely symbol or HD, by finding the lowest and next lowest LLR value for each non-zero Galois Field element other than the HD among the connected variable nodes (or non-zero row values) in the portion of the H matrix being processed.
Because the H matrix is divided into multiple portions or sections, for example a left side and a right side, the min1(d), idx(d) and min2(d) values are alternately calculated for each side of the H matrix so that check node updates can be performed twice per iteration, once early in processing of the left side based on newly identified min1—r(d), idx_r(d) and min2—r(d) values for the right side and on previously identified min1—l(d), idx_l(d) and min2—l(d) values for the left side, and again early in processing of the right side based on newly identified min1—l(d), idx_l(d) and min2—l(d) values for the left side and on previously identified min1—r(d), idx_r(d) and min2—r(d) values for the right side.
Identification of the lowest and next lowest LLR value is performed in the LLR comparison circuit 462, with the results 464 (in a two-section LDPC decoder) divided into left side results 466 and right side results 470, for example by switch 472. The left side results 466 (or min1—l(d), idx_l(d) and min2—l(d) values) are stored in a left register array 474. The right side results 470 (or min1—r(d), idx_r(d) and min2—r(d) values) are stored in a right register array 476. The register arrays 474 and 476 for a two-section LDPC decoder store left and right sets of min1(d), idx(d) and min2(d) values for each non-zero GF element other than the HD at each check node or row of the H matrix. In some embodiments of a GF(4) decoder, there is a set of three min1(d), idx(d) and min2(d) registers for the left side of the H matrix and three min1(d), idx(d) and min2(d) registers for the right side of the H matrix, for each check node or row of the H matrix. With a code structure matrix having three rows, the left register array 474 and right register array 476 would each store three sets of first minimum LLR data value, second minimum LLR data value, index value as shown in the example below:
Before starting the LLR compare process in the LLR comparison circuit 462 for each side of the H matrix, the left register array 474 or right register array 476 is reset to an initial value, for example zero. As the first non-zero LLR values are received when processing each column in that side of the H matrix, they overwrite the initial zero values. As processing of each column in that side of the H matrix continues, if the LLR value for a non-zero GF element is lower than the value in the min1(d) register, the min1(d) register is updated with the LLR value for the non-zero GF element, the previous value in the min1(d) register is copied into the min2(d) register as the next lowest value, and the idx(d) register is updated with the index of the current working column. If the LLR value for the non-zero GF element was greater than the value in the min1(d) register but lower than the value in the min2(d) register, the min2(d) register is updated with the LLR value for the GF element. As each column is processed, this LLR comparison is performed for the sets of min1(d), idx(d) and min2(d) registers for each check node or row in the column.
At the end of processing a section of the H matrix in the LLR comparison circuit 462, a select network 478 performs a check node update based on the stored left side results 480 (or min1—l(d), idx_l(d) and min2—l(d) values) stored in the left register array 474 and on the stored right side results 482 (or min1—r(d), idx_r(d) and min2—r(d) values) stored in the right register array 476. The check node updates are based on newly identified values for one side (e.g., 474) and previously identified values for the other side (e.g., 476), with the side having the newly identified values alternating between the left side and the right side as the H matrix is processed, as disclosed above with respect to
The LLR comparison circuit 462, register arrays 474 and 476, and select network 478 may be collectively referred to as a check node processor or check node unit (CNU). The simplified min-sum based CNU disclosed herein and which may be used in some embodiments of a multi-section non-binary LDPC decoder is also referred to as a compression circuit. The select network 478 selects as output 484 either the min1(d) or min2(d) to be used in the C2V message 412 such that only extrinsic values are selected. If the current column index is equal to the index of the minimum value, meaning that the C2V message is being prepared for a variable node that provided the min1(d) value, then the value to be used in the C2V message 412 is the second minimum value min2(d). Otherwise, the value to be used in the C2V message 412 is the first minimum value min1(d). The select network 478 also considers the min1(d) or min2(d) values from both sides of the H matrix, selecting the lowest LLR value from them both. When processing the left side of the H matrix, this may be accomplished according to the equation:
sel[d]=min(((circ_idx==idx—l[d])?min2—l[d]:min1—l[d]),min1—r[d]) (Eq 1)
where d is the GF element index, where circ_idx is the index of the working column, that is, the index of the variable node for which the C2V message is being generated, and where idx 1 is the column index of the min1—l(d) value. For a GF(4) decoder, d=0, 1, 2. The outer min statement selects the extrinsic minimum LLR value from either the left or the right side. Because in this instance the left side of the H matrix is being processed, the index circ_idx of the working column is in the left side, and min1—r(d) cannot have come from the variable node at the working column and is therefore from an extrinsic input. In contrast, the min1—l(d) may have come from the variable node at the working column, so the idx_l(d) is compared with the working column index circ_idx. If they are equal, then the min1—l(d) is not an extrinsic input and the min2—l(d) value is used rather than min1—l(d). Notably, the equation may be adapted to select from among more than two portions if the H matrix is further divided.
When processing the right side of the H matrix, the select network 478 selects the lowest LLR value from both sides of the H matrix according to the equation:
sel[d]=min(min1—l[d],((circ_idx==idx—r[d])?min2—r[d]:min1—r[d])) (Eq 2)
where idx_r is the column index of the min1—r(d) value for the right side. Again, the outer min statement selects the extrinsic minimum LLR value from either the left or the right side. Because in this instance the right side of the H matrix is being processed, the index circ_idx of the working column is in the right side, and min1—l(d) cannot have come from the variable node at the working column and is therefore from an extrinsic input. In contrast, the min1—r(d) may have come from the variable node at the working column, so the idx_r(d) is compared with the working column index circ_idx. If they are equal, then the min1—r(d) is not an extrinsic input and the min2—r(d) value is used rather than min1—r(d).
The R values or LLR values for each element of the Galois Field making up a portion of the C2V message vectors 412 are calculated based on the sel[d] in the select network 478, for example according to equations 3-5 in a GF(4) LDPC decoder:
R[0]=min(sel[0],sel[1]+sel[2]) (Eq 3)
R[1]=min(sel[1],sel[0]+sel[2]) (Eq 4)
R[2]=min(sel[2],sel[0]+sel[1]) (Eq 5)
For an LDPC decoder with more Galois Field elements, there would be additional equations for the extra R terms.
The hard decision HD and sign to be used in the C2V message 412 is provided at the output 486 of parity/HD memory 460, with the sign or parity calculated as the XOR of the cumulative sign and the current sign of the symbol. The R HD value to be used as the hard decision value in the C2V message vectors 412 may be calculated according to equation 6:
R
HD=sgn—l XOR sgn—r XOR Q_HD (Eq 6)
where sgn_l and sgn_r are sign or parity values for each portion of the H matrix, and Q_HD is the previous hard decision in the V2C message vectors 440, combined in XOR operations. Again, if the H matrix were divided into more than two portions, equation 6 would have additional sign terms. The hardware used to implement equations 1-6 may be shared and used for each portion of the H matrix, or may be duplicated for each portion of the H matrix.
The output 486 of parity/HD memory 460 and the output 484 of select network 478 are provided to barrel shifters 488 and 490, respectively, which shift the hard decisions and their signs in output 486 and the C2V message values in output 484 to yield shifted hard decisions and signs 492 and shifted C2V message values 494, respectively, shifting between circulant sub-matrices. The shifted C2V message values 494 and shifted hard decisions and signs 492 are combined and processed in an inverse rearranger 496 which combines the inputs 492 and 494 and which reverses the effect of rearranger 436 to yield C2V message vectors 412. The combining portion of inverse rearranger 496 is also referred to herein as a data decompression circuit, and reassembles rows to yield an approximation of the original data.
Turning to
Turning to
The selection is performed in a group of three selector circuits 602, 604 and 606, controlled by index inputs 610, 612 and 614 cycling through the variable nodes or columns in the H matrix. Each of the three selector circuits 602, 604 and 606 may be a selector circuit 700 as illustrated in
Output=(index1==index2)?B:A (Eq 7)
where the output 712 is equal to the second input 704 or input B if the indexes 706 and 710 are equal, otherwise output 712 is equal to the first input 702 or input A.
The first selector circuit 602 has an idx_l[0] input 616, index input 610, min1—l(0) input 620 and min2—l(0) input 622, yielding output 624 according to Equation 7. The second selector circuit 604 has an idx_l[1] input 626, index input 612, min1—l(1) input 630 and min2—l(1) input 632, yielding output 634. The third selector circuit 606 has an idx_l[2] input 636, index input 614, min1—l(2) input 640 and min2—l(2) input 642, yielding output 644. Outputs 624, 634 and 644 thus provide the lowest LLR input from the left side of the H matrix, unless the index inputs (e.g., 616 and 610) are equal, indicating that the lowest LLR value is from the same variable node for which the C2V message is being generated, in which case the next lowest LLR input from another variable node is provided. Thus, only extrinsic inputs are used in the generation of a C2V message, avoiding V2C messages from current variable nodes.
Output 624 is provided to a minimum selector circuit 646, along with the min1—r(0) value 650, which selects the lower of the value at output 624 and the min1—r(0) value 650, yielding output 652 (or sel[0] according to Equation 1). Output 634 is provided to a minimum selector circuit 654, along with the min1—r(1) value 656, which selects the lower of the value at output 634 and the min1—r(1) value 656, yielding output 660 (or sel[1] according to Equation 1). Output 644 is provided to a minimum selector circuit 662, along with the min1—r(2) value 664, which selects the lower of the value at output 644 and the min1—r(2) value 664, yielding output 668 (or sel[2] according to Equation 1).
The R values 670, 672 and 674 are generated by the select network 600 according to equations 3-5 based on the sel[k] values 652, 660 and 668. The sel[1] value 660 and sel[2] value 668 are added in adder 676 to yield sum output 680. A minimum selector circuit 682 yields as R[0] output 670 the lesser of sel[0] value 652 and sum output 680. The sel[0] value 652 and sel[2] value 668 are added in adder 684 to yield sum output 686. A minimum selector circuit 688 yields as R[1] output 672 the lesser of sel[1] value 660 and sum output 686. The sel[0] value 652 and sel[1] value 660 are added in adder 690 to yield sum output 692. A minimum selector circuit 694 yields as R[2] output 674 the lesser of sel[2] value 668 and sum output 692.
In some embodiments, the C2V message vectors 412 are formed with the output 484 of the select network 478 (which may include R values, e.g., 670, 672 and 674) as LLR values and with the XOR of the parity symbol and the hard decision in the output 486 of the parity/HD memory 460 as the new hard decision.
When sharing a single select network 600 when processing the left and right sides of an H matrix, the left and right inputs to the select network 600 can be swapped. This may be accomplished in some embodiments using multiplexers to supply the appropriate inputs with left side values or right side values.
Turning to
The selection is performed in a group of three selector circuits 802, 804 and 806, controlled by index inputs 610, 612 and 614 cycling through the variable nodes or columns in the H matrix. Each of the three selector circuits 802, 804 and 806 may be a multi-selection selector circuit 900 as illustrated in
Turning back to
The R values 870, 872 and 874 are generated by the select network 800 according to equations 3-5 based on the sel[k] values 826, 846 and 866. The sel[1] value 846 and sel[2] value 866 are added in adder 876 to yield sum output 880. A minimum selector circuit 882 yields as R[0] output 870 the lesser of sel[0] value 826 and sum output 880. The sel[0] value 826 and sel[2] value 866 are added in adder 884 to yield sum output 886. A minimum selector circuit 888 yields as R[1] output 872 the lesser of sel[1] value 846 and sum output 886. The sel[0] value 826 and sel[1] value 846 are added in adder 890 to yield sum output 892. A minimum selector circuit 894 yields as R[2] output 874 the lesser of sel[2] value 866 and sum output 892.
Turning now to
Check node processing functions, shown in blocks 1020-1026, may be performed partly or fully in parallel with the variable node processing functions of blocks 1004 and 1012, or can be done at different times. In the check node processing, V2C messages are processed for a first section of an H matrix. (Block 1020) In the example min-sum based LDPC decoder, this includes identifying the lowest and next lowest LLR values for the first section of an H matrix. C2V messages are generated. (Block 1022) In some embodiments, there is an update delay period between the time at which block 1020 completes and block 1022 is started. V2C messages are processed for a second section of the H matrix. (Block 1024) C2V messages are generated. (Block 1024) Again, there may be an update period between the completion of block 1024 and the start of block 1026. The C2V messages generated in blocks 1022 and 1026 are based on some embodiments on the results of both blocks 1020 and 1024, with the results from blocks 1020 and 1024 alternatingly having the most recently updated values.
In some embodiments, the C2V messages are generated in blocks 1022 and 1026 according to Equations 1-6. The index of the current working column in the H matrix is compared with the column index of the lowest LLR value for the section of the H matrix being processed to yield an indication of whether the lowest LLR value is from the variable node at the current working column. If not, the lowest LLR value is selected as the lowest extrinsic LLR value for that section of the H matrix. If so, the next lowest LLR value is selected as the lowest extrinsic LLR value for that section of the H matrix. The lesser of the lowest extrinsic LLR value for that section of the H matrix and of the lowest LLR value for a reminder of the H matrix is selected as the lowest extrinsic LLR value for the H matrix. The LLR values in the C2V messages are calculated based on the extrinsic LLR value for the H matrix. The hard decision in the C2V messages are based on combining the parity symbol for each section of the H matrix with the hard decision in the previous V2C message in an XOR operation.
The multi-section non-binary LDPC decoder disclosed herein enhances decoding performance, particularly when decoding iterations are limited. Bit error performance by performing check node updates during local decoding iterations.
Low Density Parity Check (LDPC) technology is applicable to transmission of information over virtually any channel or storage of information on virtually any media. Transmission applications include, but are not limited to, optical fiber, radio frequency channels, wired or wireless local area networks, digital subscriber line technologies, wireless cellular, Ethernet over any medium such as copper or optical fiber, cable channels such as cable television, and Earth-satellite communications. Storage applications include, but are not limited to, hard disk drives, compact disks, digital video disks, magnetic tapes and memory devices such as DRAM, NAND flash, NOR flash, other non-volatile memories and solid state drives.
Although the multi-section non-binary LDPC decoder disclosed herein is not limited to any particular application, several examples of applications are presented in
In a typical read operation, read/write head assembly 1120 is accurately positioned by motor controller 1112 over a desired data track on disk platter 1116. Motor controller 1112 both positions read/write head assembly 1120 in relation to disk platter 1116 and drives spindle motor 1114 by moving read/write head assembly 1120 to the proper data track on disk platter 1116 under the direction of hard disk controller 1110. Spindle motor 1114 spins disk platter 1116 at a determined spin rate (RPMs). Once read/write head assembly 1120 is positioned adjacent the proper data track, magnetic signals representing data on disk platter 1116 are sensed by read/write head assembly 1120 as disk platter 1116 is rotated by spindle motor 1114. The sensed magnetic signals are provided as a continuous, minute analog signal representative of the magnetic data on disk platter 1116. This minute analog signal is transferred from read/write head assembly 1120 to read channel circuit 1102 via preamplifier 1104. Preamplifier 1104 is operable to amplify the minute analog signals accessed from disk platter 1116. In turn, read channel circuit 1102 decodes and digitizes the received analog signal to recreate the information originally written to disk platter 1116. This data is provided as read data 1122 to a receiving circuit. As part of decoding the received information, read channel circuit 1102 processes the received signal using a multi-section non-binary LDPC decoder. Such a multi-section non-binary LDPC decoder may be implemented consistent with that disclosed above in relation to
It should be noted that storage system 1100 may be integrated into a larger storage system such as, for example, a RAID (redundant array of inexpensive disks or redundant array of independent disks) based storage system. Such a RAID storage system increases stability and reliability through redundancy, combining multiple disks as a logical unit. Data may be spread across a number of disks included in the RAID storage system according to a variety of algorithms and accessed by an operating system as if it were a single disk. For example, data may be mirrored to multiple disks in the RAID storage system, or may be sliced and distributed across multiple disks in a number of techniques. If a small number of disks in the RAID storage system fail or become unavailable, error correction techniques may be used to recreate the missing data based on the remaining portions of the data from the other disks in the RAID storage system. The disks in the RAID storage system may be, but are not limited to, individual storage systems such storage system 700, and may be located in close proximity to each other or distributed more widely for increased security. In a write operation, write data is provided to a controller, which stores the write data across the disks, for example by mirroring or by striping the write data. In a read operation, the controller retrieves the data from the disks. The controller then yields the resulting read data as if the RAID storage system were a single disk.
Turning to
It should be noted that the various blocks discussed in the above application may be implemented in integrated circuits along with other functionality. Such integrated circuits may include all of the functions of a given block, system or circuit, or a portion of the functions of the block, system or circuit. Further, elements of the blocks, systems or circuits may be implemented across multiple integrated circuits. Such integrated circuits may be any type of integrated circuit known in the art including, but are not limited to, a monolithic integrated circuit, a flip chip integrated circuit, a multichip module integrated circuit, and/or a mixed signal integrated circuit. It should also be noted that various functions of the blocks, systems or circuits discussed herein may be implemented in either software or firmware. In some such cases, the entire system, block or circuit may be implemented using its software or firmware equivalent. In other cases, the one part of a given system, block or circuit may be implemented in software or firmware, while other parts are implemented in hardware.
In conclusion, the present invention provides novel systems, devices, methods and arrangements for decoding data in a multi-section non-binary LDPC decoder. While detailed descriptions of one or more embodiments of the invention have been given above, various alternatives, modifications, and equivalents will be apparent to those skilled in the art without varying from the spirit of the invention. Therefore, the above description should not be taken as limiting the scope of the invention, which is defined by the appended claims.