The present invention is related to techniques for decoding low density parity check (LDPC) codes and, more particularly, to configurable techniques for decoding quasi-cyclic low density parity check (LDPC) codes.
Errors can occur when information is transmitted between a transmitter and a receiver. Error correction codes, such as Low Density Parity Check (LDPC) codes, are often used to detect and correct such errors. LDPC codes are block codes based on a parity check matrix, H. See, for example, R. G. Gallager, “Low-Density Parity-Check Code,” IRE Trans. Inform. Theory, vol. IT-8, 21-28 (January 1962). LDPC codes are being proposed or suggested for use in a variety of transmission systems, such as satellite communications, wireless transmissions, fiber optics, and a variety of storage media, including hard disk drives, optical disks, and magnetic bands.
A given LDPC code is defined by a parity check matrix, H. A non-zero entry of the parity check matrix defines a parity check used to detect and correct errors in the received codeword. An LDPC parity check matrix is said to be sparse. In other words, there are a small number of non-zero entries in the matrix relative to the size of the matrix. If the parity check matrix, H, has dimension (n-k, n), a codeword is said to be n bits long with k information bits and n-k parity check bits. A parity check matrix for an (n, k) code has n columns and n-k rows.
Quasi-cyclic LDPC codes combine some of the advantages of random and structured code constructions. Encoding of random LDPC codes is typically an “order of n2” (O(n2)) operation. Quasi-cyclic LDPC codes combine good error rate performance with the opportunity for simplified encoding and decoding. As a result, quasi-cyclic LDPC codes have been proposed for the IEEE 802.16e standard.
LDPC decoders have traditionally been designed for a specific parity check matrix, H. Thus, the block length that the decoder processes and the rate of the code are fixed for the particular architecture. A need therefore exists for LDPC decoders that can support multiple code block lengths and code rates. A further need exists for LDPC decoders that can support a variable parity check matrix.
Generally, methods and apparatus are disclosed for block and rate independent decoding of LDPC codes. The disclosed LDPC decoders support multiple code block lengths and code rates, as well as a variable parity check matrix. The disclosed LDPC decoders decode LDPC codes that are based on a parity check matrix having a plurality of sub-matrices, wherein each row and column of the plurality of sub-matrices has a single non-zero entry. For example, each of the plurality of sub-matrices, Iml, is an m by m identity matrix cyclically-shifted by l.
According to one aspect of the invention, each of the sub-matrices has at least one associated Phi-node, wherein each Phi-node comprises a memory device having a plurality of memory elements, wherein one or more of the memory elements may be selectively disabled. In one implementation, the plurality of memory elements comprises mmax memory elements and wherein a code with submatrix dimension up to mmax·mmax can be realized.
According to another aspect of the invention, the Phi-nodes may be selectively disabled. For example, one or more rows or columns of Phi-nodes may be selectively disabled. In one implementation, a Phi-node is selectively disabled by setting a memory associated with the Phi-node to 0.
According to yet another aspect of the invention, the Phi-nodes may be selectively disabled at run-time and wherein the at least one Phi-node further comprises a multiplexer. In this manner, a variable parity check matrix is provided. Again, the Phi-node may be selectively disabled, for example, by setting a memory associated with the Phi-node to 0. A plurality of Phi-nodes are connected to a row summer and the multiplexer is a one-to-many multiplexer that selects the columns that the at least one Phi-node is connected to.
A more complete understanding of the present invention, as well as further features and advantages of the present invention, will be obtained by reference to the following detailed description and drawings.
The present invention provides LDPC decoders that can support multiple code block lengths and code rates. According to another aspect of the invention, LDPC decoders are provided that can support a variable parity check matrix.
It is noted that while the parity check matrix 100 of
LDPC codes can be classified in a number of ways. A parity check matrix is cyclic if each row (or column) is the previous row (or column) shifted one place across (or down) with wrap-around. In addition, parity check matrices can be said to have a quasi-cyclic structure.
The parity check matrix 300 describes a rate ½ code (H has dimension (n-k, n) and rate=k/n). If m equals 1, then the parity check matrix 300 becomes:
and a (n, k) equal to (6, 3) code is provided. When m is equal to 3, however, the resultant parity check matrix 400 is shown in
All the Phi-nodes operate in parallel, in lock-step, processing their respective list of m inputs sequentially. Thus, the present invention provides an architecture where doubling the code length only doubles the processing time. This also means that for a given throughput, the clock speed is independent of the code length, so a very large code length can be used at no throughput cost.
In decoding a received codeword, the message most likely to have been sent is determined.
The decoding algorithm of
The exemplary decoding algorithm 500 of
In the log domain, when taking a “sum,” the product of the signs are used to properly perform the multiplication as illustrated by 600-2.
The decoding algorithm of
It is noted that the decoding algorithm 600 of
As shown in
Configurable Decoder Architecture for Quasi-Cyclic LDPC Codes
Each submatrix has dimension m by m. Thus, each submatrix has only m ones present. Calculations are only performed for positions where there is a one in the parity check matrix. Therefore, each Phi-node is responsible for m signed numbers. Similarly, each row and column summer operates on m elements.
As shown in
The row and column summers 930, 940 consists of compression routines (Index->Float and Float->Index) to minimize the transfer of data, adders sufficient to add all the inputs and subtractors to subtract off the initial Phi-node value.
The circuitry within the Phi-node 910 can be replicated to support as many levels of parallelism as required. For example, greater parallelism can be achieved in the architecture 900 by widening the data paths so that two or more numbers can be transferred in the one clock cycle. This requires the appropriate duplication of the row/column summers 930, 940 and Φ(x) calculation units.
While an actual parity check matrix may have dimensions several thousand by several thousand, the parity check matrix can be divided into a much smaller matrix composed of many submatrices. Using the architecture 900 of
1. From the received data, compute the log likelihood ratios (LLRs), Lj, and store in the column summers, 940. Copy the magnitude of the Lj values into the appropriate xi memories within the Phi-nodes, 910, i.e. xi=|Lj|.
2. Each Phi-node 910 calculates xi←Φ(xi) (i.e. the result is written back into xi).
3. The row summers, 930, sum the xi in the appropriate row, i.e., rj←Σx
4. The sum and sign (results) are returned to the Phi-node and the Phi-node value subtracted, i.e., xi←ri−xi.
5. Each Phi-node calculates xi←Φ(xi) again. The Phi-nodes now contain the result of equation 600-2.
6. The column summers, 940, sum the appropriate columns and adds the initial LLRs cj←(Σx
7. The sum and sign are returned to the Phi-node and the Phi-node stored value subtracted, i.e., xi←cj−xi. This is the result of equation 600-3.
Steps 2 through 5 correspond to ‘Iteration Step 1’ in
Iteration over steps 2-7 for each value of i is continued until either a valid codeword is reached and the algorithm terminated, or the maximum number of iterations is complete.
By increasing the parallelism factor more than one of {0, . . . , m−1} are operated on at a time.
It is noted that the architecture 900 performs a parity check. The sign bits of cj are used to form the codeword and on the next step these are loaded back into the Phi-nodes (preserving the sign). Calculating Φ(x) also preserves the sign and when the next row operation is performed, the sign bits are multiplied together. If the product is zero, this means that the row satisifies the parity equation. Thereafter, a test is performed to see if all rows are zero to see if it is a valid codeword.
Variable Block Length
According to one aspect of the invention, the architecture 900 allows for a variable block length by having the Phi-nodes responsible for differing amounts of data, in accordance with the value m of the submatrices. Thus, in one exemplary implementation, the register 1030 in each Phi-node 910 has sufficient memory, mmax, to store, for example, 96 elements. The memory elements that are actually used may be controlled, for example, by a switch. It is noted that as used herein, a “switch” can include any hardware or software device, capable of selectively enabling the memory elements. Any code with submatrix dimension up to mmax·mmax can be realized.
Variable Rate
According to another aspect of the invention, the architecture 900 provides a variable rate code decoder by having the ability to “turn off” extra Phi-nodes (for example, by setting the memory 1030 to 0). In this manner, whole rows and columns can be turned off. For example, the (6, 3) code specified by
can be transformed into a (5, 3) code by turning off all the Phi-nodes in one column.
Variable Parity Check Matrix
According to a further aspect of the invention, the parity check matrix can optionally be configurable at runtime by turning off Phi-nodes and adding a column selector or multiplexer to each Phi-node. Phi-nodes can be turned off, for example, when every possible slot in the parity check matrix is populated, i.e., each of the n*(n-k) spaces specified by the parity check matrix. This, however, requires more gates. A column selector or multiplexer for each Phi-node can be implemented, for example, by hardwiring 8 Phi nodes to each row summer, and then adding a 1-4 multiplexer to each phi-node to select which of 4 columns the Phi node will be connected to. For example, by having the first Phi node (in each row) map to columns 1 to 4, the second Phi-node map to columns 3 to 6 and so on, many different parity check matrices can be substantiated.
The parity check matrices are low density (sparse). Thus, a combination of these two methods is highly effective in being able to implement many different parity check matrices. This method can be used in conjunction with the variable rate method to create different rate codes as well.
A plurality of identical die are typically formed in a repeated pattern on a surface of the wafer. Each die includes a device described herein, and may include other structures or circuits. The individual die are cut or diced from the wafer, then packaged as an integrated circuit. One skilled in the art would know how to dice wafers and package die to produce integrated circuits. Integrated circuits so manufactured are considered part of this invention.
It is to be understood that the embodiments and variations shown and described herein are merely illustrative of the principles of this invention and that various modifications may be implemented by those skilled in the art without departing from the scope and spirit of the invention.