The present invention relates generally to error-correcting codes, and more specifically to constructing large-girth quasi-cyclic low-density parity-check (QC-LDPC) codes.
Error-Correcting Codes
A fundamental problem in the field of data storage and communication is constructing efficient error-correcting codes. Unless specified otherwise, any reference to a “code” in the following description should be understood to refer to an error-correcting code.
One very important error-correcting code is a linear block error-correcting code.
This code encodes a block of k information symbols using a block of N symbols, where N>k. The additional N−k symbols are used to correct corrupted signals when they are received over a noisy communication channel or retrieved from faulty storage media. The parameter k is called the “dimension” of the code.
A block of N symbols that satisfies all the constraints of the code is called a “code-word,” and the corresponding block of k information symbols is called an “information block.”
A linear code can be represented by a parity check matrix. The parity check matrix representing a binary [N, k] code is a matrix of zeros and ones, with M rows and N columns. The N columns of the parity check matrix correspond to the N symbols of the code. The number of linearly independent rows in the matrix is N−k.
LDPC Codes
There has recently been a great deal of interest in codes that can be decoded using iterative methods. One particularly important iteratively-decodable code is a low-density parity-check (LDPC) code, first described in 1963 by Gallager, see T. Richardson and R. Urbanke, “The Renaissance of Gallager's Low-Density Parity-Check Codes,” IEEE Communications Magazine, vol. 41, pp. 126-131, August 2003, and U.S. Pat. No. 6,633,856 to Richardson et al. on Oct. 14, 2003, “Methods and apparatus for decoding LDPC codes.”
Many LDPC codes use random constructions. For example, Gallager's original binary LDPC code is defined in terms of a parity check matrix, which consists only of 0's and 1's, where a small number of 1's are placed randomly within the matrix according to a pre-defined probability distribution. In general, iterative decoders work well for codes with a parity check matrix that has a relatively small number of non-zero entries, whether that parity check matrix has a random or regular construction.
LDPC codes are often represented using “Tanner graphs,” which are equivalent to parity check matrices. In a Tanner graph, columns of the parity check matrix are represented as “bit nodes” while rows of the parity check matrix are represented as “check nodes.” The Tanner graph is a bi-partite graph that contains a connection between a bit node and a check node if there is a 1 in the parity check matrix at the corresponding row and column position.
QC-LDPC Codes
LDPC codes based on random constructions have the disadvantage that the wiring complexity involved in fabricating a hardware decoder is prohibitive. For this reason, the more regular “quasi-cyclic” LDPC (QC-LDPC) codes have been developed. An example of a QC-LDPC code is described by R. M. Tanner, D. Sridhara, and T. Fuja, “A Class of Group-Structured LDPC Codes,” Proc. International Symposium on Communication Theory and Applications, Ambleside, U.K., July 2001, US Patent Publications 20060109821, “Apparatus and method capable of a unified quasi-cyclic low-density parity-check structure for variable code rates and sizes,” and US Patent Publication 20050149845 “Method of constructing QC-LDPC codes using qth-order power residue.”
The regular structure of QC-LDPC codes enables efficient and high-speed very large scale integration (VLSI) implementations. For this reason, a number of wireless communications standards use QC-LDPC codes, e.g., the IEEE 802.16e, 802.11n standards and DVB-S2 standards.
In general, QC-LDPC codes are in the form of a parity-check matrix H that has a special structure, which makes the codes very convenient for hardware implementation. The parity check matrix is constructed out of square p×p sub-matrices. These sub-matrices either consist of all zeroes, or they are circulant permutation matrices.
Circulant permutation matrices are matrices with a single 1 in each row, where the column in which the 1 is located is shifted by one from row to row. The following matrix is an example of a circulant permutation matrix with p=6:
This matrix is called “I(2)” because when the rows and columns are counted starting at position 0, the 1 in the 0th row is in column 2. The permutation matrix I(0) is the identity matrix. If the value of the index t in I(t) is greater than or equal to p, then the matrix wraps around, so that for p=6, we have I(2)=I(8)=I(14) etc. One special notation that is generally used in describing QC-LDPC codes, and which is used herein, is that the p×p all-zeroes matrix is denoted I(−1).
A (J, L) QC-LDPC code with sub-matrices of size p×p is defined by its parity check matrix
where 0≦j≦J−1, 0≦l≦L−1, and pj,l is an integer in the range −1≦pj,l≦p−1. For a parity check matrix of size M by N, N=pL and m=pJ.
A specific QC-LDPC parity check matrix can be constructed form a corresponding J×L dimensional base matrix B. The entries in the base matrix are the values pj,l for the corresponding entry of the parity check matrix:
When all-zeroes sub-matrices are used, the resulting LDPC code can be an “irregular” LDPC code. An irregular LDPC code is one for which different columns or rows in the parity check matrix can contain different numbers of ones.
Girth of LDPC Codes
Depending on the application, LDPC codes are designed to optimize performance in either the “water-fall” (SNR near the code threshold) or “error-floor” (higher SNR) regime, or both. Low error floors are particularly important for applications that have extreme reliability demands, such as data storage devices and communication systems.
Some theoretical analyses of error floor problems for LDPC codes argue that error events are caused by “trapping sets,” see for example, T. J. Richardson, “Error Floors of LDPC Codes,” Proc. 41st Annual Allerton Conference on Communications, Control, and Computing 2003. Trapping sets result from clusters of short cycles in the Tanner graph of the code. One way to remove trapping sets that involve short cycles is to carefully design the clustering of short cycles in the Tanner graph.
A simpler approach uses codes with larger girths. The “girth” of a code is the length of a shortest cycle in the code graph. By removing short cycles, a large number of cycles are removed, which has the effect of lowering (improving) the error floor.
There is considerable prior art on optimizing girth in LDPC codes. One code uses a progressive-edge growth (PEG) method, see X. Y. Hu, E. Eleftheriou, and D. M. Arnold, “Regular and Irregular Progressive Edge-Growth Tanner Graphs,” IEEE Trans. Info. Theory, vol. 51, pp. 386-398, 2005.
There has also been work on applying the PEG method to QC-LDPC codes, see Z. Li, B. V. K. V. Kumar, “A Class of Good Quasi-cyclic low-density parity-check Codes Based on Progressive Edge Growth Graph,” Signals, Systems, and Computers, Conference Record of the 38th Asilomar Conference, vol. 2, pp. 1990-1994, 2004.
High-girth QC-LDPC codes can also be constructed using a random “guess-and-test” method, see M. P. C. Fossorier, “Quasi-cyclic low-density parity-check codes from circulant permutation matrices,” IEEE Trans. Inform. Theory, vol. 50, no. 8, pp. 1788-1793, August 2004.
However, there is room for improvement in those methods, in terms of the girth that can be achieved for a code of particular block-length and dimension. Therefore, an improved method for constructing high-girth QC-LDPC codes is desired.
The embodiments of the invention provide a method for constructing high-girth quasi-cyclic low-density parity-check (QC-LDPC) codes. The method uses a hill-climbing optimization procedure. The hill-climbing procedure greedily adjusts parameters of the code to find a code of short length that meets specified code and girth parameters.
Given a girth, the method can construct QC-LDPC codes of shorter block-length in less time compared with a prior art PEG or “guess-and-test” methods. The time difference is significant for QC-LDPC codes with large base matrices.
As shown in
It is possible for the channel to corrupt the code-word to a signal y[n] 103. The signal is then passed to a decoder 140, which attempts to output a reconstruction z[a] 104 of the information block u[a] 102 using the QC-LDPC code 150.
Finding Cycles in QC-LDPC Codes
p1=0, p2=1, p3=2 and p4=1 in the matrix 211, and
p1=0, p2=p3=p4=1 in the matrix 212.
The first subgraph results in cycles of length four, as shown with dashed lines 231, while the latter results in a cycle 232 of length twelve.
We now describe a correspondence between a choice of shifts and the length of a resulting cycle in the Tanner graph of the code. Each row of the parity-check matrix corresponds to a check node in the Tanner graph, and each column to a bit node in the Tanner graph. Cycles correspond to a path through nodes, alternating between check and bit nodes.
As shown in
For a cycle, the path ends at the same bit node from which the path started. It is necessary, but not sufficient, for the path, when viewed at the base matrix level, to form a cycle. In other words, it is required that the path does not pass through any all-zeroes sub-matrices, denoted in the base matrix by a −1. We assume that potential paths satisfy this condition. However, because each circulant permutation sub-matrix corresponds to p parity nodes and p variable nodes, this is not sufficient.
The path could end at a different bit node in the same circulant permutation sub-matrix, thus not completing a cycle. What is sufficient is if, when the path returns, it returns to the same column of the circulant sub-matrix from which the path started. For example, in the subgraph 201, this happens for a cycle of length four. However, with a slightly different choice of circulant shifts of the subgraph 202, this only happens after a cycle of length twelve.
We can now specify the conditions on the parameter pj,l that result in a cycle. We determine the differences between the pj,l for neighboring permutation matrices along a given path, where neighbors are on the same row. In
p
1
−p
2
+p
3
−p
4
mod p=0, (4)
which is satisfied for p1=0, p2=1, p3=2, p4=1, but is not satisfied for p1=0, p2=p3=p4=1. Each cycle in the QC-LDPC code is necessarily related to p−1 other cycles obtained by the p−1 possible cyclic shifts in the circulant sub-matrices.
The same logic extends to longer cycles. As for a four-cycle pass through four elements of the base matrix arranged in a rectangle, an arbitrary cycle of length 2i in the Tanner graph of the code passes through 2i elements of the base matrix denoted by the ordered series
(j0, l0), (j1, l0), (j1, l1), . . . , (ji−1, li−1), (j0, li−1), (5)
where 1≦k<i, jk≠jk−1, lk≠lk−1, ji−1≠j0, and li−1≠l0. This ordered series can be considered a “potential” cycle of length 2i. Actually, the series only corresponds to a cycle if the elements traversed in the base matrix satisfy the generalization of Equation (4). To define this generalization, we use the following notation. We define
Δjx,jy(l)=pjx,l−pjy,l.
A necessary and sufficient condition for the code to have girth of at least 2(i+1) is that
for all m, 2≦m≦i, all jk, 0≦jk≦J−1, all jk+1, 0≦jk+1≦J−1, and all lk, 0≦lk≦L−1, with j0=jm, jk≠jk+1, and lk≠lk+1.
For every dimension pair (J, L) and girth g (minimum-length cycle of graph), there exists a pmin (or equivalently Nmin), such that when p<pmin or N<Nmin, no parity check matrices exists that satisfies Equation (6).
We provide a method that constructs a high girth QC-LDPC code of dimensions (J, L), and a girth g. The method returns a base matrix B and a value of p such that the specified code constructed from the base matrix has the girth and p is equal to pmin, or at least as close as possible to pmin.
One prior art method uses a “guess-and-test” method, see Fossorier, “Quasi-cyclic low-density parity-check codes from circulant permutation matrices,” IEEE Trans. Inform. Theory, vol. 50, no. 8, pp. 1788-1793, August 2004. For the guess-and-test method, (J−1)(L−1) integers between 0 and p−1 are randomly and independent identically distributed for non-zero elements of the base matrix B until a set is found such that Equation (6) is satisfied.
Fossorier shows that for codes of girth 8 or larger, all non-zero values in the base matrix must be distinct. We consider his method for those girths to be one where the (J−1)(L−1) integers are selected randomly in the range from 1 to p−1, such that all the integers are distinct. The problem with the guess-and-test method is that it is time-consuming, especially for large base matrices and where N is close to Nmin.
Hill-Climbing Optimization Procedure
Our method is a “hill-climbing” optimization procedure. Hill climbing is an optimization technique that can be used to solve problems that have many solutions but where some solutions are better than others. The procedure starts with a random initial solution to the problem, and iteratively makes small changes to improve the solution.
We start with a randomly selected base matrix B. We also have a corresponding cost matrix C of costs. Then, we change the base matrix B by making a “move.” The move changes the value of a single element (a pj,1) in the base matrix. We select the move that maximizes a reduction in the costs. The costs are a function of the number of cycles of length less than the girth that remain in the base matrix B.
When we no longer can change any single value of the base matrix to a value that further reduces the costs, and thus the number of undesired shorter cycles, the method terminates.
The procedure uses local hill-climbing where the cost function defined by the cost matrix C is the weighted sum of undesired cycles, and cycles of different weights have different weights in the cost function.
Our method tracks the number of cycles of each length in the current code, and the resulting number of cycles if we change each possible element in the base matrix B to other possible values.
The method becomes more complicated when we search for codes with large girths, because of the many possible ways that cycles can form in the Tanner graph.
Cost Matrix
The cost matrix C indicates the costs of changing any value in the base matrix B to have any other possible value, in terms of the weighted sum of the number of cycles. Thus, for any base matrix B, there is a corresponding cost matrix
where cj,l=[cj,l,0, cj,l,1, . . . , cj,l,p−1] is a cost, and cj,l,z is the cost for assigning element pj,l in the base matrix B to the value z for 0≦z≦p−1.
Let Si denote the set of all possible and distinct length-2i potential cycles represented by the ordered series, and |Si| denote the number of all the elements in the set Si. Therefore Si={si1, si2, . . . , si|Si|} with
s
ik[(j0(k), l0(k)), (j1(k), l0(k)), (j1(k), l1(k)), . . . , (ji−1(k), li−1(k)), (j0(k), li−1(k))]
for 1≦k≦|Si|. The girth is g, and a weight vector is w=[w2, w3, . . . , wg/2−1], where wi is the weight or cost for length-2i cycles.
For the base matrix, the corresponding cost matrix C is based on the following. For each potential cycle, we go through each of the elements of the base matrix in the potential cycle. We check, for each element, which value would cause a cycle, assuming all other base matrix elements in the potential cycle remain the same, and we mark that value of the element.
For example, for a potential six-cycle, we know that a cycle exists if and only if p1−p2+p3−p4+p5−p6 mod p=0, where p1 through p6 are the elements of the potential six-cycle. So, if the current summed value of p1−p2+p3−p4+p5−p6 mod p is 1, then we know that the marked values for p1, p3, and p5 are one less than the current value, and the marked values for p2, p4, and p6 are one greater than the current value.
This is relatively uncomplicated for potential cycles including 2i distinct elements of the base matrix. It becomes more complicated if some elements of the base matrix in the potential cycle appear twice. This can occur for cycles of size eight and larger and occurs, e.g., in the subgraph 202 of
Formally, we determine the cost matrix as follows.
Step 1
Determine the cost matrix cj,l,z=0 for
0≦j≦J−1, 0≦l≦L−1, and 0≦z≦p−1.
Step 2
For 2≦i≦g/2−1,
is unique in sik, then determine the marked value
is not unique in sik and occurs for the first time, and α mod 2=0, determine the marked value
is not unique in sik and occurs for the first time, and (p−α)mod 2=0, then determine the additional marked value
Step 3
For 0≦j≦J−1, 0≦l≦L−1, and 0≦z≦p−1, take the weighted sum
With this method for determining the cost matrix, the hill-climbing method is relatively straightforward to describe.
We start with a random base matrix B and a corresponding cost matrix C, and keep selecting the move of changing a value of a single base matrix element that maximizes the reduction in the costs while breaking ties randomly, until we find a base matrix and the corresponding cost matrix that has a zero cost for each of the current values of the elements of the base matrix. At this point, we return the base matrix B, or if we end in a local optimum, which has positive costs, we indicate a failure condition.
We specify 311 the girth g and a weight vector w=[w2, w3, . . . , wg/2−1]. The weight vector w has a weight for each type of cycle having a length less than the girth g. The weight vector represents the “costs” of different types of cycles.
We randomly select 320 an initial base matrix B according to the initial parameters of the code.
We determine 330 a cost matrix C corresponding to the base matrix B and the specified girth g and weight vector w. This step was described in detail above.
We generate 340 a gain matrix G using the cost matrix C. For 0≦j≦J−1 and 0≦l≦L−1, let
Here, {tilde over (c)}j,l is a minimum possible cost at position j, l in the base matrix B, {tilde over (z)}j,l is the value of the element pj, l giving that cost, and c′j,l is the cost associated with the current value of the element pj,l. The gain matrix G=[gj,l] is generated as
g
j,l
=c′
j,l
−{tilde over (c)}
j,l.
The gain matrix represents the value of a best move of an element at each position j, l in the base matrix.
Next, we determine
If there are multiple elements j, l, which all achieve the same maximal gain, ties are broken randomly. We check 350 whether the maximum gain gmax is positive.
If true, we update 360 the base matrix B by selecting a maximum gain step pjmax,lmax={tilde over (z)}jmax,lmax and repeat the step of determining 330 the cost matrix C.
Otherwise if false, we check 370 if all the costs cj.l.p
Then, the QC-LDPC code, i.e., the parity check matrix H 150, can be used in an encoder to encode 120 the information block u[a] to be transmitted through the channel 130 to the decoder, and to reconstruct the information block. The encoded information block can also be written to storage media for later retrieval.
The graphs in
As shown in the graphs that success rate is at least three orders of magnitude greater than the prior art method.
It is to be understood that various other adaptations and modifications can be made within the spirit and scope of the invention. Therefore, it is the object of the appended claims to cover all such variations and modifications as come within the true spirit and scope of the invention.