The present invention generally pertains to forward error correction. In particular, the present invention relates to structured Low-Density Parity-Check (LDPC) codes.
In a typical communication system, forward error correction (FEC) is often applied in order to improve robustness of the system against a wide range of impairments of the communication channel.
Referring to
In many modern communication systems, FEC uses Low-Density Parity-Check (LDPC) codes that are applied to a block of information data of the finite length.
One way to represent LDPC codes is by using so-called Tanner graphs, in which N symbol nodes, correspond to bits of the codeword, and M check nodes, correspond to the set of parity-check constraints which define the code. Edges in the graph connect symbol nodes to check nodes.
LDPC codes can also be specified by a parity check matrix H of size M×N. In the matrix H, each column corresponds to one of the symbol nodes while each row corresponds to one of the check nodes. This matrix defines an LDPC block code (N, K), where K is the information block size, N is the length of the codeword, and M is the number of parity check bits. M=N−K. A general characteristic of the LDPC parity check matrix is the low density of non-zero elements that allows utilization of efficient decoding algorithms. The structure of the LDPC code parity check matrix is first outlined in the context of existing hardware architectures that can exploit the properties of these parity check matrices.
In order to accommodate various larger code rates without redesigning parity check matrix and therefore avoiding changing significantly base hardware wiring, expansion of a base parity check matrix is one of the common approach. This may be archived, for example, by replacing each non-zero element by a permutation matrix of the size of the expansion factor.
One problem often faced by the designer of LDPC codes is that the base parity check matrices are designed to follow some assumed degree distribution, which is defined as the distribution of column weights of the parity check matrix. Column weight in turn equals the number of 1's in a column. It has been shown that irregular degree distributions offer the best performance on the additive white Gaussian noise channel. However, the base parity check matrix does not exhibit any structure in its Hd portion to indicate the final matrix after expansion. The number of sub-matrix blocks, corresponding to the number of sub-iterations in the layered decoding algorithm may become large. Since the maximum number of rows that can be processed in parallel equals the number of rows in the sub-matrix block, the overall throughput may be impacted.
Another problem is that in order to maintain the performance such as coding gain as high as possible, there are different requirements such as to select the largest suitable codeword from the available set of codewords and then properly adjust the amount of shortening and puncturing; use as few of the modulated symbols as possible; and keep the overall complexity at a reasonable level.
Some attempts have been made to enhance the throughput by reducing the number of rows of the base parity matrix, and consequently the number of block of rows in the expanded parity check matrix, by combining rows as a method to increase the code rate without changing the degree distribution. However, the derived high rate matrix is still relatively large, since in order to allow row combining, the original low rate base parity matrix usually has a large number of rows. The decoding time also becomes a function of the code rate: the higher the code rate the less layers in the layered decoding and, in general, less decoding time.
Other existing approaches for shortening and puncturing of the expanded matrices may preserve the column weight distribution, but may severely disturb the row weight distribution of the original matrix. This, in turn, causes degradation when common iterative decoding algorithms are used. This adverse effect strongly depends on the structure of the expanded matrix.
Therefore, there is an unmet need for a method, a system to design structured base parity check matrices, in combination with expansion, allow achieving high throughput, low latency, and at the same time, the preservation of the simple encoding feature of the expanded codeword.
There is further an unmet need for a method and a system to enable flexible rate adjustments by using shortening, or puncturing, or a combination of shortening and puncturing; and at the same time the code rate is approximately the same as the original one, and the coding gain is preserved.
In accordance with a first aspect of the present invention there is provided a method for constructing a low-density parity-check (LDPC) code having a structured parity check matrix comprising the steps of: a) constructing a structured base parity check matrix having a plurality of sub-matrices, the sub-matrices are selected from a group consisting of permutation matrix, pseudo-permutation matrix, and zero matrix; and b) expanding the structured base parity check matrix into an expanded parity check matrix.
Preferably, the sub-matrices in the plurality of sub-matrices have the same size.
Preferably, a majority of the plurality of sub-matrices has the same size, and a small subset of the sub-matrices is constructed by concatenating smaller permutation sub-matrices, pseudo-permutation matrices or zero matrices.
Preferably, the method of claim 1 wherein the base parity check matrix is in the form of H=[Hd|Hp], Hd being a data portion of the parity check matrix, Hp being a parity portion of the parity check matrix.
Preferably, the expanding step further comprises the steps of: replacing each non-zero member of the sub-matrices by a permutation matrix or a pseudo-permutation matrix; and replacing each zero member of the sub-matrices by a zero matrix.
Preferably, the parity portion of the structured base parity check matrix comprises a dual diagonal.
In accordance with another aspect of the present invention there is provided a method for decoding data stream encoded using the LDPC code comprising the steps of: a) receiving a set of input values corresponding to variable nodes of the structured parity check matrix; and b) estimating a probability value of the variable nodes based on the plurality of parity checks contained within an block of parity checks corresponding to a row of sub-matrices of the base parity check matrix, over the blocks of the base parity check matrix.
Preferably, the estimating step is repeated until a termination criterion is reached.
In accordance with another aspect of the present invention there is provided a device for decoding data stream encoded using LDPC code, comprising: a) intra-layer storage elements for receiving a set of input values corresponding to variable nodes of the structured parity check matrix, and for storing the updated variable nodes information; b) a read network for delivering the information from the intra-layer storage elements to the processing units; c) processing units for estimating a probability value of the variable nodes based on a plurality of parity checks contained within a block of parity checks corresponding to a row of sub-matrices of the base parity check matrix; d) inter-layer storage for storing additional information from sub-matrices concatenated using sub-matrices selected from a group consisting of permutation matrix, pseudo-permutation matrix, and zero matrix; and d) a write network for delivering the results from processing units to the intra-layer storage elements.
In accordance with another aspect of the present invention there is provided a method for constructing a low-density parity-check (LDPC) code having a structured parity check matrix comprising the steps of: a) constructing a structured base parity check matrix H=[Hd|Hp], Hd being a data portion of the parity check matrix, Hp being a parity portion of the parity check matrix; b) selecting a parity portion of the structured base parity check matrix so that when expanded, an inverse of the parity portion of the expanded parity check matrix is sparse; and c) expanding the structured base parity check matrix into an expanded parity check matrix.
In accordance with another aspect of the present invention there is provided a method for constructing a low-density parity-check (LDPC) code having a structured parity check matrix, the method comprising the steps of: a) constructing a base parity check matrix H=[Hd|HP] having a plurality of elements, Hd being a data portion of the parity check matrix, Hp being the parity portion of the parity check matrix; b) expanding the base parity check matrix into an expanded parity check matrix by replacing each non-zero element of the plurality of elements by a shifted identity matrix, and each zero element of the plurality of elements by a zero matrix; wherein the base parity check matrix has a coding rate selected from the group consisting of R=½, ⅔, ¾, ⅚, and ⅞; and accordingly is of the size selected from the group consisting of 12×24, 8×24, 6×24, 4×24, and 3×24.
Preferably, the base parity check matrix has a coding rate of R=¾, and is:
More preferably, the base parity check matrix is expanded by expansion factors L between 24 and Lmax=96, and is represented by the expanded parity check matrix:
wherein −1 represents L×L all-zero square matrix, the integer sij represents, circular shifted L×L identity matrix, the amount of the shift s′ij is determined as follows:
in accordance with another aspect of the present invention there is provided a method for encoding variable sized data using low-density parity-check (LDPC) code and transporting the encoded variable sized data in modulated symbols, the method comprising the steps of: a) calculating a minimum number of modulated symbols capable for transmitting a data packet; b) selecting an expanded parity check matrix having a proper codeword size suitable for transmitting the data packet; c) calculating a number of shortening Nshortened bits to be used during transmission of the data packet; and d) calculating a number of puncturing Npunctured bits to, be used during transmission of the data packet.
Preferably, the method further comprises the steps of: a) constructing one or more than one structured base parity check matrix H=[Hd|Hp], Hd being a data portion of the parity check matrix, Hp being a parity portion of the parity check matrix; and b) expanding the one or more than one structured base parity check matrix into one or more than one expanded parity check matrix, each of the one or more than one expanded parity check matrix having a different codeword size for use in the selecting step.
Preferably, the method further comprises the steps of: a) determining a performance criterion of the shortened and punctured expanded parity check matrix; b) adding an additional symbol to transmit the encoded data packet in the case when performance criterion is not met; and c) recalculating the amount of puncturing Npunctured bits.
Preferably, the method further comprises the steps of: a) selecting Nshortened variable nodes from the expanded parity check matrix; b) ensuring a uniform or a close to uniform row weight distribution after removing columns corresponding to the selected Nshortened variable nodes; and c) ensuring a new column weight distribution as close as possible to an original column weight distribution after removing the columns corresponded to the selected Nshortened variable nodes from the selected expanded parity check matrix.
Preferably, the selecting Nshortened variable nodes step further comprises the step of selecting variable nodes belonging to consecutive columns in the selected expanded parity check matrix.
Preferably, the ensuring a new column weight distribution step further comprises the step of prearranging columns of the data portion Hd of the selected expanded parity check matrix.
Preferably, the method further comprises the steps of: a) determining a performance criterion of the shortened and punctured expanded parity check matrix; b) adding an additional symbol to transmit the encoded data packet in the case when the performance criterion is not met; and c) recalculating the amount of puncturing Npunctured bits.
Preferably, the method further comprises the steps of a) selecting Nshortened variable nodes from the expanded parity check matrix; b) ensuring a uniform or a close to uniform row weight distribution after removing columns corresponding to the selected Nshortened variable nodes; and c) ensuring a new column weight distribution as close as possible to an original column weight distribution after removing the columns corresponded to the selected Nshortened variable nodes from the selected expanded parity check matrix.
More preferably, the selecting step further comprises the step of selecting variable nodes belonging to consecutive columns in the selected expanded parity check matrix.
More preferably, the ensuring step further comprises the step of prearranging columns of the data portion Hd of the selected expanded parity check matrix.
Preferably, the method further comprises the steps of a) selecting Npunctured variable nodes from the selected expanded parity check matrix; b) ensuring each of the selected Npunctured variable nodes is connected to fewest possible check nodes; and c) ensuring that all of the selected Npunctured nodes are connected to most possible check nodes.
Preferably, the performance criterion is selected from the group consisting of a threshold for Npunctured, a threshold for Nshortened, a threshold for normalized shortening to puncturing ratio, qnormalized, and a combination thereof; wherein qnormalized is defined as:
qnormalized=(Nshortened/Npunctured)/[R/(1−R)].
More preferably, the threshold for qnormalized is set to be in the range of 1.2-1.5.
More preferably, the threshold for qnormalized is set to be equal to 1.2.
In accordance with another aspect of the present invention there is provided a method of shortening low-density parity-check (LDPC) code comprising the steps of: a) selecting variable nodes in a parity check matrix; b) ensuring a uniform or a close to uniform row weight distribution after removing the selected variable nodes; and c) ensuring a new column weight distribution as close as possible to an original column weight distribution after removing the columns corresponded to the selected variable nodes.
Preferably, the method further comprises the step of selecting variable nodes that belongs to consecutive columns in the parity check matrix.
Preferably, the method further comprises the step of prearranging columns of the data portion of parity check matrix.
In accordance with another aspect of the present invention there is provided a method of puncturing a low-density parity-check (LDPC) code comprising the steps of: a) selecting variable nodes in a parity check matrix; b) ensuring that each of the selected variable nodes is connected to fewest possible check nodes; and c) ensuring that all of the selected variable nodes are connected to most possible check nodes.
Preferably, the method further comprises the step of selecting variable nodes belonging to consecutive columns in the parity check matrix.
The invention and the illustrated embodiments may be better understood, and the numerous objects, advantages, and features of the present invention and illustrated embodiments will become apparent to those skilled in the art by reference to the accompanying drawings, and wherein:
a,
26
b and 26c are matrices for use in relevant encoding methods and systems.
Reference will now be made in detail to some specific embodiments of the invention including the best modes contemplated by the inventors for carrying out the invention. Examples of these specific embodiments are illustrated in the accompanying drawings. While the invention is described in conjunction with these specific embodiments, it will be understood that it is not intended to limit the invention to the described embodiments. On the contrary, it is intended to cover alternatives, modifications, and equivalents as may be included within the spirit and scope of the invention as defined by the appended claims. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. The present invention may be practiced without some or all of these specific details. In other instances, well-known process operations have not been described in detail in order not to unnecessarily obscure the present invention.
Efficient decoder architectures are enabled by designing the parity check matrix, which in turn defines the LDPC code, around some structural assumptions: structured LDPC codes.
One example of this design is that the parity check matrix comprises sub-matrices in the form of binary permutation or pseudo-permutation matrices.
The term “permutation matrix” is intended to mean square matrices with the property that each row and each column has one element equal to 1 and other elements equal to 0. Identity matrix, a square matrix with ones on the main diagonal and zeros elsewhere, is a specific example of permutation matrix. The term “pseudo-permutation matrix” is intended to include matrices that are not necessarily square matrices, and matrices may have row(s) and/or column(s) consisting of all zeros. It has been shown, that using this design, significant savings in wiring, memory, and power consumption are possible while still preserving the main portion of the coding gain. This design enables various serial, parallel, and semi-parallel hardware architectures and therefore various trade-off mechanisms.
This structured code also allows the application of layered decoding, also referred to as layered belief propagation decoding, which exhibits improved convergence properties compared to a conventional sum-product algorithm (SPA) and its derivations. Each iteration of the layered decoding consists of a number of sub-iterations that equals the number of blocks of rows (or layers).
LDPC code parity check matrix design also results in the reduction in encoder complexity. Classical encoding of LDPC codes is more complex than encoding of other advanced codes used in FEC, such as turbo codes. In order to ease this complexity it has become common to design systematic LDPC codes with the parity portion of the parity check matrix containing a lower triangular matrix. This allows simple recursive decoding. One simple example of a lower triangular matrix is a dual diagonal matrix as shown in
Referring to
where d=[d0 . . . dK-1]T is the block of data bits and p=[p0 . . . pM-1]T are the parity bits. A codeword is any binary, or in general, non-binary, N-vector c that satisfies:
Hc=Hdd+Hpp=0
Thus, a given data block d is encoded by solving binary equation Hdd=Hpp for the parity bits p. In principle, this involves inverting the M×M matrix Hp to resolve p:
p=Hp−1Hdd[equation 1]
Hp is assumed to be invertible. If the inverse of Hp, Hp−1 is also low density then the direct encoding specified by the above formula can be done efficiently. However, with the dual diagonal structure of Hp 32 encoding can be performed as a simple recursive algorithm:
where in0 is the index of the column in which row 0 contains a “1”
where in1 is the index of the column in which row 1 contains a
where inM-1 is the index of the column in which row M−1 contains a “1”.
In these recursive expressions hr,c are non-zero elements (1 in this exemplary matrix) of the data portion of the parity check matrix, Hd 31. The number of non-zero elements in rows 0, 1, . . . , M−1, is represented by k0, k1, . . . , kM-1, respectively.
One desirable feature of LDPC codes is that they support various required code rates and block sizes. A common approach is to have a small base parity check matrix defined for each required code rate and to support various block sizes by expanding the base parity check matrix. Since it is usually required to support a range of block sizes, a common approach is to define expansion for the largest block size and then apply other algorithms which specify expansion for smaller block sizes. Below is an example of a base parity check matrix:
In this example the base parity check matrix is designed for the code rate R=½ and its dimensions are (Mb×Nb)=(6×12). Assume that the codeword sizes (lengths) to be supported are in the range N=[72, 144], with increments of 12, i.e. N=[72, 84, . . . , 132, 144]. In order to accommodate those block lengths the parity check matrix needs to be of the appropriate size (i.e. the number of columns match N, the block length). The number of rows is defined by the code rate: M=(1−R)N. The expansion is defined by the base parity check matrix elements and the expansion factor L, which results in the maximum block size. The conventions used in this example, for interpreting the numbers in the base parity check matrix, are as follows:
The following example shows a rotated identity matrix where the integer specifying rotation is 5:
Therefore, for the largest block (codeword) size of N=144, the base parity check matrix needs to be expanded by an expansion factor of 12. That way the final expanded parity check matrix to be used for encoding and generating the codeword of size 144, is of the size (72×144). In other words, the base parity check matrix was expanded Lmax=12 times (from 6×12 to 72×144). For the block sizes smaller than the maximum, the base parity check matrix is expanded by a factor L<Lmax. In this case expansion is performed in the similar fashion except that now matrices IL and 0L, are used instead of ILmax and 0Lmax, respectively. Integers specifying the amount of rotation of the appropriate identity matrix, IL, are derived from those corresponding to the maximum expansion by applying some algorithm. For example, such an algorithm may be a simple modulo operation:
rL=(rLmax)modulo L
An example of such a matrix is shown in
The expansion may be done for example by replacing each non-zero element with a permutation matrix of the size of the expansion factor. One example of performing expansion is as follows.
Hp is expanded by replacing each “0” element by an L×L zero matrix, 0L×L, and each “1” element by an L×L identity matrix, IL×L, where L represent the expansion factor.
Hd is expanded by replacing each “0” element by an L×L zero matrix, 0L×L, and each “1” element by a circularly shifted version of an L×L identity matrix, IL×L. The shift order, s (number of circular shifts, for example, to the right) is determined for each non-zero element of the base parity check matrix.
It should be apparent to a person skilled in the art that these expansions can be implemented without the need to significantly change the base hardware wiring.
The simple recursive algorithm described earlier can still be applied in a slightly modified form to the expanded parity check matrix. If hi,j represent elements of the Hd portion of the expanded parity check matrix, then parity bits can be determined as follows:
p0=h0,0d0+h0,1d1+h0,2d2+ . . . +h0,11d11,
p1=h1,0d0+h1,1d1+h1,2d2+ . . . +h1,11d11,
p2=h2,0d0+h2,1d1+h2,2d2+ . . . +h2,11d11,
p3=p0+h3,0d0+h3,1d1+h3,2d2+ . . . +h3,11d11,
p4=p1+h4,0d0+h4,1d1+h4,2d2+ . . . +h4,11d11,
p5=p2+h5,0d0+h5,1d1+h5,2d2+ . . . +h5,11d11,
p6=p3+h6,0d0+h6,1d1+h6,2d2+ . . . +h6,11d11,
p7=p4+h7,0d0+h7,1d1+h7,2d2+ . . . +h7,11d11,
p8=p5+h8,0d0+h8,1d1+h8,2d2+ . . . +h8,11d11,
p9=p6+h9,0d0+h9,1d1+h9,2d2+ . . . +h9,11d11,
p10=p7+h10,0d0+h10,1d1+h10,2d2+ . . . +h10,11d11,
p11=p8+h11,0d0+h11,1d1+h11,2d2+ . . . +h11,11d11,
However, when the expansion factor becomes large, then the number of columns with only one non-zero element, i.e. 1 in the example here, in the Hp becomes large as well. This may have a negative effect on the performance of the code.
One remedy for this situation is to use a slightly modified dual diagonal Hp matrix. This is illustrated with reference to
The parity check equations now become:
h0,0d0+h0,1d1+ . . . +h0,11d11p0+p3=0 [equation 2]
h1,0d0+h1,1d1+ . . . +h1,11d11p1+p4=0 [equation 3]
h2,0d0+h2,1d1+ . . . +h2,11d11p2+p5=0 [equation 4]
h3,0d0+h3,1d1+ . . . +h3,11d11p0+p3p6=0 [equation 5]
h4,0d0+h4,1d1+ . . . +h4,11d11p1+p4+p7=0 [equation 6]
h5,0d0+h5,1d1+ . . . +h5,11d11p2+p5+p8=0 [equation 7]
h6,0d0+h6,1d1+ . . . +h6,11d11p6+p9=0 [equation 8]
h7,0d0+h7,1d1+ . . . +h7,11d11p7+p10=0 [equation 9]
h8,0d0+h8,1d1+ . . . +h8,11d11p8+p11=0 [equation 10]
h9,0d0+h9,1d1+ . . . +h9,11d11p0+p9=0 [equation 11]
h10,0d0+h10,1d1+ . . . +h10,11d11p1+p10=0 [equation 12]
h11,0d0+h11,1d1+ . . . +h11,11d11p2+p11=0 [equation 13]
Now by summing up equations 2, 5, 8, and 11, the following expression is obtained:
(h0,0+h3,0+h6,0+h9,0)d0+(h0,1+h3,1+h6,1+ . . . +(h0,11+h3,11+h6,11+h9,11)dd11+p0+p3+p0+p3+p6+p6+p9+p0+p9=0
Since only p0 appears an odd number of times in the equation above, all other parity check bits cancel except for p0, and thus:
p0=(h0,0+h3,0+h6,0+h9,0)d0+(h0,1+h3,1+h6,1+h9,1)d1+ . . . +(h0,11+h3,11+h6,11+9,11)d11
Likewise:
p1=(h1,0+h4,0+h7,0+h10,0)d0+(h1,1+h4,1+h7,1+h10,1)d1+ . . . +(h1,11+h4,11+h7,11+10,11)d11
p2=(h2,0+h5,0+h8,0+h11,0)d0+(h2,1+h5,1+h8,1+h11,1)d1+ . . . +(h2,11+h5,11+h8,11+11,11)d11
After determining p0, p1, p2 the other parity check bits are obtained recursively:
p3=h0,0d0+h0,1d1+ . . . +h0,11d11+p0
p4=h1,0d0+h1,1d1+ . . . +h1,11d11+p1
p5=h2,0d0+h2,1d1+ . . . +h2,11d11+p2
p6=h3,0d0+h3,1d1+ . . . +h3,11d11+p0+p3
p7=h4,0d0+h4,1d1+ . . . +h4,11d11+p1+p4
p8=h5,0d0+h5,1d1+ . . . +h5,11d11+p2+p5
p9=h6,0d0+h6,1d1+ . . . +h6,11d11+p6
p10=h7,0d0+h7,1d1+ . . . +h7,11d11+p7
p11=h8,0d0+h8,1d1+ . . . +h8,11d11+p8 [equation 14]
The present invention provides method and system enabling high throughput, low latency implementation of LDPC codes, and preserving the simple encoding feature at the same time.
In accordance with one embodiment of the present invention, a general form is shown in
The data portion (Hd) may also be placed on the right side of the parity (Hp) portion of the parity check matrix. In the most general case, columns from Hd and Hp may be interchanged.
Parity check matrices constructed according to the embodiments of the present invention supports both regular and irregular types of the parity check matrix. Not only the whole matrix may be irregular (non-constant weight of its rows and columns) but also that its constituents Hd and Hp may be irregular, if such a partition is desired.
If the base parity check matrix is designed with some additional constraints, then base parity check matrices for different code rates may also be derived from one original base parity check matrix in one of two ways:
Row-combining or row-splitting, with the specific constraints defined above, allow efficient coding of a new set of expanded derived base parity check matrices. In these cases the number of layers may be as low as the minimum number of block rows (layers) in the original base parity check matrix.
Hp, present
Where T is the transform describing the base parity check matrix expansion process and m is the size of the permutation matrices. For m=1, Hp of the present invention defines the form of the prior art Hp (dual diagonal with the odd-weight column), i.e.
Hp,present
A further pair of parity portions with sub-matrices 905, 906 illustrate cases where these first and last columns, respectively, have only one sub-matrix each.
The two parity portions with sub-matrices 907, 908 in
However, in order to solve the weight-1 problem, the sub-matrices 99 (shown hatched) in each example have the weight of all columns equal to 2, except the last one, which has weight equal to 1.
One of the characteristics of the base parity check matrix expansion of the present invention is that the expanded base parity check matrix inherits structural features from the base parity check matrix. In other words, the number of blocks (rows or columns) that can be processed in parallel (or serial, or in combination) in the expanded parity check matrix equals the number of blocks in the base parity check matrix.
Referring to
The base parity check matrix 100 of
It can be seen that expanded parity check matrix 110 has inherited structural properties of its base parity check matrix 100 from
The sub-matrices of the present invention are not limited to permutation sub-matrices, pseudo-permutation sub-matrices or zero sub-matrices. In other words, the embodiments of the present invention are not restricted to the degree distribution (distribution of column weights) of the parity check matrix, allowing the matrix to be expanded to accommodate various information packet sizes and can be designed for various code rates. This generalization is illustrated through following examples.
In the context of parallel row processing, layered belief propagation decoding is next briefly described with reference to
A high level architectural block diagram is shown in
In order to support a more general approach in accordance with an embodiment of the present invention, the architecture of
By exercising careful design of the parity check matrix, the additional inter-layer storage 155 in
Iterative parallel decoding process is best described as read-modify-write operation. The read operation is performed by a set of permuters, which deliver information from memory modules to corresponding processing units. Parity check matrices, designed with the structured regularity described earlier, allow efficient hardware implementations (e.g., fixed routing, use of simple barrel shifters) for both read and write networks. Memory modules are organized so as to provide extrinsic information efficiently to processing units.
Processing units implement block (layered) decoding (updating iterative information for a block of rows) by using any known iterative algorithms (e.g. Sum Product, Min-Sum, Bahl-Cocke-Jelinek-Raviv (BCJR)).
Inverse permuters are part of the write network that performs the write operation back to memory modules.
Such parallel decoding is directly applicable when the parity check matrix is constructed based on permutation, pseudo-permutation or zero sub-matrices.
To encode using sub-matrices other than permutation, pseudo-permutation or zero sub-matrices, one embodiment of the present invention uses special sub-matrices. A sub-matrix can also be constructed by concatenation of smaller permutation or pseudo-permutation matrices. An example of this concatenation is illustrated in
Parallel decoding is applicable with the previously described modification to the methodology; that is, when the parity check matrix includes sub-matrices built by concatenation of smaller permutation matrices.
It can be seen that for the decoding layer 171a first processing unit receives information in the first row 179 from bit 1 (according to S21), bit 6 (S22), bit 9 (S23), bit 13 (S124), bit 15 (S224), bit 21 (S26), and bit 24 (S29). Other processing units are loaded in a similar way.
For layered belief propagation type decoding algorithms, the processing unit inputs extrinsic information accumulated, by all other layers, excluding the layer currently being processed. Thus, the prior art implementation described using
This is illustrated in
For simplicity,
Improvement in throughput, and reduction in latency in accordance to an embodiment of the present invention is further illustrated by the following example.
The LDPC codes can be decoded using several methods. In general, iterative decoding is applied. The most common is the sum-product algorithm (SPA) method. Each iteration in SPA comprises two steps:
It has been shown that better performance, in terms of the speed of convergence, can be achieved with layered decoding. In layered decoding only row variables are updated for a block of rows, one block row at a time. The fastest approach is to process all the rows within a block of rows simultaneously.
The following is a comparison of the achievable throughput (bit rate) of two LDPC codes: one based on the existing method for expanding matrix, as described in
T=(K×F)/(C×I),
where K is number of info bits, F is clock frequency, C is number of cycles per iteration, and I is the number of iterations. Assuming that K, F, and I are fixed and, for example, equal: K=320 bits, F=100 MHz, and I=10, the only difference between the existing method and the present invention is derived from C, the factor which is basically a measure of the level of allowed parallelism. It can be seen, by comparing
Cexisting=16 and Cpresent
Using these numbers in the formula gives:
Tmax,existing=200 Mbps
Tmax,present
As expected, the maximum throughput is 4 times greater. All the desirable features of the code design in terms of efficient encoding are preserved. For example, without degradation in performance, the encoding algorithm as described earlier with respect to
Furthermore, when a scaleable solution is desired, the size of the expanded LDPC parity check matrix is designed to support the maximum block size. The existing solutions do not scale well with respect to the throughput for various block sizes. For example, using the existing method for layered decoding, processing of short and long blocks takes the same amount of time. This is caused by the fact that for shorter blocks, not all processing units are used, resulting proportionally lower achieved throughput.
The following example is based on the same example as before by comparing matrices as described earlier in
The following table compares the computed results.
It can be seen from the table that the embodiment of the present invention provides constant throughput independent on the codeword size, whereas in the case of the existing method the throughput for the smaller blocks drops considerably. The reason is that while the embodiment of the present invention fully utilizes all available processing resources irrespective of block size, the existing method utilizes all processing units only in the case of the largest block, and a fraction of the total resources for other cases.
The example here illustrating the throughput improvement for shorter blocks, leads also to the conclusion that reduced latency is also achieved with the embodiment of the present invention. When large blocks of data are broken into smaller pieces, the encoded data is split among multiple codewords. If one places a shorter codeword at the end of series of longer codewords, then the total latency depends primarily on the decoding time of the last codeword. According to the table above, short blocks require proportionally less time to be decoded (as compared to the longer codewords), thereby allowing reduced latency to be achieved by encoding the data in suitably short blocks.
In addition to the full hardware utilization illustrated above, embodiments of the present invention allow hardware scaling, so that short blocks can use proportionately less hardware resources if an application requires it.
Furthermore, utilization of more efficient processing units and memory blocks is enabled. Memory can be organized to process a number of variables in parallel. The memory can therefore, be partitioned in parallel.
The present invention provides new LPDC base parity matrices, and expanded matrices based on the new base parity matrices, and method for use thereof.
The locations of non-zero matrices for rate R in an exemplary matrix are chosen, so that
An example of R=¾ base parity check matrix design using criteria a) to d) is:
The rate R=¾ matrix definition, built based on such base parity check matrix covers expansion factors in the range L between 24 and Lmax=96 in increments of 4. Right circular shifts of the corresponding L×L identity matrix sij′, are determined as follows:
The present invention further enables flexible rate adjustments by the use of shortening, or puncturing, or a combination thereof. Block length flexibility is also enabled through expansion, shortening, or puncturing, or combinations thereof.
Any of these operations can be applied to the base or expanded parity check matrices.
Referring to
The data packet 201 of length L is divided into segments 208. These segments are in turn encoded using an LDPC code (N, K). The information block K 202 may be optionally pruned to K′ 204; and the parity check bits M may be pruned to M′ 205. The term “pruning” is intended to mean applying code shortening by sending less information bits than possible with a given code, (K′<K). The term “puncturing” is intended to mean removing some of the parity bits and/or data bits prior to sending the encoded bits to the modulator block and subsequently over the channel. Pruned codewords may be concatenated 206 in order to accommodate the encoded data packet, and the resulting stream 207 is padded with bits 209 to match the boundaries 210 of modulated symbols before being sent to the modulator. The amount of shortening and puncturing may be different for the constituent pruned codewords. The objectives here are:
From objective (a) above it follows that in order to use a small number of codewords, an efficient shortening and puncturing operation needs to be applied. However, those operations have to be implemented in a way that would neither compromise the coding gain advantage of LDPC codes, nor lower the overall transmit efficiency unnecessarily. This is particularly important when using the special class of LDPC parity check matrices that enable simple encoding operation, for example, as the one describe in the previous embodiments of the present invention. These special matrices employ either a lower triangular, a dual-diagonal, or a modified dual-diagonal in the parity portion of the parity check matrix corresponding. An example of a dual-diagonal matrix is described earlier in
Work to achieve efficient puncturing has been done using the “rate compatible” approach. One or more LDPC parity check matrix is designed for the low code rate application. By applying the appropriate puncturing of the parity portion, the same matrix can be used for a range of code rates which are higher than the original code rate as the data portion in relation to the codeword increases. These methods predominantly target applications where adaptive coding (e.g. hybrid automatic repeat request, H-ARQ) and/or unequal bit protection is desired.
Puncturing may also be combined with code extension to mitigate the problems associated with “puncturing only” cases. The main problem that researchers are trying to solve here is to preserve an optimum degree distribution through the process of modifying the original parity check matrix.
However, these methods do not directly address the problem described earlier: apply shortening and puncturing in such a way that the code rate is approximately the same as the original one, and the coding gain is preserved.
One method attempting to solve this problem specifies shortening and puncturing such that the code rate of the original code is preserved. The following notation is used:
Npunctured—Number of punctured bits,
Nshortened—Number of shortened bits.
Shortening to puncturing ratio, q, is defined as: q=Nshortened/Npunctured. In order to preserve the same code rate, q has to satisfy the following equation:
qrate
Two approaches are prescribed for choosing which bits to shorten and which to puncture to reach a shortening and a puncturing pattern.
Two approaches for shortening and puncturing of the expanded matrices are described in Dale Hocevar and Anuj Batra, “Shortening and Puncturing Scheme to Simplify LDPC Decoder Implementation,” Jan. 11, 2005, a contribution to the informal IEEE 802.16e LDPC ad-hoc group, the entirely of the document is incorporated herein by reference. These matrices are generated from a set of base parity check matrices, one base parity check matrix per code rate. The choice depends on the code rate, i.e. on the particular parity check matrix design.
The method may preserve the column weight distribution, but may severely disturb the row weight distribution of the original matrix. This, in turn, causes degradation when common iterative decoding algorithms are used. This adverse effect strongly depends on the structure of the expanded matrix.
This suggests that this approach fails to prescribe general rules for performing shortening and puncturing, and has an unnecessary restriction for a general case such as the one described in
In general, the amount of puncturing needs to be limited. Extensive puncturing beyond certain limits paralyzes the soft decision decoder. Prior art methods, none of which specify a puncturing limit or alternatively offer some other way for mitigating the problem, may potentially compromise the performance significantly.
In accordance with another embodiment of the present invention, above described shortcomings may be addressed by:
This embodiment of the present invention may be beneficially applied to both the transmitter and the receiver. Although developed for wireless systems, embodiments of the invention can be applied to any other communication system which involves encoding of variable size data packets by a fixed error correcting block code.
The advantage of this invention can be summarized as providing an optimal solution to the above described problem given the range of the system parameters such as the performance, power consumption, and complexity. It comprises the following steps:
At step 213, the minimum number of modulated symbols Nsym
Both the encoder and the decoder may be presented with the same input parameters in order to be able to apply the same procedure and consequently use the same codeword size, as well as other relevant derived parameters, such as the amount of shortening and puncturing for each of the codewords, number of codewords, etc.
In some cases only the transmitter (encoder) has all the parameters available, and the receiver (decoder) is presented with some derived version of the encoding procedure parameters. For example, in some applications it is desirable to reduce the initial negotiation time between the transmitter and the receiver. In such cases the transmitter initially informs the receiver of the number of modulated symbols it is going to use for transmitting the encoded bits rather than the actual data packet size. The transmitter performs the encoding procedure differently taking into consideration the receiver's abilities (e.g. using some form of higher layer protocol for negotiation). Some of the requirements are relaxed in order to counteract deficiencies of the information at the receiver side. For example, the use of additional modulated symbols to enhance performance may always be in place, may be bypassed altogether, or may be assumed for the certain ranges of payload sizes, e.g. indirectly specified by the number of modulated symbols.
One example of such an encoding procedure is an OFDM based transceiver, which may be used in IEEE 802.11n. In this case the reference to the number of bits per modulated symbol translates into the number of bits per OFDM symbol. In this example, the AggregationFlag parameter specified in 801.11n is used to differentiate between the case when both the encoder and the decoder are aware of actual data packet size (AggregationFlag=0) and the case when the packet size is indirectly specified by the number of required OFDM symbols (AggregationFlag=1).
An exemplary algorithm in accordance with one embodiment of the present invention is with following parameters are now described:
Algorithm Parameters
Algorithm Input
Algorithm Output:
Algorithm Procedure
Each of those features will be now described in more detail.
(a) General Rules for Shortening and Puncturing
Much effort has been spent to come up with designs of LDPC parity check matrices such that the derived codes provide optimum performance. Examples include: T. J. Richardson et al., “Design of Capacity-Approaching Irregular Length Low-Density Parity-Check Codes,” IEEE Transactions on Information Theory, vol. 47, February 2001 and S. Y. Chung, et al., “Analysis of Sum-Product Decoding of Low-Density Parity-Check Codes Using a Gaussian Approximation,” IEEE Transactions on Information Theory, vol. 47, February 2001, both of which are incorporated herein by reference, are examples. These papers show that, in order to provide optimum performance, a particular variable nodes degree distribution should be applied. Degree distribution refers here to the distribution of the column weights in a parity check matrix. This distribution, in general, depends on the code rate and the size of the parity check matrix, or codeword. It is desirable that the puncturing and shortening pattern, as well as the number of punctured/shortened bits, are specified in such a way that the variable nodes degree distribution is preserved as much as possible. However, since shortening and puncturing are qualitatively different operations, different rules apply to them, as will now be explained.
(b) Rules for Shortening
Shortening of a code is defined as sending less information bits than possible with a given code, K′<K. The encoding is performed by: taking K′ bits from the information source, presetting the rest (K-K′) of the information bit positions in the codeword to a predefined value, usually 0, computing M parity bits by using the full M×N parity check matrix, and finally forming the codeword to be transmitted by concatenating K′ information bits and M parity bits. One way to determine which bits to shorten in the data portion of the parity check matrix, Hd (31 in
3 3 3 8 3 3 3 8 3 3 3 8
When discarding columns, the aim is to ensure that the ration of ‘3’s to ‘8’s remains close to optimal, say 1:3 in this case. Obviously it cannot be 1:3 when one to three columns are removed. In such circumstances, the removal of 2 columns might result in e.g.:
3 3 8 3 3 8 3 3 3 8
giving a ratio of ˜1:3.3 and the removal of a third column—one with weight ‘8’—might result in:
3 3 3 3 8 3 3 3 8
thus preserving a ratio of 1:3.5, which is closer to 1:3 than would be the case where the removal of the third column with weight ‘3’, which results in:
8 3 3 3 8 3 3 3 8
giving a ratio of 1:2.
It is also important to preserve approximately constant row weight throughout the shortening process.
An alternative to the above-described approach is to prearrange columns of the part of the parity check matrix, such that the shortening can be applied to consecutive columns in Hd. Although perhaps suboptimal, this method keeps the degree distribution of Hd close to the optimum. However, the simplicity of the shortening pattern, namely taking out the consecutive columns of Hd, gives a significant advantage by reducing complexity. Furthermore, assuming the original matrix satisfies this condition, approximately constant row weight is guaranteed. An example of this concept is illustrated in
After rearranging the columns of the Hd part of the original matrix, the new matrix takes on the form 221 shown in
In the case of a regular column parity check matrix, or more generally, approximately regular, or regular and approximately regular only in the data part of the matrix, Hd, the method described in the previous paragraph is still preferred compared to the existing random or periodic/random approach. The method described here ensures approximately constant row weight, which is another advantage from the performance and the implementation complexity standpoint.
(c) Puncturing
Puncturing of a code is defined as removing parity bits from the codeword. In a wider sense, puncturing may be defined as removing some of the bits, either parity bits or data bits or both, from the codeword prior to sending the encoded bits to the modulator block and subsequently over the channel. The operation of puncturing, increases the effective code rate. Puncturing is equivalent to a total erasure of the bits by the channel. The soft iterative decoder assumes a completely neutral value corresponding to those erased bits. In case that the soft information used by the decoder is the log-likelihood ratio, this neutral value is zero.
Puncturing of LDPC codes can be given an additional, somewhat different, interpretation. An LDPC code can be presented in the form of the bipartite graph of
Each variable node 231 is connected 234 by edges, for example 233, to all the check nodes 232 in which that particular bit participates. Similarly, each check node (corresponding to a parity check equation) is connected by a set of edges 237 to all variable nodes corresponding to bits participating in that particular parity check equation. If a bit is punctured, for example node 235, then all the check nodes connected to it, those connected by thicker lines 236, are negatively affected. Therefore, if a bit chosen for puncturing participates in many parity check equations, the performance degradation may be very high. On the other hand, since the only way that the missing information (corresponding to the punctured bits) can be recovered is from the messages coming from check nodes those punctured bits participate in, the more of those the more successful recovery may be. Faced with contradictory requirements, the optimum solution can be found somewhere in the middle. These general rules can be stated as following:
Some of these trade-offs can be observed from
In
It can be seen from the
The matrix in
As discussed previously, in the case where the preservation of the exact code rate is not mandatory, the shortening-to-puncturing ratio can be chosen such that it guarantees preservation of the performance level of the original code. Normalizing the shortening-to-puncturing ratio, q, as follows:
qnormalized=(Nshortened/Npunctured)/[R/(1−R)],
means that q becomes independent of the code rate, R. Therefore, qnormalized=1, corresponds to the rate preserving case of combined shortening and puncturing. However, if the goal is to preserve performance, this normalized ratio must be greater than one: qnormalized>1. It was found through much experimentation that qnormalized in the range of 1.2-1.5 complies with the performance preserving requirements.
In the case of a column regular parity check matrix, or more generally, approximately regular, or regular and approximately regular only in the data part of the matrix, Hd the method described above in accordance with one embodiment of the present invention is still preferred compared to the existing random or periodic/random approach since the present invention ensures approximately constant row weight, which provides another advantage from both the performance and the implementation complexity standpoints.
A large percentage of punctured bits paralyzes the iterative soft decision decoder. In the case of LDPC codes this is true even if puncturing is combined with some other operation such as shortening or extending the code. One could conclude this by studying the matrix 250 of
Ppuncture=100×(Npuncture/M),
then it can be seen that the matrix 250 from
Some of the embodiments of the present invention may include the following characteristics:
The system, apparatus, and method as described above are preferably combined with one or more matrices shown in the
The matrices in
A first group of matrices (
A further matrix (
The rate R=¾ matrices (
The rate R=⅚ matrix (
The two rate R=⅚ matrices (
s′=floor{s(L/96)},
where s is the right circular shift corresponding to the maximum codeword size (for L=Lmax=96), and it is specified in the matrix definitions.
The invention can be implemented in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations thereof. Apparatus of the invention can be implemented in a computer program product tangibly embodied in a machine-readable storage device for execution by a programmable processor; and method actions can be performed by a programmable processor executing a program of instructions to perform functions of the invention by operating on input data and generating output. The invention can be implemented advantageously in one or more computer programs that are executable on a programmable system including at least one programmable processor coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system, at least one input device, and at least one output device. Each computer program can be implemented in a high-level procedural or object oriented programming language, or in assembly or machine language if desired; and in any case, the language can be a compiled or interpreted language. Suitable processors include, by way of example, both general and special purpose microprocessors. Generally, a processor will receive instructions and data from a read-only memory and/or a random access memory. Generally, a computer will include one or more mass, storage devices for storing data files. Storage devices suitable for tangibly embodying computer program instructions and data include all forms of non-volatile memory, including by way of example semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM disks. Any of the foregoing can be supplemented by, or incorporated in, ASICs (application-specific integrated circuits). Further, a computer data signal representing the software code which may be embedded in a carrier wave may be transmitted via a communication network. Such a computer readable memory and a computer data signal are also within the scope of the present invention, as well as the hardware, software and the combination thereof.
While particular embodiments of the present invention have been shown and described, changes and modifications may be made to such embodiments without departing from the true scope of the invention.
This application claims the benefits of U.S. Provisional Applications No. 60/617,902, filed Oct. 12, 2004; 60/627,348, filed Nov. 12, 2004; 60/635,525, filed Dec. 13, 2004; 60/638,832, filed Dec. 22, 2004; 60/639,420, filed Dec. 27, 2004; 60/647,259, filed Jan. 26, 2005; 60/656,587, filed Feb. 25, 2005; and 60/673,323, filed Apr. 20, 2005.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/CA2005/001563 | 10/12/2005 | WO | 00 | 7/29/2008 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2006/039801 | 4/20/2006 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
7178080 | Hocevar | Feb 2007 | B2 |
7203897 | Blankenship et al. | Apr 2007 | B2 |
7263651 | Xia et al. | Aug 2007 | B2 |
7313752 | Kyung et al. | Dec 2007 | B2 |
7581157 | Oh et al. | Aug 2009 | B2 |
7607063 | Kikuchi et al. | Oct 2009 | B2 |
20040034828 | Hocevar | Feb 2004 | A1 |
20050050435 | Kyung et al. | Mar 2005 | A1 |
20050283707 | Sharon et al. | Dec 2005 | A1 |
20050289437 | Oh et al. | Dec 2005 | A1 |
20060015791 | Kikuchi et al. | Jan 2006 | A1 |
20080022191 | Stolpman et al. | Jan 2008 | A1 |
Number | Date | Country | |
---|---|---|---|
20090259915 A1 | Oct 2009 | US |
Number | Date | Country | |
---|---|---|---|
60617902 | Oct 2004 | US | |
60627348 | Nov 2004 | US | |
60635525 | Dec 2004 | US | |
60638832 | Dec 2004 | US | |
60639420 | Dec 2004 | US | |
60647259 | Jan 2005 | US | |
60656587 | Feb 2005 | US | |
60673323 | Apr 2005 | US |