1. Field
Certain aspects of the present disclosure generally relate to low density parity check codes and, more particularly, to a method for constructing a parity check matrix, encoding and decoding.
2. Background
With the explosive growth in multimedia and broadband services over wired and wireless networks, significant effort has been made to apply highly efficient error correcting coding to data transmission over noisy and impaired channels. Low Density Parity Check (LDPC) codes have emerged as one of the most promising error correcting codes due to their offering of higher speeds with significantly lower complexity by taking advantage of the natural parallelism of LDPC codes. In fact, LDPC coded were the first to allow data transmission close to the theoretical limit, e.g., the Shannon limit.
A Low Density Parity Check (LDPC) codes is an error correcting code that is used to detect and correct errors introduced during transmission over a noisy and impaired channel. A binary LDPC code is a block error-correcting code based on a sparse Parity Check Matrix (PCM) H, i.e. matrix H contains mostly 0's and only a small number of 1's or equivalently H has low density of 1's. An (N, K) LDPC code is a linear block code whose PCM HM×N contains M rows where M=N−K and N columns. A regular (N, K, Wc, Wr) LDPC code is a linear block code for which the PCM HM×N contains exactly Wc1's per column and exactly Wr=WcN/M 1's per row, where the low density constraints implies that Wr<<N and Wc<<M. The code rate is Rate=K/N=1−M/N=1−Wc/Wr. If the number of ones in each row or column is not constant than such codes are called irregular LDPC codes.
An LDPC code can be defined in both matrix form and graphical form. An LDPC code can be graphically defined by a Tanner bipartite graph corresponding to the PCM HM×N. Not only do such graphs provide a complete representation of the code, they also describe the decoding algorithm explained in more detail below. A Tanner bipartite graph is essentially a visual representation of the PCM HM×N. A M×N PCM HM×N defines a code in which the N bits of each codeword satisfy a set of M parity-check constraints. The Tanner graph contains N bit-nodes (also called variable nodes); one for each bit, and M check-nodes (also called parity nodes); one for each of the parity check equations. The check-nodes are connected via edges (also called arcs) to the bit nodes they check. Specifically, a branch connects check-node i to bit-node j if and only if the i-th parity check equation involves the j-th bit, or more succinctly, if and only if Hi,j=1. The graph is said to be bipartite because there are two distinct types of nodes, bit-nodes and check-nodes, and there are no direct connection between any two nodes of the same type.
An LDPC code may also be defined using a generator matrix GN×K. A message (also called dataword) dM×1 comprising M bits is encoded into a codeword as follows
cN×1=GN×KdK×1
Alternatively, the dataword dM×1 can be encoded into a codeword cN×1 using the PCM HM×N by solving for the constraints specified in the following equation
HM×NcN×1=0M×1
An LDPC encoded data stream comprising one or multiple codewords is typically transmitted over a noisy and/or impaired channel. A received word corresponding to a transmitted codeword may be contaminated with errors. An LDPC decoder is used to detect and/or correct the errors. LDPC decoding is based on iterative decoding using a message-passing algorithm as an alternative to an optimal yet highly complex maximum-likelihood decoding. Received words are processed iteratively over a Tanner graph wherein messages are exchanged iteratively between bit nodes and parity nodes until a stopping criterion is satisfied.
Conventional LDPC PCMs are random in nature which leads to fully parallel LDPC encoders and decoders. Fully parallel LDPC decoding means that all the messages to and from parity nodes have to be computed at every iteration in the decoding process. This leads to large complexity, increased power and increased cost. Serializing part of the decoder by sharing a number of parity node processing elements (PNPE) is one option for reducing some of the overhead involved; however, serializing part of the decoder would result in stringent memory requirements to store the messages and in an interconnection complexity bottleneck network, i.e. complex interconnects and multiplexing between Variable Nodes Processing Elements (VNPEs), PNPEs and memory.
Further, if different coding rates are to be supported, then the encoder and decoder become even more complex in terms of memory size and architecture, speed, interconnect and multiplexing complexity.
Therefore, there is a need in the art for a method of high speed multi-rate LDPC encoding and decoding that avoids the drawbacks of the standard LDPC encoding and standard message-passing decoding algorithms.
Certain aspects provide a method for wireless and wired communications. The method generally includes encoding at least one of the fields of a data stream with one an LDPC encoder wherein the corresponding LDPC matrix comprises at least on matrix block that admits a cyclic sub-block shift registers with a set of fixed permuters, and transmitting the spread data stream.
Certain aspects provide a method for wireless and wired communications. The method generally includes receiving a data stream wherein at least one of the fields LDPC encoded, and decoding the encoded fields using an LDPC decoder comprising one or multiple a cyclic sub-block shift registers with a set of fixed permuters.
So that the manner in which the above-recited features of the present disclosure can be understood in detail, a more particular description, briefly summarized above, may be had by reference to aspects, some of which are illustrated in the appended drawings. It is to be noted, however, that the appended drawings illustrate only certain typical aspects of this disclosure and are therefore not to be considered limiting of its scope, for the description may admit to other equally effective aspects.
Various aspects of the disclosure are described more fully hereinafter with reference to the accompanying drawings. This disclosure may, however, be embodied in many different forms and should not be construed as limited to any specific structure or function presented throughout this disclosure. Rather, these aspects are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art. Based on the teachings herein one skilled in the art should appreciate that the scope of the disclosure is intended to cover any aspect of the disclosure disclosed herein, whether implemented independently of or combined with any other aspect of the disclosure. For example, an apparatus may be implemented or a method may be practiced using any number of the aspects set forth herein. In addition, the scope of the disclosure is intended to cover such an apparatus or method which is practiced using other structure, functionality, or structure and functionality in addition to or other than the various aspects of the disclosure set forth herein. It should be understood that any aspect of the disclosure disclosed herein may be embodied by one or more elements of a claim.
The word “exemplary” is used herein to mean “serving as an example, instance, or illustration.” Any aspect described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other aspects.
Although particular aspects are described herein, many variations and permutations of these aspects fall within the scope and spirit of the disclosure. Although some benefits and advantages of the preferred aspects are mentioned, the scope of the disclosure is not intended to be limited to particular benefits, uses, or objectives. Rather, aspects of the disclosure are intended to be broadly applicable to different wireless technologies, system configurations, networks, and transmission protocols, some of which are illustrated by way of example in the figures and in the following description of the preferred aspects. The detailed description and drawings are merely illustrative of the disclosure rather than limiting, the scope of the disclosure being defined by the appended claims and equivalents thereof.
The techniques described herein may be used for various broadband wireless and wired communication systems, including communication systems that are based on a Single Carrier (SC) transmission and Orthogonal Frequency Division Multiplexing/multiple Access OFDM(A). Aspects disclosed herein may be advantageous to systems employing Ultra Wide Band (UWB) signals including millimeter-wave signals, Code Division Multiple Access (CDMA) signals, and OFDM. However, the present disclosure is not intended to be limited to such systems, as other coded signals may benefit from similar advantages.
A variety of algorithms and methods may be used for transmissions in the wireless communication system 100 between the SAPs 104 and the STAs 106 and betweens STAs 106 themselves. For example, signals may be sent and received between the SAPs 104 and the STAs 106 in accordance with CDMA technique and signals may be sent and received between STAs 106 in according with OFDM technique. If this is the case, the wireless communication system 100 may be referred to as a hybrid CDMA/OFDM system.
A communication link that facilitates transmission from a SAP 104 to a STA 106 may be referred to as a downlink (DL) 108, and a communication link that facilitates transmission from a STA 106 to a SAP 104 may be referred to as an uplink (UL) 110. Alternatively, a downlink 108 may be referred to as a forward link or a forward channel, and an uplink 110 may be referred to as a reverse link or a reverse channel. When two STAs communicate directly with each other, a first STA will act as the master of the link, and the link from the first STA to the second STA will be referred to as downlink 112, and the link from the second STA to the first STA will be referred to as uplink 114.
A BSS 102 may be divided into multiple sectors 112. A sector 116 is a physical coverage area within a BSS 102. SAPs 104 within a wireless communication system 100 may utilize antennas that concentrate the flow of power within a particular sector 116 of the BSS 102. Such antennas may be referred to as directional antennas.
The wireless device 202 may include a processor 204 which controls operation of the wireless device 202. The processor 204 may also be referred to as a central processing unit (CPU). Memory 206, which may include both read-only memory (ROM) and random access memory (RAM), provides instructions and data to the processor 204. A portion of the memory 206 may also include non-volatile random access memory (NVRAM). The processor 204 typically performs logical and arithmetic operations based on program instructions stored within the memory 206. The instructions in the memory 206 may be executable to implement the methods described herein.
The wireless device 202 may also include a housing 208 that may include a transmitter 210 and a receiver 212 to allow transmission and reception of data between the wireless device 202 and a remote location. The transmitter 210 and receiver 212 may be combined into a transceiver 214. An antenna 216 may be attached to the housing 208 and electrically coupled to the transceiver 214. The wireless device 202 may include one or more wired peripherals 224 such as USB, HDMI, or PCIE. The wireless device 202 may also include (not shown) multiple transmitters, multiple receivers, multiple transceivers, and/or multiple antennas.
The wireless device 202 may also include a signal detector 218 that may be used in an effort to detect and quantify the level of signals received by the transceiver 214. The signal detector 218 may detect such signals as total energy, energy per subcarrier per symbol, power spectral density and other signals. The wireless device 202 may also include a digital signal processor (DSP) 220 for use in processing signals.
The various components of the wireless device 202 may be coupled together by a bus system 222, which may include a power bus, a control signal bus, and a status signal bus in addition to a data bus.
Data 306 to be transmitted are shown being provided as input to an LDPC encoder 308. The LDPC encoder encodes the data 306 by adding redundant bits. The LDPC encoder 308 outputs an encoded data stream 310. The encoded data stream 310 is input to the mapper 314. The mapper 314 may map the encoded data stream onto constellation points. The mapping may be done using some modulation constellation, such as binary phase-shift keying (BPSK), quadrature phase-shift keying (QPSK), 8 phase-shift keying (8PSK), quadrature amplitude modulation (QAM), constant phase modulation (CPM), etc. Thus, the mapper 312 may output a symbol stream 314, which may represents one input into a block builder 310. Another input in the block builder 310 may be comprised of one or multiple of spreading codes produced by a spreading-codes generator 318.
The block builder 310 may be configured for partitioning the symbol stream 314, into sub-blocks and creating OFDM/OFDMA symbols or single carrier sub-blocks. The block builder may append each sub-block by a guard interval, a cyclic prefix or a spreading sequence from the spreading codes generator 318. Furthermore, the sub-blocks may be spread by one or multiple spreading codes from the spreading codes generator 318.
The output 320 may be pre-pended by a preamble 322 generated from one or multiple spreading sequences from the spreading codes generator 324. The output stream 326 may then be converted to analog and up-converted to a desired transmit frequency band by a radio frequency (RF) front end 328 which may include a mixed signal and an analog section. An antenna 330 may then transmit the resulting signal 332.
The transmitted signal 332 is shown traveling over a wireless channel 334. When a signal 332′ is received by an antenna 330′, the received signal 332′ may be down-converted to a baseband signal by an RF front end 328′ which may include a mixed signal and an analog portion. Preamble detection and synchronization component 322′ may be used to establish timing, frequency and channel synchronization using one or multiple correlators that correlate with one or multiple spreading codes generated by the spreading code(s) generator 324′.
The output of the RF front end 326′ is input to the block detection component 316′ along with the synchronization information from 322′. When OFDM/OFDMA is used, the block detection block may perform cyclic prefix removal and fast Fourier transform (FFT). When single carrier transmission is used, the block detection block may perform de-spreading and equalization.
A demapper 312′ may perform the inverse of the symbol mapping operation that was performed by the mapper 312 thereby outputting soft or hard decisions 310′. The soft or hard decisions 310′ are input to the LDPC decoder which provides an estimate data stream 306′. Ideally, this data stream 306′ corresponds to the data 306 that was provided as input to the transmitter 302.
The wireless system 100 illustrated in
In the descriptions that follow, certain example parameters, values, etc., are used; however, it will be understood that the disclosure described herein is not necessarily limited by these examples. Accordingly, these examples should not be seen as limiting the disclosure in any way. Further, the embodiments of an LDPC encoder and an LDPC decoder described herein can be applied to many different types of systems implementing a variety of protocols and communication techniques. Accordingly, the embodiments should not be seen as limited to a specific type of system, architecture, protocol, air interface, etc. unless specified.
In order to illustrate the operation of LDPC codes, the following PCM example is provided
As can be seen, the PCM H3×6 is low density, or sparse. A dataword d3×1 may be encoded into a codeword c6×1 such that H3×6c6×1=03×1. The encoding may be done in a systematic way, i.e. the codeword c6×1 is partitioned into two vectors, the dataword d3×1 and a parityword p3×1 as follows
Expanding the constraint H3×6c6×1=03×1, we obtain
Solving for the parityword bits {pm} for m=1, 2, 3 we obtain
p1=d1
p2=d1+d3
p3=d1+d2
Thus, for example, for the data word d3×1=[010]T where “T” is the transpose operator, the parityword is p3×1=[001]T and the codeword is c6×1=[010001]T.
The normalized received vector, denoted r6×1, may be a multilevel signal corresponding to the transmitted codeword c6×1, and may be modeled as
where w6×1 is a vector modeling the channel noise, imperfections, impairments and interference, and σ2 is the variance of the noise samples {wn}. The first parity check node or equation, corresponding to the first row of PCM H3×6 will check received samples r1, r3, and r5. The second parity check node, i.e., the second row of H3×6, checks for received samples r2, r4, and r6, and the third parity check node checks samples r1, r2, and r6. The first and second parity check equations are considered to be orthogonal, because they involve mutually exclusive sets of samples.
In an LDPC decoder, the operations of the parity check nodes and variable nodes may be implemented by processor elements as indicated above. An LDPC decoder is than an iterative decoder that implements a message passing algorithm defined by PCM H3×6.
As shown by R. G. Gallager, “Low-Density Parity-check Nodes,” IRE Trans. Inform. Theory, vol. IT-8, pp. 21-28, January 1962, the parity node messages produced by PNPE 402-1 can be computed as follows
Thus, PNPE 402-1 can be configured to implement the above equations or any approximation to the above equations such as the min-sum approximation, the scaled min-sum approximation, the offset min-sum approximation, etc.
v1k=r1+Ek(1→1)+Ek(3→1)
The iterative decoder may be stopped according to a stopping criterion such as if the hard decisions taken on the multilevel variables {vnk} with n=1, 2, . . . , 6 meet the parity check equations, i.e. H3×6v6×1k=03×1, or if a certain defined number or iterations is surpassed.
The message-passing decoder for a binary code with a PCM HM×N can be summarized by introducing the index sets Cn={m: Hm,n=1} and Rm={n: Hm,n=1}. The index set Cn is the set of all row indexes for which Hm,n=1, i.e. the index of all parity check equations involving variable node number n, and the index set Rm is the set of all column indexes for which Hm,n=1, i.e. the index of all variable nodes involved in the mth parity check equation. Let Ek(m→n) denote the parity node message from PCNE number m to VNPE number n during the kth iteration, and let vnk denote an estimate of the nth a-posteriori LLR of bit number n after k iterations. Using the compact notation n=1:N which means n=1, 2, . . . , N, the message passing decoder may be summarized as follows
In the min-sum approximation, the computation of the edge messages Ek(m→n) are replaced by
where the sign function is +1 if its argument is ≧0 and −1 if its argument is <0, and min is the minimum. The min-sum approximation may be further modified by replacing the minimum in the above equation with
where α is a scaling factor often chosen to be less than one and β is an offset.
It will be understood that the decoder described above may be implemented using hardware and/or software configured appropriately and that while separate PNPEs and VNPEs are described, these processors may be implemented by a single processor, such as an Application Specific Integrated Circuit (ASIC); however, as mentioned above, implementation of an LDPC processor such as that described in
The embodiments described below allow for more practical implementation of an LDPC encoder and decoder. For example, according to one aspect of the disclosure, a highly structured PCM HM×N, where different sub-blocks have cyclic sub-block shift registers representations wherein different memory components are connected by fixed permuters, and may be used to facilitate encoding and to solve the memory, interconnect and multiplexing bottlenecks at the decoder.
According to one aspect of the disclosure, the received vector rN×1 is partitioned into L block vectors of length N/L each, and the PCM matrix HM×N(1/2) is partitioned accordingly into Lr×Lc block matrices. For simplicity, we shall assume in the following block column matrices, i.e. Lr=1, and Lc=L,
where “T” is the transpose operator, the received vector rN×1 is written into matrix form r(N/L)×L, and the first block vectors r(N/L)×1l for l=1:L contain successive blocks of N/L elements of vector rN×1. The following exemplary parameters shall be used to clarify the construction procedure, rate=½, N=67, and L=2; therefore M=K=N/2=336 and N/L=336.
The processing of rM×1Left and rM×1Right use the same procedures and therefore we shall use block vector xM×1 to refer to either one of them whenever no distinction between the two is required, and the construction and partitioning of HM×MLeft and HM×MRight use the same procedures and therefore we shall use block matrix GM×M to refer to either one of them whenever no distinction between the two is required.
Next, according to one aspect of the disclosure, block vector xM×1 is portioned into P sub-block vectors of length l=M/P, and bock matrix GM×M is partitioned into P×P square sub-block matrices.
where, as above, the sub-block vectors xI×1p for p=1:P, contain successive block of I elements of vector xM×1, and are collected together into matrix xI×P. In the following, an example value of 4 for the parameter P shall be used for illustration purposes of the decoder operation and construction of PCM HM×N(1/2). Therefore, for the above example I=84.
According to one aspect of the disclosure, a block matrix GM×M, may be fully specified by one of the block rows, for example the first block row, i.e. the sub-blocks GI×I1,p with p=1:P, along with a set of fixed permuters connecting each of the vectors xI×1p with p=1:P in a sub-block shift-register. This is further explained below.
According to one aspect of the disclosure, each of the sub-block vectors xI×1p with p=1:P is partitioned into Q element vectors of length S=I/Q each, and each of the sub-block matrices GI×Ip
with p,pr,pc=1:P. In order to clarify this last partitioning, we shall use for illustration purposes non-zero elementary sub-matrices constructed from cyclic shift of an identity matrix. Let J be the left cyclic shift of the identity matrix of size S×S defined by,
The matrix J has the following properties, J0=I, Je=J×Je-1, and JS=I where I is the identity matrix. In order to accommodate the 0 matrix, we define J−=0. Alternatively, the matrix J can be defined as the right cyclic shift of the identity matrix.
According to another aspect of the disclosure, a block matrix GM×M, may be fully specified by a block column, for example the first block column, i.e. the sub-blocks GI×1p,1 with p=1:P, along with a set of fixed permuters connecting the element vectors xS×1p,q with q=1:Q, within a sub-vector xl×1p with p=1:P in an element block shift-register. This is further explained below.
In the following, an example value of 2 for the parameter Q shall be used for illustration purposes of the decoder operation and construction of PCM HM×N(1/2). Therefore, for the above example S=42, and
According to another aspect of the disclosure, the non-zero element matrices may be chosen as permutation matrices. As an example, for S=4, there are 24 possible permutations, i.e. S!=S×(S−1)× . . . ×1=24. These are listed below in the matrix perm(4)
The different permutation vectors are separated by a semi-column, i.e. “;” in the above matrix. So if the chosen permutation is number 11, i.e. vector [2, 4, 3, 1], than the non-zero element matrix may be constructed by placing a 1 in the 2nd column of the first row, a 1 in the 4th column of the 2nd row, a 1 in the 3rd column of the third row and so on
According to one aspect of the disclosure, the sub-block rows in each block matrix, such as block column matrices HM×MLeft and HM×MRight illustrated in
According to one embodiment of the present disclosure, the variable nodes in the variable nodes processing elements are stored in L memory blocks (L=2 in the above example), wherein each memory block contains P memory sub-blocks (P=4 in the above example), and wherein each memory sub-block contain Q memory banks (Q=2 in the above example), and wherein each memory bank comprises S memory cells (S=84 in the above example), and wherein each memory cell comprises B-bits where B is the bit width allocated to the variable nodes. The first memory block is capable of storing r84×4Left, and the second memory bank is capable of storing r84×4Right. A memory sub-block is capable of storing a column of r84×4Left or r84×4Right, and a memory bank is capable of storing an element vector such as r42×11.
According to one embodiment of the present disclosure, the variable nodes in the variable nodes processing elements are initialized with the above partitioned received vector, rotated according to the first block row of block column matrices HM×MLeft and HM×MRight. This is further illustrated in
According to one aspect of the disclosure, a square block matrix G, such as block column matrix HM×MLeft or block column matrix HM×MRight, is constructed in such a way that pairs of memory banks in two memory sub-blocks are connected by fixed permuters. The construction details are provided in reference to
The initialized memory banks are shown again in
Memory bank 702-1 in the first memory sub-block 718-1 is connected to memory bank 704-2 in the second memory sub-block 718-2 via fixed permuter 706-1, memory bank 704-2 in the second memory sub-block 718-2 is connected to memory bank 704-3 in the third memory sub-block 718-3 via fixed permuter 708-2, memory bank 704-3 in the third memory sub-block 718-3 is connected to memory bank 702-4 in the fourth memory sub-block 718-4 via fixed permuter 708-3, and memory bank 702-4 in the fourth memory sub-block 718-4 is connected back to memory bank 702-1 in the first memory sub-block 718-1 via fixed permuter 706-4. Therefore, the set of memory banks 702-1, 704-2, 704-3, and 702-4 form a cyclic vector shift register wherein each successive pair of vector registers, i.e. memory banks, are connected via a fixed permuter. In the same way, the set of memory banks 704-1, 702-2, 702-3, and 704-4 form a cyclic vector shift register wherein each successive pair of vector registers, i.e. memory banks, are connected via a fixed permuter. The set of permuters used are, 708-1, 706-2, 706-3, and 708-4. The set of two cyclic vector shift registers, i.e. vector shift register comprising memory banks 702-1, 704-2, 704-3, and 702-4 and permuters 706-1, 708-2, 708-3, and 706-4, and vector shift register comprising memory banks 708-1, 706-2, 706-3, and 708-4 and permuters 708-1, 706-2, 706-3, and 708-4, shall be referred to collectively as cyclic sub-block shift register.
The cyclic sub-block shift registers 720-1 corresponds to block matrix HM×MLeft and cyclic sub-block shift registers 720-2 corresponds to block matrix HM×MRight. The two cyclic sub-block shift registers 720-1 and 720-2 are completely independent and therefore in what follows, it is enough use one of them for exemplary clarifications.
According to one aspect of the disclosure, a block matrix G, such as block column matrix HM×MLeft or block column matrix HM×MRight, may be fully constructed from a cyclic sub-block shift registers. Explanation is provided in reference to 720-1 in
The initial values in the memory banks 702-1, 702-2, 702-3, 702-4, 704-1, 704-2, 704-3 and 704-4 correspond to the entries in the first block row in block matrix HM×MLeft in
Therefore, the first step of the construction of a block matrix G, is to specify an “entry matrix” aQ×P of Q×P non-zero elements chosen randomly from the set {0, 1, . . . , S−1}. In the example of
The second step is to place each of the columns of aQ×P in the corresponding sub-block in matrix HM×MLeft in
The first element, i.e. 40 is always associated with r1, and the second element, i.e. 25 is always associated with r2. However, there are two different ways to place the first them in memory sub-block 702-1. These are
In the above example Q=2. So in general, there are Qperm=Q! ways in placing the elements of a column of aQ×P in the corresponding sub-block in matrix G, or equivalently in the corresponding memory sub-block. Therefore, the placements for all the rows of matrix aQ×P can be fully specified by selecting a placement vector b1×P with elements chosen randomly form the set {1, 2, . . . , Qperm}. In the example considered here, Q=2, there are 2 possible permutation patterns [1 2] and [2 1]. And therefore, the placement vector in
b1×P=[1 1 1 1]
On the other hand, the placement vector in
b1×P=[2 1 2 1]
The third step, in reference to the cyclic sub-block shift register in
where the entries in the last column, i.e. e1,4 and e2,4 are to be computed. The first connection element specifies how the memory banks 702-1 and 704-1 in the first memory sub-block 718-1 are connected to memory banks 702-2 and 704-2 in the second memory sub-block 718-2. There are two possibilities. The first possibility is to connect 702-1 to 702-2 and 704-1 to 704-2 (unlike what is shown in
f1×P=[2 1 2 f4]
where the last element f4 is to be computed as follows.
Consider memory bank 702-1. When 702-1 travels through the vector shift register, it travels through 706-1, 704-2, 708-2, 704-3, 708-3, and 702-4. In order for 702-1 to go back to its place, 702-4 should be connected back to 702-1 and not 704-1. Therefore, in one aspect of the disclosure, the connection between the last memory sub-block back to the first memory sub-block which specifies the last element of connection vector f1×P is chosen in such a way that each of the memory banks in the first memory sub-block should go back to their places after traveling through their respective paths in the cyclic sub-block shift register. In the circuit 720-1 in
Once the last element of the connection vector f1×P is computed as shown above, the last vector of permuter matrix eQ×P is computed as follows. Consider what happens to memory bank 702-1 when it travels through the set of fixed permuters in its path within the cyclic sub-block shift register. Memory block 702-1 with content J40r1 travels first through permuter 706-1, and its value change to J27×J40r1=J67r1. Since JS=I, so in the example of
J16×J18×J23×J27=J84=J2×42=I
Therefore, according to another aspect of the disclosure, the sum of the exponents of the fixed permuters along a path within a cyclic sub-block shift register representing a block matrix G, should be an integer multiple of S. If permutation matrices are used instead of cyclic shift matrices Jn, than according to another aspect of the disclosure, the product of the fixed permuters, A(P)×A(P−1)× . . . ×A(1) where A(p) is the pth permuter matrix, along a path within a cyclic sub-block shift register representing a block matrix G, should be the identity.
The sum of the exponents in the set of permuters in the path of 704-1 is
6+41+13+24=84=2×42
The fourth step is the construction of the remainder block rows of the block matrix. In reference to
The content of cyclic sub-block shift register at different clock cycles, i.e. 720-1, 752, 764, and 776 may be used to construct the block matrix HM×MLeft in
The second block row 616 in
According to another aspect of the disclosure, a PCM HM×M of a given rate, may be partitioned into a set of LR×LC block matrices G(M/L
In the example provided above, the block matrices were block row matrices.
According to one aspect of the disclosure, some of the non-zero entries may be masked, i.e. set to zero, in order to ensure that the PCM is full rank. As an example, the PCM
is not full rank. The block matrix HM×MRight is masked as shown in
As an example, the original non-zero entries 574 and 576 in sub-block 552 in
A dataword dM×1 may be encoded into a codeword cN×1 such that HM×N(1/2)cN×1=0M×1. The encoding may be done in a systematic way, i.e. the codeword cN×1 is partitioned into two vectors, the dataword dM×1 and a parityword pM×1 as follows
Expanding the constraint HM×N(1/2)cN×1=0M×1, we obtain
HM×MLeftdM×1=HM×MRightpM×1
The first multiplication uM×1=HM×MLeftdM×1 is facilitated using the cyclic sub-block shift register structure of HM×MLeft, where the multiplication is achieved in 4 clock cycles corresponding to the four shift register updates 720-1 in
pM×1=(HM×MRight)−1uM×1
The inverse of HM×MRight can be easily computed in terms of matrix J. The above multiplication can use the same shift register using well known techniques.
According to one aspect of the disclosure, a higher coding rate PCM may be obtained from a lower coding rate PCM by combining multiple permuted rows of the lower coding rate PCM into a single row of the higher coding rate PCM. In a preferred embodiment of the disclosure, the multiple rows to be combined into a single row are chosen from the same block.
As an example, different rows of PCM
of rate ½, are combined in
of rate ⅝. In
In another aspect of the present disclosure, a higher coding rate PCM may be obtained from one or multiple high rate block matrices, wherein each high rate block matrix is obtained by combining multiple permuted rows of the lower coding rate block matrix of a low rate PCM into a single row of the higher coding rate PCM. Furthermore, the rows of the high rate block matrices may be shuffled, cyclic shifted by a multiple of sub-blocks and masked.
is obtained from the constituent block matrices. Therefore, it should be obvious that a multitude of coding rates may be obtained by forming different higher rate blocks matrices form a lower rate block matrices using combining, permutations, shuffling, sub-block cyclic shifting, masking and deleting rows.
As another example, if we delete the last row of PCM H(M×3/4)×M(3/4) constructed above, we obtain a PCM with 3 rows and 16 columns. The resulting PCM has a coding rate of 13/16. It can be easily shown that all rates Z/16 can be generated from the original matrix with Z=8, 9, . . . , 15.
According to one aspect of the disclosure, the variable nodes in VNPEs in the decoder may be stored in cyclic sub-blocks shift register wherein the cyclic sub-block shift register comprise fixed permuters connecting different memory banks to each other. An example decoder for rate ½ for the constructed PCM in the above example,
is shown in
The outputs of the lower memory banks of 1190, i.e. memory banks 1118, 1120, 1122, and 1124 are connected to a second bank of S PNPEs. Similarly, the outputs of the lower memory banks of 1192, i.e. memory banks 1158, 1160, 1162, and 1164 are connected to the same second bank of S PNPEs. Each of the S PNPEs in the second bank 1184 has 8 inputs and produces 8 parity messages such as those shown in 1188-1 to 1188-8.
When an entry in the PCM is masked, the masked entry may be replaced by a high positive number when inputted to a PNPE in order not to affect the functionality of the PNPE. As an example, using the min-sum algorithm, a PNPE computes the signs, the first minimum and second minimum from which the parity messages are computed. If an entry is masked and substituted with a high positive number, the overall parity (sign) remains unaffected as well as the first minimum and second minimum and therefore the parity messages.
The produced parity messages 1186-1 to 1186-8 and 1188-1 to 1188-8 may be pipelined and are fed back through a set of fixed permuters to be added (not shown) to the content of the memory banks within the cyclic sub-blocks shift register 1190 and 1192. This is best illustrated in
At each clock cycle, sub-block row comprising two element rows of the example PCM HM×N(1/2) are processed, and that is the reason why there are two PNPE banks in
The decoder in
Therefore, as shown above, according to one aspect of the disclosure, an LDPC decoder using cyclic sub-blocks of shift register is used with a set of fixed permuters to update the variable and parity nodes and provide an estimate of the data.
The example decoder in
At 1444, the baseband modulated data stream is demodulated using a multi-rate LDPC decoder comprising at least one cyclic sub-block shift registers for the variable nodes and a set of fixed permuters as the one illustrated in
The various operations of methods described above may be performed by any suitable means capable of performing the corresponding functions. The means may include various hardware and/or software component(s) and/or module(s), including, but not limited to a circuit, an application specific integrated circuit (ASIC), or processor. Generally, where there are operations illustrated in Figures, those operations may have corresponding counterpart means-plus-function components with similar numbering. For example, blocks 1402-1408, and 1442-1446, illustrated in
As used herein, the term “determining” encompasses a wide variety of actions. For example, “determining” may include calculating, computing, processing, deriving, investigating, looking up (e.g., looking up in a table, a database or another data structure), ascertaining and the like. Also, “determining” may include receiving (e.g., receiving information), accessing (e.g., accessing data in a memory) and the like. Also, “determining” may include resolving, selecting, choosing, establishing and the like.
The various operations of methods described above may be performed by any suitable means capable of performing the operations, such as various hardware and/or software component(s), circuits, and/or module(s). Generally, any operations illustrated in the Figures may be performed by corresponding functional means capable of performing the operations.
The various illustrative logical blocks, modules and circuits described in connection with the present disclosure may be implemented or performed with a general purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array signal (FPGA) or other programmable logic device (PLD), discrete gate or transistor logic, discrete hardware components or any combination thereof designed to perform the functions described herein. A general purpose processor may be a microprocessor, but in the alternative, the processor may be any commercially available processor, controller, microcontroller or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.
The steps of a method or algorithm described in connection with the present disclosure may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in any form of storage medium that is known in the art. Some examples of storage media that may be used include random access memory (RAM), read only memory (ROM), flash memory, EPROM memory, EEPROM memory, registers, a hard disk, a removable disk, a CD-ROM and so forth. A software module may comprise a single instruction, or many instructions, and may be distributed over several different code segments, among different programs, and across multiple storage media. A storage medium may be coupled to a processor such that the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor.
The methods disclosed herein comprise one or more steps or actions for achieving the described method. The method steps and/or actions may be interchanged with one another without departing from the scope of the claims. In other words, unless a specific order of steps or actions is specified, the order and/or use of specific steps and/or actions may be modified without departing from the scope of the claims.
The functions described may be implemented in hardware, software, firmware or any combination thereof If implemented in software, the functions may be stored as one or more instructions on a computer-readable medium. A storage media may be any available media that can be accessed by a computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer. Disk and disc, as used herein, include compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk, and Blu-ray® disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers.
Thus, certain aspects may comprise a computer program product for performing the operations presented herein. For example, such a computer program product may comprise a computer readable medium having instructions stored (and/or encoded) thereon, the instructions being executable by one or more processors to perform the operations described herein. For certain aspects, the computer program product may include packaging material.
Software or instructions may also be transmitted over a transmission medium. For example, if the software is transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, digital subscriber line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of transmission medium.
Further, it should be appreciated that modules and/or other appropriate means for performing the methods and techniques described herein can be downloaded and/or otherwise obtained by a user terminal and/or base station as applicable. For example, such a device can be coupled to a server to facilitate the transfer of means for performing the methods described herein. Alternatively, various methods described herein can be provided via storage means (e.g., RAM, ROM, a physical storage medium such as a compact disc (CD) or floppy disk, etc.), such that a user terminal and/or base station can obtain the various methods upon coupling or providing the storage means to the device. Moreover, any other suitable technique for providing the methods and techniques described herein to a device can be utilized.
It is to be understood that the claims are not limited to the precise configuration and components illustrated above. Various modifications, changes and variations may be made in the arrangement, operation and details of the methods and apparatus described above without departing from the scope of the claims.
The techniques provided herein may be utilized in a variety of applications. For certain aspects, the techniques presented herein may be incorporated in a base station, a mobile handset, a personal digital assistant (PDA) or other type of wireless device that operate in UWB part of spectrum with processing logic and elements to perform the techniques provided herein.
Number | Name | Date | Kind |
---|---|---|---|
20060291571 | Divsalar | Dec 2006 | A1 |
20070157061 | Lee | Jul 2007 | A1 |
20070162815 | El-Khamy | Jul 2007 | A1 |
20080028272 | Richardson | Jan 2008 | A1 |
20080028274 | Lin | Jan 2008 | A1 |
20080072122 | Nimbalker | Mar 2008 | A1 |
20080155385 | Jeong | Jun 2008 | A1 |
20080189589 | Park | Aug 2008 | A1 |
20080263425 | Lakkis | Oct 2008 | A1 |
20090106625 | Jun | Apr 2009 | A1 |
20090113276 | Radosavljevic | Apr 2009 | A1 |
20090125780 | Taylor et al. | May 2009 | A1 |
20100070818 | Ulriksson | Mar 2010 | A1 |
20100122142 | Sun | May 2010 | A1 |
20110307755 | Livshitz | Dec 2011 | A1 |
Number | Date | Country | |
---|---|---|---|
20100287438 A1 | Nov 2010 | US |