This invention relates to communication such as wireless communication and in particular coding in such communication.
In wireless systems messages, i.e. a sequence of symbols drawn from a signaling set, are transmitted in coded form. That is, the message is reduced to binary symbols in a series of codewords. The codewords are grouped into frames that are ultimately transmitted. The process of converting a message into a frame or series of frames for transmission is generally denominated coding.
Since any wireless network is subject to noise and other conditions (e.g. interference) influencing the transmitted signal, frames are often not received, are sufficiently distorted so that the encoded message cannot be decoded, or are decoded incorrectly. A failure to decode is recognized by detection algorithms, such as special parity check algorithms, that operate on a series of parity symbols derived from and appended to the transmitted codewords. Some errors in a received frame are correctable using error correction algorithms. To address the remaining uncorrected errors, expedients such as automatic repeat request (ARQ) schemes are employed. In these approaches, if a frame is not received or an unresolvable error is detected at the receiver, a message (generally denominated a negack) is sent to the transmitter requesting retransmission.
Many coding techniques have been developed for transforming a message into a frame or series of frames that has a further improved probability of reception and correct decoding. One such approach has been denominated low density parity codes (LDPC)—a class of linear block codes. In LDPC, the binary symbols representing a message are each associated with a variable node. Thus as shown in the illustrative LDPC code of
The LDPC coding scheme also requires that only a defined codeword be transmitted. A word is a codeword only if it has an appropriate length (8 bits in the example) and satisfies specific parity checks, i.e., the sum modulo 2 at each check node of the associated variable nodes is 0 (or some other fixed value). (Thus for an eight variable node, 4 check node scheme there are at least 28/24=16 codewords.) In the illustration of
It is, however, desirable to transmit the fewest number of bits that yield upon reception an acceptable error rate. For LDPC codes a puncturing approach is often employed to increase transmitted bit rate while maintaining an acceptable error rate. In puncturing only a portion of a LDPC codeword sequence is transmitted at each transmission interval. Thus
The use of LDPC codes with puncturing has proven beneficial for the transmission of wireless messages. Nevertheless, not all LDPC codes perform equally well. (The typical metric of performance is throughput as measured by the average number of user data bits accepted at the receiver in the time required for transmission of a single bit.) Generally irregular LDPC codes after optimization perform better than regular codes. However, among irregular codes performance varies greatly. Additionally, the computational complexity involved in decoding varies substantially among such codes. As a further complicating factor, LDPC codes generally do not perform well on transmission channels having substantial interference and fail when the capacity of the channel is smaller than the rate of the code.
Various other codes have been developed in the hope of improving throughput. One such robust class designed for, and most often applied to, optical communication systems is Raptor codes. Such codes have been thought to have the potential for performing better than LDPC codes when the communication channel is noisy. In a Raptor code an LDPC or turbo (as described in Raptor Codes, Amin Shokrollahi Digital Fountain Technical Report DF-2003-06-001) codeword is further encoded. A probability, Ωd is assigned to each integer, d, where d corresponds to an integer from 1 to the number of bits in the LDPC codeword frame. A series of numbers, d, is chosen before each codeword frame transmission. The choosing algorithm is designed such that the likelihood of choosing a specific number is commensurate with its assigned probability. The number d, chosen is employed as the number of distinct bits of the LDPC codeword sequence chosen at random that are summed with subsequent transmission of such sums in a stream. Since the transmitter and receiver are synchronously running the same version of a random number generator for d, the receiver knows the sequence of d's chosen at the transmitter and which corresponding d bits of the LDPC code bits are chosen. With the knowledge of the chosen d, decoding is attempted upon reception.
In a Raptor scheme, the likelihood of decoding depends on the parameters of 1) signal transmission intensity, and 2) the transmitted number of bits (representing sums) per underlying LDPC frame. If decoding is not achieved for the parameter chosen, a new series of sums are formed for the non-decoded bits by choosing a new series of d's. The process of choosing a series of d's as well as a corresponding number of bits and sending the corresponding sums is continued until reception and decoding is accomplished.
Thus in the Raptor approach puncturing is not employed. Instead, an expedient involving assigned probabilities is used. The efficacy of the chosen Raptor code scheme depends on a variety of variables. For example, the chosen set of Ωds, the underlying LDPC code, transmission intensity and the frame size all affect the effectiveness of the code.
It is possible to improve transmission throughput by dynamically adjusting transmission parameters in response to feedback from the receiver of information providing a measure of statistical channel quality. For example, in a punctured LDPC transmission in each transmission interval the power of the transmission and/or the fraction of codeword bits transmitted is dynamically adjusted in response to a receiver feedback signal indicative of the SNR at the receiver and/or the bit erasure rate.
Similarly, in another embodiment of the invention, a Raptor code frame is transmitted in intervals. In a particular interval 1) the power employed in, and 2) the fraction of the frame bits transmitted is dynamically adjusted based on a feedback signal that is indicative of statistical signal quality. Additionally, the throughput of a transmitted Raptor code is improved, especially in the presence of dynamic adjustment responsive to feedback, by a judicious choice of Ωd's.
Thus in accordance with another embodiment of the invention, Raptor codes are adapted for efficient encoding and transmission, i.e. throughput within 30% of the Shannon limit by an appropriate choice of Ωd's. The Shannon limit is defined by:
where ν is the signal-to-noise ratio, so that the Shannon capacity (in bits) for BPSK (Binary Phase Shift Keying) using the alphabet −1, 1 is:
This adaptation of Ωd's depends on maximizing through linear programming over X and over all choices of Ω (consistent with the constraints) the minimum value of K satisfying the inequality:
together with the constraints that Ωd is greater than or equal to 0, Ω1 is much less than 1, and the sum of all Ωd is equal to 1. (In formula (1) d, as defined previously is the number of randomly chosen variable node bits to be summed, D is the number of bits in the codeword, and X is a number in the interval from p to 1 inclusive, where p is the fraction of bit erasures that on average is decodable from the underlying LDPC code with the algorithm, such as belief propagation decoding, being employed in the communication system.)
As discussed, irrespective of the specific Ωd choice, use of a Raptor code in a hybrid ARQ system is used with particular advantage through dynamically responding to a feedback signal with retransmission at a responsive signal intensity with a responsive number of bits. Such intensity and transmitted bit number are determined in an advantageous embodiment by the condition:
where Rj is the number of summed bits sent in the jth interval directed by the number of bits in the underlying LDPC code frame, γ(j) is the Bhattacharyya noise on the channel during the jth transmission attempt, c0 is a parameter defined infra, and P(θ) is the probability of obtaining a 1 for transmission in the Raptor code for a codeword of fractional weight θ. Most of these variables are determinable before operation. However, the variable γ(j) is derivable from the channel quality information obtained from the feedback signal. The values for intensity and transmitted bit number used in actual code transmission should typically be within 25 percent of values derivable from formula (2).
It is also possible to improve the efficacy to at least 30% of the Shannon limit of a punctured LDPC encoding scheme in wireless communications by a dynamic choice of 1) transmitted power and 2) fraction of frame bits transmitted in each interval of the puncturing scheme based on a feedback signal. In one advantageous embodiment, the formula:
yields suitable values where c0 is a constant determined by the choice of LDPC scheme, γ(j) is the Bhattacharyya noise after the jth transmission and αj is the fraction of bits sent in the jth interval of the puncturing transmission. Since γ(j) is dependent on the transmission power that is derived from the feedback signal formula (3) yields a boundary condition for both the power employed and the fraction of bits transmitted during a specific puncturing interval. Generally deviations of 25 percent from the values derivable from formula (3) still yield advantageous results.
Although by use of the invention Raptor codes are adapted with particular advantage to wireless communication networks, the channel condition of such network determines if use of punctured LDPC is, nevertheless, more advantageous. Generally use of a punctured LDPC approach is preferable for channels having relatively high SNR's. The improved Raptor codes of the invention, significantly, operate efficiently at SNR levels that preclude punctured LDPC use.
The invention involves methods associated with punctured LDPC or Raptor code transmission. In punctured LDPC during a series of transmission intervals a portion of a codeword is sent. Thus, as shown in
A measure of channel quality is a quantity that is relatable to the probability that a transmission is received and the received transmission decoded. Exemplary of a measure of channel quality is the signal-to-noise ratio measured at the receiver. Alternatively, another measure is the fraction of codeword bits that are not discerned by the receiver after appropriate processing.
If a negack and/or a feedback signal indicative of channel quality is received, a transmission of a further fraction of the codeword bits is sent during a second interval. The transmission power and/or the fraction of codeword bits transmitted are adjusted based on the feedback signal and the second interval transmission at 46 is sent. The transmission is answered with either 1) an ack, or 2) a negack and/or a feedback signal. The sequence is continued until the codeword is decoded at the receiver or a decision to continue on to the next codeword is made.
A similar inventive approach is taken for transmitting Raptor encoded information. A frame of information (generally 100 to 10,000 bits) is encoded at 51. In a first transmission interval, 52, a fraction of the frame bits are sent. In an analogous manner to the LDPC approach, the receiver sends back at 53 either 1) an ack, or 2) a negack and/or signal feedback that is a measure of the channel quantity. If ack is received, the process at 54 is begun on the next frame. If a negack with a feedback signal, 55, is received, a transmission in the next interval is prepared. The power and/or fraction of frame bits to be transmitted is chosen at 56 based on the feedback signal. The transmission intervals are continued with feedback until decoding of the frame is achieved or it is decided to continue to the next frame.
Thus in either Raptor or punctured LDPC, the power and/or bit fraction is dynamically adjusted at least during some intervals based on feedback that is a measure of channel quality. By such dynamic adjustment, it is possible to achieve a throughput that is within 35%, preferably 20%, most preferably within 10% of the Shannon limit. Thus, for example, the information transmission rate, or power level, is increased in response to a feedback signal indicating an increase in SNR or decrease in the symbol erasure rate. Similarly the information transmission rate or power level is decreased in response to a feedback signal indicating a decrease in SNR or increase in the signal erasure rate.
In a specific embodiment relating to Raptor codes, throughput is enhanced by using one or both of two expedients. The first expedient, whether or not dynamic adjustment is employed, involves a suitable choice of Ωd's. In particular, these Ωd's are derivable by using linear programming to maximize over X and over all choices of Ω, the value of K constrained as follows:
(In the above formula d is the number of randomly chosen variable nodes to be summed, D is the number of bits in the codeword, and X is a number in the interval from p to 1 inclusive where p is the maximum fraction on average of bit erasures of the underlying LDPC code decodable with the algorithm such as belief propagation decoding, being employed in the communication system.) Additionally, this formula has the further constraints that the sum of the Ωd's equal 1 and all Ωd's are greater than or equal to 0. It is possible to use conventional linear programming algorithms such as the simplex algorithm or Karmarkar algorithm to achieve such solution.
Generally, a solution is obtainable using about 100 or more inequality equations. Each such inequality is obtainable by substituting a different value of X into formula (1). Thus, for example, in one embodiment values are chosen to divide the interval into 100 equal parts. Nevertheless, values of X need not necessarily be chosen by an equal partition of the interval. Although about 100 or more inequalities is typically adequate to derive an acceptable solution, use of significantly more inequalities, typically up to 1000, is not precluded. Although use of more than 1000 inequalities is acceptable, the obtained results generally do not justify the additional computation time.
In implementing the inventive Raptor codes, it is not necessary to use precisely the values derived. Improvement over conventional encoding systems is still obtainable if the Ωd's vary from values derivable from formula (1). Typically, it is possible to modify Ωd's corresponding to derivable values of 0.05 or less by plus or minus 30 percent from the derivable values. Similarly, for Ωd's greater than 0.05 variations up to plus or minus 10 percent are acceptable.
In any Raptor code, as previously discussed, bits are randomly chosen from the underlying LDPC encoded information and as discussed such sums are transmitted. The signal intensity used for such transmission and the number of sums transmitted for each underlying LDPC frame encoded to a corresponding Raptor frame both affect code efficiency. In a second expedient for improving a Raptor code, dynamic adjustment is employed. In an advantageous embodiment, choice of intensity and fraction of summed bits transmitted are guided by formula (2):
In this formula, Rj is the number of summed bits sent in the jth interval divided by the number of bits in the underlying LDPC code frame, γ(j) is the Bhattacharyya noise on the channel during the jth transmission attempt, c0 (to be defined infra) is a parameter dependent on the code weight spectrum and ΠΩ is defined by:
where p(θ) is the probability of obtaining a 1 for transmission in the Raptor code for a codeword of fractional weight, θ, and is obtained as:
with d defined as before, and (jd) is the standard definition of d-choose-j.
Significantly, most parameters are determinable without feedback. However, the γ(j) is a quantity that is dynamically determinable from the feedback measure of channel quality. For example, γ(j) is dependent on the SNR through the relation γ=e−P/2σ
In practice, as previously discussed, the Raptor code frame is transmitted during the first interval. If upon reception decoding is not possible a second transmission is made using the values as discussed above with the transmission attempt, j, equal to 2. Similarly, if the second transmission is not decodable a third transmission attempt is made with the intensity of transmission and the number of Raptor bits transmitted using the above technique with j equal to 3. The procedure continues until decoding is accomplished or further transmission is not desirable.
In accordance with the applicant's invention it is not only possible to improve Raptor codes but also possible to improve the performance of LDPC codes using a dynamic feedback puncturing technique. In this technique the transmitted power and/or the fraction of LDPC frame bits transmitted in each puncturing interval is dynamically controlled. Advantageous values of intensity and fraction of LDPC frame bits are derivable from the feedback information using the formula:
where αj is the fraction of bits sent in the jth interval of the puncturing transmission, c0 is the same parameter as employed in formula (2) (to be defined infra), and γ(j) is the Bhattacharya noise after the jth transmission. Thus formula (3) yields a boundary condition for power employed and fraction of bits transmitted during the jth interval of puncturing based on feedback. In particular γ(j) is derivable from a measure of statistical channel quality. For example, the transmission power is related to SNR through the relation γ=e−P/2σ
As discussed the values used depend on the acceptable range of parameters derived from formula (3) and goals of the transmission system. However, the operation parameters employed need not be precisely in the range of derivable values for αj and γ(j) determined from formula (3). It is generally acceptable to deviate from the values derivable from formula (3) by 10 percent. Typically such deviation generally does not unacceptably degrade the improvement in system efficacy that is achievable.
In the calculations related to formula (2) and formula (3) the parameter c0 is employed. (In the context of this invention, c0 in denominated the ensemble spectrum parameter.) This parameter is derived from the code weight spectrum of the LDPC code employed in the puncturing process relative to formula (3) and the underlying LDPC or Turbo code for the Raptor code relative to formula (2). A typical ensemble spectrum is shown in
By use of the subject invention both punctured LDPC codes and Raptor codes are improved. Nevertheless, punctured LDPC codes generally become ineffective as channel noise increases. Through the use of the subject inventive Raptor code transmission acceptable operation is achievable even for transmission channels with excessive noise for adequate operation of a punctured LDPC code transmission. Thus a system is possible that uses punctured LDPC code at lower channel noise levels to gain the advantage of higher throughput. However, when noise levels increase so that this advantage is substantially diminished use of the inventive Raptor code transmission is advantageously implemented.
The following addendum is hereby made part of this specification and is included to provide details concerning the derivation of formulae used herein.
Throughout the addendum we suppose that the channel is a Binary Input Symmetric Channel (BISC). Input is taken from one of two discrete symbols and the channel is additive noise (discrete or continuous). Furthermore we assume that the channel is known only at the receiver and that the goal is to maximize the throughput. Consequently, we organize an IR-HARQ protocol as follows: Initially the transmitter sends only as many codeword symbols as necessary to ensure a high probability of successful ML decoding over a high SNR channel. If the decoding fails, the receiver sends a NACK and the channel information to the transmitter. Taking into account the channel information of the past transmission(s), the transmitter sends only as many additional codeword symbols as necessary to insure a high probability of successful ML decoding assuming a high SNR channel during the current transmission.
The ability of Raptor codes to produce, for a given set of k information symbols, as many codeword symbols as needed for their successful decoding is what makes these codes of interest for use in HARQ schemes. In this addendum, we first study the spectra of Raptor codes and the ML decoding error rates for HARQ schemes based on Raptor codes. As in the case of LDPC codes, we assume that the channel is known only at the receiver and the goal is to maximize the throughput. With that in mind, we organize an IR-HARQ scheme based on Raptor codes in a very similar fashion as in the HARQ scheme with LDPC codes. Taking into account the channel information of the past transmission(s), the transmitter generates and then sends only as many codeword symbols as necessary to insure a high probability of successful ML decoding assuming a high SNR channel during the current transmission. Consequently, the central question is to determine the minimum number of symbols which should be generated at each transmission and the minimum power at which they should be transmitted to ensure a low error rate. This question is answered in this paper.
In Section II we begin with a spectrum analysis of the LDPC code ensembles which we use in the HARQ schemes. In Section III we analyze ML decoding of LDPC codes for HARQ and describe an LR-HARQ protocol with random transmission assignments for this HARQ scheme. In this section we also provide results for belief-propagation (BP) decoding of the same codes. In Sections IV and V, we focus on Raptor codes—in Section IV we give an ML analysis and propose an IR-HARQ protocol and in Section Y we provide several results on BP decoding of Raptor codes for HARQ. In Section VI we give a comparison between HARQ schemes employing LDPC and Raptor codes.
We begin with several results regarding the spectrum of regular LDPC codes. The results given in this section are used as a foundation for both LDPC and Raptor code ensemble analysis.
We study ensembles of regular binary LDPC code whose k×n parity check matrices have r as the sum of each row and c as the sum of each column where
The code rate of such codes is R≧1−ζ.
Generally, for a binary linear code C we denote the weight enumerator by Ah(i.e., the number of codewords of weight h in this code is Ah). For a code ensemble [C](n), the average number of codewords of normalized weight θ=h/n is denoted by Āθ[C](n). To analyze the performance of HARQ schemes based on LDPC codes, we are interested in the asymptotic behavior of left tail of the ensemble spectrum, namely
where 0<θ0 ≦1, and in the quantity known as the ensemble noise threshold [3], which we here define as follows:
We next show how these two quantities can be bounded. Our derivations are based on the results of Litsyn and Shevelev, IEEE Transactions on Information Theory, Vol. 48, 2002 and on certain results from the theory of large deviations. Only the main steps are presented here; details will be given in the Appendix to this addendum.
A. The Left Tail of the Ensemble Spectrum
We first derive upper bounds on the number of code words of small weight in the ensemble. Let p be the unique positive root of the following equation:
Note that rho→0 as θ→0. Then the following holds for sufficiently large n:
Theorem I: There is a constant C independent of θ and n0 so that for n>0,
Proof See Appendix I.
Note that for all sufficiently small θ and c ≧2, we can use the above upper bound to obtain
Using an estimate for p (see Appendix I) for sufficiently small θ, we find there is an ε>0 so that the RHS of the above inequality is upper-bounded by
C[θ(c/2−1)·((r−1)(1×ε))c/2]nθ. (2)
Consequently,
or, equivalently,
where a =[(r−1)(1+ε)]c/(c−2).
We next investigate the left tail of the spectrum
and show that it converges to 0 as n→∞ for a suitable choice of ε0. Note that we must choose c>3 and that we may show the result for c=3 only as the individual terms decrease monotonically as c is increased for a given (ha)/n. By considering x log x, we observe that the sequence [(ha)/n]h decreases as h increases provided (ha)/n<e−1. Thus
and consequently
B. The Ensemble Noise Threshold
The noise threshold c0[C](n) for a fixed n is
We can also equivalently write
where
aθΔ log
Now, by applying the result of Thin. 1, we obtain
where H(θ)=−(θ log θ+(1−θ) log(1−θ)) is the binary entropy function.
The RHS of (5) has a unique maximum in (0, ½). Therefore, we can obtain an upper bound on c0[C] by differentiation with respect to θ of the RHS of (5). Recall that there is a dependency between ρ and θ expressed by (53). We can find the minimizing θ* numerically. For instance, in the case of r=5, c=3, we find that θ*=0.3189 and c0[C]≦0.6966. We will come back to this bound later in Section III-D.
Recall that we are mainly considering the scenario when the channel is known only at the receiver and the goal is to maximize the throughput. Therefore, in an IR-HARQ schemes based on LDPC codes, the idea is to, at each transmission, transmit only as many codeword symbols as necessary to insure a high probability of successful ML decoding on an ideal channel taking into account the information about the overhead and the channel state information during the past transmissions. We first analyze the ML performance of IR-HARQ schemes averaged over certain ensembles of LDPC codes and all possible transmission assignments (or puncturing patterns) of the mother code bits. We then test our results on HARQ schemes based on practical finite-length LDPC codes with rate compatible random puncturing.
A. The ML Decoding Analysis for LDPC Codes over Parallel Channels
We first consider a binary input memoryless channel with output alphabet Y and transition probabilities W(y|0) and W(y|1), y ∈ Y. When a codeword x ∈ C ⊂{0, 1}n has been transmitted, the probability that the ML detector finds codeword x′ at Hamming distance h from x more likely can be bounded as follows:
Pe(x, x′)≦γh, (6)
where γ is the Bhattacharyya noise parameter defined as
if Y is discrete and as
if Y is a measurable subset of R.
Generally, for an (n, k) binary linear code C with the weight enumerator Ah, we have the well known union-Bhattacharyya bound on the ML decoder word error probability
Recall that, for a code ensemble [C](n), the average number of codewords of weight h in c(n) is denoted by Āh[C](n). The bound on the ML decoder word error probability averaged over the ensemble is obtained by averaging the (additive) union bound:
Now, from the results of Section II, we know that there is θ8*, such that 0<θ*<1, and
and, for sufficiently large n,
Āh[C](n)≦n exp(hc0[C]), {h nθ*<h≦n (9)
Therefore, for sufficiently large n, we have
Thereby, when
γ<exp(−c0[C]), (11)
we have
We now assume that the channel varies during the transmission of a single codeword, namely, channel transition probabilities at time i are Wi(b|0) and Wi(b|1), b ∈ Y. When codeword x ∈ {0, 1}n has been transmitted, the probability that the ML detector finds codeword x′ ∈ {0, 1}n more likely can be bounded as follows:
where we denote
It is easy to see that
Note that when xi=x′i, the corresponding factor Σb∈Y√{square root over (Wi(b|xi)Wi(b|x′i))}{square root over (Wi(b|xi)Wi(b|x′i))} in the product (12) equals 1 and can be omitted. When xi≠x′i, the corresponding factor Σb∈Y√{square root over (Wi(b|xi)Wi(b|x′i))}{square root over (Wi(b|xi)Wi(b|x′i))} equals to the Bhattacharyya noise parameter γi of the channel at time i:
Therefore, the bound (12) can be written as
Note that when all γi have the same value γ(time-invariant channel case), the above bound reduces to the well known γh bound (6), where h is the Hamming distance between x and x′.
We now assume that the codewords of the mother code are transmitted in m transmissions, and the decoding is performed after the last transmission has been received. This will help us to later analyze an IR-HARQ protocol with at most m transmissions. Let I={1, . . . , n} denote the set indexing the bit positions in a codeword. For the m transmissions, set I is partitioned in m subsets I(j), for 1≦j≦m. During the j-th transmission, only bits at positions i where i ∈ I(j) are transmitted. We assume that the channel is slowly time-varying, namely that Wi(y|0) and Wi(y|1) remain constant for all bits at positions i taking part in the same transmission. Consequently, the Bhattacharyya noise parameter for transmission j depends only on j:
γiγ(j) for all i ∈ I(j).
Let hj=dH(x, x′, I(j)) denote the Hamming distance between sequences x and x′ over the index set I(j). The bound (13) can be written as
In the case of only two transmissions, we have
Pe(x, x′)≦γ(1)d
where h is the Hamming distance between x and x′.
Let Ah
Further direct analysis of this expression seems formidable, even in the case of only two transmissions for which we have
We thus resort to finding the expected performance over all possible transmission assignments where a bit of a mother code is assigned to transmission j with probability αj, αj>0, Σjαj=1. The expected (and asymptotic as n→∞) number of bits assigned to transmission j equals to αjn. Such scheme can actually be implemented as follows:
We are interested in the expected performance of the mother code under this probabilistic model. If each bit of a codeword with Hamming weight h is randomly assigned to transmission j with probability αj, then the probability that the sub-word corresponding to the j-th transmission has weight hj for 1≦j≦m is given by
Therefore, for a given codeword with Hamming weight h, the expected value of Ah
and consequently, the expected value of the union bound (14) is
We define the average Bhattacharyya noise parameter seen by the mother code as
Then, we have
Therefore, when
we have
B. An IR-HARQ Protocol
We consider an IR-HARQ scheme with at most m transmissions where a bit is assigned to transmission j with probability αj. Transmission j takes place if transmission j−1 fails. The rates αj may be predetermined (e.g., specified by a standard) or determined based on current network conditions. In both cases, we are interested in evaluating performance after j transmissions, 1≦j≦m. In the latter case, we are interested in determining the parameters αj to achieve some required performance.
To ensure that the upper bound (8) on the probability of error of the ML decoder approaches 0 on a channel with the Bhattacharyya noise parameter γ, as n→∞, it is sufficient and necessary that the condition (11) holds. Therefore, in HARQ schemes, the mother code is chosen so that this condition is satisfied for the worst probable channel realization.
We now assume that the decoding after transmission j−1 failed. On the average, nαj bits will participate in the j-th transmission, and the remaining (1−α1− . . . −αj)×n bits of the mother code will not be transmitted. We assume that they are transmitted over a really bad channel, i.e., a channel with γ(j+1)=1, and compute
Our goal is to guarantee limn→∞
Condition (19) can be written in a form which clearly shows the tradeoff between the rate of the j-th transmission code and the signal power:
To satisfy the above lower bound on the product of αj and 1−γ(j), the transmitter can either increase the code redundancy αj or increase the signal power which results in a decrease of γ(j) and increase of 1−γ(j). An increase in redundancy results in the lower throughput of the user while an increase in the power results in a higher interference level experienced by other users in the network. Since γ(j) is positive, there is a minimum redundancy requirement:
Note that this condition ensures that the probability of error of the ML decoding is bounded by O(n1/2) for high SNR. In the case of predetermined αj (as it is sometimes in practice), the required signal power is specified by
In this protocol, equations (20), (21), and (22) constitute j-th transmission rules after transmission j−1 fails.
C. Upper Bounds on Throughput for BP Decoding of LDPC Codes in High SNR Region
We now turn to belief-propagation (BP) decoding. We first examine the maximum throughputs that can be sup- ported by randomly punctured LDPC codes ensembles decoded using BR To do so, we study the code performance over an ideal channel i.e., very high SNR channel. The following results are described in a variety of papers. In random puncturing, the bits which have not been transmitted can be considered as erasures. For ensembles of very long LDPC codes, successful decoding is obtained provided the rate of erasures is below the iterative decoding threshold p. This latter quantity is defined for single parameter families of channels with parameter θ as follows.
Definition 1: Let Pe∞(l) be the expected fraction of incorrect messages passed in iteration lunder the condition that the graph does not contain any cycles of length 2l or less. Then the iterative decoding threshold is defined to be
θ* can be determined as the largest p ∈ [0, 1] for which
x=pλ(1−ρ(−x) (23)
has no other root than 0 for x ∈ [0, p]. Here λ(x)≐Σiλixi−1, ρ(x)≐Σjρjxj−1 are the generator polynomials for the degree distributions of the variable and check nodes respectively. That is, λi denotes the fraction of edges connected to symbol nodes of degree i and ρj denotes the fraction of edges connected to check nodes of degree j.
For example consider a regular (3, 5) LDPC code so that the variable and check node edge distribution polyno-mials are λ(x)+x2, ρ(x)+x4. The rate of this code is R(3,5)=0.4. By optimizing for x ∈ (0, 1] we find that the largest p for which
x =p(1−(1−x)4)2 (24)
has no root other than 0 is P(3,5)=0.5175702 which is the iterative decoding threshold in this case. It follows that a punctured version of this code ensemble can attain a maximum throughput of T(3,5)=0.4/(1.0−0.5175702)=0.82914 over an ideal channel. For regular (3, 15) and (3, 30) codes we similarly find that T(3,15)=0.8/(1.0−0.167518)=0.96098 and T(3,30)=0.9/(1.0−0.082835) =0.98128472. Thus we may expect high rate LDPC mother codes to provide high throughputs when the channel SNR is high, but we see that very high throughputs are not achievable when the LDPC mother code rate is low. This agrees with the results presented in Li and Naryanan, Int. Conf. on Comms., Internet and Information Technology (CuT) November 2002 and is confirmed with our simulation results presented next.
D. LDPC Code Examples
We consider the IR-HARQ schemes on an additive white Gaussian noise channel (AWGN) with Binary Phase Shift Keying signalling. In
Rp=1, 0.975, 0.95, 0.925, . . . , R . (25)
In other words, after sending the first k=Rn bits, in the subsequent transmission (if it is necessary, i.e., if a codeword is not achieved after 50 decoding iterations) we send additional (1−0.975)n randomly selected parity bits, and decoding is attempted again. This procedure is repeated until an acknowledgement (ACK) is received or until all n symbols are sent.
First, in
For comparison, in FIGS. 6 and 7 we also plot the Binary Phase Shift Keying capacity of the AWGN channel and the performance of one optimized irregular LDPC code with the code rate R=0.5. The irregular mother code was designed based on the optimized edge degree polynomials given by λ(x)=Σiλixi−1=0.21991x+0.23328x2+0.02058x3+0.08543x5 +0.06540x6+0.04767x7+0.01912x8+0.08064x18+0.22798x19 and ρ(x)=Σiρixi−1=0.64854x7+0.34747x8+0.00399x9. Its block length was set to n=10000. From
An upper bound to the noise threshold c0[c]for the regular-(3,5) ensemble was computed in Section II to be 0.6966. Since the minimal redundancy requirement for the first transmission is α1 >1 exp(c0[c]), we can compute a conservative estimate of this quantity based on the upper bound on c0[c]We obtain α1=0.514 and the corresponding rate rp=r/α1<0.7973. Although this result gives only the necessary condition on the minimal α1, our simulations show that it predicts the saturation point very well. Indeed, looking at
Another way to estimate the saturation point is to use directly the results of Section Ill-C. Recall that the upper bound on the throughput for regular-(3,5) code was evaluated to be T(3,5)=0.82914. Naturally, this bound is not tight for the finite-length codes presented in
In
Recall that to obtain a Raptor codeword, the information sequence of k symbols is pre-coded by a high rate block code. Here LDPC codes will be used for pre-coding. The Raptor codeword symbols are then obtained based on the n resulting symbols by the means of a probability distribution Ω on the numbers 1,.. , n. The probability generating function of this distribution is
Each codeword symbol is obtained independently, by first sampling this distribution to obtain a number d, and then adding the values of d randomly chosen information symbols. Note that Q represents the degree distribution of the codeword symbols, and thus can be used to determine the degree distribution of the corresponding bipartite variable/check graph used for BP decoding. In an IR-HARQ schemes based on Raptor codes, the idea is to, at each transmission, generate and then transmit only as many codeword symbols as necessary to insure a high probability of successful ML decoding on an ideal channel taking into account the information about the overhead and the channel state information during the past transmissions. Thus, we first analyze the ML performance of HARQ schemes based on Raptor codes.
A. The ML Decoding Analysis for Raptor Codes over Parallel Channels
When a codeword x of length-N Raptor code has been transmitted over the channel with the Bhattacharyya noise parameter γ, the probability that the ML detector finds codeword x′ at Hamming distance w from x more likely can be bounded as
Pe(x,x′)≦γw.
If an (n, k) binary LDPC code ensemble [C](n) with the weight enumerator Ah is used as the precode in the Raptor scheme with the degree distribution Ω, then the number of Raptor codewords of weight w is given by
where p(h/n) denotes the probability of 1 in the Raptor codeword when the input LDPC codeword has normalized weight h. It is easy to see the following:
The Union-Bhattacharyya bound on the ML decoder word error probability for Raptor code ensembles can therefore be expressed as
Note that γ<p(h/n)·γ+1−p(h/n)<1. Therefore, the above expression can be bounded in a manner of (10), as follows
where a is the spectrum of the LDPC code as defined by equation (4) and T ={1/n, . . . , n−1/n, 1}. It is interesting to compare the expression (28) with the corresponding expression (10) for the LDPC codes without the LT coding, which can be obtained from (28) merely by substituting γh in the place of [p(h/n)·γ+1 −p(h/n)]N Since γ<[p(h/n)·γ+1−p(h/n)], the LT code has the effect of making the channel noisier according to the original weight of the LDPC codeword.
In the time-varying case, when a codeword x of length-(N1+N2) Raptor code has been transmitted over the channel with the Bhattacharyya noise parameter γ1 during the first N1 symbol intervals and the channel with the Bhattacharyya noise parameter y2 during the following N2 symbol intervals, the probability that the ML detector finds codeword x′ at Hamming distance w1 from x over the first N1 bits and Hamming distance w2 from x over the second N2 bits more likely can be bounded as
Pe(x,x′)≦γ1w1γ2w2.
If an (n, k) binary LDPC code C with the weight enumerator Ah is used as the precode in the raptor scheme with the degree distribution Ω, then the number of Raptor codewords of weight w is given by
where p(h/n) denotes the probability of 1 in the Raptor codeword when the input LDPC codeword has normalized weight h. The Union-Bhattacharyya bound on the ML decoder word error probability for Raptor codes can therefore be expressed as
Similarly, when a codeword a, of length(N1 +N2++Nm) Raptor code has been transmitted over the channel with the Bhattacharyya noise parameter γ1 during the first N1 symbol intervals, the channel with the Bhattacharyya noise parameter γ2 during the following N2 symbol intervals, and so on, the channel with the Bhattacharyya noise parameter γm during the last Nm symbol intervals, then the ML decoder word error probability can be bounded as
B. An IR-HARQ Protocol
In Section IV-A, we derived the following bound (see (29)):
We will use the inequality (1−x)≦e−x to bound [1−p(θ) (1−γ)]N which is tight in the law SNR region (γ close to 1). Taking into account the definition of the noise threshold (3), we obtain
Therefore, when the rate of the Raptor code satisfies
By using the same bounding techniques in the time-varying case (see (30)), when a codeword x of length—(N1+N2+ . . . +Nm) Raptor code has been transmitted over the channel with the Bhattacharyya noise parameter γ1 during the first N1 symbol intervals, the channel with the Bhattacharyya noise parameter γduring the following N2 symbol intervals, and so on, the channel with the Bhattacharyya noise parameter γm during the last Nm symbol intervals, we obtain the following result:
where Rlj=n/Nj. Therefore, when
we have
Condition (31) can be written in a form which clearly shows the tradeoff between the rate of the j-th transmission code and the signal power:
To satisfy the above lower bound on the product of Rlj−1 and 1−γ(j), the transmitter can either increase the code redundancy Rlj−1 or increase the signal power which results in a decrease of γ(j) and increase of1−γ(j). An increase in redundancy results in the lower throughput of the user while an increase in the power results in a higher interference level experienced by other users in the network. Since γ(j) is positive, there is a minimum redundancy requirement:
Note that this condition ensures that the probability of error of the ML decoding is bounded by O(n1/2) for high SNR. In the case of predetermined aαj (as it is sometimes in the practice), the required signal power is specified by
Equations (32), (33), and (34) constitute j-th transmission rules after transmission j−1 fails. It is interesting to compare them with their counterparts for LDPC codes given by equations (20), (21), and (22).
A. Ensemble Bounds on the Iterative Decoding Threshold
We turn to the question of designing a Raptor code for Hybrid ARQ. A design must specify the parity check degree polynomial Ω as well as the inner code. As our simulation results will show subsequently, schemes based on Raptor codes have superior performance at low SNR over conventional systematic punctured LDPC codes. In fact the throughput of punctured LDPC codes falls off rapidly as the SNR falls to the point where the channel capacity corresponds to the rate of the mother code. No universal Ω can be found which is capacity achieving over a range of SNRs.
Given the superior performance at low SNR, we will concentrate on the high SNR performance of Raptor codes. In this case puncturing an LDPC code performs better than sending random parity check bits, as in Raptor codes as far as hybrid ARQ throughput is concerned. In fact the performance over such channels may be approximated by the performance over an ideal channel. Consideration of transmission over an ideal channel is also important because lower bounds for ensemble code performance are determined for Raptor codes and LDPC codes once throughput is given for the ideal channel. We BEC in by discussing this ensemble lower bound.
The lower bound on ensemble performance of infinite length graph based codes provides a limit on the iterative decoding threshold. (It is well defined in cases where worse channels can be obtained by physical degradation of the channel with a smaller parameter value.)
We turn to the bound itself which applies to any physically degradable BISC:
Theorem 2 (Khandekar): Suppose the Binary Erasure Channel with erasure probability p is within the iterative decoding threshold for an ensemble of codes. Then so is any other BISC with Bhattacharya parameter
The above bound can be used to determine a lower bound on the iterative decoding threshold for Raptor codes over a Binary Input Channel with AWGN noise. To do so first consider the Raptor codes being used over an ideal channel. Take its outer code to be an ensemble of length n regular LDPG codes with rate RL and with an iterative decoding threshold po for the binary erasure channel.
Definition 2: Define κ106 (po) to be the Raptor threshold rate. This is the smallest fraction of LT symbols per LDPC symbol needed to determine all but a fraction Po of the LDPC symbols in the limit as n →∞.
Now suppose that the same Raptor code is used over a BIAWGN channel with Bhattacharya noise γSince the fraction of parity check symbols must be inflated by 1/(1−p) for a BEC with parameter p, Theorem 2 implies that we will be within the iterative decoding threshold for this channel provided we transmit at least
LT parity check symbols per LDPC symbol. The rate of the Raptor code in bits per channel therefore exceeds
which depends only on the Raptor threshold rate and the Bhattacharya noise of the channel.
(36) may be taken as an approximate lower bound to the throughput which can be achieved by using Hybrid ARQ if the channel is fixed to be BIAWGN with Bhattacharya noise γ. It may be an overestimate for finite length codes as they typically have worse performance than their corresponding infinite length ensembles. This is balanced by the fact that the codeword is transmitted piecemeal with an attempt at decoding at each stage in Hybrid ARQ. This obviously gives better throughput performance than one shot decoding of the complete codeword.
As we mentioned earlier we thus see that this lower bound is determined via the performance of the Raptor code over an ideal channel. It should be noted that Etesami and Shokrollahi, supra, have obtained a corresponding upper bound for the performance of Raptor codes (and other graph based codes). This too leads to a performance bound in connection with a further BEC. Its erasure probability is given as 1−E[tanh(Z)/2]where Z is the distribution of the Log Likelihood Ratio between 0 and 1 over the channel given that 0 was transmitted. This bound coincides with the lower bound for an ideal channel. Furthermore this bound is also determined once κΩ(p0) is given.
We now tam to obtaining an upper bound, over the ideal channel, for the throughput for the set of design choices (p0,Ω) where p0 is the iterative decoding threshold for the BEC for ensembles of the outer code (not necessarily LDPC) and Q is the degree polynomial for the LT parity checks.
B. Linear Programming Bounds
To fix things we will consider Raptor codes with a regular LDPC inner code. Suppose the variable and check node edge distribution polynomials of the LDPC code are λ(x), w(x). For ensembles of graph based codes, recall that the iterative decoding threshold P0 for the BEC can be determined as the largest p for which (23) has no other root than x =0 in [0, p] as. A punctured version of an LDPG code ensemble can thus attain a maximum throughput
over an ideal channel.
For Raptor codes the fraction of erased bits x0 remaining after the first round of iterative decoding depends on the choice of degree polynomial ΩIn this case the edge distribution of the check nodes is given by w(x) =Ω′(x) and the data (LDPC code bits) nodes are Poisson P(α) where a is the average number of bits in a random parity check. Hence λ(x)=eα(x−1). x0 is determined as the largest root in (0,1) of
x =e−κΩ′(1−x). (37)
andκ is the number of parity checks per LDPC code bit generated. In this case the throughput is
For Raptor codes we can thus upper bound the throughput performance over an ideal channel for a given underlying LDPC code by minimisingκΩ(p0) over all possible choices of Ω. It is actually more convenient to maximise 1/κΩ as we may then obtain a bound using a linear program. (Other linear programming constructions are described in Etesami and Shokrollahi supra). For convenience we write κ for κΩ.
As we have just discussed it is necessary for the Raptor ensemble to be BP decodable that,
x≧e−κΩ′(1−x,x∈[p0, 1] (38)
so that all roots are in (0, p0). Rewriting this constraint in terms of 1/κ, we may form the objective,
we also have the constraints Ωd≧0 andΣdΩd=1. Note that (39) is linear in the coefficients and we obtain a finite linear program by working with polynomials of maximum length D and discretising the objective finely over the given interval. The problem is made an LP by using the standard construction,
where Xκ are the chosen constraint points, x1=p0, . . . ,xK=1.
Practical Raptor codes have further constraints which make the bound tighter. For example a certain fraction of single degree nodes are needed in order to start BP decoding. However this tightens the bound. Indeed the information rate over an ideal channel is reduced as there will be a consequent fraction of single node repeats which are entirely redundant.
Table I are the throughput bounds for 3 Raptor codes using regular LDPC codes as outer codes. Bounds for ensembles of other codes such as irregular LDPC codes and Turbo codes may also be obtained. Fixing Ω1=0.01
tightens the bound to be TRaptor=0.586 for ensembles of (3,5) codes. In
In fact for every p>0 we will on the average decode a smaller fraction of symbols than this and
This picture changes if we replace BP decoding with say joint ML decoding of the Raptor code (including the underlying LDPC precode) . In this case capacity can be approached using choices of Ω which would restrict throughput if employed with BP decoding.
C. Raptor Code Examples
In this section we present simulation results for IR-HARQ based on Raptor codes decoded using BP. In FIG. 10 we show the results for three Raptor codes with the same LT degree distributionΩ(x)=0.05x+0.5x2+0.05x3+0.25x4+0.05x6+0.1x8 and three different regular LDPC precodes (with λ(x)=x2 and p(x)=x4, p(x) =14 and p(x)x+29). The degree distribution polynomial Ω(x) was chosen in an ad-hoc manner the choice was based on the results presented by Etesami and Shokrollahi, “Raptor Codes on Symmetric Channels”, preprint, 2005. We should note here that similar results are obtained when the degree distribution polynomial Ω(x) is derived using the linear programming optimization procedure for high SNRs presented in Section V-B.
First, we observe from FIG. 10 that we benefit from using high rate LDPC precodes in Raptor coding schemes. In other words, as we increase the rate of the precode (from 0.4 to 0.8 and then to 0.9) higher throughputs are achievable at all SNRS of interest. (For high SNRs, this can also be concluded from
As predicted by the analysis in Section V-B the curves saturate at high SNRs. Again, the predictions are accurate for very long block codes under assumption that the number of BP decoding iterations is sufficiently large. For finite length codes, these bounds can be used in estimating the throughput limits, but they are not tight. For example, the bound on the throughput for the regular-(3,15) LDPC precode from Section V (see Table I) is T=0.8699. The simulation from
In Section IV we discussed the theoretical results obtained for the two different schemes. We now turn to the comparison between the simulation results for IR-HARQ schemes based on LDPC and Raptor codes. In
Our final comment is on the difference between the encoding/decoding complexities of these schemes. Although the encoding and decoding complexities of Raptor codes are higher than the corresponding complexities of the underlying LDPC codes (and generally higher than the complexities of punctured LDPC codes) they have the property that we only need to encode as many parity bits as we need to send in the initial transmission or in subsequent re-transmission(s). On the other hand, for punctured codes we need to encode all parity bits, even though we may send only a small fraction of them.
Let Λκ,n denote the set of binary κ×n matrices, and Λκ,nc,r denote the set of binary κ×n parity-check matrices whose column weights are given by vector c =(c1, . . . , cn) and row weights by r =(r1, . . . , rκ). If ci=c, ∀ i, and ri=r, ∀ i, the code ensemble with parity check matrices Λκ,nc,r is referred to as regular ensemble; otherwise, the ensemble is referred to as irregular. We will deal only with the former case and assume that r>2. We define the parameter ξ as
for regular code ensembles, and denote the set of the corresponding parity check matrices by Λnξ,r.
Counting the number of codewords of weight w and the form
in the ensemble is equivalent to counting the number of matrices in Λnξ,r whose row sum over the first w columns is even. Furthermore, a permutation of the columns of such a matrix ensures that the same permutation of 1w0n−w is then a codeword. Therefore, the average number of weight w codewords in the ensemble is at most (wn) times greater of the number of matrices in Λnξ,r whose row sum over the first w columns is even.
For nθ=w=1,2, . . . , n−1, define Λn,θξ,r⊂Λnξ,r as
By the definition of an ensemble,
is the probability of such a matrix. By constructing an upper bound on Pn,θξ,r we will obtain an upper bound on the spectrum of the code itself.
To enumerate ∂n,θξ,r for r even, we further define Ln,θξ,r⊂Λnξ,r to be the set of binary matrices with the first mo rows having sum 0, the next m2 rows having sum 2, and so on, the last mr rows having sum r. Thus Ln,θξ,r contains all possibilities for the first w columns of a matrix in Λn,θξ,r given the row sums in increasing order. Similarly, we define Rn,θξ,r to be the set of corresponding matrices complementing Ln,θξ,r to form Λn,θξ,r with the first m0 rows summing to r, the next m2 rows summing to r−2 and so on. We have
where the sum is over all feasible combinations of the row sums i.e., m0 m2 . . . mr,. satisfying
A similar result holds for r odd, with (42) being over even arguments; so that, the final term is for 2j =r−1 instead of 2j =r.
To find bounds on |Λn,θξ,r |,|Ln,θξ,r |, and |R n,θξ,r |, we will use the following result on the number of zero-one matrices with prescribed column and row-weight. See Litsyn and Shevelev supra. Lemma 1: The number Nc,r . of zero-one matrices with row weight distributions r and column weight distributions c can be bounded as follows:
Theorem 3 of Litsyn and Shevelev supra provides an asymptotic lower bound for |Λnξ,r |:
for sufficiently large n>n0, which is of course independent of θ.
We now establish the following theorem:
Theorem 3: For fixed r, c ∈N, r even, there is a constant C2, and n0 both independent of θ such that
whenever n>n0. The sum is taken with values constrained as
m0+m2+ . . . mr=ξn 2m2+4m4+ . . . +rmr=ξθrn (46)
Proof We first apply Lemma 1 to upper bound |L n,θξ,r |and |R n,θξ,r |. Then use (44) to lower bound |Λnξ,r | for sufficiently large n0 and all n>n0. The result follows on substituting into (41) and using the definition of Pn,θr,ξ given in (40)
Again a similar result holds for r odd with the changes indicated above.
We introduce next a simple method for evaluating the sum in (45) based on a set of results from large deviation theory. Let
hence p ={pj}j=0r represents a probability vector. Multiplying both the numerator and denominator of the expression in (45) by 2rξn, we obtain
where the constraints (46) hold. The expression above can be bounded and its log-asymptotics can be assessed in terms of Sanov's theorem, stated below.
Theorem 4: Let {X1, . . . , Xn} be i.i.d random variables with probability mass function Q(x) over a bounded set of K elements. Let F ⊂P be a set of probability distributions. Then
Qn(F)=Qn(f∩P)≦(n+1)K2−nD(P*∥Q), (49)
where
Therefore, the problem of estimating the probability in (12) reduces to finding a probability mass function qi that minimizes Σ1i log(qi/pi), such that
with probabilities pi defined in (47). By using the Lagrangian multiplier method, with the multiplier function
one can show that the unique optimizing distribution for both r even and r odd is of the form
and zero for odd values of i, where p is the unique positive root of the equation
It follows that
In the upper bound for Pn,θr, ξ there remains the factor (nθrξnrξ) which we may upper bound using Stirling's formula (see for example [2, p. 530]):
There is a corresponding upper bound for the remaining term in the spectrum,
Using Theorem 3 together with (45), (49), (54), (55), (56), we obtain the following upper bound on the spectrum:
Theorem 5: There is a constant C3 independent of 1>θ>0 and n0 so that for n>n0
where p is the unique positive root of (53).
In order to apply the above result to bounding the left tail of the spectrum, we must investigate the behavior of p near 0, which we examine using (53). As θ approaches 0, so does p with
Since (2r)p2=rθ/2+O(θ2),
for all sufficiently small θand c≧2. We thus have that
Using our estimate for p for sufficiently small θthere is an ε>0 so that the RHS is upper bounded by
Thus the tail of the spectrum for some 9o >0 is upper bounded by
where a =[(r−1) (1+ε)]c/(c−2) We now show that this converges to 0 as n→∞for suitable choice of θ0. We must choose c≧3 and note that we may show the result for c=3 only as the individual terms decrease monotonically as c is increased provided (ja)/n<1. Also by considering x log x we observe that the sequence ((ja)/n)j decreases as j increases provided (ja)/n<e−1. Thus
and so converges to 0 at least as fast as O(n−1/2).
Number | Name | Date | Kind |
---|---|---|---|
6801532 | Anandakumar et al. | Oct 2004 | B1 |
7158473 | Kurobe et al. | Jan 2007 | B2 |
7295549 | Pepin et al. | Nov 2007 | B2 |
7426241 | Proctor, Jr. | Sep 2008 | B2 |
Number | Date | Country | |
---|---|---|---|
20070260957 A1 | Nov 2007 | US |