The present invention relates to the field of error correction codes such as the ones employed for wireline and wireless digital communication systems and in data storage systems. More particularly, the present invention relates to iterative decoding methods that implement the Maximum A Posteriori (MAP) Algorithm.
A digital communication system of both wireless and wireline type usually comprises a transmitter, a communication channel and a receiver as known by those skilled in the art.
The communication channel of such system is often noisy and introduces errors in the information being transmitted such that the information received at the receiver is different from the information transmitted.
In general, in order to correct an error of the channel transmitted through wireline and wireless environments, the digital communication system uses a protection method by which the transmitter performs a coding operation using an error correction code and the receiver corrects the error produced by the noisy channel.
Various coding schemes have been proposed and developed for performing correction of errors introduced by the channel. In particular, turbo codes, which are Forward Error Correction (FEC) codes, are capable of achieving better error performance than conventional codes. Furthermore, turbo codes can achieve exceptionally low error rates in a low signal-to-noise ratio environment.
For this reason, such turbo codes can usefully be employed in wireless communications, for example in the more recent CDMA wireless communication standard.
A detailed description of turbo coding and decoding schemes can be found in “Near Shannon limit error-correcting coding and decoding: Turbo-codes”, Berrou et al., Proc., IEEE Int'l Conf. On communications, Geneva, Switzerland, 1993, pp. 1064-1070, Particularly, turbo codes are the parallel concatenation of two or more recursive systematic convolutional codes separated by interleavers.
As known by those skilled in the art, decoding of turbo codes is often complex and involves a large amount of complex computations. Turbo decoding is typically based on a Maximum A Posteriori (MAP) algorithm which operates by calculating the maximum a posteriori probabilities for the data encoded by each constituent code.
While it has been recognized that the MAP algorithm is the optimal decoding algorithm for turbo codes, it is also recognized that implementation of the MAP decoding algorithm is very difficult in practice because of its computational complexities.
To reduce such complexities, approximations and modifications to the MAP algorithm have been developed. These include a Max-Log-MAP algorithm and a Log-MAP algorithm.
The Max-Log-MAP and Log-MAP algorithms are described in detail in “A Comparison of Optimal and Sub-Optimal MAP Decoding Algorithms Operating in the Log Domain”, Robertson et al., IEEE Int'l Conf. on Communications (Seattle, Wash.), June, 1995,
To reduce the computational complexity of the MAP algorithm, the Max-Log-MAP and Log-MAP algorithms perform the entire decoding operation in the logarithmic domain. In fact, in the logarithmic domain, multiplication operations become addition operations, thus simplifying numeric computations involving multiplication.
The object of the present invention is to provide an improved method for performing iterative decoding in accordance with a Log-MAP algorithm in a digital system using a turbo code.
Particularly, the inventive method comprises the steps of:
Furthermore, among such performed calculations at least one and no more than two calculations are performed by the use of the look-up table for implementing the Log-MAP decoding algorithm and the remaining calculations are performed implementing a Max-Log-MAP decoding algorithm.
Another object of the invention is a receiving apparatus which comprises a decoder for performing iterative decoding of information coded by an error correction code.
The decoder comprises:
The characteristics and the advantages of the present invention will be understood from the following detailed description of an exemplary non-limiting embodiment thereof with reference to the annexed figures, in which:
Particularly, the communication system 100 comprises a transmitter 101, a communication channel 102 and a receiver apparatus 103. The transmitter 101 comprises a source 104 of digital data and an encoder 105 to encode data in order to make the transmission more robust against errors introduced by the channel 102. Furthermore, the transmitter 101 comprises a modulator 106 to translate the encoded bits into signals suitable to be transmitted through the channel 102.
Instead, the receiver apparatus 103 comprises a demodulator 108 for translating the received signals into bit likelihoods or soft values indicating the probability of a received bit to be a 0 or 1 logic. Subsequently, such soft values are elaborated by a decoder 109 that retrieves the source bits.
In order to avoid errors introduced by the noisy channel 102, the data bits to be transmitted can be, advantageously, encoded using a PCCC (Parallel Concatenated Convolutional Coding) turbo code known in the art.
Particularly, employing the PCCC turbo code in the digital communication system 100, data bits to be transmitted over the communication channel 102 are encoded as an information sequence (also called systematic information) and as two or more parity sequences (also called parity information).
Particularly, such encoder 200 comprises a first 201 and a second 202 encoder, for example, substantially equal each other. The encoders 201 and 202 are chosen among the Recursive Systematic Convolutional codes (RSC) and are described in “Near Shannon limit error-correcting coding and decoding: Turbo-codes”, Berrou et al., Proc., IEEE Int'l Conf. On communications, Geneva, Switzerland, 1993, pp. 1064-1070 and in U.S. Pat. No. 6,298,463 B1 (Bingeman et al.), herein enclosed by reference.
With reference to a first information bit sequence dk which corresponds to the data to be transmitted, the PCCC encoder 200 generates a systematic sequence of bits Xk, a first parity sequence Y1k and a second parity sequence Y2k.
As shown in
On the contrary, a second information bit sequence dk1 corresponds to the first information bit sequence dk after elaboration made by an interleaver 203, Such second information bit sequence dk1 represents the input of the second encoder 202 which produces the second parity sequence Y2k at its output.
The interleaver 203 in the PCCC turbo code encoder 200 operates, as it is known, to interleave the information bits supplied to the same encoder 200, Such interleaving operation can be accomplished, for example, storing all input information bits consecutively in the rows of a matrix of storage locations and then reading consecutively the columns of the same matrix to access the stored bits (row-column interleaver).
It should be observed that, typically, in the PCCC encoder 200, the interleaver 203 is embedded into the encoder structure to form an overall concatenated code. In more detail, the interleaver 203 is employed to decorrelate the decoding errors produced in the first convolutional code (i.e. the first encoder 201) from those coming from the second convolutional code (i.e. the second encoder 202).
The systematic sequence of bits Xk, the first Y1k and second Y2k parity sequences produced by the PCCC encoder 200 are then multiplexed to form a code word. After encoding, the code word is modulated according to techniques known in the art and transmitted over the noisy communication channel 102, either wired or wireless.
In more detail, such decoder 300 comprises a first 301 and a second 302 decoder for receiving and decoding the soft values xk, y1k and y2k produced by the demodulator 108 of the receiver apparatus 103. Such soft values xk, y1k and y2k correspond to the systematic bit Xk, the first parity bit Y1k and the second parity bit Y2k, respectively, which have been demodulated and then filtered and sampled by, for example, a suitable Filter/Match/Sample unit not shown in
Moreover, the PCCC turbo decoder 300 can be constructed as an application specific integrated circuit (ASIC) or as a field-programmable gate-array (FPGA) or a digital signal processor (DSP) software or using other suitable technologies known by one skilled in the art.
As can be seen, the soft values xk and y1kcorresponding to the systematic bit Xk and the first parity bit Y1k, represent the inputs of the first decoder 301. The output of the first decoder 301 is a first log-likelihood ratio Λ1e(dk) to be sent to the second decoder 302 through a first interleaver 303. Such first interleaver 303 is analogous to the interleaver 203 of the PCCC encoder 200.
Moreover, a first interleaved log-likelihood ratio Λ1e(dk1) represents a first input of the second decoder 302 and the soft value y2k corresponding to the second parity bit Y2k represent a second input for the same decoder 302.
Furthermore, a second interleaved log-likelihood ratio Λ2e(dk1) produced at the output of the second decoder 302 represents the input of a deinterleaver 304 in order to generate a deinterleaved log-likelihood ratio Λ2e(dk) to be sent to the first decoder 301 as a feedback signal.
As known by those skilled in the art, the log-likelihood ratios Λ1e(dk) and Λ2e(dk1) produced at the output of the first 301 and second 302 decoders are the so called “extrinsic information”. Such extrinsic information Λ1e(dk), Λ2e(dk1) each one feeding the successive decoder in an iterative fashion is used as an estimation of the “a priori probability” of the logic value (0 or 1) of information bits. The first interleaver 303 and the deinterleaver 304 reorder the extrinsic information Λ1e(dk) and Λ2e(dk1), respectively, before each exchange between the first 301 and 302 second decoders.
The reliability of the estimation increases for each iterative decoding step and determines the number of iterations to be carried out. Finally, a hard decision on the information bits is taken at the end of the last iteration. Particularly, at the last step of iteration, a further deinterleaver 304′ deinterleaves the second log-likelihood ratio Λ2e(dk1) produced by the second decoder 302 and the decoded bits are provided at the output of a threshold circuit 305,
According to the described example, the first 301 and second 302 decoders are identical for the PCCC Turbo decoder 300 of the invention. Such decoders 301 and 302 can operate using different algorithms of the type Maximum A Posteriori (MAP).
As example, the first 301 and second 302 decoders can implement a MAP algorithm, such as the one proposed by Bahl et al in “Optimal decoding of linear codes for minimizing symbol error rate”, IEEE Trans. Inform. Theory, vol. IT-20, pp. 248-287, March 1974, herein enclosed by reference.
Particularly, the first 301 and the second 302 decoders can employ the MAP algorithms in the forms “Log-MAP” and “Max-Log-MAP” algorithms, which are described in “A Comparison of Optimal and Sub-Optimal MAP Decoding Algorithms Operating in the Log Domain”, Robertson et al., IEEE Int'l Conf. on Communications (Seattle, Wash.), June, 1995, herein enclosed by reference.
More particularly, as it will be described hereinafter, the first decoder 301 can perform at least one calculation in accordance with the “Log-MAP” algorithm.
Such first decoder 301 may comprise a processor unit 400 for performing the decoding processing and a memory unit 500 which can be written and read by the processor unit 400. In more detail, the processor unit 400 may comprise a computation module 401 for performing calculations and a retrieving module 402 for retrieving data from the memory unit 500. The modules 400, 401, 402 and the memory 500 may be hardware and/or software modules exclusively dedicated to perform the functions of the first decoder 301,
For example, the computation module 401 may be a microprocessor (Central Processing Unit or CPU) or other kinds of processing units. Further, the memory unit 500 may include a Read Only Memory (ROM).
Preferably, the units of the architecture shown in
Following, an example of the inventive decoding method provided by the PCCC turbo decoder 300 will be described. Particularly, such turbo decoder 300 operates in accordance with the Log-MAP algorithm.
Further details on the mathematical bases of the inventive method can be found in the above mentioned paper “Near Shannon limit error-correcting coding and decoding: Turbo-codes”, Berrou et al., Proc., IEEE Int'l Conf. On communications, Geneva, Switzerland, 1993, pp. 1064-1070.
Particularly, the first decoder 301 computes the A Posteriori Probability in logarithmic domain. The output of such decoder 301, i.e. the first log-likelihood ratio Λ1(dk), can be expressed as:
where m and m′ are the trellis states (known to the skilled person) at time k and k−1, respectively. Moreover, values α, β and γ are usually called Forward metric, Backward metric and branch transition probability, respectively. These three quantities are kind of probabilities computed on the basis of the following equations:
where hα and hβ are normalization constants.
Moreover, Rk corresponds to the soft values xk, y1k at the output of the noisy channel 102 after demodulation. With p( ) is indicated the transition probability of a discrete gaussian memoryless channel. Then, q( ) is the function that select the possible transition in a convolutional encoder finite state machine and π( ) is the transition state probability.
As known by those skilled in the art, the values α and β can be recursively calculated from γ during the trellis Forward and Backward Recursions, respectively.
If we apply the logarithm operator to the branch transition probability γ, we obtain a further branch transition probability {circumflex over (γ)} expressed as:
{circumflex over (γ)}(Rk,m′,m)i=log γ(i(Rk,m′,m))=
log[p(Rk/dk=i,Sk=m,Sk−1=m′)q(dk=i/Sk=m,Sk−1=m′)]+
log(π(Sk=m/Sk−1=m′)) (5)
then, the Forward α and Backward β metrics can be expressed as further Forward {circumflex over (α)} and Backward {circumflex over (β)} metrics
where the terms Hα and Hβ correspond to the above indicated normalization constants hα and hβ and the first log-likelihood ratio Λ1(dk), can be expressed as:
It should be observed that the quantities in (6), (7) and (8) can be computed by solving recursively terms such as the following:
log (exp(a1+a2))=max(a1,a2)+log (1+e−|a
where the operator max*(a1,a2) comprises a max operator between general quantities a1, and a2 and a correcting factor log (1+e−|a−a
As an example, the recursion to compute the further Forward metric {circumflex over (α)} will be performed by a recursive application of equation (9) where the arguments will be terms like:
{circumflex over (γ)}i(Rk,m′,m)+{circumflex over (α)}k−1j(m′)
On the contrary, the computation of ratio Λ1 will be performed by two independent recursive application of equation (9) where the arguments will be terms like:
({circumflex over (γ)}1(Rk,m′,m)+{circumflex over (α)}k−1j(m′)+{circumflex over (β)}k(m))
Advantageously, to approximate the correcting factor log(1+e−|a
Such LUT could be memorized in the memory unit 500 provided in the turbo decoder 300 and, for example, it can be generated by a processor external to the turbo decoder 300 and downloaded into the decoder itself during data processing of the input data. Alternatively, the LUT can be generated within the decoder 301.
It should be observed that the number of elements of the LUT depends on the metric representation precision. Accordingly, we assume that the further branch transition probabilities {circumflex over (γ)} are digitized according to the formula:
{circumflex over (γ)}q=round({circumflex over (γ)}2p) (10)
where p is the number of bits dedicated to the precision or equivalently 2-p is the quantization interval.
Then, assuming that a saturation is applied and considering that a limited number of bits is available in any real implementation:
where d is the number of bit dedicated to the signal dynamic and p+d is equal to the number of ADC bits as known by those skilled in the art.
Consequently, each element in the LUT can be defined as:
LUT(Δ)=round(log(1+exp(−Δ/2p))2p) (12)
where
and the round operator represents the nearest integer. Each value LUT(Δ) represents the correcting factor of the max*(a1,a2) operator expressed by the formula (9).
Generally, the LUT has a limited number of elements given by the maximum integer Δmax that satisfies the following inequality:
log(1+exp(−Δmax/2p))≧2−(p+1) (13)
For example, the LUT obtainable for a level of precision p=1 is shown in Table 1.
The argument Δ of the LUT in table 1 can varies within a limited range having Δmax=4 as maximum limit. Each tabled value is associated to a corresponding value of the argument Δ.
The following Table 2 shows the tabled values of the LUTs for the level of precision p=2 and p=3. The values of the argument Δ are not shown in Table 2 for seek of clarity.
Neglecting the correcting factor log(1+e−|
According to the example described, the computing module 401 computes the further branch transition probabilities {circumflex over (γ)} and, also, it computes the further Forward {circumflex over (α)} and Backward {circumflex over (β)} metrics and the first log-likelihood ratio Λ1(dk) in accordance with equations (6), (7) and (8).
As an example,
In
Another branch of the first architecture 600 shows the operations performed to compute the modulus of argument Δ (ABS) to be compared with Δmax and the use of the LUT in such a way to obtain the correcting factor of the max* algorithm. The correcting factor is then added, by means of a corresponding node, to the max term.
Further, a second selector SEL2 can generate the value of further Forward metric {circumflex over (α)} by selection of max and max*.
The further Backward metrics {circumflex over (β)} are computed in an analogous way.
Moreover,
Compared with the first architecture 600 of
Such term max corresponds to the further Forward metric {circumflex over (α)} at time k of iteration.
Furthermore,
On the basis of the systematic bit x and the first parity bit y1 and on the basis of a value of a log-likelihood ratio Λ1, evaluated on a previous recursion step, a computing module 801 gives the further branch transition probability {circumflex over (γ)} which is stored in a corresponding first memory module 802.
The architecture 800 includes recursion modules 701, 702, 703 and 704 for computing, respectively, the further Forward metrics {circumflex over (α)}, the further Backward metrics {circumflex over (β)} and the first Λ—1 and second Λ—2 terms of the expression (8).
The recursion module 701 has been already described with reference to
In a first embodiment of the inventive method for performing the iterative decoding, the first decoder 301 implements the Log-MAP decoding algorithm using the LUT to calculate the further Forward metric {circumflex over (α)}, i.e. to approximate the correcting factor which appears whether equation (6) is expressed as in (9).
In other words, the first decoder 301 calculates the further Forward metric {circumflex over (α)} approximating the correcting factor with the retrieved value of the LUT corresponding to an argument Δ.
On the contrary, the further Backward metric {circumflex over (β)} in (7) and the log-likelihood ratio Λ1(dk) in (8) are calculated implementing the Max-Log-MAP decoding algorithm, i.e. neglecting the corresponding correcting factors.
In a second embodiment of the inventive method, the first decoder 301 uses the LUT to approximate the correcting factor referred to equation (7) to calculate the further Backward metric {circumflex over (β)}. Instead, the further forward metric {circumflex over (α)} in (6) and the log-likelihood ratio Λ1(dk) in (8) are calculated neglecting the correcting factors.
In a third embodiment of the inventive method, the first decoder 301 uses the LUT to approximate the correcting factors referred to equations (6) and (7) in order to calculate the further Forward {circumflex over (α)} and Backward {circumflex over (β)} metrics. In this case, the log-likelihood ratio Λ1(dk) is calculated neglecting the correcting factor.
In a fourth embodiment of the inventive method, the first decoder 301 uses the LUT to approximate the correcting factor referred to equation (8) to calculate the log-likelihood ratio Λ1(dk). The further Forward {circumflex over (α)} and Backward {circumflex over (β)} metrics in (6) and in (7), respectively, are calculated neglecting the correcting factors.
In a fifth embodiment of the inventive method, the first decoder 301 uses the LUT to approximate the correcting factors referred to equations (6) and (8) to calculate the further Forward {circumflex over (α)} metric and the log-likelihood ratio Λ1(dk), respectively. In this case, the Backward metric {circumflex over (β)} in (7) is calculated neglecting the correcting factor.
In a sixth embodiment of the inventive method, the first decoder 301 uses the LUT to approximate the correcting factors referred to equations (7) and (8) to calculate the further Backward metric {circumflex over (β)} and the log-likelihood ratio Λ1(dk), respectively. In this case, the Forward {circumflex over (α)} metric in (6) is calculated neglecting the correcting factor.
It should be observed that the inventive method performs the known Log-MAP algorithm when the further Forward metric {circumflex over (α)}, the further Backward metric {circumflex over (β)} and the log-likelihood ratio Λ1(dk) are evaluated by decoder 301 approximating the corresponding correcting factors using LUT. On the contrary, neglecting all the correcting factors for evaluating {circumflex over (α)}, {circumflex over (β)} and Λ1(dk), the method of invention performs the Max-Log-MAP algorithm.
Advantageously, the different embodiments of the method of the present invention are decoding algorithms each showing intermediate complexity and performances between the Log-MAP and Max-Log-MAP algorithms.
In fact, using LUTs for computing the further Forward metric {circumflex over (α)}, the further Backward metric {circumflex over (β)} and the log-likelihood ratio Λ1(dk) corresponds to the optimum in terms of performance with highest computational complexity needs (Log-MAP case).
On the opposite, i.e. LUTs are not used at all (Max-Log-MAP case), advantages in reduced computational complexity are paid in terms of decoding very low performances.
For example, in Table 3 a comparison in terms of complexity and performances for each embodiment of the method of invention, the Log-MAP and Max-Log-MAP algorithms are proposed.
Particularly, the applicant has made a profiling text with a proprietary VLIW (Very Long Instruction Word) Digital Signal Processor to obtain the listed results in Table 3, Particularly, the test reports the case of a PCCC turbo code with 8 decoding iterations.
Furthermore, the complexity is measured in terms of clock cycles required by the decoding process and the performances in term of Bit Error Rate (BER).
As can be seen, the decoding algorithms 1 to 6 of the proposed method have complexity greater than Max-Log-MAP algorithm because they make use of LUTs.
In addition, while algorithms from 1 to 3 present less than 100% of complexity increasing, algorithms from 4 up to 7 have different level of complexity an unreasonable increase higher than 400%.
These data are justified basing on the different number of LUT accesses needed to compute the further Forward metric {circumflex over (α)}, the further Backward metric {circumflex over (β)} and the log-likelihood ratio Λ1(dk).
Even if the above description refers to the case in which the teachings of the invention are implemented only into the first decoder 301, it has to be pointed out that such teachings can be applied alternatively, or preferably, in addition also to the second decoder 302.
Advantageously, the second decoder 302 is so as to perform the computing of quantities analogous to the above described metrics α, β and the log-likelihood ratio Λ in accordance with the method above described with reference to the first decoder 301. To this end, a further Look-Up Table, analogous to the LUT employed in the described method, can be created and employed by the second decoder 302 or, alternatively, the second decoder 302 can employ the same Look-Up Table of the first decoder 301,
It should be observed that different decoders each suitable to implement one of the algorithms of the inventive method can be provided. In this way, the complexity analysis can be expressed in terms of hardware implementations, i.e. in terms of silicon area, power consumption and achievable speed.
Advantageously, the algorithms of the inventive method can be used for both Parallel and Serial Turbo decoding.
Furthermore, the method of invention shows flexibility to achieve the most suitable trade-off between performances and computational complexity.
Obviously, to the decoding method and apparatus of the present invention, those skilled in the art, aiming at satisfying contingent and specific requirements, may carry out further modifications and variations, all however being contemplated within the scope of protection of the invention, such as defined in the annexed claims.
Number | Name | Date | Kind |
---|---|---|---|
6298463 | Bingeman et al. | Oct 2001 | B1 |
7246295 | Xu | Jul 2007 | B2 |
20030056167 | Yuan et al. | Mar 2003 | A1 |
20030091129 | Zhang et al. | May 2003 | A1 |
20050052991 | Kadous | Mar 2005 | A1 |
Number | Date | Country | |
---|---|---|---|
20070157063 A1 | Jul 2007 | US |