The present invention relates to a method and apparatus for decoding, according to a Log-MAP algorithm, a bit sequence encoded by a convolutional encoder and received through a noisy channel.
A key demand on any communications system is to ensure that the information received by the system corresponds closely (exactly if possible) with the information originally transmitted to the system. Transmission errors, such as bit errors, are often unavoidably introduced into a communications system by noisy transmission channels etc. and, as a result of this, much effort has been expended on developing forward-error-correction (FEC) schemes. These schemes aim to correct errors in received signals by using information contained within the signal itself.
FEC schemes generally aim to be sufficiently sophisticated to provide acceptably low bit error rates in error-corrected data, yet not be too complex or costly (in terms of reduced data-transmission rates) to implement in practice. A widely used FEC coding scheme is that of “Turbo-Coding” which is regarded by some commentators in the field of data coding as being the most powerful FEC coding scheme presently available.
Turbo-Codes developed from the concepts of “concatenated coding” and “iterative decoding”, and the term “Turbo-Codes” better describes the iterative decoding step rather than the concatenated encoding step. Typically, the encoding step of Turbo-Codes involves the parallel concatenation of two convolutional codes, although serial (or hybrid) concatenation is also possible. In this arrangement, two convolutional encoders are arranged in parallel, both having the same data sequence as input, but with one of the two encoders operating upon that data only after it has been interleaved. The output of these encoders may then be combined with the original (un-coded) data sequence for transmission, thereby providing a “systematic” code sequence in which the data symbols from the input data sequence appear unchanged in the resulting output code sequence.
Thus, the output code sequence comprises the original un-coded data and associated check-bits which can be used by a decoder to correct errors in the received code sequence. By concatenating the encoders in this way, a relatively complex code can be produced using relatively simple constituent encoders.
Convolutional encoding is a well established encoding technique in which input data sequences are split into “blocks” of a predetermined length, each of which are independently encoded to produce a “code block” of check bits for the input data. In particular, convolutional encoding is performed by calculating the modulo-2 sum of a current input data-bit and one or more preceding input data-bits.
Implementation of this encoding method may be by way of shift-registers and exclusive-OR gates as illustrated by the example Recursive Systematic Convolutional (RSC) encoder of
The behaviour of an encoder is conveniently represented by a “code-trellis” as illustrated in
In this way a trellis of all possible transitions of the encoder is provided. The possible transitions depend upon the nature of the code (RSC code in this case), and any one path through the trellis represents one possible input data sequence and its corresponding output code sequence. It is this property which is used in decoding a given code sequence output by an encoder, when received at a decoder.
The Viterbi algorithm is often employed to decode convolutional codes. Its objective is to find the path through the code-trellis that most closely resembles the received code sequence, processing one code block (i.e. time-step) at a time. The principle of the algorithm is to progressively build the most likely complete path by determining at each node (at a given time-step) the path to that node which corresponds with the code sequence which is closest to the received code sequence. Each such “survivor” path determined in this way is then extended by repeating the step for successive adjacent nodes in the trellis until the path can be extended no further. The best of the remaining “survivor” paths is then chosen as being the one most likely to represent the actual state transitions that the encoder made when encoding the data. Accordingly, an estimate of the data sequence input to the encoder can be obtained with knowledge of the encoder properties.
Each branch of the trellis carries with it a “branch metric” representing the probability of the encoder having made the transition represented by that branch, given the received code sequence and channel side information and the encoder properties. Each survivor path also carries with it a “path metric” derived from the metrics of the constituent branches of that path. Thus, when each survivor path is extended from a given node by one branch, there is an associated branch metric calculation for each possible branch from that node, followed by an addition of each branch metric to the path metric of that node, followed by a comparison of the two resulting path metrics and then a final selection of which of the two is to be the survivor path. This add-compare-select (“ACS”) process must be repeated many times throughout the trellis when decoding a code sequence, and results in a very large number of operations having to be performed and extensive data storage requirements. Consequently, a number of “acceleration” schemes have been proposed in the art which aim to perform more efficiently the ACS process in a Viterbi decoding algorithm.
An alternative decoding algorithm is the Logarithmic Maximum A Posteriori (Log-MAP) decoding algorithm. This decoding algorithm performs ACS operations, similar to those of the Viterbi algorithm, for each time-step in the trellis. In doing so the Log-MAP algorithm determines the most likely information bit to have been transmitted given a received code sequence which may be noisy or distorted. This is unlike the Viterbi decoding algorithm which determines the most likely transmitted code sequence (i.e. via the survivor path).
In general, a MAP decoder must ultimately select the most likely transmitted code sequence {right arrow over (x)}, given a received sequence {right arrow over (y)}, in a manner equivalent to maximizing the conditional probability:
This is the “maximum a posteriori” (MAP) criterion. The quantities p({right arrow over (y)}) and p({right arrow over (x)}) are assumed constant, hence the MAP criterion amounts to maximizing p({right arrow over (y)}|{right arrow over (x)}).
In “hard decision” coding, the MAP criterion is satisfied using only the received sequence {right arrow over (y)}, unchanged. In “soft decision” coding, information regarding the reliability or “likelihood” of the bit values in {right arrow over (y)} is generated also.
The Log-MAP algorithm quantifies this soft information in terms of “Log-Likelihood Ratios” (LLR) to represent, in the log domain, the degree of certainty of specific decoded bits at the output of a decoder and is used in iterative decoding of Turbo-codes.
The Log-MAP algorithm operates in the logarithmic domain in order to compress the large range of numerical values encountered and also to turn multiplication into simple additions etc. Thus, the Log-MAP algorithm utilises more of the information available during decoding so as to increase decoding efficiency since unreliable bit decisions can be corrected if required. More importantly, the Log-MAP decoding algorithm inherently provides “soft” decision information which can be effectively used in decoding concatenated codes.
Although the Viterbi decoding algorithm may be adapted to provide soft information for this purpose, such information is widely regarded as being inferior to that provided by the Log-MAP decoding algorithm. This limitation is especially important when employing Turbo-Codes which rely on an iterative decoding scheme which employs soft information. Hence, the higher quality of soft information provided by the Log-MAP decoding algorithm renders it well suited to applications involving Turbo-Codes.
Consequently, it is generally desirable to increase the efficiency of the Log-MAP algorithm in its application to Turbo-Coding. However, in contrast to the Viterbi algorithm, current digital signal processors do not provide any specific acceleration schemes for the Log-MAP algorithm.
Thus, it is an aim of the present invention to overcome this general deficiency in the prior art at least by exploiting the structure of the trellis associated with a Log-MAP encoder. In particular, in the trellis of a binary convolutional encoder, the transitions between neighbouring states can be segregated into disjoint groups of four each originating in a concurrent pair of states and terminating in another concurrent pair. The structure produced by these four states is known as a trellis “butterfly”.
At its most general, the present invention proposes to accelerate the operation of a Log-MAP decoding algorithm by at least performing each step of an Add-Compare-Select (ACS) operation in respect of one state of one concurrent pair of states of a trellis butterfly in parallel with each corresponding step of an Add-Compare-Select operation in respect of the other state of the one concurrent pair of states. The ACS operation may form part of a path metric update or part of a Log-Likelihood Ratio calculation.
In parallel processing in this way, each one of the two ACS operations performed on the two concurrent states of a pair of butterfly states utilize the same data quantities, namely the same set of path metric values and the same two possible transition metric values between the two concurrent pairs of states of the butterfly. Thus, according to the present invention, the operation of the Log-MAP decoding algorithm may be accelerated at least through approximately halving the data retrieval requirements of the operation.
Accordingly, in a first aspect of the present invention there may be provided a method for calculating path metric values of a convolutional encoder for use in decoding according to a Log-MAP algorithm a bit sequence encoded thereby and received through a noisy channel, the method comprising the steps of:
Preferably, the method includes the step of adding the correction term
ln(1+exp(−Δ))
to the selected path metric value associated with a given adjacent encoder state, where Δ is the absolute value of the difference between said first path metric value of said given adjacent encoder state and said second path metric value thereof.
Thus, an accelerated Log-MAP Add-Compare-Select operation (LM_ACS) is provided. The LM_ACS operation requires two path metric values and two branch metric values to process a trellis butterfly. The result of the operation is the simultaneous production of two updated path matrices. The use of the correction term may improve the accuracy of the path metric values obtained according to the LM_ACS operation. This correction factor, when added to the selected maximum value of the two quantities defining Δ, results in the Jacobian logarithm of these two quantities, of which the selected maximum value is only an approximation (i.e. ln(ea+eb)=max(a,b)+ln(1+e−Δ),Δ=|a−b|).
The above steps (i) to (vi) may be repeated for all other concurrent trellis butterflies. Thus, each complete path metric update at a given trellis time-step may be mapped onto an m-fold execution of the LM_ACS operation for a trellis having m butterflies per time-step.
The path metric values for all of said encoder states and said adjacent encoder states may be forward path metrics calculated by forward recursion wherein all of said adjacent encoder states succeed all of said encoder states. Alternatively, the path metric values for all of said encoder states and said adjacent encoder states may be backward path metrics calculated by backward recursion wherein all of said adjacent encoder states precede all of said encoder states.
Therefore, both forward and backward path metric updating may be performed using the LM_ACS operation. This is particularly advantageous in calculating Log-Likelihood Ratios (LLR) which require both.
According to a second aspect of the present invention, there may be provided a method for calculating Log-Likelihood Ratio values for state transitions of a convolutional encoder for use in decoding according to a Log-MAP algorithm a bit sequence encoded thereby and received through a noisy channel, the method comprising the steps (i) to (iii) which are performed in respect of only those transitions corresponding with a parity bit of a first value, and steps (iv) to (vi) which are performed in respect of only those transitions corresponding with a parity bit of a second value:
Thus, it will be appreciated that steps (i) to (vi) of this second aspect of the present invention employ an extension of the LM_ACS operation according to the first aspect of the invention. In particular, the “Add” component of the LM_ACS operation is here extended from being the addition of two quantities (e.g. path metric+transition metric to the same one state) to the addition of three quantities (i.e. forward path metric+transition metric to different states+backward path metric). Accordingly, this extended LM_ACS operation requires two forward path metric values, two transition metric values and two backward path metric values to process a trellis butterfly.
In accordance with the second of its aspects, the invention preferably may provide a method for calculating LLR values wherein the maximum element of said set of second maximum values is subtracted from the maximum element of said set of first maximum values according to the steps of:
Thus, the calculation of the Log-Likelihood Ratio (LLR) for a given encoder transition (time step within the encoder trellis) may be further accelerated by performing in parallel the processing of date produced by the extended LM_ACS operation, thus providing an accelerated LLR (LLR_ACC) operation.
Preferably, in the invention according to its second aspect, the correction term:
ln(1+exp(−Δ))
is added to any quantity selected as being the maximum of two quantities compared for that purpose, where Δ is the absolute value of the difference between said quantities compared.
This correction factor, when added to the selected maximum value of the two quantities defining Δ, results in the Jacobian logarithm of these two quantities, of which the selected maximum value is only an approximate.
In method for calculating LLRs according to the second aspect of the invention, the forward path metric values may be calculated according to a method comprising the steps of:
Thus, it will be appreciated that the forward path metrics used in the calculation of the LLR values may be determined in the LM_ACS operation.
Similarly, in the method for calculating LLRs according to the second aspect of the invention, the backward path metric values may be calculated according to a method comprising the steps of:
Thus, it will be appreciated that the backward path metrics used in the calculation of the LLR values may be determined in the LM_ACS operation.
Preferably, in the method according to the second aspect of the present invention the first value of parity bits output from said encoder is +1 and said second value thereof is −1. However, the first and second values may be other than +1 and −1 respectively, they need only differ.
In accordance with any-aspect of the present invention, there may be provided a method for calculating transition metric values (γ) for use in decoding wherein:
Thus, it will be appreciated that calculation of the four possible transition metric values is substantially simplified in only requiring two terms to be calculated (i.e. s1 and s2). Preferably, s1=4·SNR·ykp and s2=4·SNR·yks+zk., where SNR is the signal-to-noise ratio associated with the received bit sequence, Ykp and Yks are received parity and systematic bits, and Zk is at least an estimate of the a priori probability of the encoder transition at time step k respectively.
The present invention also proposes a digital signal processor (DSP) for accelerating the operation of a Log-MAP decoding algorithm by at least performing each step of an Add-Compare-Select (ACS) operation in respect of one state of one concurrent pair of states of a trellis butterfly in parallel with each corresponding step of an Add-Compare-Select operation in respect of the other state of the one concurrent pair of states, in accordance with any of the methods described above. The ACS operation may form part of a path metric update or part of a Log-Likelihood Ratio calculation.
Thus, the invention may provide a digital signal processor for calculating path metric values of a convolutional encoder for use in decoding, according to a Log-MAP algorithm, a bit sequence encoded thereby and received through a noisy channel, the processor comprising:
Thus, the DSP provides updated path metric values, either forward or backward, for use in decoding a bit sequence. It will be appreciated that the ACS unit of the DPS may be employed not only to update forward or backward path metric values by adding these to successive transition metric values in accordance with the first aspect of the present invention, but the ACS unit may also be employed in calculating updated path metric and LLR values in accordance with the second aspect of the invention.
Preferably, the transition metric calculating unit employs the method of calculating transition metric values (γ) described above. The transition metric calculating unit may output calculated transition metric values to a memory store of the DSP to which the ACS unit is also connected for the purposes of receiving transition metric values therefrom. The ACS unit preferably receives path metric values and transition metric values from the memory store of the DSP, and outputs updated path metric values thereto for storage in that memory store.
A transition metric cache may be provided in the DSP into which the metric calculating unit outputs and stores the calculated transition metrics associated with a given time step of the encoder trellis being decoded, and the ACS unit may receive those transition metric values from the transition metric cache for use in deriving updated path metric values.
Thus, since for each trellis time step, only four transition metric values are required in order to process each of the concurrent butterflies of the time step, by storing the four values in a temporary cache store the need to repeatedly retrieve the values from the main memory store of the DSP is obviated. This helps to increase the efficiency of the signal processor.
Preferably the ACS unit of the DSP comprises:
Thus, the ACS unit may update either forward or backward path metrics in accordance with the first aspect of the invention or with the second aspect when employed in calculating LLRs. The calculation unit may have a first data input port for receiving path metric data values, a second data input port for receiving transition metric data values, a third data input port for receiving path metric data values, and a fourth data input port for receiving transition metric data values.
The ACS unit may have function-selection apparatus which permits the function of the ACS to be that of producing updated forward or backward path metrics in accordance with the first aspect of the invention, or to be that of producing the elements of the first and second sets of maximum values for use in calculating LLRs in accordance with the second aspect of the invention.
The function-selection apparatus preferably comprises:
The input-selection apparatus preferably has two selection states, each of which determine the function of the ACS unit.
Preferably in a first selection state of the function selection apparatus, the first data input gate blocks data output from the calculation unit and causes data input at the two feedback data input ports thereof to have a value of zero, while concurrently the second data input gate causes transition metric data input at the fourth input port of the calculation unit to be simultaneously input at the fifth data input thereof, and concurrently the third data input gate causes transition metric data input at the second input port of the calculation unit to be simultaneously input at the sixth data input port thereof.
Thus, in this first selection state, the input selection apparatus may cause the ACS unit to function so as to update path metric values in accordance with the first aspect of the present invention since for each adder means the two transition metric values simultaneously input thereto represent different transition metrics.
In a second selection state of the function selection apparatus, the first data input gate preferably permits data to pass from the outputs of the calculation unit to the two feedback data input ports thereof, while concurrently the second data input gate causes transition metric data input at the second input port of the calculation unit to be simultaneously input at the fifth data input thereof, and concurrently the third data input gate causes transition metric data input at the fourth input port of the calculation unit to be simultaneously input at the sixth data input port thereof.
Hence, in its second selection state, the input selection apparatus permits the ACS unit to function so as to calculate the elements of the sets of first and second maximum values for use in LLR calculations according to the second aspect of the present invention. That is to say, the two path metric values simultaneously input to any one adder means are made to be the same path metric and are input concurrently with backward path metric values.
The DSP is preferably provided with a Log-Likelihood Ratio unit for calculating LLR values from the set of first maximum values and the set of second maximum values in accordance with the LLR acceleration (LLR_ACC) methods derived above relating to the second aspect of the present invention, and for outputting calculated LLRs to the memory store of the DSP. The Log-Likelihood Ratio unit is preferably connected to the cache apparatus such as to be able to retrieve any data elements of the sets of first maximum values and second maximum values when stored therein, and to be able to store therein elements of successive sets of first and second maximum values produced by the Log-Likelihood Ratio unit.
Preferably, the Log-Likelihood Ratio unit comprises:
Preferably, in the aforementioned Log-Likelihood Ratio unit, the element output from both compare-and-select units is input into said cache apparatus for storage as an element in the respective successive set of maximum values. The data output port of each of the two compare-and-select units is preferably connected to a respective one of a first input port and a second input port of a subtraction unit which is operable to subtract the data at one input port thereof from the data at the other input port thereof, and to output the result.
Preferably, each compare-and-select unit of the Log-Likelihood Ratio unit includes a subtraction unit connected to said first and second input ports of the two compare-and-select means, wherein the subtraction unit is operable to subtract data input at said first input port from data concurrently input at said second input port and to output the sign and the absolute magnitude of the result.
Preferably, each compare-and-select unit includes a selecting unit having a first and second input port respectively connected to said first and second input ports of the compare-and-select unit, wherein the selecting unit is operable to receive as a further input the sign output from said subtracting unit and to output the data input via one of its first and second input ports in dependence upon the value of said sign.
Each compare-and-select unit may include correcting apparatus for adding to the output of said selecting unit a correction factor substantially equal to
ln(1+exp(−Δ))
where Δ is the absolute value of the result output from said subtracting unit.
This correction factor, when added to the selected maximum value of the two quantities defining Δ, results in the Jacobian logarithm of these two quantities, of which the selected maximum value is only one approximation.
Preferably, said correcting apparatus comprises:
The invention may also provide a turbo decoder comprising a plurality of Log-MAP decoders wherein:
The invention may provide a turbo decoder comprising a plurality of Log-MAP decoders wherein:
The present invention may also provide a turbo decoder comprising a plurality of Log-MAP decoders wherein:
There now follows a non-limiting example of the present invention with reference to the following figures:
Referring to
Each constituent RSC encoder produces parity bits for use in forward error correction of the data block input thereto. The first encoder 35 outputs a parity bit sequence {right arrow over (x)}1p, while the second encoder 34 outputs a parity bit sequence {right arrow over (x)}2pint associated with the interleaved data block input thereto. Due to the parallel concatenation of the systematic information ({right arrow over (x)}s) and the parity information ({right arrow over (x)}1p,{right arrow over (x)}2pint), three output bits are generated for each bit dk of the input data sequence {right arrow over (d)}=(dl, . . . ,dN). These three outputs are subsequently input to separate respective inputs 36, 37 and 38 of the multiplexer 39.
Puncturing unit 41 ensures that certain bits are removed from the parity bit-stream input to the multiplexer 39 and are not transmitted. For example, every second bit of the parity information may be punctured, leading to the transmitted data sequence {right arrow over (x)}=(x1s,x11p,x2s,xint,22p, . . . ,xN−1s, xN−11p,xNs,xint,N2p) in which sucessive parity bits (denoted x1p) are taken from the first encoder 35 alternately with parity bits (denoted x2p) are taken from the second encoder 34.
With this information and channel state information 56, the first MAP decoder 41 calculates for each bit dk of the input data sequence {right arrow over (d)}=(d1, . . . ,dN) the MAP Log-Likelihood Ratio value:
where {right arrow over (R)}=(R1, . . . ,Rk, . . . ,RN) and Rk=(yks,yk1p,zk2), which can be written as:
Λhu 1(dk)=zk1+c·yks+zk2
with c=4×SNR, the quantity SNR being the signal-to-noise ratio associated with the received data signal. The systematic term c·yks and the a priori term zk2 are regarded as independent of the parity information for the bit dk.
The newly generated extrinsic information can therefore be computed as:
zk1=Λ1(dk)−c·yks−zk2
which is output at the data output 49 of the first MAP decoder 41 and serves, after interleaving by intermediate interleaver 43, as a priori information zk,int1 for input at one of the data inputs 50 of the second MAP decoder 42. This information is input to the second MAP decoder 42 together with interleaved systematic data {right arrow over (y)}k,ints at another data input 51 thereof, interleaved parity data {right arrow over (y)}int2p (from the second RSC encoder 34) at a further data input 52 thereof, and channel state information 57. The second MAP decoder 42 computes extrinsic information as:
zk,int2=Λ2(dk,int)−c·yk,ints−zk,int1
which is subsequently de-interleaved by the de-interleaver 44 connected between the extrinsic information output of the second MAP decoder 42 and the a priori information input 48 of the first MAP decoder 41. This procedure iterates several times until the MAP estimates {right arrow over (Λ)}1,2 stabilise, whereupon the stabilised MAP estimate {right arrow over (Λ)}int2 is output from the output port 53 of the second MAP decoder 42 to the input of a second de-interleaver 45 for de-interleaving thereby. The de-interleaver MAP estimate of the Log-Likelihood Ratio {right arrow over (Λ)}2 is subsequently output from the second de-interleaver 45 to a decision circuit 55 for use in decoding the received bit sequence {right arrow over (y)}.
It is preferable to implement this MAP decoding in the (natural) logarithmic domain in order to avoid numerical problems without degrading decoding performance. In the logarithmic domain, each of the first and second MAP decoders 41 and 42 operate as Log-MAP decoders which compute log-likelihood ratios as follows:
Λ(dk)=max*(S
−max*(S
where the max* operation is in respect of all states S(k,k−1) involved in transitions between trellis states at time step k−1 and time step k, and
{overscore (γ)}i[(yks,ykp),Sk−1,Sk]=2·SNR·yksxks(i)+2·SNR·ykpxkp(i,Sk,Sk−1)+ln(Pr{Sk|Sk−1})
are the branch metrics γ (with i=0,1) represented in the logarithmic domain, ln(Pr{Sk|Sk−1}) being the a priori information. Branch metrics with i=0 correspond to branches of the encoder trellis associated with the output by the Turbo-Code encoder of a “zero” bit, while those with i=1 correspond to the output of a “one” bit.
It is to be noted that prior to transmission, every data bit output from the Turbo-Code encoder is subject to transformation. Data bits xks and xkp are transformed according to the relation x→2·x−1;y→2·y−1 such that “zero” bits are transmitted as “−1” bits.
Using these branch metrics, the forward path metrics {overscore (α)}k in the log domain, and backward path metrics {overscore (β)}k in the log domain, are calculated recursively using the following relations:
{overscore (α)}k(Sk)=max*(S
and
{overscore (β)}k(Sk)=max*(S
The operator term max* is the Jacobian logarithm of the quantities operated on thereby, of which the selected maximum (max) value is only an approximation (i.e. max*(a,b)=ln(ea+eb)=max(a,b)+ln(1+e−Δ),Δ=|a−b|). The present invention may operate according to the sub-optimal approximation max*(a,b)≈max(a,b) which omits the logarithmic correction term, but the present embodiment does not omit this term.
It will be appreciated that the four main tasks of each one of the two the Log-MAP decoders 41 and 42 of
According to this embodiment, the branch metrics are calculated from the received systematic and parity information bits along with channel state information and a priori information. Assuming the sent symbols xkε{−1,1}, the probabilities for the systematic and parity bits yk received through a channel subject to additive white Gaussian noise are defined by:
{overscore (γ)}i[(yks,ykp),Sk−1,Sk]=2·SNR·yksxks(i)+2·SNR·ykpxkp(i,Sk,Sk−1)+ln(Pr{Sk|Sk−1})
in the logarithmic domain. The a priori information ln(Pr{Sk|Sk−1}) required by any one of the two Log-MAP decoders 41 and 42 of the Turbo-Code decoder is directly deduced from the extrinsic information (z1,2) calculated by the other of the two decoders. If a transition Sk−1m→Skm′ is possible according to the trellis and dk, from trellis node m at time k−1 to trellis node m′ at time step k, then:
ln(Pr{Sk|Sk−1})=zk; for dk=1
ln(Pr{Sk|Sk−1})=0; for dk=0.
Thus, the a priori information ln(Pr{Sk|Sk−1}) required by any one of the two Log-MAP decoders 41 and 42 of the Turbo-Code decoder is directly deduced from the extrinsic information (z1,2) calculated by the other of the two decoders.
A total of four different branch metric values are possible at each trellis time step k, one for each of the four combinations of the two possible received systematic data bits yksε{−1,1}, and parity bits ykpε{−1,1}:
{overscore (γ)}(xks=−1,xkp=−1)=−2·SNR·(yks+ykp)
{overscore (γ)}(xks=−1,xkp=+1)=−2·SNR·(−yks+ykp)
{overscore (γ)}(xks=+1,xkp=−1)=−2·SNR·(yks−ykp)+zk
{overscore (γ)}(xks=+1,xkp=+1)=−2·SNR·(yks+ykp)+zk.
The structure of the Turbo-Code encoder determines which transition metric is assigned to a given transition. Simplification of the calculation of these branch metrics is achieved by adding the term (2·SNR·(yks+ykp)) to each of the above four equations, yielding:
{overscore (γ)}k0,0={overscore (γ)}(xks=−1,xkp=−1)=0
{overscore (γ)}k0,1={overscore (γ)}(xks=−1,xkp=+1)=s1
{overscore (γ)}k1,0={overscore (γ)}(xks=+1,xkp=−1)=s2
{overscore (γ)}k1,1={overscore (γ)}(xks=+1,xkp=+1)=s1+s2
where s1=4·SNR·ykp and s2=4·SNR·yks+zk. Thus, only two terms have to be calculated by either Log-MAP decoder 41 and 42, from the received data bits.
In the portion of the trellis of the binary convolutional Turbo-Code encoder of
Each one of the states of the first pair (m,m+M/2) is joined to each one of the second pair of states (2m,2m+1) by a respective one of a pair of distinct branch metrics γk(I) and γk(II). Each of these two branch metrics takes a value given by one of the four possible values given above. The first branch metric of the pair is associated with the output by the encoder of a parity bit of a first binary value and the second one of the pair is associated with the output by the encoder of a parity bit of a second binary value (e.g. parity bit 0, and parity bit 1 respectively).
The forward branch metrics αk associated with the first pair of states (m,m+M/2) of the butterfly are updated by adding the relevant transition metric thereto so as to extend that path metric to one of the second pair of states (2m,2m+1) according to the following equations:
αk((2m)=max*(αk−1(m)+γk(I),αk−1(m+M/2)+γk(II))
αk((2m+1)=max*(αk−1(m)+γk(II),αk−1(m+M/2)+γk(I))
Similarly, the backward branch metrics βk associated with the second pair of states (2m,2m+1) of the butterfly are updated by adding the relevant transition metric thereto so as to extend that path metric to one of the first pair of states (m,m+M/2) according to the following equations:
βk−1(m)=max*(βk(2m)+γk(I),βk(2m+1)+γk(II))
βk−1(m+M/2)=max*(βk(2m)+γk(II),βk(2m+1)+γk(I))
It will be readily appreciated that in updating any concurrent pair of states of a Log-MAP butterfly according to these equations, the same four data items are used for each state of the pair. That is to say, the same two path metrics to the state being updated, and the same two transition metric values for achieving that update are used for updating each state of a concurrent butterfly pair.
Each state update requires three successive steps: the addition of transition metric values to each of the two branch metrics of a concurrent pair; a comparison of the two resulting updated branch metrics; and a selection of the maximum value of the two. Thus, implementation of each of the above updating equations requires an Add-Compare-Select “ACS” operation.
At its most general, the present invention proposes to accelerate the operation of a Log-MAP decoding algorithm by at least performing each step of an Add-Compare-Select (ACS) operation in respect of one state of one concurrent pair of states of a trellis butterfly in parallel with each corresponding step of an Add-Compare-Select operation in respect of the other state of the one concurrent pair of states. The ACS operation may form part of a path metric update or part of a Log-Likelihood Ratio calculation.
Accordingly, the Log-MAP ACS operation “LM_ACS” is introduced. This operation is performed on a trellis butterfly and comprises the steps of:
In this operation, the sequence of steps (i), (ii) and (iii) are performed substantially simultaneously with the sequence of steps (iv), (v) and (vi) respectively.
The max* operation is defined by:
max*(a,b)=max(a,b)+ln(1+exp(−|a−b|)).
The correction term ln(1+exp(−|a−b|)) being added to the result of the result of the ACS operation associated with the “max” operation to provide the full Jacobian logarithm of which the “max” operator is only an approximation. This is implemented in the digital signal processor of the present embodiment by use of a small look-up table (LUT) as will be explained in more detail below.
Each complete path metric update of this type at a time step k can be mapped onto an m-fold execution of the LM_ACS operation:
(RM1,RM2)=LM_ACS(PM1,PM2,TM1,TM2)
by the digital signal processor as depicted in
In order to update the forward path metrics of the eight states (m=0, . . . ,m=7) of the encoder trellis segment of
(αk(0),αk(1))=LM—ACS(αk−1(0),αk−1(4),γk0,0,γk1,1)
(αk(2),αk(3))=LM—ACS(αk−1(1),αk−1(5),γk0,1,γk1,0)
(αk(4),αk(5))=LM—ACS(αk−1(2),αk−1(6),γk1,0,γk0,1)
(αk(6),αk(7))=LM—ACS(αk−1(3),αk−1(7),γk1,1,γk0,0)
During the updating of these forward path metrics, all updated path metrics (αk(0), . . . ,αk(7)) are stored for every time step k in a memory store of the DSP for later use in Log-Likelihood Ratio calculations. As every two LM_ACS operations use the same pair of branch metrics, either (γk0,0,γk1,1) or (γk0,1,γk1,0) the buffering of these branch metric values in a transition metric cache can reduce the required bandwidth of the main memory store of the DSP and enhance its efficiency.
The updating of backward path metrics is achieved by the DSP according to the following:
(βk−1(0),β−1k(4))=LM—ACS(βk(0),βk(1),γk0,0,γk1,1)
(βk−1(1),βk−1(5))=LM—ACS(βk(2),βk(3),γk0,1,γk1,0)
(βk−1(2),βk−1(6))=LM—ACS(βk(4),βk(5),γk1,0,γk0,1)
(βk−1(3),βk−1(7))=LM—ACS(βk(6),βk(7),γk1,1,γk0,0)
By combining the backward path metric update with Log-Likelihood Ratio (LLR) calculations, the DSP is able to directly use backward path metric values calculated (and stored) in this way at time step k for use in LLR calculations later. Only 2M backward path metric values need to be stored by the DSP, the values being for time-steps k and k−1.
The calculation of LLR values for the eight-state encoder trellis section of
This equation consists of two extended ACS operations and may be performed in three stages beginning with the partitioning of the equation into four butterflies as follows:
Stage 1:
llr1_s1_1=max*(αk(4)+γk0,1+βk(0), αk(0)+γk0,1+βk(1))
llr0_s1_1=max*(αk(0)+βk(0), αk(4)+βk(1))
llr1_s1_2=max*(αk(5)+βk(2), αk(1)+βk(3))
llr0_s1_2=max*(αk(1)+γk0,1+βk(2), αk(5)+γk0,1+βk(3))
llr1_s1_3=max*(αk(2)+βk(4), αk(6)+βk(5))
llr0_s1_3=max*(αk(6)+γk0,1+βk(4), αk(2)+γk0,1+βk(5))
llr1_s1_4=max*(αk(3)+γk0,1+βk(6), αk(7)+γk0,1+βk(7))
llr0_s1_4=max*(αk(7)+βk(6), αk(3)+βk(7))
followed by stage 2:
llr1_s2_1=max*(llr1_s1_1, llr1_s1_2)
llr0_s2_1=max*(llr0_s1_1, llr0_s1_2)
llr1_s2_2=max*(llr1_s1_3, llr1_s1_4)
llr0_s2_2=max*(llr0_s1_3, llr0_s1_4)
then ending in stage 3:
llr1_s3=max*(llr1_s2_1, llr1_s2_2)
llr0_s3=max*(llr0_s2_1, llr0_s2_2)
Λk=llr1_s3−llr0_s3
Thus, it will be appreciated that stage 1 of this process amounts to performing an extended LM_ACS operation four times, one for each of the four butterflies defined by (llr1_s1_1, llr0_s1_1), (llr1_s1_2, llr0_s1_2), (llr1_s1_3, llr0_s1_3), and (llr1_s1_4, llr0_s1_4).
The extended LM_ACS operation performed on each one of these four butterflies comprises the steps (i) to (iii) which are performed in respect of only those trellis transitions corresponding with a parity bit of a first value (e.g. bit 0), and steps (iv) to (vi) which are performed in respect of only those trellis transitions corresponding with a parity bit of a second value (e.g. bit 1):
The sequence of steps (i), (ii) and (iii) is performed substantially simultaneously with the sequence of steps (iv), (v) and (vi) respectively and steps (i) to (vi) are repeated for the encoder states of all of the other three concurrent trellis butterflies to provide a set of first maximum values and a set of second maximum values. In the present example, the set of first maximum values comprises the four elements {llr1_s1_1, llr1_s1_2, llr1_s1_3, llr1_s1_4}, and the set of second maximum values comprises the four elements {llr0_s1_1, llr0_s1_2, llr0_s1_3, llr0_s1_4}.
To determine the LLR value for the trellis segment, the maximum element of the set of second maximum values is subtracted from the maximum element of the set of first maximum values to provide a Log-Likelihood Ratio according to stages 2 and 3 defined above. These two stages define an accelerated LLR operation (LLR_ACC) as explained below.
It will be appreciated that the “Add” component of the LM_ACS operation is here extended from being the addition of two quantities (e.g. path metric+transition metric to the same one state) to the addition of three quantities (i.e. forward path metric+transition metric to different states+backward path metric). Accordingly, this extended LM_ACS operation requires two forward path metric values, two transition metric values and two backward path metric values to process a trellis butterfly.
The extended LM_ACS operation is schematically illustrated in
When used to perform simple metric updating, the LM_ACS operation requires the DSP to call metrics 81 from the DSP memory store, metrics 82 are not called from the aforementioned cache and their value is set to BT1=0 and BT2=0 by the DSP. Similarly, the DSP sets PAR1=TM2 and PAR2=TM1 and performs the LM_ACS operations 83 resulting two updated path metrics RM1 And RM2. These two path metrics 84 are returned to the DSP memory store for later use in LLR calculation using the extended LM_ACS operation.
When used to perform stage 1 above, the extended LM_ACS operation requires the DSP to call metrics 81 and 82 from the DSP memory store. The DSP sets PAR1=TM1 and PAR2=TM2 and performs the LM_ACS operations 83 resulting two elements, RM1 And RM2, of the sets of first and second maximum values respectively. These two elements 84 are returned to the DSP memory store. This process is repeated until all required butterflies are processed and the sets of first and second maximum values are complete. Stages 2 and 3 above may then be performed in respect of these two sets.
To accelerate the calculation of stages 2 and 3 of the LLR calculation, the operation LLR_ACC is introduced. In performing this operation, the DSP obtains all of its data from the cache within which are stored the elements of the first and second sets of maximum values.
The DSP calculates the LLR value by subtracting the maximum element of the set of second maximum values from the maximum element of said set of first maximum values according to the LLR_ACC operation having the steps of:
In this way the calculation of the Log-Likelihood Ratio (LLR) for a given encoder transition (time step within the encoder trellis) is accelerated, by performing in parallel the processing of date produced by the extended LM_ACS operation, thus providing an accelerated LLR (LLR_ACC) operation.
The LLR_ACC operation is schematically illustrated in
Except for the last stage of the LLR_ACC operation (here stage 3) the difference value RM1-RM2 is meaningless. At the last stage, the difference value is the LLR.
These components and connections of the DSP are generic and their function and interaction shall not be further discussed herein.
An extension to the generic portion of the DSP of
The metric calculating unit 153 is able to calculate transition metric values in accordance with the above described methods, and to supply transition metric values for storage in the data memories 101, 102 of the DSP and in the metric cache 156 thereof. The LM_ACS unit 154 is able to the LM_ACS operation in updating path metric values. Transition metric values, path metric values (forward and backward), and other data values are retrievable by the LM_ACS unit 154 from data memories 101 and 102 and cache 156, for this purpose.
Input lines 201 and 203 of
The accelerated LLR operation is performed by the LLR_ACC unit 155. This unit retrieved data values from the dual port cache 157 via dual data busses 160 and 161, and also returns data values thereto. Data values calculated by the LLR_ACC unit are also output to the data memories 101 and 102 of the DSP. These data values include the sets of first and second maximum values, and LLR values.
The LM_ACS unit of the DSP comprises a calculation unit 209. The calculation unit is operable to add path metric values to transition metric data values concurrently received at its data input ports to give updated path metric values, compare pairs of concurrent updated path metrics, and concurrently output at separate ones of each data output port 217 and 217′ the maximum of the two compared updated path metric pairs in accordance with the “max*” component of the LM_ACS operation described above.
The calculation unit has a first data input port 201 for receiving path metric data values PM1, a second data input port 202 for receiving transition metric data values (TM1), a third data input port 203 for receiving path metric data values (PM2), a fourth data input port 204 for receiving transition metric data values (TM2), a fifth input port 205 for receiving transition metric values (PAR1), a sixth input port 206 for receiving transition metric values (PAR2), a first feedback data input port 207 for receiving backward path metric values (BT1), and a second feedback data input port 208 for receiving backward path metric values (BT2).
The calculation unit is operable to perform the LM_ACS operation illustrated in
The transition metric selector gate 232 (232′) has a first input 233 (233′) and a second data input 234 (234′) for concurrently receiving transition metric values TM2 and TM1, and a data output 235 (235′) for outputting one of those two values as the parameter PAR1 (PAR2) to the fifth data input 205 of the calculation unit. The path metric selector gate 227 has two data input ports 228 and 229, connected to respective output ports 217 (217′) of the calculation unit 209 for receiving backward path metric values output therefrom. The first data output port 230 of the path metric selector gate 227 is connected to the second feedback data input port 208 of the calculation unit 209. Similarly, the second data output port 231 of the path metric selector gate 227 is connected to the first feedback data input port 207 of the calculation unit.
Thus, data values BT1 and BT2 may be input to the calculation unit via the second and first feedback data output ports 231 and 230, respectively, of the path metric selector gate 227. The path metric selector gate may also set the values BT1 and BT2 to zero.
In its first selection state, the function section apparatus causes the LM_ACS unit to perform path metric updates by having the path metric selector gate 227 set BT1=0 and BT2=0, while simultaneously having transition metric selector gates 232 and 232′ set PAR1=TM2 and PAR2=TM1 respectively. Updated transition metric values RM1 and RM2 output from the calculation unit 209 are stored in the main memory store 219 of the DSP and, for backward path metric values, are also copied back into a dual port cache store (not shown).
In its second selection state, the function selection apparatus causes the LM_ACS unit to perform the extended LM_ACS operation as part of an LLR calculation (see stage 1 above). This is achieved by having the path metric selector gate 227 input the values BT1 and BT2, from the output ports 217 (217′) of the calculation unit, into the first and second feedback input ports of the calculation unit without setting them to zero. Simultaneously transition metric selector units 232 and 232′ set PAR1=TM1 and PAR2=TM2 respectively. Data elements RM1 and RM2 output from the calculating unit 209 are stored in the main memory store 219 of the DSP and, for later use in LLR_ACC operations, are also copied back into the dual port cache store (not shown).
A sample architecture for the LLR_ACC unit (115 of
The compare-and-select unit 300 of the LLR_ACC unit comprises a first data input port 301 for receiving via data interface 319, data elements from one of the sets of first and second maximum values stored within the dual port cache 321 (cache data addresses [0 . . . 15]), and a second data input port 302 for concurrently receiving via data interface 320, data elements from the other of the sets of first and second maximum values stored within the dual port cache 321 (cache data addresses [16 . . . 31]).
It is to be noted that the cache 321 of
Furthermore, output line 318 maps to either one of data busses 151 or 152 of
Data values concurrently received at these two input ports are simultaneously input to a compare unit 307 via the two data input ports 303 and 305 thereof, and input to the difference unit 308 via its two data input ports 304 and 306. The difference unit 308 determines the absolute value of the arithmetic difference (i.e. |b−a|) between the data values concurrently input at its two data input ports (i.e. value “b” at port 304 and value “a” at port 306), and also determines the sign of this difference. The absolute value of this difference is output to a look-up table (LUT) 310 while the sign of this difference is output to a third data input 309 of the compare unit 307.
On the basis of the sign value input to at port 309, the compare unit 307 selects the maximum of the two data values concurrently input to it at data input ports 303 and 305, and outputs the selected maximum data value on output port 312 to an adder unit 315. Simultaneously with this operation, a correction term is retrieved from the look-up table 310, on the basis of the data input to it from difference unit 308, and is output via output port 311 to adder unit 315. The adder unit then adds the correction value to the selected maximum value and outputs the result at the output port 314 of the compare-and-select unit 300.
The above explanation applies to the parallel compare-and-select unit 300′ with like items given like (primed) reference numerals.
Thus, the compare-and-select units 300 (300′) perform the max* operation as defined by:
max*(a,b)=max(a,b)+ln(1+exp(−|a−b |)).
The correction term ln(1+exp(−|a−b|)) being added to the result of the result of the ACS operation associated with the “max” operation to provide the full Jacobian logarithm of which the “max” operator is only an approximation. This is implemented in the digital signal processor of the present embodiment by use of the look-up table (LUT) 310 (310′) as explained above.
Thus, the compare-and-select unit 300 (300′) performs the “max*” operation upon the concurrently input data elements and outputs the result at output port 314 (314′) to an input port 316 (316′) of a subtracting unit 315 and to the dual port cache 321 (via data bus 317 (317′)) for use in later stages of the LLR_ACC operation (e.g. “stage 3” above).
The subtracting unit 315 subtracts data values input at port 316′ from data values concurrently input at port 316 and outputs the result 318. After the final stage of the LLR_ACC operation has been performed, the output result 318 of the subtracting unit 315 is the LLR value.
It is to be understood that variations and modifications to the above described embodiments of the present invention, as would be readily apparent to the skilled person, may be made without departing from the scope of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
01404649 | Jun 2001 | EP | regional |
Number | Name | Date | Kind |
---|---|---|---|
4802174 | Hiraiwa et al. | Jan 1989 | A |
5331664 | Desperben et al. | Jul 1994 | A |
5442627 | Viterbi et al. | Aug 1995 | A |
6343103 | Lou et al. | Jan 2002 | B1 |
6452984 | Banister et al. | Sep 2002 | B1 |
Number | Date | Country |
---|---|---|
0409205 | Jan 1991 | EP |
WO 0027085 | May 2000 | WO |
WO 0038366 | Jun 2000 | WO |
Number | Date | Country | |
---|---|---|---|
20030002603 A1 | Jan 2003 | US |