The invention relates generally to signal decoding and, more particularly, to the decoding of signals that are encoded with lattice codes and/or multilevel coset codes.
Maximum likelihood decoding represents an optimal solution for the decoding of signals encoded with multilevel coset codes and/or lattice codes. However, maximum likelihood decoding is not practical in many cases. For example, maximum likelihood decoding may not be practical when a code has a complex trellis. A complex trellis may exist, for example, when the number of constituent codes within a multilevel coset code or lattice code is large or when the minimal trellis of the constituent code(s) is large. Complex trellises are typically difficult to store (and draw) and, when maximum likelihood decoding is used, require a very large number of calculations to implement at the decoding stage. For this reason, decoding solutions are needed for multilevel coset codes and lattice codes that are less computationally complex than maximum likelihood decoding techniques.
In the following detailed description, reference is made to the accompanying drawings that show, by way of illustration, specific embodiments in which the embodiments of the invention may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice the embodiments of the invention. It is to be understood that the various embodiments of the invention, although different, are not necessarily mutually exclusive. For example, a particular feature, structure, or characteristic described herein in connection with one embodiment may be implemented within other embodiments without departing from the spirit and scope of the invention. In addition, it is to be understood that the location or arrangement of individual elements within each disclosed embodiment may be modified without departing from the spirit and scope of the invention. The following detailed description is, therefore, not to be taken in a limiting sense, and the scope of the present invention is defined only by the appended claims, appropriately interpreted, along with the full range of equivalents to which the claims are entitled. In the drawings, like numerals refer to the same or similar functionality throughout the several views.
The present invention relates to methods and structures for decoding signals encoded with multilevel coset codes and/or lattice codes using iterative decoding techniques. The inventive techniques are significantly less computationally complex than optimal decoding solutions (e.g., maximum likelihood decoding) yet are capable of performance levels that approach those of the optimal methods. Because of the low complexity, the inventive techniques are especially valuable for use with codes having high dimensionality. In one embodiment, a data word received through a modulo lattice additive noise channel is treated as the sum of a multicode codeword and an interferer. The statistics of the interferer are used to determine prior probabilities associated with the multicode codeword. Iterative decoding techniques are then used to decode the individual constituent codes of the multicode codeword using the prior probabilities. The iterative decoding process may include, for example, the exchange of information (e.g., extrinsic information) between a number of constituent decoders. After the multicode codeword has been decoded, a cell translation is determined and used to map the multicode codeword to an appropriate underlying lattice cell. As used herein, the term “multicode” refers to the set of codewords of a multilevel coset code (or lattice code) that are within the base cell of an underlying lattice and the term “multicode codeword” will refer to a single codeword of the multicode. The inventive principles have application in communication systems and other systems using multilevel coset codes and/or lattice codes.
A lattice must satisfy several rules; namely, it is closed under addition, it contains an inverse element for each lattice element, it contains the zero element (i.e., the origin of the coordinate axes is a point of the lattice), and it is discrete. Therefore, a lattice is a discrete infinite collection of points that is created by the repetition of a base or unit cell throughout space. A lattice code is a finite subset of a lattice. As such, a lattice code may be defined as the intersection of a lattice with a region of bounded support. A decoding rule that ignores the effect of a bounding region is referred to as lattice decoding. Other decoding rules exist that take the bounding region into account (e.g., minimum distance decoding). The inventive principles may be used in connection with both lattice decoding and minimum distance-like decoding techniques.
Lattices can be constructed from one or more individual codes in a number of different ways (see, e.g., “Sphere Packings, Lattices and Groups” by J. H. Conway et al., pages 137–156 and 232–236, Springer-Verlag, New York, 1999). In one approach, for example, a point x=(x1, x2, . . . , xn) is considered a lattice point if it is congruent modulo 2 to a codeword of a binary linear code C. Such a lattice will be referred to herein as a “construction A” lattice. A construction A lattice can be expressed mathematically as follows:
Λn=C+2Zn
where C is a binary code of length n, n is the number of coordinates of each point of the lattice, and 2Zn is a lattice that translates the points of code C to form the lattice Λn (Z being the set of integers). As used herein, the lattice 2Zn will be referred to as the underlying lattice of lattice Λn. The codewords of code C each occur within a base cell (i.e., a cell that includes the origin) of the underlying lattice. A more general lattice construction, referred to herein as construction D, can be expressed as:
Λn=C1+2C2+4C3+ . . . +2k−1Ck+2kZn
where constituent codes {Ci}i=1k are linear binary codes that are nested (i.e., a codeword of Ci is also a codeword of Ci+1), k is the number of constituent codes, and 2kZn is the underlying lattice that translates the points of multicode (C1+2C2+4C3+ . . . +2k−1Ck) to form the lattice Λn. As described previously, the codewords of the multicode (C1+2C2+4C3+ . . . +2k−1Ck) occur within a base cell of the underlying lattice 2kZn. For a single constituent code, the construction D lattice reduces to the construction A lattice described above. The value of the translations in each of the above equations can be the same or different for each of the n coordinates.
A coset code is a code of infinite codebook size that can be expressed as follows:
C=C0+(Λ1)n
where Λ⊃Λ1 are lattices, [Λ/Λ1] is a finite complete set of coset representatives of Λ1 in Λ, and C0⊂ ([Λ/Λ1])n is a code. The lattice (Λ1)n in the above equation is the underlying lattice that translates the points of code C0 to form the coset code C. A multilevel coset code is a code of infinite codebook size that can be written as:
C=C1+C2+C3+ . . . +Ck+(Λk+1)n
where Λ⊃Λ1⊃Λ2⊃ . . . ⊃Λk+1 are lattices and Ck⊂([Λk/Λk+1])n, Ck−1⊂ ([Λk−1/Λk])n, . . . , and C1⊂ ([Λ/Λ1])n. The points of the multicode (C1+C2+C3+ . . . +Ck) occur within the base cell of the underlying lattice (Λk+1)n. The underlying lattice (Λk+1)n translates the points of the multicode (C1+C2+C3+ . . . +Ck) to form the multilevel coset code C. The lattice (Λk+1)n can be replaced by a direct product Λk+1(1)×Λk+1(2)× . . . ×Λk+1(n) to create a time varying multilevel coset code. For Λk+1=2k Z and Ci=2i−1Ĉi (where the Ĉi's are nested linear binary codes), the multilevel coset code C is reduced to a construction D lattice. Thus, the construction D lattice is one form of multilevel coset code. Multilevel coset codes differ from construction D lattices in two primary areas: namely, (1) the nesting property of constituent codes of a construction D lattice is not required for a multilevel coset code, and (2) the constellation used by a multilevel coset code does not need to be one dimensional (i.e., Λk+1 need not be isomorphic to Z). In the discussion that follows, the inventive principles are most often described in general terms in the context of a multilevel coset code. It should be appreciated, however, that the inventive principles are equally applicable to construction A and construction D lattices as well as other lattice types.
{overscore (y)}={overscore (x)}+{overscore (s)}=({overscore (c)}1+{overscore (c)}2+ . . . +{overscore (c)}k+{overscore (λ)})+{overscore (s)}
where {overscore (c)}1∈C1, {overscore (c)}2 ∈C2, . . . , {overscore (c)}k∈Ck, and {overscore (λ)}∈(Λk+1)n. The interference signal {overscore (s)} can include, for example, additive white Gaussian noise (AWGN) having a well known probability density function. By rearranging the terms in the above equation, the input word {overscore (y)} can be expressed as follows:
{overscore (y)}=({overscore (c)}1+{overscore (c)}2+ . . . +{overscore (c)}k)+({overscore (λ)}+{overscore (s)})={overscore (r)}+{overscore (w)}
where {overscore (r)}=({overscore (c)}1+{overscore (c)}2+ . . . +{overscore (c)}k) is a multicode codeword and {overscore (w)}={overscore (λ)}+{overscore (s)} is treated as an interferer. As will be described in greater detail, in one approach, the multicode codeword {overscore (r)} is first decoded using an iterative decoding technique and the translation {overscore (λ)} is then determined and used to map the multicode codeword {overscore (r)} to an appropriate cell of the underlying lattice.
As illustrated in
Using the input word {overscore (y)}, the prior probability generator 12 generates probabilities for each of the constituent codes of the corresponding multilevel coset code. When the statistics of the interferer {overscore (s)} are known (e.g., probability density function), these statistics may be used to determine the prior probabilities. As will be described later, techniques are also provided for determining the prior probabilities when the statistics of the interferer are not known. The iterative decoding unit 14 is operative for decoding the constituent code(s) of the multicode codeword {overscore (r)} using iterative decoding techniques. The iterative decoding unit 14 uses the prior probabilities generated by the prior probability generator 12 to initialize the iterative process. The cell translation determination unit 16 determines the translation {overscore (λ)} of the codeword {overscore (x)} based on the input word {overscore (y)} and the multicode codeword {overscore (r)} determined by the iterative decoding unit 14. The cell mapping unit 18 then maps the multicode codeword {overscore (r)} to an appropriate lattice cell based on the translation {overscore (λ)} determined by the cell translation determination unit 16 to output the codeword {overscore (x)}.
In iterative decoding, there are typically two types of information concerning a bit of a codeword, information that depends on that bit's channel observation and information that depends on the channel observation of other bits that reflects to the subject bit due to the nature of the code. The latter type of information is referred to as “extrinsic information.”
In one embodiment of the present invention, the prior probability generator 12 determines the probabilities that each coordinate of each constituent code has a predetermined binary value (e.g., zero) based on the content of the input word {overscore (y)} and the statistics of the interferer {overscore (s)}. In one such approach, a set S(m) is first defined as the set of all symbols r whose mth coordinate in the binary representation is zero. For example, if r=c1+2c2∈{0,1,2,3} then S(1)={0,2} and S(2)={0,1}. The set Si(m) is then defined as the set S(m) that corresponds to the symbol at time instant i (i.e., r1). The prior probability that the ith time coordinate of the mth constituent code is zero is then given by:
It should be appreciated that the expression P(y+λ|r) refers to a probability function when the alphabet is discrete and a density function when the alphabet is continuous. For lattice decoding, when it is assumed that all translations are legal, the probability P(λi|ri=η) is deemed a constant and the above equation reduces to:
In one implementation, the above summations are truncated to include only the most significant terms. In the discussion above, the prior probabilities are expressed as conventional probabilities. It should be appreciated that the prior probabilities can be expressed in any of a number of alternative maimers in accordance with the embodiments of the invention including, for example, likelihood ratios, log likelihood ratios (LLRs), probability differences, and other similar formats.
In at least one embodiment of the present invention, the iterative decoding unit 14 includes a separate decoder for each of the constituent codes of the multilevel coset code. Each of these constituent decoders will preferably be a soft in, soft out (SISO) decoder that accepts soft information at an input and uses it to generate soft information at an output that is indicative of the reliability of the decoder's decisions.
The iterative decoding unit 28 may include any number of constituent decoders. The individual constituent decoders 20, 22, 24 of the iterative decoding unit 28 may also take many forms. For example, in at least one embodiment, one or more of the constituent decoders 20, 22, 24 is an iterative decoder itself. Some iterative decoder types that can be used include, for example, turbo decoders and low density parity check (LDPC) decoders. For each iteration of the iterative decoding unit 28, an iterative constituent decoder can perform one or more of its own iterations. The iterative constituent decoder can then output, for example, posterior LLR's (or hard decisions) as well as extrinsic information for distribution to the other constituent decoders. Non-iterative constituent decoders can also be used (e.g., maximum a posteriori (MAP) decoders, forward-backward algorithm decoders, soft output Viterbi algorithm decoders, etc.). Non-iterative constituent decoders may also output posterior LLR's (or hard decisions) and extrinsic information for the other constituent decoders. The constituent decoders 20, 22, 24 can all be of the same type or different decoder types can be used. In one embodiment of the present invention, the information delivered to the mth constituent decoder (for the ith time coordinate) prior to a subsequent iteration includes the following probability:
where D(j)(η) is the value of the codeword of the jth constituent code, cj, at the ith time coordinate, if η is the value of the multicode, and {circumflex over (P)}(cj=α) is the posterior information received from the jth decoder. For the sake of brevity, dependence on i has been omitted in the above equation.
Referring back to
The cell translation determination unit 16 uses the multicode codeword estimate {circumflex over ({overscore (r)} output by the iterative decoding unit 14 and the input word {overscore (y)} to determine the translation (e.g., {overscore (λ)}∈(Λk+1)n) associated with the input word {overscore (y)}. In one approach, the translation is found by determining arg max{overscore (λ)}P({overscore (λ)}|{overscore (y)},{circumflex over ({overscore (r)}). On some occasions, this expression reduces to arg min{overscore (λ)}∥{overscore (λ)}−({overscore (y)}−{circumflex over ({overscore (r)})∥2. The cell mapping unit 18 subsequently maps the multicode codeword {circumflex over ({overscore (r)} to the appropriate lattice cell using the estimated translation {circumflex over ({overscore (λ)}. The cell mapping unit 18 may output either hard or soft values.
As an example of the operation of the decoding system 10 of
{overscore (x)}={overscore (c)}1+2{overscore (c)}2+4Zn.
The iterative decoding unit 14 decides that {overscore (c)}1=(1 1 1) and {overscore (c)}2=(0 1 1). Thus, {circumflex over ({overscore (r)}=([1+2(0)][1+2(1)][1+2(1)])=(1 3 3). To determine the estimated translation {circumflex over ({overscore (λ)}, it is first determined that ({overscore (y)}−{circumflex over ({overscore (r)})=([5.6−1][7−3][−1−3])=(4.6 4 −4). The vector is then compared to the possible values of 4Zn and it is determined that the appropriate translation {circumflex over ({overscore (λ)} is (4 4 −4) (i.e., arg min{overscore (λ)}∥{overscore (λ)}−({overscore (y)}−{circumflex over ({overscore (r)})∥2). The output {overscore (x)} is thus determined to be {circumflex over ({overscore (r)}+{circumflex over ({overscore (λ)}=([1+4] [3+4] [3−4])=(5 7 −1).
In at least one embodiment of the present invention, turbo iterations are performed. That is, instead of determining {circumflex over ({overscore (r)} and {circumflex over ({overscore (λ)} once before generating the output word {overscore (x)}, a number of iterations are performed with each iteration further refining the previous one.
In the above description, it was assumed that the statistics of the interferer {overscore (s)} were known. When no such statistical characterization of the interferer exists, however, other means for initializing the iterative process need to be developed. If a minimum Euclidean distance criterion is being used (i.e., the multilevel coset code point that is closest to {overscore (y)} in Euclidean distance is sought), then a number of initialization strategies are possible. In one approach, for example, it is assumed that the interferer has a Gaussian distribution with zero mean and unknown variance σ2. The Gaussian distribution is an explicit function of Euclidean distance. In another approach, it is assumed that the interferer is uniformly distributed within a Voronoi cell of the underlying lattice. In an approximation of this approach, it can be assumed that the Voronoi cell of Λk+1 is a v-dimensional ball of radius R. In a high dimensionality lattice, this will typically be a good approximation. The volume of the ball is denoted Vv(R) and it is known that Vv(R)=Rv×constant. Therefore,
and the probability density function of the interferer squared distance is given by
If this approach is used, the log likelihood ratios are independent of the radius R. Therefore, the iterative algorithm does not need to know the value of R. As the exact value of R may be unknown, this is a very favorable result.
Once the prior probabilities have been determined, they are used to initialize an iterative decoding process for the input signal to estimate a multicode codeword {circumflex over ({overscore (r)} (block 34). If multiple constituent codes are involved, the iterative decoding process may involve the exchange of information among individual constituent decoders during two or more iterative cycles. A translation estimate {circumflex over ({overscore (λ)} is next determined for the input signal based on the result of the iterative decoding process (block 36). Optionally, the translation and estimated multicode codeword can be used to further refine the result of the iterative decoding process in one or more additional iterative cycles (block 38). The translation is eventually used to map the result of the iterative decoding process to an appropriate cell of the underlying lattice (block 40).
It should be appreciated that the principles of the embodiments of the present invention are not limited to use within communication systems. For example, any application that relies on vector quantization (e.g., speech or image compression) may benefit from the use of reduced complexity lattice decoders in accordance with the present invention. Other non-communications based applications are also possible.
Although the embodiments of the present invention have been described in conjunction with certain embodiments, it is to be understood that modifications and variations may be resorted to without departing from the spirit and scope of the invention as those skilled in the art readily understand. Such modifications and variations are considered to be within the purview and scope of the invention and the appended claims.
Number | Name | Date | Kind |
---|---|---|---|
4933956 | Forney, Jr. | Jun 1990 | A |
6145114 | Crozier et al. | Nov 2000 | A |
Number | Date | Country |
---|---|---|
0237186 | Sep 1987 | EP |
Number | Date | Country | |
---|---|---|---|
20030081691 A1 | May 2003 | US |