The present invention relates generally to mobile communications, and, more particularly, to two-stream receivers for multiple-input multiple-output MIMO systems and their extensions.
In order to meet the ambitious spectral efficiency targets set for Evolved-UMTS Terrestrial Radio Access (EUTRA), low-latency and low complexity receivers are necessary. Such receivers are particularly needed at the user equipment (UE) where the complexity constraints are much more stringent. The most important scenario in the multiple antenna downlink system involves UEs with two antennas, where the base-station or the Node-B transmits two encoded streams to a scheduled UE.
A known brute force maximum likelihood ML reception method 10 for two streams, depicted in
The main competing demodulators to the invention are the Deterministic Sequential Monte-Carlo (D-SMC) based receiver (another promising low-complexity receiver), shown in
Complexity reduction is achieved with the D-SMC method by computing the soft output for each coded bit over only a reduced set of hypotheses. The price paid for this complexity reduction is that the D-SMC suffers from a problem, usually referred to as the “missing candidate problem”, in that the hypotheses (or candidates) necessary for computing the soft outputs for some of the bits may not be present in the reduced set. This missing candidate problem can cause significant degradation in the performance particularly if the reduced set is relatively small compared to the set of all hypotheses. Heuristic techniques to alleviate this problem in the D-SMC have also been proposed but such techniques require a lot of system or scenario specific fine tuning and may not work well under across all conditions.
Referring again to
In contrast to the D-SMC reception method, the SIC receiver is a sequential receiver where one stream is first decoded and subtracted from the received signal before decoding the second stream. The soft output for the first stream is obtained after assuming the second stream to be a Gaussian interferer which can lead to performance degradation.
Referring again to the
Accordingly, there is a need for two-stream receivers that are eminently suitable for receivers with low-latency and low complexity necessary to meet the ambitious spectral efficiency targets set for Evolved-UMTS Terrestrial Radio Access (EUTRA).
In accordance with the invention, a method includes the steps of i) listing out all possibilities for a first symbol of a two stream signal; ii) determining a second symbol of the two stream signal for each of the first symbol listed out, iii) evaluating a metric for each of the first symbol and second symbol pair, iv) listing out all possibilities for second symbol, v) determining a first symbol for each choice of the second symbol listed out, vi) evaluating a metric for each of the second symbol and first symbol pair, vii) determining the exact maximum log likelihood ratio for all bits using the metrics, and viii) decoding codeword(s) in the two stream signal using the determined exact maximum log likelihood ratio for all bits.
In another aspect of the invention, a method includes the steps of i)-viii) to decode the two codewords associated with the two streams ix) conducting a CRC check on the two decoded codewords x) In case the CRC of only one codeword is true, re-encoding, modulating and subtracting that codeword from the received signal to obtain a single stream signal, xi) listing out all possibilities for the remaining symbol in the single stream signal, xii) evaluating a metric for each possibility of the remaining symbol, xiii) determining the maximum log likelihood ratio for all bits using the metrics, and xiv) decoding the remaining codeword in the single stream signal using the determined maximum log likelihood ratio for all bits.
In a preferred extension wherein the steps for two signal stream reception are extended to four signal streams received by splitting the four signal stream demodulation into two smaller two-stream signal demodulations. The two smaller two-stream signal demodulations can be solved sequentially as in successive group decoding or in parallel as in parallel group decoding. The parallel group decoding involves a split of the incoming four-streams (labeled {1,2,3,4}) into one of the three unordered partitions of {1,2,3,4} which are {(1,2),(3,4)}, {(1,3),(2,4)}, {(1,4),(2,3)}. The split can be done on a per-tone basis (in an OFDM system with multiple tones) based on the instantaneous channel realizations, taking into account if the four streams are jointly encoded as in the single codeword (SCW) case or if they are independently encoded as in the multiple codeword (MCW) case. The sequential group decoding includes six ways to do the split which correspond to the six ordered partitions of {1,2,3,4} which are {(1,2),(3,4)}, {(3,4),(1,2)}, {(1,3),(2,4)}, {(2,4),(1,3)}, {(1,4),(2,3)}, {(2,3),(1,4)}, and needs the streams to be independently encoded with the split being common or fixed across all tones to allow post-decoding feedback.
These and other advantages of the invention will be apparent to those of ordinary skill in the art by reference to the following detailed description and the accompanying drawings.
In the context of the invention, the joint demodulation of two streams is considered, each stream comprising of symbols from a constellation of size M. The exact max-log outputs for all 2 log(M) bits per symbol interval is obtained with O(M) complexity by evaluating the metrics of 2M hypotheses, instead of the conventional method of O(M2) complexity which evaluates the metrics of all M2 hypotheses. From this basis, the inventive max-log two-stream receiver is presented, which is flow diagramed in
In another aspect of the invention, there is presented a two-stream enhanced max-log receiver where the max-log receiver is first used to decode the two codewords. In case only one codeword is decoded correctly, the correctly decoded codeword is re-encoded, modulated and subtracted from the received signal. Using the signal so obtained, the remaining codeword (which was erroneously decoded in the first attempt) is again decoded. The inventive enhanced max-log receiver is flow charted in
Also, described are methods to extend the inventive two-stream receivers to multiple streams, with particular emphasis on the four stream case which is another important scenario.
Referring again to
We now describe the inventive two stream max-log demodulator. Consider the model
where, H is the N×2 channel matrix (N≧2), v is the additive noise having i.i.d. zero-mean unit variance Gaussian elements. x1 and x2 are symbols from a common M-QAM constellation. Let H=[h1,h2] and H=∥h2∥2 UL be the modified QR decomposition of H with U being a scaled semi-unitary matrix and L being lower triangular with positive diagonal elements. In particular, we obtain U=[u1,u2] with
where <h1,h2>=h2*h1 is the (complex) inner product of the two vectors and
Then we obtain
and note that transformed noise vector {tilde over (v)} remains white. Let {xi,j}j=1M denote the M-QAM symbols and let xiR,xiI denote the real and imaginary parts of xi,1≦i≦2, respectively. For each x1,j we define the metric
Q(x1,j)=|z1−L11x1,j|2+minx
Defining q1j=z2−L21x1,j we can express Q(x1,j) as
Q(x1,j)=|z1−L11x1,j|2+minx
Since both x2R,x21 belong to a common √{square root over (M)}-PAM constellation, the two minimizations for computing Q(x1,j) can be done in parallel using simple slicing (rounding) operations with O(1) complexity each. All {Q(x1,j)}j=1M are efficiently determined using the described method. Also, using the fact that L11 is positive along with the symmetry of the M-QAM constellation, we have that {L11x1,jR}j=1√{square root over (M)}={L11x1,jI}j=1√{square root over (M)}. Then since
|z1−L11x1,j|2=|z1R−L11x1,jR|2+|z1I−L11x1,jI|2
we have that only 3√{square root over (M)} (real) multiplications are required to evaluate all {|z1−L11x1,j|2}j=1M instead of 2M complex ones.
Then we obtain another modified QR decomposition H=∥h1∥2 VR, with V being a scaled semi-unitary matrix and R being upper triangular with positive diagonal elements. In particular, we obtain V=[v1,v2] with
where <h2,h1>=h1*h2 is the complex conjugate of <h1,h2> and
Using V we determine w=V*y which can be expressed as
Next, for each x2,j we define the metric
Q(x2,j)=|w2−R22x2,j|2+minx
Defining q2j=w1−R12x2,j, we can express Q(x2,j) as
Q(x2,j)=|w2−R22x2,j|2+minx
Again since both x1R,x1I belong to a √{square root over (M)}-PAM constellation, the two minimizations for computing Q(x2,j) can also be done as before in parallel using simple slicing operations. All {Q(x2,j)}j=1M are efficiently determined using the described method.
The 2M metrics {Q(x1,j),Q(x2,j)}j=1M can also be efficiently determined even for other regular constellations. To illustrate, we consider the example of PSK constellation. Let x1 and x2 be symbols from a common unit average energy M-PSK constellation: exp(jσM),σMε{0,2π/M, . . . , 2π(M−1)/M}.
Then to efficiently determine {Q(x1,j)}we re-write equation (2b) as
Q(x1,j)=|z1−L11x1,j|2+minx
and obtain q1,j in its polar form as q1,j=r1,j exp(jαi,j) such that r1,j>0,α1,jε[0,2π). Let β1,j=Mα1,j/(2π)−½. Then the minimizing x2 in (3b) can now be determined (with O(1) complexity) in closed form and is given by exp(2π(└β1,j┘+1)/M), where └ ┘ denotes the floor operator. Similarly we can efficiently determine minimizing x1 in (2c) with O(1) complexity. In a similar manner the minimizing x1 in (2c) (and the minimizing x2 in (2b)) can be determined efficiently for other regular constellations by exploiting their decision regions.
Now each size M constellation corresponds to log(M) bits so we need to determine the max-log soft output for 2 log(M) bits. The 2M metrics {Q(x1,j),Q(x2,j)}j=1M which we efficiently determined are exactly those needed to determine the max-log output for each bit. To see this, suppose bits numbered 1 to log(M) correspond to symbol x1. Then letting λi denote the max-log output of the ith bit bi and assuming equal a-priori probabilities, we have that
λi=∥h2∥2(minj:b
and
λi=∥h1∥2(minj:b
Thus we have shown that the complexity of our method to determine the max-log output for each of the 2 log(M) bits is O(M) instead of the O(M2) complexity of the usual method. Note that the described method extends in a straightforward manner to the case when the two symbols belong to different constellations.
Further reduction in complexity can be achieved by avoiding the redundant computation in the two modified QR decompositions H=∥h2∥2UL,H=∥h1∥2 VR. Also considerable reduction in processing delay can be achieved by implementing the computation of {Q(x1,j)}j=1M,{λi}i=1log(M) and {Q(x2,j)}j=1M, {∥i}i=log(M)+12 log(M) in parallel.
The inventive max-log two-stream receiver includes the two-stream demodulator described above along with the outer code (FEC) decoder(s).
Referring again to
Next, we describe our enhanced max-log receiver. Our enhanced max-log receiver works as follows. We use the previously described max-log receiver to decode the two codewords and conduct a cyclic redundancy check (CRC) on the two decoded codewords. In case CRC is true for both or false for both we stop the decoding process. If CRC is true for codeword-1 (and false for codeword-2), for each symbol interval, we compute {circumflex over (z)}2=z2−L21{circumflex over (x)}I, where {{circumflex over (x)}1} correspond to the re-encoded and modulated codeword-1 and the soft-outputs for the second stream (codeword) are obtained as
Q(x2,j)=|{circumflex over (z)}2−x2,j|2,1≦j≦M
λi=∥h2∥2(minj:b
The obtained LLRs are used to decode the second codeword.
In case CRC is true for codeword-2 (and false for codeword-1), for each symbol interval, we compute ŵ1=w1−R12{circumflex over (x)}2 and the soft-outputs for the first stream (codeword) are obtained as
Q(x1,j)=|ŵ1−x1,j|2,1≦j≦M
λi=∥h1∥2(minj:b
The obtained LLRs are used to decode the first codeword.
In order to extend our max-log two-stream receiver to decode larger number of streams, we use the group decoding concept. Although the resulting receivers no longer yield the exact max-log output for each coded bit, nevertheless they provide good performance at low complexity. To illustrate we consider the case of four stream transmission over MIMO-OFDM. Over each of the N tones we have a flat fading MIMO model given by
We can leverage our two-stream demodulators by splitting the four-stream demodulation problem into two smaller two-stream demodulation problems which are then solved by our two-stream demodulators. Moreover the two smaller problems can be solved sequentially (as in successive group decoding) or in parallel (as in parallel group decoding).
In the parallel case we have three ways to do the split corresponding to the three unordered partitions of {1,2,3,4} which are {(1,2),(3,4)}, {(1,3),(2,4)}, {(1,4),(2,3)}. This split can be done on a per-tone basis based on the instantaneous channel realizations taking into account if the four streams are jointly encoded as in the single codeword (SCW) case or if they are independently encoded as in the multiple codeword (MCW) case. Note that in the SCW case only the max-log demodulator can be used in the smaller two-stream problems. To further elaborate, suppose {(1,2),(3,4)} is the chosen split on some tone. Then in parallel group decoding, we obtain the LLRs for streams 1 and 2 by using the two-stream demodulator after suppressing the streams 3 and 4 using MMSE filtering and whitening the suppressed interference plus noise. Similarly, we obtain the LLRs for streams 3 and 4 by using the two-stream demodulator after suppressing the streams 1 and 2 using MMSE filtering and whitening the suppressed interference plus noise.
In the sequential case we have six ways to do the split which correspond to the six ordered partitions of {1,2,3,4} which are {(1,2),(3,4)}, {(3,4),(1,2)}, {(1,3),(2,4)}, {(2,4),(1,3)}, {(1,4),(2,3)}, {(2,3),(1,4)}. However in this case we need the streams to be independently encoded and the split should be common or fixed across all tones to allow post-decoding feedback. We can use either one of our two 2-stream receivers to decode the two codewords in each one of the two smaller two stream problems. To further elaborate, suppose {(1,2),(3,4)} is the chosen split across all tones. Then in successive group decoding, we decode streams 1 and 2 by using the two-stream receivers after suppressing the streams 3 and 4 using MMSE filtering and whitening the suppressed interference plus noise. Then, we subtract the re-constructed streams 1 and 2 from the received signal and decode streams 3 and 4 by using the two-stream receivers after assuming perfect cancellation of streams 1 and 2.
Next, if limited feedback is available, the receiver can pick one out of three unordered partitions per-tone or six ordered partitions (which are fixed across all tones) and inform the transmitter. The transmitter can then employ one codeword within each group and successive group decoding (using the max-log demodulator in each group) can be used at the receiver.
In summary, we considered the two-stream MIMO decoding problem and designed two receivers. The first one is a highly efficient implementation of the maximum likelihood demodulator (MLD) yielding the exact max-log LLR outputs. The second receiver is an enhanced max-log receiver which provides further performance improvements at the expense of higher complexity. Extensions of the inventive two-stream receivers to the general case with multiple streams were also obtained.
The present invention has been shown and described in what are considered to be the most practical and preferred embodiments. It is anticipated, however, that departures may be made therefrom and that obvious modifications will be implemented by those skilled in the art. It will be appreciated that those skilled in the art will be able to devise numerous arrangements and variations which, although not explicitly shown or described herein, embody the principles of the invention and are within their spirit and scope.
This application claims the benefit of U.S. Provisional Application No. 60/826,119, entitled “Novel two Stream Receivers For MIMO systems and their Extensions”, filed on Sep. 19, 2006, the contents of which is incorporated by reference herein.
Number | Name | Date | Kind |
---|---|---|---|
6865712 | Becker | Mar 2005 | B2 |
6993098 | He | Jan 2006 | B2 |
7266168 | Kwak et al. | Sep 2007 | B2 |
7463703 | McElwain | Dec 2008 | B2 |
7567635 | Scheim et al. | Jul 2009 | B2 |
7639760 | Kim | Dec 2009 | B2 |
7724832 | Hosur et al. | May 2010 | B2 |
Number | Date | Country | |
---|---|---|---|
20080225974 A1 | Sep 2008 | US |
Number | Date | Country | |
---|---|---|---|
60826119 | Sep 2006 | US |