The present invention relates to a method and arrangement for processing signals, especially a Maximum Likelihood (ML) demodulation of Quadrature Amplitude Modulation (QAM) signals, in a digital communications system.
The existing wireless mobile communication systems provide several types of services and mostly depend on channel coding to overcome any inferiority of channels. However, due to the increasing demands, for example for a high-quality multimedia services, in which users can communicate with anyone regardless of time and place, the existing services have evolved data-oriented services. Accordingly, there is a high demand for next generation wireless transmission technology for transmitting the larger amount of data at a lower error rate. In particular, it is very important to transmit data at a high rate in a link in which the amount of required data is large.
For the next generation wireless communication, various antenna systems have been proposed. For example, a MIMO system, i.e., a typical antenna system, increases spectrum efficiency through all of transmission antennas without excessive use of a frequency bandwidth. Generally, MIMO is classified into Space-Time Coding (STC), Diversity, Beam Forming (BF), and Spatial Multiplexing (SM) according to the transmission structure and scheme of a transmitter, all of which provide high data rate and reliability.
A MIMO system adopts multiple antennas or array antenna to transmit/receive data in the transmitter and receiver. Multiple antennas are provided in different spatial positions, with different fading features, thus the received signals of adjacent antennas can be approximated as uncorrelated entirely as long as the spacing between adjacent antennas for transmitting/receiving signals in the MIMO system is large enough. The MIMO system takes full advantage of the spatial characteristics of multipath for implementing space diversity transmission and reception.
Existing maximum-likelihood detection methods for MIMO are in general highly computationally complex. Much effort has therefore been focused on finding reduced-complexity MIMO demodulation techniques that still give close to optimal performance.
One example of a low-complexity method is Interference Rejection Combing (IRC), which gives good performance at low SNR, but falls well below an optimal demodulator at higher SNRs, closer to the saturation region of the modulation scheme. Other techniques, like sphere decoding (SD), are exact and very fast at high SNR, but extremely complex at low SNRs, at least if configured to give reasonably good performance, since the complexity is then exponential in the problem size, see for example Joakim Jaldén, “On the Complexity of Sphere Decoding in Digital Communications”, IEEE Transactions on Signal Processing, April 2005. Also, SD in its basic form generates only hard bits, and soft value generation is complex or relies on ad hoc methods.
Another method, so-called m-algorithm has complexity that is not very dependent on SNR, and shows performance close to an optimal demodulator with substantially lower complexity. However, fairly many retained paths are needed for reaching this performance, making its complexity still rather high. Like the SD, the m-algorithm also relies on ad hoc methods for soft value generation. An approach towards low-complexity exact ML has been proposed in Toshiaki Koike, Daisuke Nishikawa, and Susumu Yoshida, “Metric-segmented Low-complexity ML Detection for Spectrum-efficient Multiple-antenna Systems”, VTC 2005 Fall, but also this method only provides hard bits.
EP 1422896 relates to a method of decoding a communications signal in a digital communications system. The communications signal is modulated according to a modulation scheme, such as 16-QAM, including amplitude information. The method comprises the steps of generating a likelihood value for a received communications signal; decoding the communications signal based on at least the generated likelihood value. The method further comprises providing a reliability indication of the amplitude information conveyed by the received communications signal; and generating the likelihood value on the basis of the provided reliability indication of the amplitude information.
One advantage of the present invention is that it provides a novel method comprising a set of instructions to be applied on a computer, and has low computational complexity while still yielding the exact maximum-likelihood solution and exact LOGMAX soft values for QAM signals.
Preferably, the set of instructions has substantially lower complexity than any other prior art true LOGMAX algorithm for MIMO. The complexity is comparable to or even lower than for many other low-complexity but suboptimal algorithms.
These advantages are achieved by a method for processing signals, especially a Maximum Likelihood (ML) demodulation of Quadrature Amplitude Modulation (QAM) signals, in a digital communications system, having a number of transmit antennas transmitting nT symbols. The method comprises: using xm, denoting symbols transmitted, wherein m=1 . . . nT, given a channel, selecting i and j and making pre-calculations facilitating finding a best combination of x1 . . . xi−1, xi+1 . . . xj−1, xj+1. . . xn
Preferably a list of Ln
The method may reduce computation for computing hard output bits and further comprise the steps of repeating steps for other i and j pairs, for each pair generating a list of candidate combinations x1 . . . xn
According to a preferred embodiment a method for processing signals is provided, especially a Maximum Likelihood (ML) demodulation of Quadrature Amplitude Modulation (QAM) signals, in a digital communications system transmitting 4 symbols. The method comprises: using x1, x2, x3, x4 denoting symbols transmitted, given a channel, selecting two i and j, wherein i=1 . . . 4, j=1 . . . 4 and i≠j, and making pre-calculations facilitating finding a best combination of symbols transmitted for any given xi and xj and a received signal, going through every combination of symbols transmitted and projecting in a number of directions for each of said combinations and generating a list of candidate combinations of symbols for each combination of transmitted symbols, and hence generating a list of candidate combinations x1, x2, x3, x4, calculating and comparing metrics of the candidate combinations x1, x2, x3, x4, and providing a maximum-likelihood symbol combination.
The invention also relates to an network node arrangement for processing signals, especially a Maximum Likelihood (ML) demodulation of Quadrature Amplitude Modulation (QAM) signals, connectable to a number of antennas transmitting nT symbols, wherein xm, denotes symbols transmitted, wherein m=1 . . . nT. The arrangement comprises: a computation arrangement for making pre-calculations facilitating finding a best combination x1 . . . xi−1, xi+1 . . . xj−1, xj+1 . . . xn
According to one aspect of the invention a computer program product is provided for processing signals, especially a Maximum Likelihood (ML) demodulation of Quadrature Amplitude Modulation (QAM) signals, having a number of transmit antennas transmitting nT symbols and xm, denotes symbols transmitted, wherein m=1 . . . nT. The product comprises: a set of instructions for executing pre-calculations facilitating finding a best combination of x1 . . . xi−1, xi+1 . . . xj−1, xj+1 . . . xn
According to one aspect of the invention a user equipment is provided comprising means for processing signals, especially a Maximum Likelihood (ML) demodulation of Quadrature Amplitude Modulation (QAM) signals, connectable to a number of antennas (610) for transmitting nT symbols, wherein xm, denotes symbols transmitted, wherein m=1 . . . nT. The means comprises: a computation arrangement for making pre-calculations facilitating finding a best combination of x1 . . . xi−1, xi+1 . . . xj−1, xj+1 . . . xn
In the following, the invention will be exemplified with reference to number of embodiments, as illustrated in the drawings, in which:
a-3c are examples of symbol constellations, which can be efficiently handled by the method of the invention,
The invention relates to a method for exact ML low-complexity in systems comprising a number of transmit antennas and a number of receive antennas for parallel spatially (independent) transmission and reception, and preferably MIMO systems for demodulation of Quadrature Amplitude Modulation (QAM) signals, with exact LOGMAX soft-value generation, at comparatively low computational complexity.
The method of the invention may in principle be used for any number of antenna branches; however, it is particularly suited for 4×4 MIMO because of good balance between relative complexity reduction and memory requirements. Thus, the invention is described with reference to an example comprising four transmitter and receiver antennas. Other dimensions will also be discussed.
In brief, the method is as follows: Let x1, x2, x3, and x4 denote the transmitted symbols on the four respective transmit antennas, and let the symbol alphabet contain L symbols. Given a channel, it is shown that it is possible to make pre-calculations once for their channel, which enable finding extremely quickly the best (lowest-metric) combination x1 and x2 for any given x3 and x4. This is achieved through a method based on projections in different directions. By going through all x3 and x4 (in total L2 combinations), a list of L2 candidate combinations of x1 and x2 (one for each combination of x3 and x4) can thus be collected at low computational complexity. By calculating and comparing the metrics of these L2 candidate combinations of x1, x2, x3, and x4, the maximum-likelihood symbol combination can be found. Further, since for every bit in x3 and x4, the best sequence with that bit equal to 0 and the best sequence with that bit equal to 1 are both in the list, LOGMAX soft values for each bit in x3 and x4 can easily be found. By re-running the entire procedure with x1 and x2 interchanged with x3 and x4, soft values can be found also for all bits in x1 and x2.
Consequently, in total only in the order of 2L2 metrics need to be calculated, which should be compared with L4 metrics for a brute-force method. If only hard bits are required, the method according to the invention can be even further simplified and the number of metrics that need to be computed may further be reduced by orders of magnitude as exemplified in the description.
Usually, a MIMO antenna system with nR receive and nT transmit antennas operating in a frequency non-selective channel is described by the following matrix representation:
y=Hx+w
Wherein y is the nR×1 received complex signal vector, H is the nR×nT MIMO channel response, w is the independent and identically distributed elements Additive White Gaussian Noise (AWGN) at the receiver with individual variance of σ2z and x is the nT×1 transmitted complex signal vector with a certain power constraint. For simplicity we assume that nT=nR=M in the following description.
Then
where according to this example M=4 is the number of transmit and receive antennas (same number for simplicity). The MIMO maximum-likelihood (ML) demodulation problem for hard bits can then be stated as finding
where X is the set of all symbols in the symbol alphabet and XM is the set of all vector symbols. Using Cholesky factorization and a QR decomposition or a second Cholesky factorization, the problem can be transformed into finding
where R is an upper-triangular matrix and {tilde over (x)} is the unconstrained least-squares estimate of x.
It should be appreciated by a person skilled in the art that wide range of MIMO configurations, including those using any linear dispersion code and different number of transmit and receive antennas, can be transformed into the form according to the present invention.
The generation of LOGMAX soft bits will be discussed further below.
By writing
where R11, R12, R21 and R22 are 2×2 matrices (R21 all zeros), and where and x1, x2, {tilde over (x)}1 and {tilde over (x)}2 are 2-element column vectors,
the problem in Equation (1) can be rephrased as
For a given x2, i.e. a given v as defined in Equation (2), the problem now consists in finding
The following is devoted to description of how a function ƒR
{circumflex over (x)}1=ƒR
may be found and evaluated many times with low computational complexity. This approach also enables easy generation of LOGMAX soft decision values, since if the function ƒR
In the following, the principle of finding ƒR
The basic principle is first illustrated in the following before turning to the full description.
The problem to be solved is finding the dot closest to the cross. In the following, it is described how a large number of dots can easily be ruled out. The solid lines 23 divide the plane into four distinct regions. The lines are chosen to pass exactly halfway between the dots 24 marked with solid large circles. By projecting the cross 22 onto the solid line 25 transverse to lines 23, it is possible to determine within which region the cross 22 is situated. Given which region, three out of the four dots marked with solid circles can be ruled out as candidates for closest dot. In the illustrated case, all dots with solid circles except the third 26 from the left can be ruled out. Similarly, based on the position of the cross between the dashed lines 27, all dots with dashed circles 28 except the third from the left 29 can be ruled out. In the same way, overall ¾ of all dots can be ruled out, and only 4×16=64 dots remain as candidates. These dots 20 are marked with a narrower circle. Note that although there are 64 dots, they only mark 16 projections on the axis because of their regular structure.
A similar procedure can then be repeated in a perpendicular direction. This will also result in 64 remaining candidate dots, also identified through 16 projections. Furthermore, the intersection of the former 64-dot set and the latter 64 dot-set contains 16 dots, and the dot closest to the cross must belong to this intersection set. The intersection set is completely characterized by the two sets of 16 projections.
Then, the entire procedure may be repeated a second time, but now with lines parallel and perpendicular to the second transmit antenna dimension, cf.
Using a brute force method, this approach would require approximately 3×4×4×4 projections and comparisons for each of the two directions. However, the projections of all dots can be pre-calculated. The pre-calculated table can contain a sorted list of all projections and for each projection a list of the dots that are not ruled out for a cross being projected in the corresponding interval. This would thus require a table of in total 4×4×3 projections (floating point numbers) and 4×4×3×4×4=768 integers between 0 and 3. A function f can with such pre-calculations be evaluated very quickly. Since f is to be evaluated many times for each channel realization, it turns out that the relative computational complexity from the pre-calculations is rather small.
For determining the function f the following unit vectors may be defined:
where i is an imaginary unit. Also the following is defined
where mj and nj may assume the integer values needed to construct the symbols in the symbols alphabet, e.g. for 16 QAM, nj may assume the values −3, −1, +1, +3 and γ is a normalization factor that makes the average energy of the symbols in modulation alphabet equal to unity. For notational convenience m′j and n′j, are further defined, which assume the midpoints between the values assumed by mj and nj. For example, for 16 QAM, m′j assumes any of the values −2, 0, 2.
The projection of R11x1 on each of the vectors k1r, k1i, k2r, k2i, defined as
k1r=R11e1r/∥R11e1r∥
k1i=R11e1i/∥R11e1i∥
k2r=R11e2r/∥R11e2r∥
k2i=R11e2i/∥R11e2i∥,
is then calculated.
These vectors point in the direction indicated by the antenna real and imaginary axes shown in
p1r(m′1,m2,n2)=Re {(R11x1)+k1r}=Re {x1+R11+k1r}
p1i(n′1,m2,n2)=Re {x1+R11+k1i}
p2r(m1,n1,m′2)=Re {x1+R11+k2r}
p2i(m1,n1,n′2)=Re {x1+R11+k2i}
for all mj, nj, m′j, n′j. Note that p1r does not depend on n′1 since k1r is orthogonal to R11·[n1j 0]T, wherein T denotes vector transpose. Similarly p1i, p2r, p2i do not depend on m′1, n′2 and m′2, respectively.
Now, for a given vector v, for each combination of m2 and n2 possible m1 are found by projecting v on k1r and comparing this (scalar) value with p1r(m′1,m2,n2) for each m′1. Let m′1,k, k=1,2 . . . denote the ordered sequence of all m′1.
A function that finds m1 from m2 and n2 may be denoted g1r. Hence, the evaluation of the function g1r requires a projection and a number of comparison operations (log2 4 comparisons for 16 QAM). Similarly, the functions g1i, g2r, and g2i, are defined for calculating n1, m2, and n2, respectively. Altogether, the following is obtained
m1=g1r(m2,n2),
n1=g1i(m2,n2),
m2=g2r(m1,n1),
n2=g2i(m1,n1). (3)
Evaluating each g function for every possible input argument, a set of L candidates for m1, L candidates for n1, L candidates for m2, and L candidates for n2 are obtained. This set of candidates can be further reduced by requiring all four relations to hold simultaneously. The possible m1 and n1 are thus those that fulfill
m1=g1r(g2r(m1,n1),g2i(m1,n1))
n1=g1i(g2r(m1,n1),g2i(m1,n1))
for some m1. From these m1 and n1 one may calculate candidate m2 and n2 using Eq. (3). This, results in a small set of candidates, for which the respective metrics are calculated explicitly and the overall best candidate selected.
In the following the aforementioned procedure for computing f is described for computationally efficient execution. Efficient calculation of the functions g1r is considered first. Preferably, every possible p1r (i.e. for 16 QAM 3×4×4 values, one for each combination of m′1, m2,and n2) is pre-calculated and sorted. For each value in this sorted list, a list specifying the only possible m1 for all L combinations of m2 and n2 is pre-calculated. The function g1r can then be evaluated through a projection and a lookup in a table of (L−1)L2 sorted items, which requires log2 [(L−1)L2] comparison operations. Similar holds for the other g functions.
Furthermore, the complexity of the projection operation can be greatly reduced by pre-calculating the projections of all terms in v on k1r, k1i, k2r, and k2i. For example, the projection on k1r can be re-written as
and the L different values of the first term (one for each combination of m3 and n3) and the L different values of the second term pre-calculated. The L2 projections can then be accomplished by just one summation per projection.
The entire computation of f then consists in the following operations:
It is noteworthy that although certain simple integer comparison operations are performed more than L2 times, no operation is carried out in the order of L4 times as in a brute-force method.
In the above description, only QAM with fully regular constellations—same spacing between all constellation points, both in real direction and imaginary direction—has been considered. However, the method works equally well for any QAM scheme. Some examples are shown in
When only hard output bits are required, the number of metric calculations can be further reduced. When iterating through all combinations of x3, and x4, the metrics need not be calculated for each candidate x1 and x2. Instead, a small list of possible x1 and x2 is retained for each combination of x3, and x4. Similarly, when iterating through all x1 and x2, lists of possible combinations x3, and x4 are retained. The overall most likely symbol combination x1, x2, x3, and x4 must then be in both sets of lists. The symbol combinations that exist in both sets of the list can be found in a manner similar to the one used with the g functions. Explicit metrics, thus only need to be calculated for the symbol combinations that are in both lists, which in practice typically turns out to be rather few (see Table 2 for examples). Since the metric calculations dominate the complexity in the basic form (soft value) of the method, this should lead to a substantial complexity reduction.
The invention may clearly be applied to other dimensions. For 3×3 MIMO, for example, the problem can be split into 2+1 dimensions, instead of 2+2 dimensions for 4×4 MIMO, and for 5×5 MIMO into 2+3 dimensions. Similarly, the method can be used for higher dimensionalities.
Advantageously, the technique of the invention may be used to greatly speed up 2×2 MIMO demodulations in cases where multiple demodulations with the same channel are to be made and only hard bits are needed. The pre-calculations for the function f can then first be made once and subsequently only the table lookup step is needed for each individual demodulation.
The complexity of the proposed method is in Table 2 compared with the complexity of the normal brute-force LOGMAX algorithm and the m-algorithm. The results for the method of the invention were obtained through simulations and the numbers for the other methods were estimated theoretically.
The aforementioned examples for 4×4 MIMO antennas can be generalized for any number of transmit antennas transmitting nr symbols, as illustrated in the flow diagram of
The invention may be implemented as a hardware or software solution.
The invention may be implemented in a network node which may comprise a computer unit for processing signals. The network is assumed to have a number of transmit antennas transmitting nT symbols, and in which xm, denotes symbols transmitted, wherein m=1 . . . nT. A symbol alphabet is assumed to contain L symbols. The computer unit 500, as illustrated schematically in
The invention is not limited to MIMO systems and may be implemented in any multiple transmit/receive systems such as SIMO, MISO etc.
It should be noted that the word “comprising” does not exclude the presence of other elements or steps than those listed and the words “a” or “an” preceding an element do not exclude the presence of a plurality of such elements. The invention can at least in part be implemented in either software or hardware. It should further be noted that any reference signs do not limit the scope of the claims, and that several “means”, “devices”, and “units” may be represented by the same item of hardware.
The above mentioned and described embodiments are only given as examples and should not be limiting to the present invention. Other solutions, uses, objectives, and functions within the scope of the invention as claimed in the below described patent claims should be apparent for the person skilled in the art.
From an implementation point of view not only computational complexity but also memory requirements are important. Because of the pre-calculations, the method requires a certain amount of memory. If implemented in accordance with the description for efficient computation of f above, the method requires for M=4 a memory of
For 16 QAM, 64 QAM, 256 QAM this corresponds to approximately the memory listed in Table 2.
This implies that the amount of the memory required is reasonable.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/SE2006/000867 | 7/7/2006 | WO | 00 | 2/17/2009 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2008/004921 | 1/10/2008 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6901122 | Nadgauda et al. | May 2005 | B2 |
7236536 | Hochwald et al. | Jun 2007 | B2 |
7505527 | Hwang et al. | Mar 2009 | B2 |
7864897 | Maeda et al. | Jan 2011 | B2 |
Number | Date | Country |
---|---|---|
WO03030414 | Oct 2003 | WO |
Number | Date | Country | |
---|---|---|---|
20100296608 A1 | Nov 2010 | US |