This application claims priority from European patent application No. 05425526.0, filed Jul. 21, 2005, which is incorporated herein by reference.
An embodiment of the present invention relates to signal-decoding techniques.
Wireless Local Area Networks (W-LANs) have become a widespread technology on the telecommunications market for providing a wide-band connectivity between computers and other electronic devices. W-LANs are currently deployed in various environments, for example industrial companies and public and residential premises, in so far as they enable a high-speed data access for users. The most recent standards, IEEE 802.11a and 802.11 g, are able to provide transmission rates of up to 54 Mb/s, exploiting, as technology for the physical layer (PHY), a technique of Orthogonal Frequency Division Multiplexing (OFDM) and a Bit-Interleaved Coded Modulation (BICM) on 5.2 GHz and 2.4 GHz, respectively. However, the demand for high-rate services for data is continuously growing, and a wider range of coverage is much appreciated in these types of applications. These factors are currently leading to the definition of a new W-LAN standard based upon an innovative technology capable of improving the performance of the entire system.
The adoption of OFDM reduces the complexity of equalization at the expense of a partial loss in bandwidth due to cyclic prefix insertion.
Thanks to pseudo-random bit interleaving, in order to de-correlate successive encoded bits the channel encoder and the QAM modulator (QAM—Quadrature Amplitude Modulation) can be selected independently, thus providing the possibility of different system configurations.
Multiple-Input/Multiple-Output (MIMO) radio interfaces have been studied in depth over the last few years, and have been widely considered as a suitable solution for improving the performance of modern wireless communication systems. The joint use of MIMO and OFDM techniques, in combination with a Space-Frequency Bit-Interleaved-Coded-Modulation (SF-BICM) architecture, as described in D. Zuyderhoff, X. Wautelet, A. Dejonghe, and L. Vandendorpe, “MMSE turbo receiver for space-frequency bit-interleaved coded OFDM”, IEEE Vehicular Technology Conference, 2003, which is incorporated by reference has proven able to provide a high data rate and simultaneously a mitigation of the effects of channel fades.
New-generation W-LAN systems, in addition to guaranteeing an ever-faster data rate, should also guarantee higher levels of performance in terms of system reliability. These factors lead to adopting, at the receiver end, innovative, yet at the same time sophisticated, techniques of decoding and detection, such as, for example, ones based upon the “turbo”-MIMO principle.
To concentrate on the aspect of detection, the optimal technique is based upon the MAP algorithm, which maximizes the a-posteriori probability (MAP—Maximum A-posteriori Probability). The MAP algorithm may, however, not be physically implementable, above all when combined with a high order of modulation, as in the case of 64-QAM.
In fact, by adopting more than one antenna at both the transmitter end and the receiver end, the computational complexity is found to grow exponentially with the order of modulation and also with the number of transmitting antennas.
Consequently, in order to cope with questions of complexity and performance, sub-optimal schemes have been proposed, based upon simpler detectors, such as Minimum-Mean-Square-Error (MMSE) detectors to be used in iterative decoding and detection schemes instead of the MAP algorithm, which, as has been said, maximizes the a-posteriori probability. However, even using MMSE detectors, the computational complexity increases proportionally with the number of iterations, with a consequent major impact also on the latency constraints. Finally, also the constraints on the size of the memory can become a significant aspect especially at the user-terminal end.
From the foregoing description of the current situation, there exists a need to define solutions that enable exploitation of the advantages linked to MIMO techniques, without this entailing any increase in the computational, and hence, circuit complexity, which may render impractical the application of these techniques in widespread application contexts.
In accordance with an embodiment of the present invention, a method that meets this need is provided. Embodiments of the present invention also relate to a corresponding system, the corresponding receiver, as well as a computer-program product, loadable into the memory of at least one computer, and comprises software code portions for implementing the aforesaid method. As used herein, the reference to such a computer-program product is understood as being equivalent to the reference to a computer-readable medium containing instructions for controlling a computer system for the purpose of coordinating execution of a method according to an embodiment of the invention. The reference to “at least one computer” is intended to highlight the possibility for an embodiment of the present invention to be implemented in a distributed and/or modular way.
One or more embodiments of the invention will now be described, by way of non-limiting example, with reference to the drawings.
The exemplary embodiments described herein have the purpose of reducing the complexity of a turbo-MIMO-MMSE reference scheme, targeting a significant reduction in the number of computations of the main blocks used in the iterative detection process. In particular, the main blocks are a soft-interference estimator (SIE), a MMSE detector, and the QAM soft de-mapper. The first two blocks are simplified by exploiting, respectively, Gray coding and the properties of matrix algebra, without introducing any approximation with respect to the formulas proposed in the literature. Instead, the last block has been simplified by proposing an appropriate method that approximates the calculation of the log-likelihood ratio (LLR) with respect to the standard computational method.
In particular, consider a scenario with T antennas at the transmitter end and R antennas at the receiver end. Then suppose that a stream of 1000-byte data packets is generated in a W-LAN. The stream is encoded with a 64-state ½-rate convolutional encoder, subjected to interleaving and mapped on an N-QAM Gray constellation (with N=4, 16, 64, . . . ). The physical framing format of the packet is generated in accordance with the specifications of the W-LAN standard 802.11a but imposing that the number S of OFDM symbols for transmission on the air should be a multiple of the number T of antennas at the transmitter end and adopting for the data a number of subcarriers equal to 56.
The block diagram illustrated in
A MIMO frequency-selective channel is represented by the matrix H(f)εCR×T:
where f=1, 2, . . . , F is the frequency index, ht(f) are column vectors, the elements of which hr,t(f) represent the paths between the transmitter t and the receiver r, at the f-th tone. These elements are normalized as follows:
with t=1, 2 . . . , T, and r=1, 2, . . . , R. The channel coefficients hr,t are generated, according to the indoor MIMO channel model, for environments with high levels of scattering.
For reasons of simplicity, the index f is omitted in the remaining part of the description in so far as OFDM enables consideration of each tone independently of the others. Hence, all the equations appearing from this point on will be referred to a single OFDM data tone. In this way, the signal received can be written as follows:
y=Hx+n (3)
where
are the vectors of the symbols transmitted and received respectively, and
is a white-Gaussian-noise vector with components that are independent and identically distributed with a covariance matrix:
Rn=E[nnH]=σN2I (6)
(nH denotes the Hermitian, or conjugate transpose, operator applied to the vector n). The transmitted N-QAM symbols are uncorrelated, with zero mean value and normalized variance σx2=1 for each transmitting antenna, namely:
Rx=E[xxH]=σx2I=I (7)
Therefore, starting from Eqs. (6) and (7), the signal-to-noise ratio (SNR) transmitted is equal to
With reference to
The signals generated by the OFDM modulators 26, 30 are transmitted on a MIMO channel 32.
At the receiver end, two OFDM demodulators 34, 36 receive the signals from the channel 32.
The demodulators 34, 36 forward the signals received to a MMSE detector 38, with which a soft-interference estimator (SIE) 40 is associated.
The QAM demodulators 42, 44 send their signals to a space-frequency bit de-interleaver 18, which generates a plurality of signals OFDM1, OFDM2, . . . , OFDMS/T.
This plurality of signals is sent at input to a convolutional decoder 14, which generates a bit stream at output. This bit stream can be sent at input to a module 12, which executes the bit-to-bit comparison and generates the parameters of bit error rate (BER) and frame error rate (FER).
The decoder 14 returns the decoded bits and, through a feedback loop, supplies a space-frequency bit interleaver 20, which in turn supplies the data to the soft-interference estimator 40.
As appears from
If a perfect channel-state information (CSI) is assumed at the receiver end, the de-mapper, using the estimates {circumflex over (x)}t, calculates the log-likelihood ratios (LLRs) λt(m) of the encoded bits, where m=1, 2, . . . , M, and M=log2(N) is the order of modulation. These soft values are passed on to the de-interleaver and then to a SOVA decoder (SOVA—Soft Output Viterbi Algorithm). The SOVA decoder generates not only decisions on the bits of the data, but moreover generates extrinsic information ξt(m) with m=1, 2, . . . , M corresponding to the encoded bits. This extrinsic information is used as a-priori knowledge at the detector end, after the bit-interleaving process, to carry out a soft-interference cancellation (SIC).
In this way, the MMSE detector 38 supplies more reliable estimates of the symbol xt transmitted, with t=1, 2, . . . , T, drawing advantage from the soft information fed back by the channel decoder to the input of the MMSE detector 38. Furthermore, the process of de-mapping also takes into account the a-priori probability ξt(m) for updating the LLR values λ(m) that will be used in the new decoding process.
For immediate reference, described hereinafter are the classic approach and the low-complexity approach of the system of soft-interference cancellation described herein
We shall define a function bitm[x(n)], m=1, 2, . . . , M, where M is the degree of modulation. This function returns at output a value {0,1} corresponding to the m-th bit of the symbol x(n), n=1, 2, . . . , N of an N-QAM constellation.
By exploiting the extrinsic information of the LLR, as supplied by the SOVA decoder
of the encoded bits, a soft-interference estimator (SIE) calculates soft estimates of the symbols transmitted by the t-th antenna, with t=1, 2, . . . , T
{tilde over (x)}t=E[xt]m=1, 2, . . . , M (10)
Assuming that the bits within a symbol are statistically independent of one another, the probability P[xt =x(n)] can be expressed as
At the start of the iterative process, ξt(m)=0 for every m and t; as a consequence, all the symbols are equally likely, and their probability Pt(n) is equal to 2−M.
Consequently, the soft-estimates
are equal to zero, and no cancellation is performed in the first iteration.
In the subsequent iterations, since the reliability of the bits supplied by the SOVA decoder increases, the soft-estimates become closer to their true values, and for the t-th antenna, the soft-estimates of the symbols coming from the other (T−1) interfering antennas are cancelled to obtain
where {tilde over (x)}, is the column vector with the soft-estimates, except for the t-th element, equal to zero.
In addition to calculating Zt, also the error covariance matrix is calculated as
where
{tilde over (e)}t=xt−{tilde over (x)}
and
It may be seen from Eqs. (12) and (16) that the probability Pt(n) is calculated N times for each transmitting antenna and is multiplied N times by the corresponding QAM symbol and also by its energy.
The solution described in what follows enables a lower complexity for the system of soft-interference cancellation to be obtained.
Basically, the goal is to reduce the operations performed in expressions (12) and (16). This can be done by regarding the Gray mapping as a combination of two {square root over (√N)}-PAMs along the real and imaginary axes.
The 16-QAM constellation of
Therefore, {tilde over (x)}t and σ{tilde over (e)}t2 can be written as
It should be emphasized that this simplification functions not only for a Gray mapping, but also for other types of mappings that envisage a configuration that is different but can be separated into a real part and an imaginary part.
With reference to what has been seen previously, it is possible to define
where n=1, 2, . . . , M/2.
In the normalized-power 16-QAM constellation of
10 it follows that expression (17) becomes
Likewise, a similar equation is obtained considering Eq. (18), as given below:
In the same way, Eqs. (19) and (20) can be written as
Extending the same concept to a 64-QAM Gray mapping, it is possible to interpret this mapping as two 8-PAM modulations, as in
To sum up, the symbol estimated by applying Eq. (12) can be directly calculated, avoiding expansion of expression (11) for each symbol transmitted.
Table 1 summarizes the computational costs, per data bit, of the interference cancellation for a system with T=2 and R=3 in the classic case (A) and in the low-complexity case (B) described herein. It has been assumed that the multiplications that involve powers of two do not involve significant costs and hence have not been considered.
There will now be described, once again in a comparative way for immediate reference, how the classic approach and the low-complexity approach described herein result in the structure of the MMSE detector.
In the classic approach, a MMSE detector minimizes the mean-square error between the symbol transmitted xt and the output of the Wiener filter {circumflex over (x)}t.
This filter is represented by a vector wtεCR such that
{circumflex over (x)}t=wtHzt (37)
where zt is the observation vector expressed in Eq. (13).
It can be shown that the MMSE filter is given by
wt=(σN2I+HR{tilde over (e)}tHH)−1ht (38)
where σN2 is the noise power and R{tilde over (e)}t is defined in Eq. (14).
The vector wt must be re-calculated for each transmitting antenna t and for each iteration, with consequent considerable computational costs.
The low-complexity approach described herein pursues, instead, the goal of reducing the computational cost of the formula that describes the Wiener filter.
This has been obtained by exploiting the Hermitian structure of R{tilde over (e)}t (it is a real diagonal matrix) and a Woodbury formula, as follows:
where
is the t-th unit vector of the T-dimensional space. The major advantages introduced by Eq. (39) are the following:
the matrix R{tilde over (e)}t no longer appears between H and HH, as in the formula (38); instead, the product HH H does not depend upon the variance of the symbols estimated, and can be calculated only once and used for each SIC iteration to detect all of the OFDM symbols belonging to the same W-LAN packet;
H and HH exchange their relative position. Since the number R of receiving antennas is generally equal to or greater than the number T of transmitting antennas, a matrix (σN2I+R{tilde over (e)}tHHH)εCT×T must be inverted instead of a matrix (σN2I+HR{tilde over (e)}tHH)εCR×R, with R≧T;
in expression (38) the complete computation of the inverse matrix (σN2I+HR{tilde over (e)}tHH)−1 is required for each t. Instead, in formula (39), (σN2I+R{tilde over (e)}tHH)−1 is multiplied by the unit vector ut, so that it is sufficient to calculate only one column of the matrix for each t.
Finally, a further reduction of complexity of the detector block is obtained by defining
A=HHH (41)
b=HH y (42)
so that expression (37) can be reformulated as
In this way, the term y−H{tilde over (x)}t is replaced by b−A{tilde over (x)}t, where AεCT×T (instead of HεCR×T) and bεCT×1 (instead of yεCR×1).
Neither of these terms depends upon the symbol estimates {tilde over (x)}t so that they can be calculated at the start of the iterative-detection process and stored in a memory, to be used during the subsequent iterations.
Table 2 summarizes the computational costs of MMSE detection, per data bit, averaged over the transmission of S OFDM blocks, for a system with T=2 and R=3 in the classic case (A) and in the low-complexity case (B).
There now follows a description, once again in a comparative way for direct reference, of the criteria of implementation of the classic approach and the low-complexity approach described herein for the de-mapping block.
The classic way of expressing the a-posteriori LLRs of the bits belonging to a symbol xt is
where i, m=1, 2, . . . , M, σ2 {circumflex over (x)}t is the variance of the estimate {circumflex over (x)}t, ξt (i) is the a-priori information defined in Eq. (9), and Bm(1), Bm(0) are the subsets of the QAM symbols, namely
Bm(1)⊂[1, N]⊂: bεBm(1)bitm[x(b)]=1 (45)
Bm(0)⊂[1, N]⊂: bεBm(0)bitm[x(b)]=0 (46)
The metric representing the probability that a constellation symbol has been transmitted is
Said metric is obtained through the sum of two different contributions. The first contribution depends upon the symbol estimate {circumflex over (x)}t, approximated as a Gaussian variable, supplied by the detector. The second contribution depends upon the a-priori information of expression (9) that comes from the SOVA decoder.
Therefore, the linear local regression φt(m) in expression (44) is obtained from the joint evaluation of both of the terms present in Eq. (47).
In order to avoid computation of N sums of exponential terms and a logarithmic operation, for each LLR it is common practise to approximate expression (44) by introducing the well-known and consolidated Max-Log-Map operator, which is obtained as follows:
Finally, the extrinsic soft values at output from the detector are obtained by subtracting from expression (48) the a-priori information ξt (m) coming from the SOVA decoder
λt(m)=φt(m)−ξt(m) (49)
In the low-complexity approach, the aim is to reduce the number of computations in expression (48) by proposing a solution that considers only a subset of the N metrics that the max operator would have taken into account for each linear local regression. The main idea is that, instead of maximizing the entire expression (47), the solution chooses between the metrics that separately maximize either
The solution described herein defines two aims, namely, how to select the subsets and how to reduce the complexity of calculation of the metrics.
The first aim is achieved by introducing two criteria, referred to as distance criterion and a-priori-probability criterion. The second aim consists in the explicit computation of just one metric per subset, from which the other metrics can be obtained through a less costly differential method. Said method will be described in what follows.
The low-complexity approach for de-mapping the symbol can be formalized by means of the following relations:
where j and q are selected according to the distance criterion:
and l and g according to the a-priori-probability criterion:
For reasons of simplicity, a de-normalized 16-QAM constellation has been used in the examples, even though a considerable reduction of complexity can be appreciated in the case of the 64-QAM constellation.
In both
On the basis of the distance criterion, if x(h) is the point closest to {circumflex over (x)}t, it is also the point closest in the subset x(n), with nεBm(bitm[x(h)]) for those symbols that have the m-th bit equal to bitm[x(h)]. For this reason, φh[{circumflex over (x)}t, σ{circumflex over (x)}
of those symbols having the m-th bit different from bitm[x(h)].
Thanks to the Gray-mapping properties, these points belong to the same row (column) of x(h) if the m-th bit steers the real (imaginary) part of the points of the constellation, as illustrated in
The a-priori-probability criterion is very similar to the preceding one, as illustrated in
and maximum a-priori probability. Said symbols are the ones having the m-th bit complemented and the other bits unchanged.
Finally, described in what follows is the calculation of the metrics based upon the differential method.
If we look at Eq. (47) we shall note that it is made up of two terms: one corresponding to the distance between {circumflex over (x)}t and x(n) and the other expressing the a-priori probability of x(n). This method provides a simple way for computing these two terms separately so as to use them as specified in formula (50).
The first term is calculated as described in what follows. With reference to
The constellation is considered de-normalized in such a way that the distances between the symbols are equal to 2. Finally, we define
ΔI=[{circumflex over (x)}t−x(1)] (55)
ΔQ=ℑ[{circumflex over (x)}t−x(1)] (56)
and assuming that the squared distance a2=|{circumflex over (x)}t−x(1)|2=Δ21I+Δ2Q is known, it follows that b2=|{circumflex over (x)}t−x(2)|2 can be obtained by applying the Pythagorean theorem as
with [x(2)]>[x(1)].
If [x(2)]<[x(1)], then it follows that
b2=a2+4+4ΔI (58)
Hence, given the squared distance a2, it is possible to obtain the squared distance b2 of an adjacent point by adding the differential term 4±4ΔI.
Similar equations can be deduced also in the case of two vertically aligned symbols ([x(1)]=[x(2)]). The expression for b2 generalized to two non-adjacent QAM symbols is the following:
where pεZ.
In order to simplify the a-priori terms of the constellation of symbols x(n), as emerges from Eq. (47), it is possible to exploit once again the Gray-mapping properties.
The above term can be obtained by summing the a-priori value of the symbol adjacent to the symbol x(n) to the LLR (with appropriate sign) corresponding to the only bit position for which they differ (or via the sum of a number of LLRs if they are not adjacent).
In conclusion, all the metrics required by the distance criterion and by the a-priori-probability criterion can be calculated starting from a known metric.
The other metrics are obtained simply by adding two terms, which correct the two parts of expression (47). Table 3 compares the computational costs per data bit of the classic de-mapping method (A) and the low-cost de-mapping mapping method (B) in a system with T=2 and R=3.
Finally, presented hereinafter are some numeric results, obtained from simulations.
A comparison between the classic and low-complexity iterative decoding and detection schemes appears in Table 4 and in
Whereas the initial stage for the MMSE calculation has approximately the same computational cost for both the classic scheme and the low-complexity scheme, the subsequent ones can benefit from the reduction of complexity introduced by the block described previously. Two turbo iterations and one SOVA decoding have been considered with a sliding-window approach so as to avoid having to wait for the entire W-LAN-encoded packet to arrive before starting the channel-decoding operation. The size of the window was selected equal to the length of five constraint-length bits.
In the last iteration, the computational cost for the Viterbi method was considered instead of the computational cost of the SOVA method because, in the case of the Viterbi method, only a hard decision on the bits is made. From an examination of the data regarding sums (SUM) and comparisons (CMP), it may be noted that the number of these operations is not so notably reduced by the low-complexity approach as compared to the classic approach, as occurs, instead, in the case of multiplications (MUL) and divisions (DIV). This occurs because the sums (SUM) and comparisons (CMP) are involved a large number of times also in the SOVA decoding operation, which is common to both of the schemes under comparison.
Furthermore, more than three iterations do not seem to be convenient given that the incremental gain for the iterations subsequent to the second is not significant. However, one may use more than three iterations.
A processor or other circuit may execute software that causes the processor to implement one or more of the above-described embodiments, and such a processor may be included in an electronic system such as a computer with wireless capability or wireless router.
Consequently, without prejudice to the principle of the invention, the details of implementation and the embodiments may vary, even significantly, with respect to what is described and illustrated herein purely by way of non-limiting example, without thereby departing from the scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
05425526.0 | Jul 2005 | EP | regional |