The present invention relates to communication systems, and more particularly to multimode detectors and detection methods.
Wireless communication systems typically use band-limited channels with time-varying (unknown) distortion and may have multi-users (such as multiple clients in a wireless LAN). This leads to intersymbol interference and multi-user interference, and requires interference-resistant detectors and detection methods for interference-limited systems. Interference-limited systems include multi-antenna systems with multi-stream or space-time coding which have spatial interference, multi-tone systems, TDMA systems having frequency selective channels with long impulse responses leading to intersymbol interference, CDMA systems with multi-user interference arising from loss of orthogonality of spreading codes, high data rate CDMA which in addition to multi-user interference also has intersymbol interference.
Interference-resistant detectors commonly invoke one of three approaches to combat the interference: maximum likelihood sequence estimation, (adaptive) linear filtering, and decision-feedback detection. However, maximum likelihood sequence estimation has problems including impractically large computation complexity for systems with multiple transmit antennas because the decoding requires a search over the topological product of multiple copies of the symbol constellation. Linear filtering detection, such as linear zero-forcing (ZF) and linear minimum squared error (MMSE), has lower computational complexity (search over a single symbol constellation) but has relatively poor performance due to excessive noise enhancement. And decision-feedback (iterative) detectors, such as iterative ZF and iterative MMSE, decode the transmitted symbols successively with cancellation of previously-detected symbols in each detection; the order of detection may depend upon SINR or other measures of symbol reliability. Such methods have moderate computational complexity but only moderate performance.
However, these methods have problems including trade-offs of performance and complexity which are inappropriate for many conditions.
The present invention provides multimode detectors and detection methods for adapting detection method to conditions. Preferred embodiments apply to multiple-input multiple-output (MIMO) systems.
This has advantages including performance approaching maximum likelihood detection but with complexity comparable to that of iterative interference cancellation detection.
a-3f illustrate functional blocks of detectors, receivers, and transmitters.
a-4d and 5a-5d illustrate simulation results.
1. Overview
Preferred embodiment multimode detectors and detection methods have detection mode selection dependent upon channel estimation;
Preferred embodiment communication systems, such as cellular phones and wireless local area networks, include preferred embodiment receivers employing preferred embodiment detection. The detection computations can be performed with digital signal processors (DSPs) or general-purpose programmable processors or application specific circuitry (ASICs) or systems on a chip such as both a DSP, ASIC, and RISC processor on the same chip with the RISC processor controlling. Analog-to-digital converters and digital-to-analog converters provide coupling to the real world, and modulators and demodulators (plus antennas for air interfaces) provide coupling for transmission waveforms.
2. Interference-Limited Receivers Generally
For any interference-limited system the baseband received signal can be written in Q-dimensional vector format:
r=Hs+w
where r is the Q-vector of samples of the received baseband signal (complex numbers) corresponding to a transmission time (interval/block) n:
s is the P-vector of transmitted data symbols (sets of complex numbers of symbol constellations) for time n:
H is the Q×P channel matrix of attenuations and phase shifts; and w is a Q-vector of samples of received additive (white) noise with variance per element of σ2. In particular, H=[h1 h2 . . . hP] where hp is the channel (maybe including multipath combining and equalization) from the pth data source to the receiver, and so:
r=Σ1≦p≦Phpsp+w
The channel matrix H may be estimated using known (pilot) symbols.
The foregoing relation applies generally to various systems with various interference problems and in which n, r, s, P, and Q have corresponding interpretations. For example:
(ii) Broadband wireless systems employing OFDM (orthogonal frequency division multiplex) signaling and MIMO techniques for each tone or across tones; see
(iii) TDMA (time division multiple access) systems having frequency-selective channels with long impulse response which causes severe ISI (intersymbol interference). Use equalizers to mitigate ISI.
(iv) CDMA (code division multiple access) systems having frequency-selective channels which cause MUI (multi-user interference) as a result of the loss of orthogonality between spreading codes. For high data rate CDMA systems such as HSDPA and 1×EV-DV, this problem is more severe due to the presence of ISI. Equalizers and/or interference cancellation may be used to mitigate these impairments;
(v) Combinations of foregoing.
P is essentially the number of symbols that are jointly detected as they interfere with one another, and Q is simply the number of collected (spatial and/or temporal) samples at the receiver. Presume that different symbols that are transmitted via P different antennas are uncorrelated and may utilize different modulation schemes; thus with λk denoting kth symbol expected energy E[|sk|2]:
where H denotes Hermitian conjugate. Because there are P independent sources, Q must be at least as large as P to separate the P symbols. A detector in a receiver as in
For linear filtering equalization detectors, such as linear zero-forcing (LZF) or linear minimum mean square error (LMMSE), the soft estimates, denoted by P-vector z, derive from the received signal by linear filtering with P×Q filter matrix F; namely, z=F r.
LZF detection essentially takes F to be the pseudoinverse of H; namely,
FLZF=[HHH]−1HH
For an invertible matrix H, F reduces to the inverse, H−1.
In contrast, LMMSE detection finds the filter matrix F by minimizing the mean squared error, E[∥z−s∥2]. With perfect estimation of the channel H, the minimizing matrix F is given by:
where IQ is the Q×Q identity matrix. Note that F has the form of a product of an equalization matrix with the matched filter matrix, HH, for the channel. Also, note that this LMMSE detector is biased; however, a normalization can correct this bias as indicated in
Essentially, LZF detection nulls out the interference at the expense of excessive noise enhancement, while LMMSE detection minimizes the sum of interference and noise energy to provide a balanced trade-off between bias (due to residual interference) and variance (due to noise enhancement). But then each component of z is processed separately to obtain a corresponding hard symbol estimate. That is, a hard estimate, ŝi, for the ith symbol, si, comes from application of a hard decision operator to the soft estimate: ŝi=D{xi} for i=1, 2, . . . , P, where D{ } is a non-linear decision device, usually a single-input, single-output (SISO) maximum-likelihood (ML) detector.
A typical iterative (decision-feedback) detector (IZF or IMMSE) for blocks of P symbols has a series of P linear detectors (P iterations) with each linear detector followed by a (hard) decision device and interference subtraction (cancellation). Each of the P linear detectors (iterations) generates both a hard and a soft estimate for one of the P symbols. The hard estimate is used to regenerate the interference arising from the already-estimated symbols which is then subtracted from the received signal, and the difference used for the next linear symbol estimation. Hence, the hard symbol estimates, ŝi, are also computed in a disjoint fashion.
On the other hand, the optimal joint maximum likelihood (ML) detector generates the hard symbol estimates as:
ŝML=arg minsεC|r−Hs|2
where C=C1×C2× . . . ×CP is the P-dimensional constellation of P-vectors of symbols and has size Π1≦j≦P |Cj| where |Cj| denotes the number of elements in constellation Cj. Thus the hard symbol estimates are jointly computed, and the computational complexity of this optimal detection is proportional to the constellation size which is exponential in P. Alternatively, a soft-decision ML detector that generates bit/symbol probabilities (or log likelihood ratios) can be used.
When an error correcting code is employed, soft symbol or soft bit log likelihood ratio (LLR) estimates are often required for soft decoding operations. While soft symbol estimates can be obtained in many ways, the conditional expectation estimator is optimal. The conditional expectation estimator is:
The performance loss of iterative detectors comes from the disjoint processing of the soft symbol estimates, zi, to obtain the corresponding hard symbol estimates, ŝi. For linear and iterative detectors, bit LLRs associated with different symbols from different transmit antennas are computed disjointly. For the optimal joint ML detector, however, bit LLRs associated with different symbols from different transmit antennas are jointly computed. The fundamental operations for each detector are identical regardless whether a forward error correcting code is used or not.
3. Preferred Embodiment Multimode Detectors
The performance difference between two MIMO detectors depends upon the channel conditions. In general, detectors with higher complexity outperform detectors with lower complexity. As an example, the following detectors are ranked according to their performance and complexity:
When the channels associated with different transmit antennas are orthogonal (i.e., hj|hk=0 for j≠k where | denotes the inner (scalar) product), all of the preceding detectors have the same performance. In fact, they are mathematically equivalent due to the absence of interference. In general, {h1, h2, . . . hP} are not orthogonal. The degree to which {h1, h2, . . . hP} are correlated determines the performance differences among different MIMO detectors. As {h1, h2, . . . hP} become more correlated, the performance difference between the ML detector and a linear detector becomes large. This is mainly due to the excessive noise enhancement from the linear detector as linear detectors attempt to invert the correlation effect among different channels associated with different transmit antennas. While iterative detectors result in significantly reduced noise enhancement compared to linear detectors, iterative detectors still fall short in performance compared to the optimal ML detector.
The N detectors could be hardwired or stored programs (e.g., ROM) and run on a programmable processor or a combination of such. Also, the channel estimation and metric computation can be updated, either periodically or adaptively such as in response to detection error correction rates. And the detector selection using the metric value may include hysteresis when switching from one detector to another detector.
The advantages of the preferred embodiment include the use of high complexity detectors only when the channel is sufficiently ill-conditioned to require them. This results in a reduction of average computational complexity which translates to (battery-operated) receiver power savings while maintaining overall receiver performance. And the tradeoff between performance and complexity can be adjusted by redefinition of the partitioning of the f(H) image space.
Two examples of practical f(H)s follow; one uses a condition number for the channel, and the other relies upon the distribution of the channel correlations. In particular, given a channel estimate H, the first example computes HHH and finds the eigenvalues of HHH; then the ratio of largest eigenvalue to smallest eigenvalue defines the condition number, CN(HHH). Note that HHH is always Hermitian, so the eigenvalues are real. Now a large value of f(H)=CN(HHH) indicates the smallest eigenvalue is close to 0 and that H is close to being singular (has less than maximum rank); thus a high performance detector should be used. Indeed, for a MMSE-type detector, f(H)=CN(HHH+σ2IP) may be a more appropriate metric. The condition number measures eigenvalue spread but does not always reflect the correlations of the channels associated with different transmit antennas. For instance, in the case P=2 and with orthogonal channels (h1|h2=0), it is still possible that one channel has much greater attenuation than the other channel, so that ∥h1∥>>∥h2∥ and consequently a large CN(HHH) value, although the orthogonal channels implies use of a low complexity detector.
For the second example, f(H) derived from the distribution of channel correlations, first define the normalized correlations for j,k=1, 2, . . . , P:
Cjk(H)=|hj|hk|2/∥hj∥2∥hk∥2
Of course, the hj|hk are just the matrix elements of HHH. Then the mean cross-correlation metric is defined as:
MCC(H)=Σ1≦j≠k≦PCjk(H)/P(P−1)
And the variance of cross-correlations is:
VCC(H)=Σ1≦j≠k≦P(Cjk(H)−MCC(H))2/P(P−1)
Similarly, other statistical metrics can be derived from combinations of the normalized correlations. Indeed, the use of f(H)=MCC(H) treats all channel correlations roughly equally, whereas f(H)=VCC(H) emphasizes outliers from the mean, either larger or smaller. Note that different metrics may be better for different scenarios; and in this case, one may use a combination of several metrics to achieve the most desirable result. For instance, one may use a combination of both MCC(H) and VCC(H) for f(H) and detector selection.
After a metric f(H) has been chosen, the partition the image space of f(H) into N regions to define selection of the N detectors available. For example, when f(H) has real values, the partitioning of the image space amounts to selecting N−1 thresholds, γ1, γ2, . . . , γN−1, to define the N regions. Then with the implicit understanding γ0=−∞ and γN=∞, when γn−1≦f(H)<γn, the selection is detector n.
As a simple example, consider N=2 with the joint ML and the iterative MMSE (IMMSE) detectors available. First, choose the metric f(H)=MCC(H); note that the normalization of Cjk(H) implies MCC(H) lies in the range [0,1]. Next, pick the threshold γ1. Then the detector selection is IMMSE when MCC(H)<γ1 and ML when γ1≦MCC(H). The threshold parameter γ1 is chosen to facilitate a trade-off between performance and complexity. A smaller γ1 results in more frequent usage of the ML detector and higher performance but consequent higher power consumption. Simulation results quantify this trade-off; see the following section.
Further with the foregoing example, the alternative choice of metric f(H)=CN(HHH+σ2I2) similarly requires a threshold γ1, but in this case the values of CN(HHH+σ2I2) lie in the range [1,∞).
4. Simulations
a-4d and 5a-5d illustrate the performance and detector utilization of a 2-mode preferred embodiment detector (ML and IMMSE modes) with QPSK and 16 QAM modulation, respectively.
In particular, for QPSK modulation
a-5d are analogous to
5. Modifications
The preferred embodiments can be modified in various ways while retaining one or more of the features of a multimode detector with channel analysis metric selection of detection mode.
For example, the set of detectors available for selection could include any convenient types and could be updated periodically or adaptively. Also, the switching between detectors (detection modes) could be subject to hysteresis in either the sense of the metric must cross a threshold and maintain the threshold crossing for a minimal time interval prior to switching or in the sense of the metric must cross the threshold by a minimum amount prior to selection switching or a combination of both senses. For example, in the And the detection methods could have various enhancements.
Another possible variation is the use of long-term channel condition instead of short-term (instantaneous) channel. That is, the detector selection is done periodically on a long-term basis based on long-term channel condition(s) such as the second order statistics of the channel. For instance, a metric based on the following statistics can be used (analogous to the cross-correlation metric):
ACjk(H)=E[|hj|hk|2]/E[∥hj∥2] E[∥hk∥2]
where E[.] denotes statistical expectation (i.e., average over a set of samples). Since the above statistics change at a much slower rate compared to the short-term counterpart, less frequent update is needed at the expense of power saving.
Also, ordered detection based on the symbol post-detection signal-to-interference-plus-noise ratio (SINR) is often used to reduce the effect of decision feedback error. Let the detection order be π(1), π(2), . . . , π(P) where π( ) is a permutation of the P integers {1, 2, . . . ,P}; that is, the first estimated symbol (hard estimate output) will be ŝπ(1). The maximum SINR of the components of the first soft estimate z(1), which estimates all P symbols, determines π(1). Similarly, the SINRs of the components of z(2), which estimates all of the symbols except the cancelled sπ(1), determines π(2), and so forth. That is, the ith iteration estimates symbol sπ(i).
Further, MMSE detectors are known to be biased estimators in the sense that E[zk|sk]−sk≠0. However, the bias of the MMSE detectors can be removed by applying a scaling factor to the soft outputs. This scaling factor does not affect post-detection SINR, yet results in increased mean squared error compared to the regular biased MMSE estimate. While this unbiasing operation does not affect the performance of LMMSE detectors, it improves the performance of IMMSE detectors because the decision device generating decision feedback assumes unbiased soft output. The unbiasing operation for IMMSE detectors rescales the soft estimates zk to soft estimates {hacek over (z)}k as follows:
Ak is the Q×(P−k+1) matrix [hk, hk+1, . . . , hP], and the subscript 1,1 denotes the (1,1) matrix element. For unbiased IMMSE, variance-based and mean-squared-error-based normalizations are equivalent.
For a channel encoder (e.g.,
For example, an AWGN channel where the residual interference (interference which is not cancelled) is also a zero-mean, normally-distributed, independent random variable, gives:
p(zk|sk=c)˜exp(−|zk−c|2/γk)
where c is a symbol in the symbol constellation and γk is a scaling (normalization) typically derived from the channel characteristics and the detector type. Of course, γk is just twice the variance of the estimation error random variable.
For LZF type detectors the natural choice of γp is the variance of the noise term associated with the soft estimate; that is, γp=var(np) where zp=sp+np. This relates to the AWGN noise power of the channel (σ2) and the corresponding (p,p) diagonal term of the matrix which amplifies the channel noise:
γp=σ2[HHH]−1p,p
And for the iterative ZF detector (with numerical ordering) the analog applies:
γp=σ2[ApHAp]−11,1
where the (1,1) element corresponds to the channel from the pth symbol source due to the definition of Ap with first column equal hp.
For MMSE-type detectors a natural choice is to take γp as the mean squared error:
γp=E[|zp−sp|2]
For LMMSE this translates to
γp=σ2[HHH+σ2Λp−1]−1p,p
and for the unordered iterative MMSE the analog obtains:
γp=σ2[ApHAp+σ2Λp−1]−11,1
MMSE-type detectors can also use variance type scaling.
Because MMSE detectors are biased in the sense that the mean of the estimation error is not zero, the variance scaling becomes
γp=E[|zp−sp|2]−E[zp−sp]|2.
For linear MMSE this is
γp=σ2([HHH+σ2Λ−1]−1HHH[HHH+σ2Λ−1]−1)p,p
and for unordered iterative MMSE this is:
γp=σ2([ApHAp+σ2Λp−1]−1ApHAp[ApHAp+σ2ΛP−1]−1)1,1
The bias of the MMSE detectors can be removed as previously described, and for unbiased detectors the mean-squared-error scaling and the variance scaling are thus equivalent.
An alternative scaling simply takes
γp=σ2/(λp∥hp∥2) for p=1, 2, . . . , P
where ∥hp∥2 is the square of the norm of the channel Q-vector hp; that is, the sum of the squared magnitudes of the Q components of the pth column of channel matrix H.
In terms of the bits bkj which define the symbol sk in its constellation (e.g., two bits for a QPSK symbol, four bits for a 16 QAM symbol, et cetera), take as a practical approximation P(zk|bkj=1)=p(zk|sk=ckj=1) where ckj=1 is the symbol in the sub-constellation of symbols with jth bit equal 1 and which is the closest to zk; that is, ckj=1 minimizes |zk−cj=1|2 for cj=1 a symbol in the sub-constellation with jth bit equal to 1. The analog applies for p(zk|bkj=0) using the sub-constellation of symbols with jth bit equal 0.
Similarly, the decoder for a binary trellis may use log likelihood ratios (LLRs) which are defined as
where the first log term includes the probability distribution of the demodulated symbol zk which can be computed using the channel model. The second log term is the log of the ratio of a priori probabilities of the bit values and typically equals 0. Then again using the approximation P(zk|bkj=1)=p(zk|sk=ckj=1) where ckj=1 is the symbol in the sub-constellation of symbols with jth bit equal 1 and which is the closest to zk together with equal a priori probabilities yields:
Thus the LLR computation just searches over the two symbol sub-constellations for the minima. The magnitude of LLR(bkj) indicates the reliability of the hard decision bkj=0 when LLR(bkj)<0 and bkj=1 when LLR(bkj)≧0.
The LLRs are used in decoders for error correcting codes such as Turbo codes (e.g., iterative interleaved MAP decoders with BCJR or SOVA algorithm using LLRs for each MAP) and convolutional codes (e.g. Viterbi decoders). Such decoders require soft bit statistics (in terms of LLR) from the detector to achieve their maximum performance (hard bit statistics with Hamming instead of Euclidean metrics can also be used but result in approximately 3 dB loss). Alternatively, direct symbol decoding with LLRs as the conditional probability minus the a priori probability could be used.
The following copending application discloses related subject matter and have a common assignee with the present application: application Ser. No. 10/618,970, filed Jul. 14, 2003.