This invention relates generally to wireless communication, and more particularly to detecting received symbols using probabilistic data association.
Machine-to-machine (M2M) communication enables machines to exchange short command and control messages. The main design goal is to achieve fast and highly reliable transmission of short messages over wireless channels with relatively simple devices having a small number of antennas, preferably one or two. One of the major application of M2M is in factories wherein automated production processes benefit when machines can communicate directly with each other wirelessly.
A single-input single-output (SISO) communication system with block transmission and detection is a candidate for M2M. The type of messages should use small block sizes, e.g., less than 100 symbols.
One M2M communication system uses pseudo-random phase precoding (PRPP) and likelihood ascent search (LAS) detection. That scheme performs extremely well in practice when the block size is at least 400 symbols. However, the goal is to greatly reduce the block size without degrading performance.
Maximum likelihood detection (MLD) is an optimal symbol detection scheme. However, the complexity of MLD increases exponentially with the block size. Therefore, a variety of suboptimal detection methods with polynomial complexity have been developed. One of the best known is probabilistic data association (PDA).
PDA was originally developed for radar target tracking and has gained interest in wireless communication applications. As such, PDA has been applied for multiuser detection in code division multiple access (CDMA) systems, turbo equalization, multiple input multiple output (MIMO) systems, and decision feedback equalization (DFE) for detecting symbols transmitted in MIMO systems over a frequency selective channel. A bit-by-bit detection of a higher order quadrature amplitude modulation (QAM) using the PDA method is also known.
The embodiments of the invention provide a method for detecting symbols in a wireless communication system subject to additive Gaussian noise. Specifically, the method uses probabilistic data association (PDA). PDA uses iterative block detection on a symbol-by-symbol basis, where contributions from other symbols are treated as interference that is approximated as a sum of the interference and the noise with the Gaussian distribution. Thus, the received symbol can be viewed as the desired symbol corrupted with Gaussian noise, which can be inferred using Bayes' rule. However, conventional PDA can converge to an overconfident soft estimate of the symbol after one or two iterations.
The invention is based on the realization that when the overconfident soft estimate implies an erroneous hard estimate for a symbol, as in the prior art, the PDA loses its ability to correct such a symbol in following iterations, and the PDA gets stuck in a local minimum. This characteristic is the driving force for an error floor on the performance at a relatively high signal to noise ratio (SNR), and limits the applicability of PDA to systems that require high reliability, such in machine-to-machine (M2M) communication systems.
Therefore, the embodiments modify the way the symbols are treated by the PDA. The conventional PDA models a symbol as a discrete sample selected from a constellation. In contrast, the PDA with symbol uncertainty (SU-PDA) according to the embodiments treats the symbol as a sum of a deterministic symbol and random noise. The SU-PDA can outperform the conventional PDA by a few dBs on a channel with block fading, and up to and exceeding 10 dB on a channel with fast fading.
Specifically, the symbol is modeled as a sum of a deterministic, unknown variables selected from a finite constellation (alphabet), and a zero mean random variable having a variance equal to a currently estimated variance of the symbol. The random variable expresses the uncertainty in the desired symbol, and the contribution from other symbols is considered as interference. The sum of the random part of the symbol and the system noise is the equivalent additive noise. The distribution of the sum of the interference and the equivalent additive noise is approximated with the Gaussian distribution whose mean and covariance match the mean and covariance of the quantity being approximated. Although the mean of the approximating Gaussian in the method is the same as the mean in the conventional PDA, the covariance, in contrast, depends on the currently estimated variance of the desired symbol.
The PDA with Gaussian approximation method provides relaxation or regularization to the PDA by modeling the symbol with a smooth and continuous statistical distribution, as opposed to a sum of Dirac pulses as in the conventional PDA. This lowers the possibility of getting stuck at a local minimum, and therefore mitigates the error floor problem in the prior art.
The SU-PDA can outperform the conventional PDA for short block sizes, e.g., less than 100 symbols, at lower bit error rates (BERs) in M2M communication systems. Given the equivalence between the SISO system with block transmission and large multiple input multiple output (MIMO) systems the embodiments of the method are equally applicable for symbol detection in large multiple input multiple output (MIMO) systems.
In the following description, boldface uppercase letters denote matrices and boldface lowercase letters denote vectors. An operator ( )T denotes transpose, and ( )H denotes a complex-conjugation along with transposition, i.e., Hermitian transpose. Unless otherwise specified, all vectors are assumed to be column vectors.
The notation CN(a,b) used herein denotes a complex Gaussian distribution with the first argument a being the mean, and the second argument b being the variance. In addition, we assume that the Gaussian distributions described herein are circularly symmetric. Methods developed in the real domain, as well as the complex domain with non-circular symmetry, are analogous.
System Model
The data block x is processed by a precoder P 120. The precoder essentially spreads the energy of each symbol xn into N signaling intervals, giving rise to time diversity in the case of a fast fading channel.
Formally, the precoder is described by a precoding matrix PεCN×N and its output is given by
p=Px. (1)
The output is passed to a transmitter 130 for front end radio frequency (RF) processing and transmission over a wireless channel 150.
If the wireless channel 150 experiences very slow or block fading, the precoding does not provide sufficient diversity. Formally, in this case, we use the identity precoder, i.e., P=1.
The precoded block p is modulated onto a carrier and transmitted over the wireless channel 150, subject to distortion and noise 160. The signal received by a receiver 170 is filtered, demodulated, converted into the baseband and sampled during front-end RF processing, followed by symbol detection 180 to obtain an estimate of the received symbol {circumflex over (x)}(n) 185. The detector receives an estimated channel matrix H from the channel estimator 190. The focus of the invention is on the detector 180.
The received baseband signal in discrete time is yεCN×1 and is related with the precoded signal p through a linear model,
y=√{square root over (SNR)}Hp+v, (2)
where HεCN×N models distortions caused by the wireless channel, and SNR is the signal-to-noise ratio. The additive noise is a circularly symmetric zero mean uncorrelated Gaussian process, i.e.,
v˜(0,I),
where I is the identity matrix.
The relation between the received signal y and the transmitted block x is obtained by substituting equation (1) into equation (2) such that
y=Gx+v, (3)
where
G=√{square root over (SNR)}HP (4)
is the effective channel matrix. We assume the receiver perfectly knows the effective channel matrix G. Therefore, we focus on the symbol detection.
Although derived for the SISO communication system with block detection, equation (3) models the input-output relationship in other communication systems as well. Because the methods described herein do not in particular rely on the additional specifics of the SISO system, they are equally applicable for symbol detection in other communication schemes driven by the model of equation (3), specifically, multiple input multiple output (MIMO) systems.
Probabilistic Data Association
First, we describe the conventional PDA. For simplicity but without loss of generality, the model of equation (3), after multiplication with G−1, is expressed as
z=x+w, (5)
where z=G−1y. Note that the noise w is a circularly symmetric Gaussian distribution, i.e.,
w˜(0,R)
with the covariance R=(GHG)−1.
In essence, the conventional PDA method iteratively calculates the posterior probability of each symbol xn, conditioned on the received signal z, as
p
n(a)=[xn=a|z|],aεX,n=1,2, . . . ,N. (6)
The method operates as follows. Suppose the method is at iteration i and xn is the desired current symbol to be detected. The received signal z in equation (5) is expressed as
where {ei}i=1n is the conventional basis, i.e., ei contains 1 in entry i and zeros elsewhere. The main idea behind the PDA method is to treat the contribution from all other symbols, except the current symbol xn, to the received signal as interference and approximate the sum of the interference and noise with a Gaussian distribution. That is, when detecting xn, the interference plus noise in equation (7)
is modeled as approximately
(μn,Σn)
The mean μn and a covariance matrix Σn are matched to the mean and covariance of {tilde over (w)}, i.e.,
In addition, the conventional PDA, assumes that the posterior probabilities of all symbols, except the current symbol xn, determined thus far, are the true posterior probabilities. Thus, the mean μn and covariance Σn of the approximating Gaussian distribution can be obtained by evaluating the mean E[xi|z] and variance var(xi|z) of a discrete random variable with a known distribution.
Having approximated the statistics of the interference plus noise in equation (7), the distribution of the received signal z conditioned on xn=a is a complex Gaussian distribution
p
z
|x
n(z|a)=(z;aen+μn,Σn),aεx (11)
Therefore, the posterior probability pn of the symbol xn can be updated via Bayes' rule
where the summation in the denominator is over the constellation X. Note that in equation (12), we exploit the assumption that symbols have uniform prior probabilities.
The conventional PDA method then continues and updates the posterior probability of the next symbol following some prespecified or adaptive ordering scheme. After a termination condition is reached, e.g., a predetermined number of iterations or convergence, the method outputs the estimates of the posterior probabilitiess pn. Hard estimates {circumflex over (x)}n of the corresponding symbols are obtained using the maximum a posteriori probability (MAP) rule,
PDA with Symbol Uncertainty (SU-PDA) Method
Now we describe our PDA method with symbol uncertainty (SU-PDA). The common feature of the conventional PDA and our SU-PDA is that both use the same model equation (5) (or equation (3)), and infer the posterior distribution pn of transmitted symbol xn, conditioned on received signal z (or y).
Suppose we are detecting the current symbol xn. The main idea behind the SU-PDA method is to model the symbol xn as
x
n
=
n
+{tilde over (x)}
n, (14)
where
var({tilde over (x)}n)=var(xn|z). (15)
Intuitively, {tilde over (x)}n represents the uncertainty in the current knowledge about xn. Substituting equation (14) into equation (7) yields
The contribution to the received signal z from the symbols, other than xn, is treated as interference, while {tilde over (x)}nen is viewed as an additional noise term. Overall, the distribution of the sum of the interference and noise
is approximated with a complex Gaussian distribution with a mean μn and covariance Σn.
As in the conventional PDA, the current estimates of posterior probabilities {pi}i=1N are used to evaluate the mean and covariance of the approximating Gaussian distribution such that
Note that the mean μn is the same as in equation (9) for the conventional PDA. However, the SU-PDA the summation in the expression for covariance matrix equation (19) includes contributions from all symbols, as opposed to the conventional PDA equation (10), which does not. Given that interference plus noise {tilde over (w)} is Gaussian distributed, the probability distribution of the received signal z, parameterized by
p
z(z;{tilde over (x)}n=a)=(z;aen+μn,Σn),aεX. (20)
The SU-PDA updates the posterior probability pn of xn by normalizing the parameterized distributions with respect to x. That is,
where the summation in the denominator is over the finite constellation X.
Note that the conditional mean in equation (18) and variance in equation (19) of symbol xn taking values from X={+1,−1} are, respectively, 2pn(i)−1 and 4pn(i)(1−pn(i)), where pn(i) is the estimate of P{xn=1|z} at iteration i. This can be written compactly, since we are assuming BPSK transmissions in
where, (.), denotes the multivariate circularly symmetric complex Gaussian probability density function.
SU-PDA Method Summary
In the basic SU-PDA, the current symbol being detected is modeled as a sum of deterministic, unknown variable taking values from a finite constellation and a zero mean random variable whose variance is equal to a currently estimated variance of the symbol. The random variable represents the uncertainty in our knowledge about the symbol.
The contribution from other symbols to the received signal is considered as interference. The sum of the random part of the symbol being detected and the system noise is the equivalent additive noise. The distribution of the sum of the interference and the equivalent additive noise is approximated with a Gaussian distribution whose mean and covariance match the mean and covariance of the symbol being approximated.
The mean of the approximating Gaussian distribution in the SU-PDA method is the same as the mean in the conventional PDA method. However, the covariance is different from the covariance in the conventional PDA method. In the SU-PDA the covariance depends on the currently estimated variance of the symbol.
The received signal is then modeled as the sum of the desired symbol and the Gaussian distributed random variable with the known mean and covariance. The symbol is inferred from this model using Bayes' rule, and the method iteratively updates the posterior probabilities of the symbols as each symbol is detected.
Advantage of SU-PDA Over Conventional PDA
The conventional PDA method estimates the desired symbol without taking into consideration what the method has learned about that symbol in previous iterations. In contrast, our SU-PDA method models the desired symbol as a deterministic symbol corrupted with noise whose variance is equal to the uncertainty in our knowledge about that symbol.
PDA with Gaussian Approximation (G-PDA)
In the first step of the G-PDA method, the expected value of the interference is subtracted from the received signal. Because the noise w has zero mean, the expected value of the interference is μn and the interference-free signal is
{tilde over (z)}=z−μ
n. (22)
The interference-free signal {tilde over (z)}, using equation (5), is modeled as
{tilde over (z)}=x
n
e
n
+q, (23)
where q is the equivalent noise, given by
Note that q˜(0, Σn), where Σn is evaluated using equation (10).
In the second step of our G-PDA method, the signal {tilde over (z)} in equation (23) is processed with the minimum mean square error (MMSE) filter to estimate the symbol xn. The impulse response of the MMSE filter is given in vector form by
W
MMSE=(enenT+Σa)−1en. (25)
The output from the MMSE filter is scaled and a sufficient statistic {tilde over (z)}o is given by
Substituting equation (25) into equation (26) and using the matrix inversion lemma for the inverse of the rank one update of a matrix yields
Substituting equation (23) into equation (27), yields a simple model for {tilde over (z)}o
{tilde over (z)}
o
=x
n
+{tilde over (q)}, (28)
where {tilde over (q)}˜(0, σ2). The variance σ2 is evaluated as
In the final step, the symbol xn is detected from {tilde over (z)}o using the model in equation (28). It can be confirmed that if xn is modeled as a uniform discrete random variable over the constellation X, the conventional PDA method is obtained. Depending on how the distribution of the symbol xn is modeled, we provide two embodiments of G-PDA.
Modeling xn with a Single Gaussian Random Variable
in this embodiment, we model xn as a single Gaussian random variable whose mean and variance match the mean and variance of xn. Because xn is a point from a finite constellation, without loss of generality,
{tilde over (p)}
x
(x)=(x;0,1) (30)
Hence, given that both xn and {tilde over (q)} in equation (28) are Gaussian distributed scalars, the posterior probability of xn conditioned on {tilde over (z)}o (and therefore on z because {tilde over (z)}o is a sufficient statistics) is also Gaussian distributed, i.e.,
{tilde over (p)}
x
|{tilde over (z)}
o(x|{tilde over (z)}o)=(x;μ′n,σ′n2), (31)
where the mean and variance are, respectively, given by
After substituting equation (27) and equation (29) into equation (32), the mean and variance of the a posteriori Gaussian distribution are given by
Thus, we arrive at the update equations for the mean μ′n and variance σ′n of the current symbol being detected, xn, in our “G-PDA with single Gaussian” method. These are shown in
p
n(a)=∫z(a){tilde over (p)}x
where
Z(a)={z|∥z−a∥<∥z−a′∥,a′εx}. (35)
Thus, for BPSK modulated symbols where the modulation alphabet is X={+1,−1},
z(1)={zε|z>0},
and thus, the posterior probability pn(1) is updated with the probability that a random variable distributed according to equation (31) is positive. Hence,
where the error function (erf) is defined as
as shown in the final update equation in
Modeling xn with a Gaussian Mixture
In this embodiment of the G-PDA, we assume the prior probability on xn is a mixture of Gaussian distributions whose components have means taking values from the transmitter's constellation X, the same variance σ02 and equal weights. Formally,
where the variance σ02 is a parameter that can be adjusted. The posterior probability of xn is, using the Bayes' rule, a Gaussian mixture.
As an example, we consider BPSK modulation in
Following the aforementioned updates, the decision statisic for the current symbol is computed according to EQUATION (27) and for BPSK we have
The distribution of xn conditioned on {tilde over (z)}o, and thus on z, is, using the Bayes' rule, given by
while the weights are
Thus the means μ1,2, variances σ′2 of the Gaussian mixture probabilities in equation (39) are compute by equation (40) and the mixture weights can be computed according to (41). This embodiment of the G-PDA with Gaussian mixture, which does not update the weights w1 and w2 i.e., it keeps the weights equal, can be referred to as G-PDA with partial update.
The posterior probability pn(a) is updated by integrating the Gaussian mixture px
After the posterior probability pn of a symbol xn is updated, the SU-PDA and G-PDA update the posterior probability of the next symbol from a selected ordering, and when a termination condition is reached, e.g., a certain number of iterations or convergence, the method outputs the final estimates of the posterior probabilities and hard estimates of the transmitted symbols obtained using the MAP rule equation (13).
Method for Detecting Symbols
In block 520, we determine if we are done, i.e., we have reached the maximum number of PDA iterations set in block 510, and if true, then we proceed to BLOCK 550 to estimate the symbol {circumflex over (x)} using the maximum a posteriori probability (MAP) rule, and output 560 the detected symbols along with their posterior probabilities.
If not done, then in block 530, we update, for each symbol in the block, the mean and covariance matrix of the Gaussian approximation of the interference. Recall that these are computed according to equations (9) and (19) respectively. If we are using a G-PDA detector, then we cancel in block 531 the expected interference from other symbols in the block according to equation (22), process the result with the scaled MMSE filter to obtain equation (28). The obtained result is used to update the mean and variance of the current symbol in 532. This is done via equation (32) when the current symbol is modeled as a single Gaussian, distribution or through equation (40) if we are modeling the current symbol as a Gaussian mixture. In step 540, after the update of the mean and variance of the current symbol the posterior probabilities are recomputed according to equation (21) for SU-PDA or equations (36) and (42) for G-PDA detectors. The detector increments n in block 542 and repeats the steps in blocks 530, 531, 532, 540 for the next symbol. If all symbols (n=1, 2, . . . , N) have been processed, then the detector increments in 546 the iteration counter i, and reprocesses the block with the updated set of probabilities, pn(a). We note that for the SU-PDA variants the blocks 531 and 532 are not required and after the interference mean and covariance matrix update 530, the posterior probabilities can be updated directly.
G-PDA Method Summaries
In the G-PDA methods, the contribution of the symbols, other than the desired symbol, to the received signal is viewed as interference and modeled as a random variable whose distribution is approximated with a Gaussian distribution. The mean and covariance of the approximating Gaussian distribution are equal to the mean and covariance of the value being approximated.
The mean of the interference is subtracted from the received signal. The interference-free signal is processed through a MMSE filter to estimate the symbol. The MMSE filter output is scaled. The sufficient statistics of the symbol obtained this way are then treated as the sum of the desired symbol and Gaussian distributed noise whose variance is evaluated from the covariance of the interference and the impulse response of the MMSE filter. The sufficient statistics are then used to estimate the desired symbol. The desired symbol is modeled as a sample from a continuous distribution. Depending on a particular distribution, the following embodiments are described.
Single Gaussian
The distribution of the desired symbol is approximated with a Gaussian distribution having a mean and a variance equal to the mean and variance of a corresponding transmitted symbol. Because the additive noise in the model for sufficient statistics is Gaussian distributed, the posterior probability is also Gaussian distributed. Because the desired symbol is discrete, the posterior probability of the desired symbol at a particular constellation point instantiation is updated by integrating the Gaussian posterior probabilities over a region in a complex plane that corresponds to the constellation point.
Gaussian Mixture
The distribution of the desired symbol is approximated with a Gaussian mixture, which contains a Gaussian component around each point in the constellation. The variances of all components are equal and set to some pre-specified value. All Gaussian components are weighted equally.
Given that the noise in the model for sufficient statistics is Gaussian distributed, the posterior probability is also a mixture of Gaussian distributions with the updated weights, means and variances. In general, the updated variances remain equal, while the updated weights and means are different.
The posterior probability of the desired symbol at a particular constellation point is obtained by integrating the posterior probability of the Gaussian mixture over the region corresponding to that constellation point.
Gaussian Mixture with Updates
This method is similar to the above method. The only difference is in that the weights in the posterior Gaussian mixture are not updated, and are thus kept equal.
Advantage Over the Conventional PDA
The symbol to be estimated is, in general, uniformly distributed over a finite constellation and the conventional PDA method exploits this fact.
In contrast, our methods model the desired symbol as a continuous random variable. The approximating continuous distribution either matches the mean and variance, in the single Gaussian method, or resembles the shape of the mixture of the true distribution. This turns out to be sufficient for our methods to perform at relatively low SNR, at most slightly worse than the conventional PDA method.
More importantly, our methods inherently bring relaxation (also known as regularization) to the conventional PDA method. By modeling the desired symbol with smooth and continuous statistical distributions, as opposed to modeling with the sum of Dirac pulses, soft information of the desired symbol tends not to be overconfident as in the conventional PDA method. This lowers the possibility of getting the detector stuck at local minimum, and therefore mitigates the error floor problem.
The selected distributions (a single Gaussian or a Gaussian mixture) enable tractable and simple inference on the desired symbol.
The PDA method can provide good performance in symbol detection and interference cancellation in a variety of communication techniques. One such application is symbol detection in a SISO system with block transmission and detection. Thai technique can provide fast and highly reliable transmission of short messages over wireless channels. These requirements constitute design goal for M2M communication system.
The embodiments of the invention provide PDA methods that outperform the conventional PDA method for short blocks (<100) symbols) and at SNR's corresponding to BER's required for M2M communications.
Our PDA methods differ from the prior art PDA in the way a symbol is estimated. As such, the SU-PDA models such a symbol as a sum of a discrete deterministic variable and zero mean noise, which represents the uncertainty in our knowledge about that symbol. The G-PDA method models the symbol as a random variable distributed, either as a single Gaussian distribution, or a mixture of Gaussian distributions.
Although the invention has been described by way of examples of preferred embodiments, it is to be understood that various other adaptations and modifications can be made within the spirit and scope of the invention. Therefore, it is the object of the appended claims to cover all such variations and modifications as come within the true spirit and scope of the invention.