The present invention concerns wireless telecommunication systems in general, and specifically methods and arrangements for improved model order selection for joint synchronization, channel estimation and noise covariance estimation in such systems.
The growing popularity of mobile services has resulted in ever-increasing interference levels caused by the closer proximity of users, and in the case of time division multiple access (TDMA) based systems by a tighter frequency reuse. As a result, mutual interference among users occupying the same radio channel has become a major source of signal disturbance. The ability to suppress co-channel interference has become increasingly important for mobile receivers in cellular systems with tight reuse. This has led to the development of several techniques for interference suppression in the receiver units of the base transceiver stations (BTS) or mobile stations (MS).
Multi-branch diversity or array processing is a class of commonly used techniques for suppressing interference, in which multiple versions of the same transmitted signal are produced and processed jointly in the receiver in order to cancel one or more interfering signals. The different signal versions may be obtained by using multiple receiving antennas, by sampling the received signal over the baud rate of transmission (i.e. over sampling), by separating in-phase (I) and quadrature-phase of the signal, or by combinations of these. The method of separating in-phase and quadrature-phase of the signal is commonly referred to as single-antenna-interference cancellation (SAIC) and has recently received much attention in the so called GERAN standardization.
In conventional array processing, the interference is typically modeled as temporally (across time) and/or spatially (across different signal versions) colored noise. By performing proper spatial and/or temporal noise whitening, the interference can be suppressed substantially. Such whitening operation may be performed before or during demodulation/equalization.
In order to suppress the noise or interference through spatial-temporal whitening, the receiver typically requires an estimate of a certain spectral property of the noise, such as the noise covariance matrix. From such spectral property, a whitening filter can then be derived to whiten, and therefore suppress, the noise. If the statistics of interference can be assumed to be approximately stationary over the data burst, the estimation of the noise spectral property may be performed over a sequence of training symbols in each data burst that is known to the receiver.
In addition, the demodulator or equalizer of the receiver must also be able to synchronize to the beginning of a data burst in order to begin demodulation. The synchronization process is typically done jointly with channel estimation over the training sequence. When spatial/temporal whitening is performed on the received signal to suppress noise or interference, the operating carrier-to-interference power ratio (C/I) can be changed so drastically that the ordinary method of synchronization and channel estimation, such as the least squares (LS) method, can no longer produce an accurate synchronization position. As a result, the reliability of synchronization and channel estimation becomes a bottleneck of the overall receiver performance.
One known way of improving synchronization and quality of channel estimation in a multi-branch receiver is to first perform a certain initial synchronization and channel estimation, such as the LS channel estimation, and then estimate the noise covariance matrix or function based on the residual signal after channel estimation. From the estimated noise covariance matrix, a whitening filter can be computed using the well-known Whittle-Wiggins-Robinson Algorithm (WWRA). The problem with this approach is that the initial synchronization and channel estimation (before whitening) may not produce an accurate estimate of the synchronization position and the channel estimate. As a result, the statistics of the residual signal obtained from the initial synchronization and channel estimation may not be representative of the statistics of the actual noise or interference.
To overcome this, one known technique [1] is the so called Indirect Spatio-Temporal Interference Rejection Combining (Indirect ST IRC), which is a joint synchronization, channel estimation and noise covariance estimation technique. The use of this technique in the receiver algorithms for BTS or MS results in substantial interference suppression.
The technique described in [1] gives a method to jointly estimate the synchronization position, channel, and noise covariance matrix, given a baseband model for a received signal containing a known training sequence. However, the length of the channel and the dimension of the noise covariance matrix are assumed to be known. The choice of the channel length and the dimension of the noise covariance matrix will be referred to as the model order selection problem in the following detailed description.
Existing solutions to the model order selection problem can be divided into two groups. In the first group, the order of the model is fixed, and can be guessed or deduced from field measurements and subsequently hard coded into the algorithms. In the second group there are the ad hoc methods based on simulations. In this methodology, a statistical regression is used to produce a table. The regression is made from simulation-generated data.
Neither of these two groups is satisfactory. The main disadvantage of choosing a fixed model order is that it lacks the flexibility needed to cope with the diverse deployment scenarios found in mobile networks. The main disadvantage of the ad hoc methods is that the mobile system may be put to work in environments that do not necessarily fit the simulation conditions or the test cases chosen by the system designers.
Therefore, there is a need for improved methods and arrangements for model order selection to enable improved ST IRC.
A general object of the present invention is to enable an improved telecommunication system.
A further object of the present invention is to provide an improved method of model order selection.
Another object of the present invention is to enable joint determination of a best synchronization position, channel length and model order for a signal model.
These and other objects are achieved by the attached set of claims.
According to a basic embodiment, the present invention comprises generating S0 a spatially and temporally stacked signal model by stacking successive samples of temporally adjacent received signal vectors and corresponding training vectors, computing S1 a noise variance matrix for each hypothesized synchronization position, channel length and stacking order, based on the stacked training symbols: determining S2 a best synchronization position for the received signal, based on the stacked training vectors, by jointly determining the best synchronization position for the received signal and estimating a channel length and a stacking order for said signal model based on the stacked training vectors.
An arrangement according to the invention enables the execution of the method steps.
This is then utilized in subsequent known steps of computing a noise covariance estimate, an estimate of the fictitious channel and an estimate of the whitened channel, to enable an improved interference cancellation in a telecommunication system.
The invention, together with further objects and advantages thereof, may best be understood by making reference to the following description taken together with the accompanying drawings, in which:
The present invention will be described with reference to the attached drawings, and in the context of a GSM/EDGE-based telecommunication system as illustrated by
The system in
One of the criteria of the present invention is the nature of the stacked signal vector to be evaluated. This has been described in WO 2006/136875[1]. In short, the patent document discloses system and a method in a radio receiver for joint synchronization and noise covariance estimation of a received signal. A spatially and temporally stacked signal model, whereby successive samples of temporally adjacent received signal vectors and corresponding training vectors are stacked, is used in the derivation of the estimation problem. The Toeplitz structure of the channel response matrix is neglected in the formulation of the estimation problem. The resulting estimator jointly estimates a synchronization position, a channel response matrix, and a noise covariance matrix. An estimate of a whitened channel is then computed based on the noise covariance matrix and an estimate of the channel response matrix.
As stated previously, the above described method assumes that the channel length and the stacking order or model order are known parameters. The following description of the present invention aims to describe an improved method that jointly provides an estimate of the best synchronization position, the channel length, and the stacking order.
The basic model order selection problem is the following. A data set and an identifiable mathematical model for the data are given. The goal is to obtain the best estimate of the model parameters. If the number of parameters in the model is unknown, then the models with more parameters will always yield a better fit to the known data set than the models with fewer parameters, regardless of the number of parameters in the true model. The order selection problem is to estimate the true number of parameters.
The present invention provides a solution to the model order selection problem for the indirect ST IRC as described in [1], by giving a closed form mathematical expression for an estimate of the model or stacking order. Specifically, a formula for the length of the channel and the dimension of the noise covariance matrix is given.
Model order selection problems arise frequently in many fields of science and engineering. In many cases of practical interest, they can be solved by means of a well-known methodology known as the Akaike Information Criterion [2]. The original work of Akaike has been extended by a number of authors in order to handle multidimensional signals, but always under the assumption that the signal has a known dimension. The ST IRC methodology developed in [1] has the particular feature that the dimension of the received signal is a model parameter. The original multidimensional signal is stacked to form a new, artificial, received signal of even higher dimension. A derivation and explanation as to why Akaike [2] is not applicable in a straight forward manner to signals of possibly varying dimension is shown in Appendix A for the interested reader. The present invention therefore consists of a mathematical expression to estimate the model order for indirect ST IRC [1]. It has been derived though a non-trivial extension of the methodology of Akaike to suit the indirect ST IRC framework. The interested reader is referred to Appendix B for the complete derivation. The main points of the method will be described below.
Embodiments of the present invention will be described with reference to
Basically, with reference to
Subsequently, with reference to
To further understand the framework of the present invention, an in depth discussion of the mathematics and the derivation of the expressions used is included below.
We begin by stating a version of the Akaike Information Criterion [2] which is the starting point for the present invention. Let Y denote the matrix of received, complex-valued data samples of dimension Nbranches×Nsamples. Nbranches can be thought of as a spatial dimension, whereas Nsamples is the temporal dimension. For example, Nbranches could be the number of branches in an antenna array, while Nsamples could be the number of samples received
(
is asymptotically distributed as χ2q2. (Denote the Chi-squared with n degrees of freedom by χn2.) The factor 2 in front of the q stems from the fact that Θ is a vector of complex quantities. Let dim({circumflex over (Θ)})=2q.
The model selection methodology can be used successfully when different choices of the unknown parameters Θ yield different numbers of samples Nsamples. i.e. when Nsamples=Nsamples(Θ).
The ST IRC methodology is rather special since there is a dependence between the spatial dimension and the parameter choice. In other words, Nbranches=Nbranches(Θ). However, in this case, Equation (1) does not always result in reasonable parameter choices. The reason for this will be explained in more detail later on. Thus, a straightforward application of the Akaike methodology will not solve the model selection problem for ST IRC.
For the further description, it is necessary with a few preliminary definitions from the disclosure in [1].
Consider the following typical, dispersive multiple-input-multiple output (MIMO) signal model with additive noise:
for n=L−1, L, Λ, N−1, where N denotes the length of the training sequence (N=26 for GSM/EDGE), r[n] denotes a Nr-dimensional received (column) vector, n0 denotes the synchronization position, which is the time index of in each branch. Assume that p is the pdf (probability distribution function) of Y and that it belongs to a known family parameterized by Θ.
An estimate {circumflex over (Θ)} of Θ may be chosen according to the following equation.
where dim(Θ) is the number of degrees of freedom associated with the particular parameter vector Θ.
It will be illuminating to review some aspects of the derivation of Equation (1). The following assumptions will be valid throughout the description. Assume that Y=└Y1, . . . , YN
The main ingredients in the derivation of Equation (I), starting from the Kullback-Leibler divergence, are the law of large numbers and the asymptotic consistency and normality of the maximum likelihood estimator for vector valued random variables. These are guaranteed by the previously stated assumptions. (The Gaussian pdf and the linear dependence of the mean upon the unknown parameter vector ensure that the smoothness requirements are fulfilled). Denote the true parameter vector by
Akaike discovered [2] that the log-likelihood is a biased estimator of the Kullback-Leibler divergence. He also showed that the bias can be approximated by the dimension of {circumflex over (Θ)}. Let I(
A stacked vector notation will be employed in the following description. Let rM−1[n] a vec([r[n], r[n−1], Λ r[n−M]]) be a vector formed by stacking {r[k]}k=n−Mn in columns, where M denotes the model order or “stacking order”, and for any matrix A, vec(A) is the vector formed by stacking columns of A one by one into a single vector, i.e. using typical Matlab notation, vec(A) a A(:). Similarly, let vM+1[n]≡vec([v[n],v(n−1,], Λ v[n−M]]) denote the corresponding stacked noise vector, and sL[n]=vec([s[n],s[n−1,],Λ,s [n−L+1]]) denote the corresponding stacked training vector.
Rewriting the signal model in Equation (3) by stacking (M+1) temporally adjacent received vectors, provides the following stacked signal model:
is an (M+1)×(L+M) block Toeplitz matrix of block size Nr×Nt. A key model assumption is that the (expanded) noise vector process {vM+1[n]} is independent and identically distributed (IID), and let Λ≡E[vM+1[n](vM+1[n])H] be the covariance matrix of vM+1[n].
According to [1], given L, M, n0, the covariance matrix {circumflex over (Λ)} can be estimated by
The model order selection problem for Indirect ST IRC [1] solved by the present invention consists of estimators {circumflex over (L)}, {circumflex over (M)} for L, M. As usual, the symbol ̂ on top of a quantity denotes an estimator of said quantity.
Without loss of generality assume that the stacking order M is limited to the values M0=0<M1<Λ<Mm=Mmax and that Mmax+1 is divisible by Mk+1 for all 0<k<m. (Given any desired set of stacking orders M0=0<M1<Λ<Mn, it is always possible to define
and the hypothesis is fulfilled.) Assume also that the channel length L is one of the positive integers L0<L1<Λ<Lp.
Next, define the integers
The channel length and the stacking order are chosen according to the following expression, which also gives the synchronization position.
where p(•,•) is a cost function and the parallel bars ∥ denote the determinant of a matrix. The use of the methodology of Akaike [2] yields the cost function
p(L,M)=2L+2M. (9)
In order to obtain more flexibility in the model choice it is possible to change the cost function. For example p(L,M)=C1L+C2M, where C1, C2 are constants. Other choices of the penalty function are possible, based on simulations or heuristics.
Note that the expression in brackets in Equation (8) can be replaced by any monotone increasing function or mapping of it, and the same results will be obtained. This could be advantageous for numerical reasons. For example. Equation (8) is equivalent to the expression below:
Note that Equation (10) is independent of Mmax. The complete derivation of Equation (8) is given in Appendix B attached to this disclosure.
To further support and illustrate the impact of the present invention, a series of simulations have been performed.
Similarly,
An arrangement according to the invention will be described with reference to
The methods and arrangement according to the invention can be implemented in the receivers of base transceiver stations or in the mobile stations, or in some other part of the system where a receiver algorithm is applied. Further, the MS or BTS may be provided with one or more antennas.
The known method of joint whitening/synchronization disclosed in [1] have great potential for example in receiver algorithms for upcoming GSM/EDGE dual antenna terminals or in the BTS receiver algorithms for Evolved EDGE. However, the order of the model is unknown in practical applications in wireless communications. The present invention provides a statistic for the model order selection problem based on accepted statistical methodology. The statistic has low computational complexity and is easy to implement in digital signal processors or other electronic circuitry.
It will be understood by those skilled in the art that various modifications and changes may be made to the present invention without departure from the scope thereof, which is defined by the appended claims.
Consider a fixed stacking order M and derive a statistic to choose the length of the channel impulse response, e.g. channel length.
The model selection problem for ST IRC is about finding a suitable estimate of H which is the (fictious) channel impulse response. To be specific, assume that the stacking order is M≧0, the channel has L+1 taps, there are Ntr training symbols s, the synchronization position is no, and the received signal y has Nr branches. Writing
the ST IRC signal model is the following
where the residuals νM(n) are i.i.d. and have an Nr(M+1)×Nr(M+1) covariance matrix Λ
Q
M
=E[ν
M(n)νM(n)*] (A5)
This model can also be written in matrix form
Y=HS+V (A6)
As usual S denotes a suitable Toeplitz matrix of training symbols and H is the fictious stacked channel matrix. The dimension of Y is Nr(M+1)×(Ntr−L−M). The dimension of H is (M+1)Nr×(L+1+M). The matrix V has the same dimension as Y. The columns of V are complex Gaussian, i.i.d., zero mean with (unknown) covariance ΛM.
The assumptions imply that the columns of Y are i.i.d. complex Gaussian with covariance ΛM and non-zero mean. The mean is given by the columns of HS. Therefore
p(Y|H,ΛM)=(π−N
where
N
samples
=N
tr
−L−M (A8)
and
Note that {circumflex over (Λ)}M is dependent on H. We can now write the log-likelihood
−log p(Y|H,ΛM)=[Nr(M−1)log π−log|ΛM|−tr({circumflex over (Λ)}MΛM−1)]Nsamples (A10)
For any given H the expression on the left hand side of (A10) is minimized by taking ΛM={circumflex over (Λ)}M. Moreover, using the indirect ST IRC algorithm [1] we find an estimate ĤD of H that maximizes the log-likelihood, or equivalently −log|ΛM|. Let's call {circumflex over (Λ)}MD the sample covariance matrix associated with ĤD. We then compute
min{−log p(Y|H,ΛM)}=└Nr(M+1)log π+log|{circumflex over (Λ)}MD|+tr(IN
Using Equation (1) and Equation (A11) and eliminating the terms independent of L we arrive at the expression
In order to avoid the calculation of logarithms we take exponential in Equation (A12).
Next we calculate dim(ĤD). This is straightforward, with the warning that one complex parameter is counted as 2 (real parameters). We obtain dim(ĤD)=2(M+1)Nr(L+1+M). This gives
we obtain
Although the mathematical calculations of the previous subsection can be carried out for variable stacking order M, the result is not useful because the methodology breaks down. To see why, let's look at Equations (1) and (A11). As usual, given any positive integer p, we will call Ip the identity matrix of dimension pxp. In sensitivity scenarios {circumflex over (Λ)}MD≈σ2IN
This difficulty can be solved as follows. Let us say that only the stacking orders M0=0<M1<Λ<Mm=Mmax are allowed. Without loss of generality we assume that Mm+1 is divisible by Mk+1 for all 0<k<m. (Given any desired set of stacking orders M0=0, . . . , Mk we can always add Mk+1=Mmax=┌p=1k(Mp+1)−1). The stacked model of order Mk can be embedded in the stacked model of order Mmax. For convenience, define the integers γk
and the fictious or hypothesized stacked channel matrices Hk
where k=0, . . . , m of dimension (Mmax+1)Nr×(L+1+Mmax). The expression 01×(M
Using the notation introduced in the previous section, the stacked model for order Mk embedded in the higher dimension Mmax can be written in the form
y
k
M
(n)=HksM
with the assumption that the covariance matrix
Λk=E└vM
Here is the Kronecker product and ΛM
Notice that this embedded version is not identical to the original formulation and that we have not imposed any structural constraints on Hk. In fact. Hk is composed of γk (possibly) different sub-matrices, all of which are convergent to the channel of stacking order Mk as Nsamples→∞. Because of this we say that the two formulations are asymptotically equivalent. The difference between the original and the embedded formulations is due to border effects, since the stacking reduces the number of used samples in the temporal dimension.
Now, we can apply the methodology developed in the previous subsection to the embedded models, since all the model candidates have the same spatial dimension (Mmax+1)Nr. Exactly the same argument leading to Equation (A13) yields
Recall that {circumflex over (Λ)}kD is block diagonal. Hence
|{circumflex over (Λ)}kD|=|{circumflex over (Λ)}M
Moreover,
dim(ĤkD)=2(Mmax+1)Nr)(L+1+Mk) (B7)
Hence Equation (B5) can be rewritten as
Finally, defining the factor
we arrive at the statistic
In practice it is desirable to have some flexibility in the choice of the channel lengths or stacking orders, in order to favor/punish some particular parameter sets. Thus Equation (B9) can be modified as follows.
where penalty(•,•) is some cost function. A simple choice in is the linear function
penalty(L,M)=LAIC·(L+1)+MAIC·M (B12)
for suitable constants LAIC and MAIC. The values LAIC=2. MAIC=2 give back Equation (B9). These constants can also be tuned through simulations.
Observe that the term |{circumflex over (Λ)}M
it can be seen that Equation (B11) is equivalent to
Note that Equation (B13) is independent of Mmax.
Since Equation (1) is itself based on an asymptotic approximation, a model selection criterion for the embedded models should also be valid for the original (i.e. not embedded) stacked models.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/SE07/50360 | 5/28/2007 | WO | 00 | 11/24/2009 |