This invention relates generally to multiple-input, multiple output network, and more particularly to Grassmann codes for non-coherent channels in MIMO-OFDM networks using generalized likelihood ratio test receivers.
MIMO Networks
The use of multiple antennas in multi-input, multi-output (MIMO) networks can dramatically increase data throughput. In rich-scattering channel environments, the channel capacity increases linearly according to min(M, N), where M and N denote the number of transmit antennas and receive antennas, respectively. To achieve such capacity gains, accurate channel state information (CSI) is necessary for coherent communications.
CSI
Without CSI, there is non-coherent communications. For non-coherent channels, the capacity becomes a function of M′(1−M′/L) in high signal-to-noise ratio (SNR) environments, where M′=min(M,N, └L/2┘), and L denotes the coherence time (or, the length of a non-coherent codeword), where └.┘ is the floor function.
Non-coherent codes include unitary space-time constellations, exponential mapping Grassmann codes, non-parametric Grassmann codes, and differential space-time modulations. Unitary space-time codes asymptotically achieve the non-coherent channel capacity for high SNRs. For such codes, optimal performance of maximum-likelihood decoding can be attained by using a generalized likelihood ratio test (GLRT) receiver, without having the CSI.
GLRT
The GLRT receiver uses implicit channel state estimation for each codeword of the non-coherent codes at the time of decoding. However, the performance of the conventional GLRT receiver degrades seriously when the channel coherence time is much shorter than the lengths of the non-coherent codes L. This constrains the code length to be reasonably short in practice. Shorter space-time codes in turn decrease the capacity gains for M′(1−M′/L).
The embodiments of the invention provide a method for signal processing in a non-coherent multiple input, multiple output (MIMO) network, which does not require channel state information (CSI) in either the transmitter or the receiver. With non-coherent codes on a Grassmann manifold, a receiver uses a generalized likelihood ratio test (GLRT) receiver for maximum-likelihood performance, even without having the CSI. However, the conventional GLRT receiver suffers from severe performance degradation when the channel state changes during a duration of a codeword, also known as a coding block, or simply block.
Therefore, we improve the conventional GLRT receiver using a high order and superblock, which is a concatenation of multiple adjacent codewords. The high order superblock makes effective use of correlated channels for adjacent codewords in slow fading channels, and can overcome changes in the channel while transmitting a codeword in fast fading channels.
Embodiments of the invention provide a method for decoding codewords received at a receiver over non-coherent channels in a multi-input, multiple output (MIMO) network using orthogonal frequency demultiplexing (OFDM), and wherein the codewords are encoded using Grassmann codes.
In this description, matrices and vectors are indicated by bold-face italic letters in capital cases and small cases, respectively. A complex-valued matrix is X∈m×n, where denotes a complex field. The notations
X*, XT, X†, X31 1, tr[X] and X∥
represent a complex conjugate (*), a transpose (T), a Hermite transpose (†), an inverse (−1), a trace (tr), and a Frobenius norm (∥.∥) of X, respectively. A vector-operation is denoted vec[.] aligns all columns of a matrix into a single column vector in a left-to-right manner, and is the Kronecker product of two matrices. The set of real numbers is .
Non-Coherent MIMO-OFDM Networks and Signal Processing
In M×N multiple input, multiple output (MIMO) orthogonal frequency division multiplexing (OFDM) network 100, as shown in
We focus on non-coherent communications wherein both the transmitter and the receiver do not have channel state information (CSI). The use of non-coherent codes enables us to communicate efficiently even when pilot signals or training sequences are not used during channel acquisition.
A signal transmitted from the M antennas 111 at the nth subcarrier can be expressed as a vector xnεM×1 115. A codeword X is transmitted as a block of L symbols xL, which can be represented by the following matrix,
X=┌x1, x2, . . . , xL┌ε⊂M×L,
where
={x1, x2, . . . , xQ}
denotes the non-coherent space-frequency codebook 111 with Q distinct codewords. A mean energy of each codeword X is Es, i.e.,
Typically the length of the block is less than or equal to a channel coherence time. The coherence time is a measure of the minimum time required for the magnitude change of the channel to become uncorrelated from its previous value.
A signal yn 215 received at the receiver over the MIMO channels 101 is
yn=Hnxn+wn, (1)
where at the nth subcarrier
ynεN×1, HnεN×M and wnεN×1
denote the received signal vector, the frequency-domain MIMO channel matrix and additive noise, respectively.
A conventional GLRT receiver assumes that the MIMO channel matrix remains constant during a single codeword for n=1, . . . , L, such that Hn=H. This assumption can be relaxed for a high order codebook to deal with changes in the channel over the length of the codeword. This assumption of fading channels simplifies the expression of the received signal into a matrix form as follows:
Y=HX+W, (2)
where Y and W respectively denote the received signals and the additive noise signals over the codeword, and
Y=[y1, y2, . . . , yL]εN×L, (3)
W=[w1, w2, . . . , wL]εN×L. (4)
Noise WL is expressed as white Gaussian random variables with a variance of σ2
[vec[W]vec[W]\]=σ2INL.
A conditional probability of the received signal Y, given the codeword X and the channel matrix H is known as the likelihood, which is expressed as
where π is a constant.
Without having the CSI, the GLRT receiver 120 determines an optimal estimate {circumflex over (X)} for the codeword X from the codebook 111 in favor of maximizing the likelihood, or equivalently minimizing a squared distance metric as
where the function min returns a minimum, and the function inf is the infimum or greatest lower bound function.
Because the receiver does not have the channel state matrix H, the GLRT receiver uses the optimal channel matrix over all the possible realizations for each codeword. Because we have
a candidate channel estimate
Ĥ=YX\(XX\)−1
can maximize the likelihood, where X X\ is invertible. This is equivalent to the well-known least-squares (LS) channel estimation given a candidate codeword X. Substituting Ĥ for H in Equation (5) yields
where I is the identity matrix.
Here, a matrix Pε⊂L×I denotes an idempotent projector onto the orthogonal complement of the codeword X, i.e., XP=0 and PP=P. The set
={P1, P2, . . . , PQ}
is a projector bank, whose qth member is defined as
IL−xq†(xqxq†)−1xq,
for the codebook . The minimal size of the possible projector matrix P, such that XP=0, can be L×(L−M) because the orthogonal complement of the codeword X is also of size L×(L−M).
If every codeword X in the codebook is unitary, such that
xqxq†=(Es/M)IM
for any q=1, 2, . . . , Q, then the GLRT distance metric can be simplified to max ∥YX†∥2.
Non-Coherent Grassmann Codes
A number of non-coherent codes are known, e.g., unitary space-time codes, Grassmann codes with exponential mapping, Grassmann packing codes with numerical optimization, and differential modulations. We use a non-coherent Grassmann code based on an exponential mapping. The Grassmann code parameterized by an exponential mapping is
The matrix BεM×(T−M) denotes a full-rate, full-diversity coherent space-time block code with a thin singular value decomposition (SVD) of B=UAV†. A cosine-sine decomposition yields
where α is a parameter that controls the codeword distance. Such a codeword always satisfies a unitary condition of XX†=(Es/M)IM for any arbitrary α and B. For M=2 and L=4, one choice of the coherent coding matrix B is
where
θ2=φ=exp(jπ/4).
An optimal parameter α is approximately 0.566. Each si is drawn from quadrature phase shift keying (QPSK) constellations for a spectral efficiency of 2 bps per channel use. Grassmann codes offer the maximal degree of freedoms for non-coherent communications. However, it is not obvious that the parameter settings for θ and φ provide the optimal performance in sphere packing over the Grassmann manifold. We describe better Grassmann codes with optimized parameters using a gradient method.
High Order Superblock GLRT
In principle, the length L of the non-coherent codes should be less than or equal to the channel coherence time. However, shorter space-frequency codes have suboptimal performance with the conventional GLRT receiver because the accuracy of the LS channel regressions decreases linearly with the length L of the codeword (block).
Even for highly selective fading channels in the frequency-domain for space-frequency block coded (SFBC), the channel matrix has a high correlation for adjacent codewords, in general.
Y′=[Y1,Y2, . . . , YK]εN×LK, (10)
X′=[X1,X2, . . . , XK]εM×LK, (11)
wherein each element in the received signal Y′ is an N×L matrix, and each element in the estimate X′ of the transmitted signal is an M×L matrix.
We use the GLRT decoder 300 at the receiver 120 while the channel remains coherent over the K adjacent codewords in the superblock. Here, the signal X′ is a codeword in a virtual codebook generated from the original codebook Xkε.
A corresponding projector matrix P′ 341 can be predetermined 340 from the projector bank 339, such that X′P′=0. In this case, P′ can be determined in a similar manner as shown in Equation (7). The computational complexity increases exponentially with the number K of codewords because the cardinality of a superblock codebook becomes Q′=QK.
As shown in
Sequential Decision for Superblock GLRT
Because the superblock GLRT decoder 300 processes K codewords at the same time, some different decision criteria arise as follows.
The distance metric of the superblock GLRT decoder for K consecutive codewords from Xj+1 to Xj+K can also be expressed as
μj=∥[Yj+1, Yj+2, . . . , Yj+K]P′∥2.
One-Time Decision
To decode the kth SFBC, only the metric μ└k/K┘−1 is used.
Selective Decision
To decode the kth SFBC, we select the optimal metric out of the adjacent metrics from μk+K+1 to μk+K−1.
Combined Decision
To decode the kth SFBC, we use a combined metric, which is summed over all the metrics from μk+K+1 to μk+K−1.
Sequential Decision
Here, we exploit the channel correlation across the codewords. This is done by Viterbi decoder 404 to select the optimal estimated codeword sequence over a trellis 405, of QK−1 states 406 as shown in
Sequential decision has a highest complexity for the Viterbi algorithm but achieves the optimal performance.
In principle, the GLRT receiver assumes that the channel remains coherent during the superblock, or consecutive LK symbols. Hence, changes in the channel while transmitting the superblock can incur a performance degradation.
We describe an improved GLRT procedure, which uses high order LS channel estimation to overcome any changes in the cannel fluctuation the superblock. We use Dth order polynomial curves to fit the channel fluctuation for high order LS regressions. Then, the channel matrix at the nth subcarrier is modeled as
where
=[H[0], H[1], . . . , H[D]]εN×M(D+1), (13)
Dn=[n0IM, n1IM, . . . , nDIM]TεM(D+1)×M. (14)
The matrix H[d] denotes the channel matrix at the dth order term of the polynomial. This model enables us to adopt the GLRT receiver even when the channel matrix Hn changes in the frequency domain because the expanded channel matrix H remains static.
The received signal can be rewritten as
where D is the deterministic order expansion matrix of size M(D+1)×MLK and Λ is the diagonally aligned version of the transmitted signal matrix X, each of which is respectively defined as
By considering X′=Λ as a new virtual codeword, the associated projector matrix becomes
P′=ILK−Λ††(ΛΛ†)−1ΛεLK×LK, (18)
which can be determined in advance for any D and for all codewords. We note that the GLRT structure is similar to that of
Codebook Optimization of Non-Coherent Grassmann Codes
We optimize non-coherent codes by sphere packing on the Grassmann manifold. For numerical Grassmann packing, we adapt a gradient method to minimize the pairwise error probability between two codewords in high SNR regimes.
Pairwise Error Probability
The pairwise error probability between the correct codeword Xi and the wrong codeword Xj, given a channel matrix H is
in the high SNR regimes, where erfc(.) is a complementary error function, and λ min(.) denotes the minimal singular-value of a matrix. Note that
λmin2(ij)=λmin(iji†).
Our goal is to construct a codebook , which maximizes λmin(iji†) or any possible pair i≠j. Here, we have
A semi-definite programming (SDP) relaxation method can be used to maximize the parameter t with an energy constraint. The codebook obtained by the SDP is further refined by a linear programming (LP) method. We optimize the codebook for the high order superblock GLRT by the gradient method as a lower-complexity approach.
Gradient Method
For a given Ωi,j=iji†, the eigenvector uij, which associated with the minimal eigenvalue λi,j=λmin(Ωi,j) can yield the gradient in terms of Xm as
Here,
δi,j=1 if i=j, otherwise δi,j=0.
The steps of the method for constructing the codebook using the gradient method are:
7: Constrain the energy such that ∥Xm∥2=Es; and
8: Repeat from step 2, until λi,j converges substantially.
Using multiple initial codewords or small perturbations of an optimized codebook, the gradient method yields a well-constructed codebook. The construction method for the high order superblock GLRT can be adapted because we have ∂xm′=∂Λm.
Optimization for Exponential Mapping Grassmann Codes
A numerical optimization method that was designed for non-parametric codes can now also be applied for some parametric non-coherent codes. As an example for our optimization method, an improved version of the exponential mapping Grassmann codes is described. The conventional Grassmann codes with exponential mapping use the fixed parameters θ and φ in Equation (9). We directly optimize these parameters by the gradient method with a slight modification as
where γε{α,θ,φ} is the parameter to be optimized.
The optimized parameters and its gain in the minimal eigenvalue are shown in Table I.
Although the invention has been described with reference to certain preferred embodiments, it is to be understood that various other adaptations and modifications can be made within the spirit and scope of the invention. Therefore, it is the object of the append claims to cover all such variations and modifications as come within the true spirit and scope of the invention.
Number | Name | Date | Kind |
---|---|---|---|
20040125883 | Chang et al. | Jul 2004 | A1 |
20070041463 | Wang et al. | Feb 2007 | A1 |
20100226454 | Bliss et al. | Sep 2010 | A1 |
20110064167 | Rezk et al. | Mar 2011 | A1 |