Method for decoding codewords transmitted over non-coherent channels in MIMO-OFDM networks using Grassmann codes and superblocks

Information

  • Patent Grant
  • 8059747
  • Patent Number
    8,059,747
  • Date Filed
    Wednesday, June 30, 2010
    14 years ago
  • Date Issued
    Tuesday, November 15, 2011
    13 years ago
Abstract
Codewords encoded using non-coherent codes and received at a receiver via non-coherent channels in a multi-input, multiple output (MIMO) network using orthogonal frequency demultiplexing (OFDM) are decode by concatenating multiple adjacent codewords of a received signal into a superblock at the receiver. A projector matrix based on a codebook is predetermining. Each codeword in the superblock is projected onto an orthogonal complement of a correspond transmitted codeword using the projector matrix to obtain a corresponding distance metric of a generalized likelihood ratio test (GLRT) codeword. A minimal distance metric is selected to obtain an estimate of the transmitted codeword corresponding to a transmitted signal.
Description
FIELD OF THE INVENTION

This invention relates generally to multiple-input, multiple output network, and more particularly to Grassmann codes for non-coherent channels in MIMO-OFDM networks using generalized likelihood ratio test receivers.


BACKGROUND OF THE INVENTION

MIMO Networks


The use of multiple antennas in multi-input, multi-output (MIMO) networks can dramatically increase data throughput. In rich-scattering channel environments, the channel capacity increases linearly according to min(M, N), where M and N denote the number of transmit antennas and receive antennas, respectively. To achieve such capacity gains, accurate channel state information (CSI) is necessary for coherent communications.


CSI


Without CSI, there is non-coherent communications. For non-coherent channels, the capacity becomes a function of M′(1−M′/L) in high signal-to-noise ratio (SNR) environments, where M′=min(M,N, └L/2┘), and L denotes the coherence time (or, the length of a non-coherent codeword), where └.┘ is the floor function.


Non-coherent codes include unitary space-time constellations, exponential mapping Grassmann codes, non-parametric Grassmann codes, and differential space-time modulations. Unitary space-time codes asymptotically achieve the non-coherent channel capacity for high SNRs. For such codes, optimal performance of maximum-likelihood decoding can be attained by using a generalized likelihood ratio test (GLRT) receiver, without having the CSI.


GLRT


The GLRT receiver uses implicit channel state estimation for each codeword of the non-coherent codes at the time of decoding. However, the performance of the conventional GLRT receiver degrades seriously when the channel coherence time is much shorter than the lengths of the non-coherent codes L. This constrains the code length to be reasonably short in practice. Shorter space-time codes in turn decrease the capacity gains for M′(1−M′/L).


SUMMARY OF THE INVENTION

The embodiments of the invention provide a method for signal processing in a non-coherent multiple input, multiple output (MIMO) network, which does not require channel state information (CSI) in either the transmitter or the receiver. With non-coherent codes on a Grassmann manifold, a receiver uses a generalized likelihood ratio test (GLRT) receiver for maximum-likelihood performance, even without having the CSI. However, the conventional GLRT receiver suffers from severe performance degradation when the channel state changes during a duration of a codeword, also known as a coding block, or simply block.


Therefore, we improve the conventional GLRT receiver using a high order and superblock, which is a concatenation of multiple adjacent codewords. The high order superblock makes effective use of correlated channels for adjacent codewords in slow fading channels, and can overcome changes in the channel while transmitting a codeword in fast fading channels.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a block diagram of a MIMO-OFDM network including a transmitter and a receiver, wherein the receiver includes a generalized likelihood ratio test decoder according to embodiments of the invention;



FIG. 2 is a block diagram of a receiver including a generalized likelihood ratio test (GLRT) decoder according to embodiments of the invention;



FIGS. 3A-3B are a block diagram of a structure and method of a receiver including a superblock generalized likelihood ratio test decoder according to embodiments of the invention;



FIG. 4 is a block diagram of the receiver including a Viterbi decoder; and



FIG. 5 is a graph comparing a conventional GLRT decoder to higher order GLRT decoders.





DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

Embodiments of the invention provide a method for decoding codewords received at a receiver over non-coherent channels in a multi-input, multiple output (MIMO) network using orthogonal frequency demultiplexing (OFDM), and wherein the codewords are encoded using Grassmann codes.


In this description, matrices and vectors are indicated by bold-face italic letters in capital cases and small cases, respectively. A complex-valued matrix is X∈custom characterm×n, where custom character denotes a complex field. The notations

X*, XT, X, X31 1, tr[X] and custom characterX∥

represent a complex conjugate (*), a transpose (T), a Hermite transpose (†), an inverse (−1), a trace (tr), and a Frobenius norm (∥.∥) of X, respectively. A vector-operation is denoted vec[.] aligns all columns of a matrix into a single column vector in a left-to-right manner, and custom character is the Kronecker product of two matrices. The set of real numbers is custom character.


Non-Coherent MIMO-OFDM Networks and Signal Processing


In M×N multiple input, multiple output (MIMO) orthogonal frequency division multiplexing (OFDM) network 100, as shown in FIG. 1, a transmitter 110 has M transmit antennas 111, and a receiver 120 has N receive antennas 121. The transmitter and the receiver both store a space-frequency codebook 111. In one embodiment, the receiver includes a generalized likelihood ratio test (GLRT) decoder 200.


We focus on non-coherent communications wherein both the transmitter and the receiver do not have channel state information (CSI). The use of non-coherent codes enables us to communicate efficiently even when pilot signals or training sequences are not used during channel acquisition.


A signal transmitted from the M antennas 111 at the nth subcarrier can be expressed as a vector xnεcustom characterM×1 115. A codeword X is transmitted as a block of L symbols xL, which can be represented by the following matrix,

X=┌x1, x2, . . . , xL┌εcustom charactercustom characterM×L,

where

custom character={x1, x2, . . . , xQ}

denotes the non-coherent space-frequency codebook 111 with Q distinct codewords. A mean energy of each codeword X is Es, i.e.,








Q

-
1







Q


q
=
1







X
q



2



=


E
s

.





Typically the length of the block is less than or equal to a channel coherence time. The coherence time is a measure of the minimum time required for the magnitude change of the channel to become uncorrelated from its previous value.


A signal yn 215 received at the receiver over the MIMO channels 101 is

yn=Hnxn+wn,  (1)

where at the nth subcarrier

ynεcustom characterN×1, Hnεcustom characterN×M and wnεcustom characterN×1

denote the received signal vector, the frequency-domain MIMO channel matrix and additive noise, respectively.


A conventional GLRT receiver assumes that the MIMO channel matrix remains constant during a single codeword for n=1, . . . , L, such that Hn=H. This assumption can be relaxed for a high order codebook to deal with changes in the channel over the length of the codeword. This assumption of fading channels simplifies the expression of the received signal into a matrix form as follows:

Y=HX+W,  (2)

where Y and W respectively denote the received signals and the additive noise signals over the codeword, and

Y=[y1, y2, . . . , yLcustom characterN×L,  (3)
W=[w1, w2, . . . , wLcustom characterN×L.  (4)


Noise WL is expressed as white Gaussian random variables with a variance of σ2
custom character[vec[W]vec[W]\]=σ2INL.


Generalized Likelihood Ratio Test (GLRT) Receiver

A conditional probability of the received signal Y, given the codeword X and the channel matrix H is known as the likelihood, which is expressed as







Pr


(


Y

X

,
H

)


=


1


(

πσ
2

)

NL




exp


(


-

1

σ
2








Y
-
HX



2


)








where π is a constant.


Without having the CSI, the GLRT receiver 120 determines an optimal estimate {circumflex over (X)} for the codeword X from the codebook custom character111 in favor of maximizing the likelihood, or equivalently minimizing a squared distance metric as










X
^

=



arg





min


X









inf
H






Y
-
HX



2






(
5
)








where the function min returns a minimum, and the function inf is the infimum or greatest lower bound function.


Because the receiver does not have the channel state matrix H, the GLRT receiver uses the optimal channel matrix over all the possible realizations for each codeword. Because we have
















H
*








Y
-
HX



2


=


-

(

Y
-
HX

)




X




,




(
6
)








a candidate channel estimate

Ĥ=YX\(XX\)−1

can maximize the likelihood, where X X\ is invertible. This is equivalent to the well-known least-squares (LS) channel estimation given a candidate codeword X. Substituting Ĥ for H in Equation (5) yields










X
^

=



arg





min


X








Y



(


I
L

-




X




(

XX


)



-
1



X


)



P





2






(
7
)








where I is the identity matrix.


Here, a matrix Pεcustom charactercustom characterL×I denotes an idempotent projector onto the orthogonal complement of the codeword X, i.e., XP=0 and PP=P. The set

custom character={P1, P2, . . . , PQ}

is a projector bank, whose qth member is defined as

IL−xq(xqxq)−1xq,

for the codebook custom character. The minimal size of the possible projector matrix P, such that XP=0, can be L×(L−M) because the orthogonal complement of the codeword X is also of size L×(L−M).



FIG. 2 shows an operation of our GLRT decoder 200 using Equation 7. The GLRT decoder 200 includes a set of projector matrices Q that project 202 the received signal 203 onto an orthogonal complement of each codeword in the codebook 111 to generate a distance metric 210. After projection 202, a minimum is selected 204 to obtain the estimate {circumflex over (X)} of the transmitted codeword.


If every codeword X in the codebook custom character is unitary, such that

xqxq=(Es/M)IM

for any q=1, 2, . . . , Q, then the GLRT distance metric can be simplified to max ∥YX2.


Non-Coherent Grassmann Codes


A number of non-coherent codes are known, e.g., unitary space-time codes, Grassmann codes with exponential mapping, Grassmann packing codes with numerical optimization, and differential modulations. We use a non-coherent Grassmann code based on an exponential mapping. The Grassmann code parameterized by an exponential mapping is






X
=





E
s

M




[


I
M



0

M
×

(

L
-
M

)




]





exp


(

[




0
M



B





-

B






0

T
-
M





]

)


.






The matrix Bεcustom characterM×(T−M) denotes a full-rate, full-diversity coherent space-time block code with a thin singular value decomposition (SVD) of B=UAV. A cosine-sine decomposition yields










X
=




E
s

M




[

U






cos


(

α





A

)




U







U






sin


(

α





A

)




V



]



,




(
8
)








where α is a parameter that controls the codeword distance. Such a codeword always satisfies a unitary condition of XX=(Es/M)IM for any arbitrary α and B. For M=2 and L=4, one choice of the coherent coding matrix B is










B
=


1

2




[





s
1

+

ϕ






s
2






ϑ


(


s
3

+

ϕ






s
4



)







ϑ


(


s
3

-

ϕ






s
4



)






s
1

-

ϕ






s
2






]



,




(
9
)








where

θ2=φ=exp(jπ/4).


An optimal parameter α is approximately 0.566. Each si is drawn from quadrature phase shift keying (QPSK) constellations for a spectral efficiency of 2 bps per channel use. Grassmann codes offer the maximal degree of freedoms for non-coherent communications. However, it is not obvious that the parameter settings for θ and φ provide the optimal performance in sphere packing over the Grassmann manifold. We describe better Grassmann codes with optimized parameters using a gradient method.


High Order Superblock GLRT


In principle, the length L of the non-coherent codes should be less than or equal to the channel coherence time. However, shorter space-frequency codes have suboptimal performance with the conventional GLRT receiver because the accuracy of the LS channel regressions decreases linearly with the length L of the codeword (block).


Even for highly selective fading channels in the frequency-domain for space-frequency block coded (SFBC), the channel matrix has a high correlation for adjacent codewords, in general.



FIG. 3A and 3B respectively show the structure and operation of our superblock GRLT decoder 300. Because of the high correlation between adjacent codewords 303, we can improve performance by using channel correlations if we increase an effective length of the block by concatenating 320 multiple adjacent codewords into a superblock Y′ 321 at the receiver. The superblock GLRT decoder 300 jointly estimates the K adjacent codewords 303 in the superblock Y′. At the kth codeword, the transmitted codeword is Xk, and the received codeword is Yk. We respectively express the received signal and transmitted signal as

Y′=[Y1,Y2, . . . , YKcustom characterN×LK,  (10)
X′=[X1,X2, . . . , XKcustom characterM×LK,  (11)

wherein each element in the received signal Y′ is an N×L matrix, and each element in the estimate X′ of the transmitted signal is an M×L matrix.


We use the GLRT decoder 300 at the receiver 120 while the channel remains coherent over the K adjacent codewords in the superblock. Here, the signal X′ is a codeword in a virtual codebook generated from the original codebook Xkεcustom character.


A corresponding projector matrix P′ 341 can be predetermined 340 from the projector bank 339, such that X′P′=0. In this case, P′ can be determined in a similar manner as shown in Equation (7). The computational complexity increases exponentially with the number K of codewords because the cardinality of a superblock codebook becomes Q′=QK.


As shown in FIGS. 3A-3B, the input 303 is a concatenation of K received codewords YK represented as Y′ 321. The received signal is used to generate a set of distance metric 310, where the signal is projected 302 onto the orthogonal complement of each corresponding transmitted codeword in X. If we use unitary codes, then the GLRT metric reduces to ∥ΣYkXk2. After the projection 330, a minimum is selected 350 from the metric to obtain an estimate X′ 209 of the transmitted codeword X.


Sequential Decision for Superblock GLRT


Because the superblock GLRT decoder 300 processes K codewords at the same time, some different decision criteria arise as follows.


The distance metric of the superblock GLRT decoder for K consecutive codewords from Xj+1 to Xj+K can also be expressed as

μj=∥[Yj+1, Yj+2, . . . , Yj+K]P′∥2.


One-Time Decision


To decode the kth SFBC, only the metric μ└k/K┘−1 is used.


Selective Decision


To decode the kth SFBC, we select the optimal metric out of the adjacent metrics from μk+K+1 to μk+K−1.


Combined Decision


To decode the kth SFBC, we use a combined metric, which is summed over all the metrics from μk+K+1 to μk+K−1.


Sequential Decision


Here, we exploit the channel correlation across the codewords. This is done by Viterbi decoder 404 to select the optimal estimated codeword sequence over a trellis 405, of QK−1 states 406 as shown in



FIG. 4. The total number of branch metrics 407 is QK, where the previous K−1 codewords are interpreted as trellis states. Along the trellis-state diagram, optimal decision can be obtained by the well known Viterbi algorithm.


Sequential decision has a highest complexity for the Viterbi algorithm but achieves the optimal performance.


In principle, the GLRT receiver assumes that the channel remains coherent during the superblock, or consecutive LK symbols. Hence, changes in the channel while transmitting the superblock can incur a performance degradation.


We describe an improved GLRT procedure, which uses high order LS channel estimation to overcome any changes in the cannel fluctuation the superblock. We use Dth order polynomial curves to fit the channel fluctuation for high order LS regressions. Then, the channel matrix at the nth subcarrier is modeled as











H
n

=





d
=
0

D








H

[
d
]




n
d



=



D
n




,




(
12
)








where

custom character=[H[0], H[1], . . . , H[D]custom characterN×M(D+1),  (13)
Dn=[n0IM, n1IM, . . . , nDIM]Tεcustom characterM(D+1)×M.  (14)


The matrix H[d] denotes the channel matrix at the dth order term of the polynomial. This model enables us to adopt the GLRT receiver even when the channel matrix Hn changes in the frequency domain because the expanded channel matrix H remains static.


The received signal can be rewritten as











Y


=










Λ




X





+
W


,




(
15
)








where D is the deterministic order expansion matrix of size M(D+1)×MLK and Λ is the diagonally aligned version of the transmitted signal matrix X, each of which is respectively defined as
















=


[


D
1

,

D
2

,





,

D
LK


]





M


(

D
+
1

)


×
MLK




,




(
16
)






Λ
=


[




x
1


























x
2



















































x
LK




]





MLK
×
LK


.






(
17
)







By considering X′=custom characterΛ as a new virtual codeword, the associated projector matrix becomes

P′=ILK−Λcustom character(custom characterΛΛ)−1custom characterΛεcustom characterLK×LK,  (18)

which can be determined in advance for any D and for all codewords. We note that the GLRT structure is similar to that of FIG. 3, however, due the polynomial model for H, the elements of the set of the projector matrices are now of dimension LK×LK.



FIG. 5 graphically compares conventional GLRT decoding performance 501 and high order GLRT decoding performances 502-504, as a function of signal to estimation error ratio (SER) and signal-to-noise ratio, according to embodiments of the invention for transmissions over a highly frequency selective channel. We also show performance for the case when the higher order GLRT method is combined with the superblock method. The results show the performance over a highly frequency selective channel with 16 multipaths. As we see from the performance curves, the combination of the high order GLRT receiver with superblock processing has much better performance than conventional GLRT receivers.


Codebook Optimization of Non-Coherent Grassmann Codes


We optimize non-coherent codes by sphere packing on the Grassmann manifold. For numerical Grassmann packing, we adapt a gradient method to minimize the pairwise error probability between two codewords in high SNR regimes.


Pairwise Error Probability


The pairwise error probability between the correct codeword Xi and the wrong codeword Xj, given a channel matrix H is








Pr


(



i



j



H

)


=



Pr


(






Y


j




2

<




Y


i




2



H

)




Pr


(







H






i



j




2

+

2




[

tr


[

H


i



j



W



]


]




<
0


H

)



=



1
2




erf

c



(


1

2

σ






H






i



j





)






1
2



erfc


(


1

2

σ





H





λ
min



(


i



j


)



)






,





in the high SNR regimes, where erfc(.) is a complementary error function, and λ min(.) denotes the minimal singular-value of a matrix. Note that

λmin2(custom charactericustom characterj)=λmin(custom charactericustom characterjcustom characteri).


Our goal is to construct a codebook custom character, which maximizes λmin(custom charactericustom characterjcustom characteri) or any possible pair i≠j. Here, we have








λ
min



(


i



j



i



)




t




[






i



i



-

tI
M






i



j









j



i







j



j






]




0.




A semi-definite programming (SDP) relaxation method can be used to maximize the parameter t with an energy constraint. The codebook obtained by the SDP is further refined by a linear programming (LP) method. We optimize the codebook for the high order superblock GLRT by the gradient method as a lower-complexity approach.


Gradient Method


For a given Ωi,j=custom charactericustom characterjcustom characteri, the eigenvector uij, which associated with the minimal eigenvalue λi,jmini,j) can yield the gradient in terms of Xm as















λ

i
,
j



[
m
]




=







λ

i
,
j






m
*



=






m
*





u

i
,
j





Ω

i
,
j




u

i
,
j











=




u

i
,
j




u

i
,
j





i



j



δ

i
,
m










-




(


j



j


)


-
1





j



i




u

i
,
j




u

i
,
j





i



j




δ

j
,
m


.








(
19
)







Here,

δi,j=1 if i=j, otherwise δi,j=0.


The steps of the method for constructing the codebook using the gradient method are:

    • 1: Generate random codewords Xm such that ∥Xm2=Es;
    • 2: Determine Ωi,j for all pairs i≠j;
    • 3: Select the pair that has the minimum λi,j over all i≠j;
    • 4: Determine the eigenvalue μi,j for the selected pair (i,j);
    • 5: Determine the gradient ∇λi,j[m] for all mε{1, 2, . . . , Q};
    • 6: Update the codewords as xm←xm+β∇λi,j[m], where βεz,42 is a stepsize factor, which is optimized by line searching to maximize the corresponding λi,j;


7: Constrain the energy such that ∥Xm2=Es; and


8: Repeat from step 2, until λi,j converges substantially.


Using multiple initial codewords or small perturbations of an optimized codebook, the gradient method yields a well-constructed codebook. The construction method for the high order superblock GLRT can be adapted because we have ∂xm′=custom character∂Λm.


Optimization for Exponential Mapping Grassmann Codes


A numerical optimization method that was designed for non-parametric codes can now also be applied for some parametric non-coherent codes. As an example for our optimization method, an improved version of the exponential mapping Grassmann codes is described. The conventional Grassmann codes with exponential mapping use the fixed parameters θ and φ in Equation (9). We directly optimize these parameters by the gradient method with a slight modification as














λ

i
,
j





γ


=




m
=
1

Q







tr


[



(



λ

i
,
j



[
m
]


)

T






m
*




γ



]




,




(
20
)








where γε{α,θ,φ} is the parameter to be optimized.


The optimized parameters and its gain in the minimal eigenvalue are shown in Table I.













TABLE I






α
θ
φ
min λi,j



















Conventional
0.57
exp(jπ/8)
exp(jπ/4)
0.00023


Optimized
0.23
0.24 + j0.83
0.19 + j0.33
0.034









Although the invention has been described with reference to certain preferred embodiments, it is to be understood that various other adaptations and modifications can be made within the spirit and scope of the invention. Therefore, it is the object of the append claims to cover all such variations and modifications as come within the true spirit and scope of the invention.

Claims
  • 1. A method for decoding codewords received at a receiver via non-coherent channels in a multi-input, multiple output (MIMO) network using orthogonal frequency demultiplexing (OFDM), and wherein the codewords are encoded using non-coherent codes, comprising the steps of: concatenating multiple adjacent codewords of a received signal into a superblock at the receiver;predetermining a projector matrix P′ based on a codebook;projecting each codeword in the superblock onto an orthogonal complement of a correspond transmitted codeword using the projector matrix to obtain a corresponding distance metric of a generalized likelihood ratio test (GLRT) codeword; andselecting a minimal distance metric to obtain an estimate of the transmitted codeword corresponding to a transmitted signal.
  • 2. The method of claim 1, wherein a length L of the superblock is less than or equal to a channel coherence time.
  • 3. The method of claim 1, wherein the non-coherent codes are Grassmann codes and based on an exponential mapping.
  • 4. The method of claim 2, wherein the received signal is Y′=[Y1, Y2, . . . , YK], and
  • 5. The method of claim 4, wherein the projector matrix P′ is predetermined from a projector bank IL−xq†(xqxq554)−1xq, such that X′P′=0, and wherein IL is an identity matrix, † is a conjugate transpose operator, and Q is a number of codewords X in the codebook .
  • 6. The method of claim 5, wherein the distance metric is
  • 7. The method of claim 6, further comprising: reducing the distance metric to ∥ΣYkXk†∥2.
  • 8. The method of claim 6, further comprising, expressing the distance metric as μj=∥[Yj+1, Yj+2, . . . , Yj+K]P′∥2.
  • 9. The method of claim 1, wherein an optimal sequence of estimated codewords is selected using a Viterbi trellis.
US Referenced Citations (4)
Number Name Date Kind
20040125883 Chang et al. Jul 2004 A1
20070041463 Wang et al. Feb 2007 A1
20100226454 Bliss et al. Sep 2010 A1
20110064167 Rezk et al. Mar 2011 A1