Iterative eigenvector computation for a MIMO communication system

Information

  • Patent Application
  • 20050237920
  • Publication Number
    20050237920
  • Date Filed
    April 22, 2004
    20 years ago
  • Date Published
    October 27, 2005
    19 years ago
Abstract
A matrix {circumflex over (V)} of eigenvectors is derived using an iterative procedure. For the procedure, an eigenmode matrix Vi is first initialized, e.g., to an identity matrix. The eigenmode matrix Vi is then updated based on a channel response matrix {circumflex over (H)} for a MIMO channel to obtain an updated eigenmode matrix Vi+1. The eigenmode matrix may be updated for a fixed or variable number of iterations. The columns of the updated eigenmode matrix may be orthogonalized periodically to improve performance and ensure stability of the iterative procedure. In one embodiment, after completion of all iterations, the updated eigenmode matrix for the last iteration is provided as the matrix {circumflex over (V)}.
Description
BACKGROUND

I. Field


The present invention relates generally to data communication, and more specifically to techniques for deriving eigenvectors used for spatial processing in a multiple-input multiple-output (MIMO) communication system.


II. Background


A MIMO system employs multiple (NT) transmit antennas and multiple (NR) receive antennas for data transmission. A MIMO channel formed by the NT transmit antennas and NR receive antennas may be decomposed into NS spatial channels, where NS≦min {NT, NR}. The NS spatial channels may be used to transmit data in parallel to achieve higher overall throughput or redundantly to achieve greater reliability.


In general, up to NS data streams may be transmitted simultaneously from the NT transmit antennas in the MIMO system. However, these data streams interfere with each other at the receive antennas. Improved performance may be achieved by transmitting data on NS eigenmodes of the MIMO channel, where the eigenmodes may be viewed as orthogonal spatial channels. To transmit data on the NS eigenmodes, it is necessary to perform spatial processing at both a transmitter and a receiver. The spatial processing attempts to orthogonalize the data streams so that they can be individually recovered with minimal degradation at the receiver.


For data transmission on the NS eigenmodes, the transmitter performs spatial processing with a matrix of NS eigenvectors, one eigenvector for each eigenmode used for data transmission. Each eigenvector contains NT complex values used to scale a data symbol prior to transmission from the NT transmit antennas and on the associated eigenmode. For data reception, the receiver performs receiver spatial processing (or spatial matched filtering) with another matrix of NS eigenvectors. The eigenvectors for the transmitter and the eigenvectors for the receiver may be derived based on a channel response estimate for the MIMO channel between the transmitter and receiver. The derivation of the eigenvectors is computationally intensive. Furthermore, the accuracy of the eigenvectors may have a large impact on performance.


There is therefore a need in the art for techniques to efficiently and accurately derive eigenvectors used for data transmission and reception via the eigenmodes of a MIMO channel.


SUMMARY

Techniques for deriving a matrix {circumflex over (V)} of eigenvectors using an iterative procedure are described herein. For the iterative procedure, an eigenmode matrix Vi is first initialized to, for example, an identity matrix I or a matrix of eigenvectors derived for a prior transmission interval. The eigenmode matrix Vi is then updated based on a channel response matrix {circumflex over (H)} for a MIMO channel to obtain an updated eigenmode matrix Vi+1, as described below.


The eigenmode matrix may be updated for (1) a fixed number of iterations (e.g., 10 iterations) or (2) a variable number of iterations until a termination condition is reached. The columns of the updated eigenmode matrix may be orthogonalized periodically or as necessary to improve performance and to ensure stability of the iterative procedure. In one embodiment, after all of the iterations have been completed, the updated eigenmode matrix (or the orthogonalized updated eigenmode matrix) for the last iteration is provided as the matrix {circumflex over (V)} of eigenvectors. The matrix {circumflex over (V)} may be used for spatial processing for data transmission on the eigenmodes of the MIMO channel. The matrix {circumflex over (V)} may also be used to derive a spatial filter matrix used for spatial matched filtering of a data transmission received via the eigenmodes of the MIMO channel.


Various aspects and embodiments of the invention are described in further detail below.




BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 shows a transmitting entity and a receiving entity in a MIMO system;



FIG. 2 shows an iterative procedure for deriving a matrix of eigenvectors;



FIG. 3 shows an access point and a user terminal in the MIMO system;



FIG. 4 shows a processor for channel estimation and eigenvector computation; and



FIG. 5 shows an example TDD frame structure for the MIMO system.




DETAILED DESCRIPTION

The word “exemplary” is used herein to mean “serving as an example, instance, or illustration.” Any embodiment described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other embodiments.


The eigenvector computation techniques described herein may be used for a single-carrier MIMO system as well as a multi-carrier MIMO system. For clarity, these techniques are described in detail for a single-carrier MIMO system.


A. Single-Carrier MIMO System



FIG. 1 shows a simple block diagram of a transmitting entity 110 and a receiving entity 150 in a single-carrier MIMO system 100. At transmitting entity 110, a transmit (TX) spatial processor 120 performs spatial processing on data symbols (denoted by a vector s) with a matrix {circumflex over (V)} of eigenvectors to generate transmit symbols (denoted by a vector x). As used herein, a “data symbol” is a modulation symbol for data, a “pilot symbol” is a modulation symbol for pilot (which is known a priori by both the transmitting and receiving entities), a “transmit symbol” is a symbol to be sent from a transmit antenna, and a modulation symbol is a complex value for a point in a signal constellation used for a particular modulation scheme (e.g., M-PSK, M-QAM, and so on). The transmit symbols are further conditioned by a transmitter unit (TMTR) 122 to generate NT modulated signals, which are transmitted from NT transmit antennas 124 and via a MIMO channel.


At receiving entity 150, the transmitted modulated signals are received by NR receive antennas 152, and the NR received signals are conditioned by a receiver unit (RCVR) 154 to obtain received symbols (denoted by a vector r). A receive (RX) spatial processor 160 then performs receiver spatial processing (or spatial matched filtering) on the received symbols with a spatial filter matrix {circumflex over (M)} to obtain detected symbols (denoted by a vector {circumflex over (s)}). The detected symbols are estimates of the data symbols sent by transmitting entity 110. The spatial processing at the transmitting and receiving entities are described below.


For the single-carrier MIMO system, the MIMO channel formed by the NT transmit antennas at the transmitting entity and the NR receive antennas at the receiving entity may be characterized by an NR×NT channel response matrix H, which may be expressed as:
H_=[h1,1h1,2h1,NTh2,1h2,2h2,NThNR,1hNR,2hNR,NT],Eq(1)

where element hi,j, for i=1 . . . NR and j=1 . . . NT, denotes the coupling or complex gain between transmit antenna j and receive antenna i. For simplicity, the MIMO channel is assumed to be full rank with NS=NT≧NR.


The channel response matrix H may be “diagonalized” to obtain NS eigenmodes of H. This diagonalization may be achieved by performing either singular value decomposition of the channel response matrix H or eigenvalue decomposition of a correlation matrix of H, which is C=HHH, where “H”, denotes the conjugate transpose.


The singular value decomposition of the channel response matrix H may be expressed as:

H=UΣVH,   Eq (2)

where

    • U is an NR×NR unitary matrix of left eigenvectors of H;
    • Σ is an NR×NT diagonal matrix of singular values of H; and
    • V is an NT×NT unitary matrix of right eigenvectors of H.


      A unitary matrix M is characterized by the property MHM=I, where I is the identity matrix containing ones along the diagonal and zeros elsewhere. The columns of a unitary matrix are orthogonal to one another.


The eigenvalue decomposition of the correlation matrix of H may be expressed as:

C=HHH=VΛVH,   Eq (3)

where Λ is an NT×NT diagonal matrix of eigenvalues of C. As shown in equations (2) and (3), the columns of V are right eigenvectors of H as well as eigenvectors of C. Singular value decomposition and eigenvalue decomposition are described by Gilbert Strang in a book entitled “Linear Algebra and Its Applications,” Second Edition, Academic Press, 1980.


The right eigenvectors of H (which are the columns of V) may be used for spatial processing by the transmitting entity to transmit data on the NS eigenmodes of H. The left eigenvectors of H (which are the columns of U) may be used for spatial matched filtering by the receiving entity to recover the data transmitted on the NS eigenmodes. The eigenmodes may be viewed as orthogonal spatial channels obtained through decomposition.


A diagonal matrix contains non-negative real values along the diagonal and zeros elsewhere. The diagonal elements of Σ are referred to as the singular values of H and represent the channel gains for the NS eigenmodes of H. The diagonal elements of Λ are referred to as the eigenvalues of C and represent the power gains for the NS eigenmodes of H. The singular values of H, which are denoted as {σ1, σ2 . . . σNS}, are thus the square roots of the eigenvalues of C, which are denoted as {λ1 λ2 . . . λNS}, or σi={square root}{right arrow over (λi)} for i=1 . . . NS.


B. Iterative Eigenvector Computation


The matrices H, V, and U represent “true” quantities that are not available in a practical system. Instead, estimates of the matrices H, V, and U may be obtained and denoted as {circumflex over (H)}, {circumflex over (V)}, and {circumflex over (U)}, respectively. The matrix {circumflex over (H)} may be obtained, for example, based on a MIMO pilot sent via the MIMO channel. A MIMO pilot is a pilot comprised of NT pilot transmissions sent from NT transmit antennas, where the pilot transmission from each transmit antenna is identifiable by the receiving entity. This can be achieved, for example, by using a different orthogonal sequence for the pilot transmission from each transmit antenna. The matrices {circumflex over (V)} and {circumflex over (U)} may be efficiently and accurately derived using an iterative procedure.


For the iterative procedure, an NT×NT estimated correlation matrix A is initially computed as:

A={circumflex over (H)}H{circumflex over (H)}.   Eq (4)

The eigenvalue decomposition of A would yield A=VaΛaVHa, where Va is an estimate of V and Λa is an estimate of Λ due to the fact that A is an estimate of C. An eigenmode matrix Vi, which is an estimate of Va, may be iteratively computed as follows:
V_i+1H=V_iH+μ·(Tri_up(V_iHAV_i)-Tri_low(V_iHAV_i))V_iH,Eq(5)

where

    • Vi is the eigenmode matrix for the i-th iteration;
    • Tri_up (M) is a matrix containing elements above the diagonal of M;
    • Tri_low (M) is a matrix containing elements below the diagonal of M;
    • μ is a step size for the iterative procedure; and
    • Vi+1 is the eigenmode matrix for the (i+1)-th iteration.


The eigenmode matrix Vi may be initialized to the identity matrix I, or V0=I, if no other information is available for V. The step size μ determines the rate of convergence for the iterative procedure. A larger step size speeds up convergence but also increases the granularity of the elements of Vi. Conversely, a smaller step size results in a slower convergence rate but improves the accuracy of the elements of Vi. The step size may be set to μ=0.05, for example, or to some other value.


The computation shown in equation (5) may be decomposed into four steps. In one embodiment, for the first step, a matrix X is computed as: X=AVi. In the second step, a matrix Y is computed as: Y=VHiX. In the third step, an update matrix Z is computed as: Z=(Tri_up (Y)−Tri_low (Y))Vi. In the fourth step, the eigenmode matrix is updated as: Vi+1=Vi+μ·Z.


Equation (5) may also be expressed as:
V_i+1H=V_iH+μ·(Tri_up(V_iHH_^HH^V_i)-Tri_low(V_iHH_^HH^V_i))V_iH.Eq(6)

In another embodiment, the matrix X is computed as X=ĤVi, the matrix Y is computed as Y=XHX, and the matrices Z and Vi+1 are computed as described above.


The number of multiply and add operations required for each iteration of equation (5) is dependent on the dimension of the channel response matrix {circumflex over (H)}, which in turn is dependent on the number of transmit antennas and the number of receive antennas. Since all of the matrices defined above, except for the diagonal matrices, contain complex-valued elements, complex multiplies are performed on or for the elements of these matrices. For NR=4 and NT=4, three 4×4 complex matrix multiplies are performed to obtain the three matrices X, Y, and Z. The 4×4 complex matrix multiply for each matrix normally requires four complex multiplies for each element of the matrix, or a total of 64 complex multiplies for the 16 elements of the matrix, which can be performed with 256 real multiplies. A total of 768 real multiplies (where 768=256 3 ) would then be required to compute the three matrices X, Y, and Z.


Some computational savings may be realized by recognizing that (1) the matrix Z contains zeros along the diagonal and (2) the elements below the diagonal of Z are the negative of the elements above the diagonal of Z (the lower triangle of Z is the negative of the upper triangle of Z). Thus, only 6 out of 16 elements of Z need to be computed. This reduces the total number of real multiplies to 608, or 256+256+6·16=608. The multiplies may also be performed in a manner to utilize the full available range. For example, a 4×4 complex matrix multiply for a given matrix may be performed with 256 16×16-bit real multiplies, where the two input operands for each real multiply have 16 bits of resolution and the result of the real multiply has a range that is greater than 16 bits. In this case, the resultant elements of the matrix may be divided by the magnitude of the largest element in the matrix and then multiplied by a scaling factor, which may be selected such that the largest element is represented with a 16-bit value that is as large as possible and/or convenient for processing.


The eigenmode matrix may be iteratively computed as shown in equation (5) for a number of iterations until a sufficiently good estimate of V is obtained. It has been found through computer simulation that ten iterations are typically sufficient to obtain a good estimate of V. In one embodiment, equation (5) is iteratively computed for a fixed number of iterations (e.g., ten iterations) to obtain an eigenmode matrix Vƒ, which is the final estimate of V. In another embodiment, equation (5) is iteratively computed for a variable number of iterations until a termination condition is encountered, and the eigenmode matrix Vƒ for the last iteration is provided as the final estimate of V provided by the iterative procedure. Since the matrix Y should resemble a diagonal matrix, the termination condition may be defined by how closely Y resembles a diagonal matrix, as described below.


The iterative procedure updates the eigenmode matrix Vi such that VHiAVi approaches a diagonal matrix. The final estimate of V may be expressed as:

VHƒAVƒ=D,   Eq (7)

which may be rewritten as:

A=VƒDVHƒ.   Eq (8)

If the iterative procedure is successful, then D is approximately equal to Λa (which is the diagonal matrix of eigenvalues of A) and Vƒ is approximately equal to Va (which is the matrix of eigenvectors of A). The eigenmode matrix Vƒ is also a good estimate of the matrix V of eigenvectors of C, if A is a good estimate of C.


There are typically some residual errors from the iterative procedure, so that Vƒ is not a true unitary matrix, VHƒVƒ is not exactly equal to the identity matrix, and the off-diagonal elements of D may be non-zero values. Equation (5) may thus be iteratively computed until the off-diagonal elements of D are sufficiently small. For example, equation (5) may be iteratively computed until the sum of the squared magnitude of the off-diagonal elements of Y is less than a first predetermined threshold. As another example, the computation may continue until the ratio of the sum of the squared magnitude of the diagonal elements of Y over the sum of the squared magnitude of the off-diagonal elements of Y is greater than a second predetermined threshold. Other termination conditions may also be defined. The columns of Vƒ may also be forced to be orthogonal to one another by performing eigenvector orthogonalization, as described below.


After all of the iterations have been completed, the matrix {circumflex over (V)} of eigenvectors may be defined as {circumflex over (V)}=Vƒ. The matrix {circumflex over (V)} is an estimate of the matrix V, where the estimate is obtained based on {circumflex over (H)} and using the iterative procedure. The matrix {circumflex over (V)} may be used for spatial processing, as described below.


Computer simulations have shown that the computation for Vi in equation (5) converges for most cases. However, as the number of iterations increases for a given channel response matrix {circumflex over (H)}, residual errors begin accumulating and the solution for Vi starts to diverge. Convergence may be assured by performing eigenvector orthogonalization (described below) on the eigenmode matrix Vi periodically, e.g., after every North iterations, where North may be equal to 50 or some other value.


A matrix {circumflex over (U)}, which is an estimate of U, may also be derived using the iterative procedure, similar to that described above for the matrix {circumflex over (V)}. To derive {circumflex over (U)}, the matrix A is defined as A={circumflex over (HH)}H, the matrix Vi is replaced with a matrix Ui in equation (5), and the matrix Uƒ for the last iteration is used as the final estimate of U, or {circumflex over (U)}=Uƒ. Alternatively, from equation (2), the matrix U may be expressed as: U=HVΣ−1. The matrix {circumflex over (U)} may thus be computed as {circumflex over (U)}={circumflex over (HVΣ)}−1, where {circumflex over (H)} and {circumflex over (Σ)} may be obtained from the MIMO pilot and {circumflex over (V)} may be derived using the iterative procedure.


C. Eipenvector Orthoponalization


As noted above, the columns of Vƒ may not be orthogonal to one another if D is not a diagonal matrix. This may be due to various parameters such as, for example, the step size μ, the number of iterations computed for Vi, finite processor precision, and so on. The columns of Vƒ may be forced to be orthogonal to one another using various techniques such as QR factorization, minimum square error computation, and polar decomposition. QR factorization is described in detail below. The orthogonal eigenvectors from the QR factorization are normalized, and the orthonormal eigenvectors are used for spatial processing.


QR factorization decomposes the matrix Vƒ into an orthogonal matrix Q and an upper triangle matrix R. The matrix Q forms an orthogonal basis for the columns of Vƒ, and the diagonal elements of R give the length of the components of the columns of Vƒ in the directions of the respective columns of Q. The matrices Q and R may be used to derive an enhanced matrix {tilde over (V)} having orthogonal columns.


The QR factorization may be performed using various methods, including a Gram-Schmidt procedure, a householder transformation, and so on. The Gram-Schmidt procedure is recursive and may be numerically unstable. Various variants of the Gram-Schmidt procedure have been devised and are known in the art. The “classical” Gram-Schmidt procedure for orthogonalizing the matrix Vƒ is described below.


For QR factorization, the matrix Vƒ may be expressed as:

Vƒ=QR,   Eq (9)

where

    • Q is an NT×NT orthogonal matrix; and
    • R is an NT×NT upper triangle matrix with possible non-zero values along and above the diagonal and zeros below the diagonal.


The Gram-Schmidt procedure generates the matrices Q and R column by column. The following notations are used for the description below:


i is an index for the rows of a matrix;


j is an index for the columns of a matrix;



Q=[q1 q2 . . . qNT], where qj is the j-th column of Q;


qi,j is the element in the i-th row and j-th column of Q;


{tilde over (Q)}=[{tilde over (q)}1 {tilde over (q)}2 . . . {tilde over (q)}NT], where {tilde over (q)}j is the j-th column of {tilde over (Q)};


ri,j is the element in the i-th row and j-th column of R;



V
ƒ=[v1 v2 . . . vNT], where vj is the j-th column of Vƒ; and


νi,j is the element in the i-th row and j-th column of Vƒ.


The first column of Q and R may be obtained as follows:
r1,1=v_1=[i=1NTvi,12]1/2,andEq(10)q_1=1r1,1v_1.Eq(11)

The first column of R includes one non-zero value for the element r1,1 in the first row and zeros elsewhere, where r1,1 is equal to the 2-norm of v1. The first column of Q is a normalized version of the first column of Vƒ, where the normalization is achieved by scaling each element of v1 with the inverse of r1,1.


Each of the remaining columns of Q and R may be obtained as follows:

FOR j = 2, 3, . . . NT {Eq (12)  FOR i = 1, 2, . . . j − 1 {  ri,j = qiH vj }q~_j=v_j-i=1j-1ri,j·q_iEq (13)rj,j=||q~_j||Eq (14)q_j=1rj,j·q~_j}Eq (15)


The Gram-Schmidt procedure generates one column at a time for Q. Each new column of Q is forced to be orthogonal to all prior-generated columns to the left of the new column. This is achieved by equations (13) and (15), where the j-th column of Q (or qj) is generated based on {tilde over (q)}j, which in turn is generated based on the j-th column of Vƒ (or vj) and subtracting out any components in vj pointing in the direction of the other j−1 columns to the left of vj. The diagonal elements of R are computed as the 2-norm of the columns of {tilde over (Q)} (where {tilde over (q)}1=v1), as shown in equation (14).


Improved performance may be obtained by ordering the columns of Vƒ based on the diagonal elements of D prior to performing the QR factorization. The diagonal elements of D may be ordered such that {d1≧d2≧ . . . ≧dNT}, where d1 is the largest and dNT is the smallest diagonal element of D. When the diagonal elements of D are ordered, the columns of Vƒ are also ordered correspondingly. The first or left-most column of the ordered Vƒ would then be associated with the largest diagonal element of D, and the last or right-most column of the ordered Vƒ would be associated with the smallest diagonal element of D.


If the columns of Vƒ are ordered based on decreasing values of their associated diagonal elements, then the columns/eigenvectors of Q are forced to be orthogonal to the first column/eigenvector, which is associated with the largest diagonal element and has the largest gain. The ordering thus has a beneficial effect of rejecting certain noise components of each of the remaining eigenvectors of Q. In particular, the j-th column of Q (or qj) is generated based on the j-th column of Vƒ (or vj), and noise components in vj that point in the direction of the j−1 eigenvectors to the left of qj (which are associated with higher gains) are subtracted from vj to obtain qj. The ordering also has another beneficial effect of improving the estimates of eigenvectors associated with smaller diagonal elements. The overall result is improved performance, especially if the orthogonalized eigenvectors of Q are used for spatial processing.


The enhanced matrix {tilde over (V)} from the QR factorization may be expressed as:

{tilde over (V)}=Q{tilde over (R)},   Eq (16)

where {tilde over (R)} is a diagonal matrix that contains only the diagonal elements of R. The matrix {circumflex over (V)} may be set equal to the enhanced matrix {tilde over (V)}, or {circumflex over (V)}={tilde over (V)}.



FIG. 2 shows a flow diagram of an iterative process 200 for deriving the matrix {circumflex over (V)} of eigenvectors. The eigenmode matrix Vi is first initialized, e.g., to the identity matrix or to a matrix of eigenvectors derived for another transmission interval or another subband (block 212). The matrix A is computed based on the channel response matrix {circumflex over (H)}, e.g., as shown in equation (4) (block 214).


The eigenmode matrix Vi is then iteratively computed for a number of iterations. For each iteration, the matrix Y is first computed based on the matrices Vi and A, e.g., Y=VHiAVi (block 216). The update matrix Z is then computed based on the matrices Vi and Y, e.g., Z=(Tri_up (Y)−Tri_low (Y))Vi (block 218). Blocks 216 and 218 represent one way to obtain the update matrix Z. The eigenmode matrix is then updated based on the matrix Z, e.g., Vi+1=Vi+μ·Z (block 220).


A determination is next made whether to orthogonalize the eigenvectors in the eigenmode matrix (block 222). Orthogonalization may be performed, for example, if North iterations have been completed since the start of the iterative procedure or since the last eigenvector orthogonalization. If the answer is ‘yes’ for block 222, then eigenvector orthogonalization is performed to obtain orthogonal columns for the eigenmode matrix (block 224). Otherwise, if the answer is ‘no’, then block 224 is skipped. In any case, a determination is next made whether or not to terminate the iterative procedure (block 226). The procedure may be terminated, e.g., after a fixed number of iterations or if a termination condition is encountered. If the answer is ‘no’ for block 226, then the process returns to block 216 to perform another iteration. Otherwise, the eigenmode matrix Vƒ for the last iteration is provided as the matrix {circumflex over (V)} of eigenvectors (block 228), and the process terminates. Although not shown in FIG. 2, eigenvector orthogonalization may be performed on Vƒ, and the enhanced matrix {tilde over (V)} with orthogonal columns may be provided as the matrix {tilde over (V)}.


D. Spatial Processing


The channel estimation and eigenvector computation may be performed in various manners. For example, referring to FIG. 1, the receiving entity can receive a MIMO pilot from the transmitting entity, obtain the channel response matrix {tilde over (H)} based on the MIMO pilot, derive the matrix {tilde over (V)} of eigenvectors based on {tilde over (H)} and using the iterative procedure, and derive a spatial filter matrix {circumflex over (M)} based on {circumflex over (V)} and {circumflex over (H)}. The receiving entity can send {tilde over (V)} back to the transmitting entity.


The transmitting entity performs spatial processing for data transmission on the NS eigenmodes, as follows:

x={circumflex over (V)}s,   Eq (17)

where

    • s is an NT×1 vector with up to NS data symbols to be sent on the NS eigenmodes in one symbol period; and
    • x is an NT×1 vector with NT transmit symbols to be sent from the NT transmit antennas in one symbol period.


      The eigenvectors in {circumflex over (V)} are also referred to as transmit vectors or steering vectors.


The received symbols at the receiving entity may be expressed as:

r=Hx+n,   Eq (18)

where r is an NR×1 vector with NR received symbols obtained via the NR receive antennas, and n is a noise vector.


The receiving entity may perform spatial matched filtering, as follows:

{circumflex over (s)}={circumflex over (Λ)}−1{circumflex over (Mr)}={circumflex over (Λ)}−1{circumflex over (V)}H{circumflex over (H)}Hr={circumflex over (Λ)}−1{circumflex over (V)}H{circumflex over (H)}H(H{circumflex over (V)}s+n)≅s+n′,   Eq (19)

where

    • {circumflex over (s)} is an NT×1 vector with NT detected symbols;
    • {circumflex over (M)} is an NT×NR spatial filter matrix, which is {circumflex over (M)}={circumflex over (V)}H{circumflex over (H)}H; and
    • n′ is a post-detection noise vector.


      As shown in equation (19), the receiving entity can recover the transmitted data symbols by performing spatial matched filtering with {circumflex over (M)} and then scaling with {circumflex over (Λ)}−1 to obtain the detected symbols. If {circumflex over (H)} is a good estimate of H and {circumflex over (V)} is a good estimate of V, then {circumflex over (s)} is a good estimate of s in the absence of noise.


For simplicity, the above description assumes a full rank MIMO channel with NS=NT≦NR. The MIMO channel may be rank deficient so that NS<NT≦NR, or the number of receive antennas may be less than the number of transmit antennas so that NS≦NR<NT. For both of these cases, the NT×1 vector s would contain non-zero values for the first NS elements and zeros for the remaining NT-NS elements, and the NT×1 vector {circumflex over (s)} would contain NS detected symbols for the first NS elements. Alternatively, the vector s may have a dimension of NS×1, the matrix {circumflex over (V)} may have a dimension of NT×NS, and the spatial filter matrix {circumflex over (M)} may have a dimension of NS×NT.


E. Channel Estimation


Data may be transmitted in various manners in the MIMO system. For a burst mode, data is transmitted in a small number of frames (e.g., one frame). A frame may be defined as a transmission interval of a predetermined time duration (e.g., 2 msec). The channel estimation is then performed based on the pilot received in a limited number of frames. For a continuous mode, data is transmitted in a larger number of frames, either continuously or with small gaps in transmission. The channel estimation may then be performed based on the pilot received in multiple frames.


The receiving entity can estimate the MIMO channel response for each frame n based on the pilot received in that frame and obtain a channel response matrix {circumflex over (H)}(n) for that frame. To improve the quality of the channel estimate, the receiving entity can filter the channel response matrices obtained for the current and prior frames using a finite impulse response (FIR) filter, an infinite impulse response (IIR) filter, or some other type of filter. The filtering is performed on each of the elements in the channel response matrix and provides a filtered channel response matrix {tilde over (H)}(n). For example, the filtering of the channel response matrices for multiple frames with a single-tap IIR filter may be expressed as:

{tilde over (h)}i,j(n)=α·{tilde over (h)}i,j(n−1)+(1−α)·ĥi,j(n), for i=1 . . . NR and j=1 . . . NT,   Eq (20)

where

    • ĥi,j(n) is the element in the i-th row and j-th column of {circumflex over (H)}(n) for frame n;
    • {tilde over (h)}i,j(n−1) is the (i,j)-th element of {tilde over (H)}(n−1) for frame n−1;
    • {tilde over (h)}i,j(n) is the (i,j)-th element of {tilde over (H)}(n) for frame n; and
    • α is a coefficient for the IIR filter.


      A larger value for α gives greater weight to the prior channel estimates, and a smaller value for α gives greater weight to the current channel estimate. The coefficient α may be set as α=0.75, for example, or to some other value.


The eigenvector computation may be performed for each frame n based on either the unfiltered channel response matrix {circumflex over (H)}(n) or the filtered channel response matrix {circumflex over (H)}(n) obtained for that frame. For each frame, the eigenmode matrix Vi may be initialized to either the identity matrix or the matrix {circumflex over (V)}(n−1) of eigenvectors obtained for a prior frame. The eigenvector computation for each frame provides the matrix {circumflex over (V)}(n) of eigenvectors that can be used for spatial processing in that frame.


The eigenvector orthogonalization may be performed in various manners, which may be dependent on whether data is transmitted using the burst mode or the continuous mode. In one embodiment for both modes, eigenvector orthogonalization is performed on the eigenmode matrix Vƒ for the last iteration, and the matrix {circumflex over (V)}(n) is set to the enhanced matrix {tilde over (V)} having orthogonal columns. In another embodiment for the burst mode, eigenvector orthogonalization is not performed, and the matrix {circumflex over (V)}(n) is simply set to the eigenmode matrix Vƒ. In another embodiment for the continuous mode, eigenvector orthogonalization is performed after every Nfr frames, where Nfr may be set to Nfr=5, for example, or to some other value. If filtering is performed on the channel response matrices, then eigenvector orthogonalization may be performed periodically to ensure stability of the iterative procedure. In general, eigenvector orthogonalization may improve performance and stability but require computation. Eigenvector orthogonalization may thus be performed sparingly or as necessary to reduce the amount of computation required to derive {circumflex over (V)}(n).


F. Multi-Carrier MIMO System


The eigenvector computation techniques described herein may also be used for a multi-carrier MIMO system. Multiple carriers may be obtained with orthogonal frequency division multiplexing (OFDM), some other multi-carrier modulation techniques, or some other construct. OFDM effectively partitions the overall system bandwidth into multiple (NF) orthogonal subbands, which are also referred to as tones, subcarriers, bins, and frequency channels. With OFDM, each subband is associated with a respective subcarrier that may be modulated with data.


For a multi-carrier MIMO system, the eigenvector computation may be performed for each subband used for data transmission (or each “data” subband). A channel response matrix {circumflex over (H)}(k) may be obtained for each data subband k based on, e.g., a MIMO pilot received on that subband. Eigenvector computation may be performed on {circumflex over (H)}(k) for each data subband k to obtain a matrix {circumflex over (V)}(k) of eigenvectors that may be used for spatial processing for that subband. A high degree of correlation may exist in the channel response matrices for nearby subbands, especially for a flat fading MIMO channel. The eigenvector computation may be performed in a manner to take advantage of this correlation. For example, the eigenmode matrix Vi(k) for each data subband may be initialized to the matrix of eigenvectors obtained for an adjacent subband, e.g., Vi(k)={circumflex over (V)}(k−1) or Vi(k)={circumflex over (V)}(k+1). As another example, a first set of matrices of eigenvectors may be iteratively derived for a first set of subbands, and a second set of matrices of eigenvectors for a second set of subbands may be derived by interpolating the matrices in the first set.


G. System



FIG. 3 shows a block diagram of an embodiment of an access point 310 and a user terminal 350 in a MIMO system 300. Access point 310 is equipped with Nap antennas that may be used for data transmission and reception, and user terminal 350 is equipped with Nut antennas, where Nap>1 and Nut>1.


On the downlink, at access point 310, a TX data processor 314 receives traffic data from a data source 312 and signaling and other data from a controller 330. TX data processor 314 formats, codes, interleaves, and modulates (or symbol maps) the different types of data and provides data symbols. A TX spatial processor 320 receives the data symbols from TX data processor 314, performs spatial processing on the data symbols with one or more matrices of eigenvectors for the downlink (e.g., as shown in equation (17)), multiplexes in pilot symbols as appropriate, and provides Nap streams of transmit symbols to Nap transmitter units 322a through 322ap. Each transmitter unit 322 receives and processes a respective transmit symbol stream and provides a corresponding downlink modulated signal. Nap downlink modulated signals from transmitter units 322a through 322ap are then transmitted from Nap antennas 324a through 324ap, respectively.


At user terminal 350, Nut antennas 352a through 352ut receive the transmitted downlink modulated signals, and each antenna provides a received signal to a respective receiver unit 354. Each receiver unit 354 performs processing complementary to that performed by receiver unit 322 and provides received symbols. An RX spatial processor 360 then performs spatial matched filtering on the received symbols from all Nut receiver units 354a through 354ut (e.g., as shown in equation (20)) to obtain detected symbols. An RX data processor 370 processes (e.g., symbol demaps, deinterleaves, and decodes) the detected symbols and provides decoded data to a data sink 372 for storage and/or a controller 380 for further processing.


The processing for the uplink may be the same or different from the processing for the downlink. Data and signaling are processed (e.g., coded, interleaved, and modulated) by a TX data processor 388, spatially processed by a TX spatial processor 390 with one or more matrices of eigenvectors for the uplink, and multiplexed with pilot symbols to generate Nut transmit symbol streams. Nut transmitter units 354a through 354ut further condition the Nut transmit symbol streams to generate Nut uplink modulated signals, which are then transmitted via Nut antennas 352a through 352ut.


At access point 310, the uplink modulated signals are received by Nap antennas 324a through 324ap and processed by Nap receiver units 322a through 322ap to obtain received symbols for the uplink. An RX spatial processor 340 performs spatial matched filtering on the received symbols and provides detected symbols, which are further processed by an RX data processor 342 to obtain decoded data for the uplink.


Digital signal processors (DSPs) 328 and 378 perform channel estimation and eigenvector computation for the access point and user terminal, respectively. Controllers 330 and 380 control the operation of various processing units at the access point and user terminal, respectively. Memory units 332 and 382 store data and program codes used by controllers 330 and 380, respectively.



FIG. 4 shows an embodiment of DSP 378, which performs channel estimation and eigenvector computation for the user terminal. A channel estimator 412 obtains received pilot symbols (which are received symbols for a MIMO pilot sent on the downlink) and derives a channel response matrix {circumflex over (H)}(n) for the current frame n. A filter 414 performs time-domain filtering of the channel response matrices for the current and prior frames, e.g., as shown in equation (21), and provides a filtered channel response matrix {tilde over (H)}(n) for the current frame. An eigenvector computation unit 416 derives an eigenmode matrix Vƒ(n) for the current frame based on either the unfiltered matrix {circumflex over (H)}(n) or the filtered matrix {tilde over (H)}(n) and using the iterative procedure, as described above. A unit 418 performs eigenvector orthogonalization on Vƒ(n), if enabled, and provides the matrix {circumflex over (V)}(n) of eigenvectors for the current frame. Unit 418 may skip the eigenvector orthogonalization and provide the matrix {circumflex over (V)}ƒ(n) from unit 416 directly as the matrix {circumflex over (V)}(n). This may be the case, for example, for the burst mode, or if less than Nfr frames have elapsed since the last orthogonalization for the continuous mode. Unit 418 may also perform the eigenvector orthogonalization and provide an enhance matrix {tilde over (V)}(n) with orthogonal columns as the matrix {circumflex over (V)}(n). Units 416 and 418 may also pass their results back and forth multiple times for the current frame. In any case, the matrix {circumflex over (V)}(n) may be sent back to the access point and used for spatial processing for downlink data transmission. A unit 420 derives a spatial matched filter {circumflex over (M)}ut(n) for the user terminal based on the matrix {circumflex over (V)}(n) and either the unfiltered matrix {circumflex over (H)}(n) or the filtered matrix {tilde over (H)}(n), e.g., {circumflex over (M)}ut(n)={circumflex over (V)}H(n){circumflex over (H)}H(n) or {circumflex over (M)}ut(n)={circumflex over (V)}H(n){tilde over (H)}H(n).



FIG. 4 shows a representation of the processing units for channel estimation and eigenvector computation. The processing by the various units in FIG. 4 may be performed, for example, in a time division multiplex (TDM) manner by shared multipliers and adders within DSP 378.


DSP 328 performs channel estimation and eigenvector computation for the access point. The processing by DSP 328 may be the same or different from the processing by DSP 378, depending on the channel structure and pilot transmission scheme used for the MIMO system.


System 300 may utilize a frequency division duplex (FDD) or a time division duplex (TDD) channel structure. For the FDD structure, the downlink and uplink are allocated separate frequency bands, and the channel response matrix for one link may not correlate well with the channel response matrix for the other link. In this case, the channel estimation and eigenvector computation may be performed separately for each link. For the TDD structure, the downlink and uplink share the same frequency band, with the downlink being allocated a portion of the time and the uplink being allocated the remaining portion of the time. The channel response matrix for one link may be highly correlated with the channel response matrix for the other link. In this case, the channel estimation and eigenvector computation may be performed in a manner to take advantage of this correlation, as described below.



FIG. 5 shows an example frame structure 500 that may be used for a TDD MIMO system. Data transmission occurs in units of TDD frames, with each TDD frame covering a predetermined time duration (e.g., 2 msec). Each TDD frame is partitioned into a downlink phase 510a and an uplink phase 510b. Each phase 510 includes a pilot portion 520 and a data and signaling portion 530. Pilot portion 520 for each link is used to transmit one or more types of pilot, which may be used to estimate the MIMO channel response or the eigenvectors for that link. Data and signaling portion 530 for each link is used to transmit data and signaling. Each phase 510 may include multiple pilot portions 520 and/or multiple data and signaling portions 530, but this is not shown in FIG. 5 for simplicity.


For a TDD MIMO system, the downlink and uplink channel responses may be assumed to be reciprocal of one another. That is, if H represents the channel response matrix from antenna array A to antenna array B, then a reciprocal channel implies that the coupling from array B to array A is given by HT, where “T” denotes the transpose. Typically, the responses of the transmit and receive chains at the access point are not equal to the responses of the transmit and receive chains at the user terminal. Calibration may be performed to determine and account for the differences in the transmit/receive responses at the two entities. For simplicity, the following description assumes that the transmit and receive chains at the access point and user terminal are flat, H is the channel response matrix for the downlink, and HT is the channel response matrix for the uplink. The channel estimation and eigenvector computation may be simplified for a reciprocal channel.


For a reciprocal MIMO channel, the singular value decomposition for the downlink and uplink may be expressed as:

HT=UapΣVHut (uplink), and   Eq (21)
H=V*utΣTUTap (downlink),   Eq (22)

where

    • Uap is an Nap×Nap unitary matrix of left eigenvectors of HT,
    • Σ is an Nap×Nut diagonal matrix of singular values of HT,
    • Vut is an Nut×Nut unitary matrix of right eigenvectors of HT, and
    • “*” denotes the complex conjugate.


      The matrices V*ut and U*ap are also matrices of left and right eigenvectors, respectively, of H. The matrices Uap and Vut may be used by the access point and user terminal, respectively, for spatial processing for both data transmission and reception and are denoted as such by their subscripts.


The channel estimation and eigenvector computation may be performed in various manners for the TDD MIMO system. In one embodiment, the access point transmits a MIMO pilot on the downlink. The user terminal estimates the MIMO channel response based on the downlink MIMO pilot and obtains the channel response matrix {circumflex over (H)} for the downlink. The user terminal then performs decomposition of {circumflex over (H)}T using the iterative procedure described above and obtains {circumflex over (V)}ut, which is an estimate of the matrix Vut of right eigenvectors of HT. The user terminal uses the matrix {circumflex over (V)}ut for both (1) spatial processing for an uplink data transmission to the access point and (2) spatial matched filtering of a downlink data transmission from the access point. The user terminal transmits a steered reference on the uplink using {circumflex over (V)}ut. A steered reference (or steered pilot) is a pilot that is transmitted from all antennas and on the eigenmodes of the MIMO channel. The access point derives {circumflex over (U)}ap, which is an estimate of Uap, based on the uplink steered reference sent by the user terminal. The access point then uses the matrix {circumflex over (U)}ap for both (1) spatial processing for the downlink data transmission to the user terminal and (2) spatial matched filtering of the uplink data transmission from the user terminal. With a reciprocal channel, the MIMO pilot may be sent on only one link (e.g., the downlink), and the eigenvector computation may be performed by only one entity (e.g., the user terminal) to derive matrices of eigenvectors used by both entities.


System 300 may or may not utilize OFDM for data transmission. If system 300 utilizes OFDM, then NF total subbands are available for transmission. Of the NF total subbands, ND subbands may be used for data transmission and are referred to as data subbands, NP subbands may be used for a carrier pilot and are referred to as pilot subbands, and NG subbands may be used as guard subbands (no transmission), where NF=ND+NP+NG. In each OFDM symbol period, up to ND data symbols may be sent on the ND data subbands, and up to NP pilot symbols may be sent on the NP pilot subbands. For OFDM modulation, NF frequency-domain values (for ND data symbols, NP pilot symbols, and NG zeros) are transformed to the time domain with an NF-point inverse fast Fourier transform (IFFT) to obtain a “transformed” symbol that contains NF time-domain chips. To combat intersymbol interference (ISI), which is caused by frequency selective fading, a portion of each transformed symbol is repeated to form a corresponding OFDM symbol. The repeated portion is often referred to as a cyclic prefix or guard interval. An OFDM symbol period (which is also referred to as simply a “symbol period”) is the duration of one OFDM symbol. In FIG. 3, the OFDM modulation for each transmit antenna may be performed by the transmitter unit for that antenna. The complementary OFDM demodulation for each receive antenna may be performed by the receiver unit for that antenna.


The eigenvector computation techniques described herein may be implemented by various means. For example, these techniques may be implemented in hardware, software, or a combination thereof. For a hardware implementation, the processing units used to perform the eigenvector computation may be implemented within one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays (FPGAs), processors, controllers, micro-controllers, microprocessors, other electronic units designed to perform the functions described herein, or a combination thereof.


For a software implementation, the eigenvector computation techniques may be implemented with modules (e.g., procedures, functions, and so on) that perform the functions described herein. The software codes may be stored in a memory unit (e.g., memory unit 332 or 382 in FIG. 3) and executed by a processor (e.g., DSP 328 or 378, or controller 330 or 380). The memory unit may be implemented within the processor or external to the processor, in which case it can be communicatively coupled to the processor via various means as is known in the art.


Headings are included herein for reference and to aid in locating certain sections. These headings are not intended to limit the scope of the concepts described therein under, and these concepts may have applicability in other sections throughout the entire specification.


The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims
  • 1. A method of deriving eigenvectors for a multiple-input multiple-output (MIMO) communication system, comprising: initializing a first matrix of eigenvectors; and updating the first matrix based on a channel response matrix for a MIMO channel, wherein the first matrix is updated for a plurality of iterations, and wherein eigenvectors in the updated first matrix are used for spatial processing to transmit data via the MIMO channel.
  • 2. The method of claim 1, wherein the updating of the first matrix is based on the following equation:
  • 3. The method of claim 1, wherein the updating of the first matrix comprises: computing a second matrix based on the first matrix and the channel response matrix, computing an update matrix based on the first and second matrices, and updating the first matrix with the update matrix.
  • 4. The method of claim 3, wherein the second matrix is computed based on the following equation:
  • 5. The method of claim 1, further comprising: orthogonalizing the eigenvectors in the updated first matrix.
  • 6. The method of claim 5, wherein the orthogonalization of the eigenvectors in the updated first matrix is performed using QR factorization.
  • 7. The method of claim 3, further comprising: ordering the eigenvectors in the updated first matrix based on diagonal elements of the second matrix; and orthogonalizing the ordered eigenvectors in the updated first matrix.
  • 8. The method of claim 1, wherein the first matrix is initialized to an identity matrix.
  • 9. The method of claim 1, wherein the first matrix is initialized with eigenvectors obtained for a prior transmission interval.
  • 10. The method of claim 1, wherein the MIMO system utilizes orthogonal frequency division multiplexing (OFDM), and wherein a different first matrix of eigenvectors is derived for each of a plurality of subbands based on a channel response matrix obtained for the subband.
  • 11. The method of claim 10, wherein a first matrix of eigenvectors for a first subband is initialized to a first matrix of eigenvectors derived for a second subband.
  • 12. The method of claim 1, wherein the first matrix is updated for a predetermined number of iterations.
  • 13. The method of claim 1, wherein the first matrix is updated for a variable number of iterations until a termination condition is encountered.
  • 14. The method of claim 3, wherein the first matrix is updated for a variable number of iterations until a sum of squared magnitude of off-diagonal elements of the second matrix is less than a threshold.
  • 15. The method of claim 1, further comprising: estimating a channel response for a first link of the MIMO channel to obtain the channel response matrix, and wherein the updated first matrix of eigenvectors is used for spatial processing for data transmission on a second link of the MIMO channel.
  • 16. The method of claim 1, further comprising: estimating a channel response for a downlink of the MIMO channel to obtain the channel response matrix, and wherein the updated first matrix of eigenvectors is used for spatial processing for data transmission on an uplink of the MIMO channel.
  • 17. The method of claim 1, further comprising: filtering a plurality of channel response matrices for a plurality of transmission intervals to obtain a filtered channel response matrix, and wherein the first matrix is updated based on the filtered channel response matrix.
  • 18. The method of claim 1, further comprising: deriving a spatial filter matrix based on the updated first matrix of eigenvectors and the channel response matrix, wherein the spatial filter matrix is used for spatial matched filtering of a data transmission received via the MIMO channel.
  • 19. The method of claim 1, wherein the first matrix and the channel response matrix contain complex-valued elements.
  • 20. An apparatus in a multiple-input multiple-output (MIMO) communication system, comprising: a channel estimator operative to obtain a channel response matrix for a MIMO channel; and a first unit operative to initialize a first matrix of eigenvectors and to update the first matrix based on the channel response matrix, wherein the first matrix is updated for a plurality of iterations, and wherein eigenvectors in the updated first matrix are used for spatial processing to transmit data via the MIMO channel.
  • 21. The apparatus of claim 20, wherein the first unit is operable to update the first matrix based on the following equation:
  • 22. The apparatus of claim 20, further comprising: a second unit operative to orthogonalize the eigenvectors in the updated first matrix.
  • 23. The apparatus of claim 22, wherein the first and second units are implemented by a digital signal processor.
  • 24. The apparatus of claim 20, further comprising: a third unit operative to derive a spatial filter matrix based on the updated first matrix and the channel response matrix.
  • 25. The apparatus of claim 20, further comprising: a filter operative to filter a plurality of channel response matrices for a plurality of transmission intervals and provide a filtered channel response matrix, and wherein the first matrix is updated based on the filtered channel response matrix.
  • 26. The apparatus of claim 20, wherein the MIMO system utilizes orthogonal frequency division multiplexing (OFDM), and wherein a different first matrix of eigenvectors is derived for each of a plurality of subbands based on a channel response matrix obtained for the subband.
  • 27. An apparatus in a multiple-input multiple-output (MIMO) communication system, comprising: means for initializing a first matrix of eigenvectors; and means for updating the first matrix based on a channel response matrix for a MIMO channel, wherein the first matrix is updated for a plurality of iterations, and wherein eigenvectors in the updated first matrix are used for spatial processing to transmit data via the MIMO channel.
  • 28. The apparatus of claim 27, wherein the first matrix is updated based on the following equation:
  • 29. The apparatus of claim 27, further comprising: means for orthogonalizing the eigenvectors in the updated first matrix.
  • 30. The apparatus of claim 27, further comprising: means for deriving a spatial filter matrix based on the updated first matrix and the channel response matrix.
  • 31. The apparatus of claim 27, further comprising: means for filtering a plurality of channel response matrices for a plurality of transmission intervals to obtain a filtered channel response matrix, and wherein the first matrix is updated based on the filtered channel response matrix.