I. Field
The present disclosure relates generally to communication, and more specifically to techniques for computing filter weights in a communication system.
II. Background
A multiple-input multiple-output (MIMO) communication system employs multiple (T) transmit antennas at a transmitting station and multiple (R) receive antennas at a receiving station for data transmission. A MIMO channel formed by the T transmit antennas and the R receive antennas may be decomposed into S spatial channels, where S≦min {T, R}. The S spatial channels may be used to transmit data in a manner to achieve higher overall throughput and/or greater reliability.
The transmitting station may simultaneously transmit T data streams from the T transmit antennas. These data streams are distorted by the MIMO channel response and further degraded by noise and interference. The receiving station receives the transmitted data streams via the R receive antennas. The received signal from each receive antenna contains scaled versions of the T data streams sent by the transmitting station. The transmitted data streams are thus dispersed among the R received signals from the R receive antennas. The receiving station would then perform receiver spatial processing on the R received signals with a spatial filter matrix in order to recover the transmitted data streams.
The derivation of the weights for the spatial filter matrix is computationally intensive. This is because the spatial filter matrix is typically derived based on a function that contains a matrix inversion, and direct calculation of the matrix inversion is computationally intensive.
There is therefore a need in the art for techniques to efficiently compute the filter weights.
Techniques for efficiently computing the weights for a spatial filter matrix are described herein. These techniques avoid direct computation of matrix inversion.
In a first embodiment for deriving a spatial filter matrix M, a Hermitian matrix P is iteratively derived based on a channel response matrix H, and a matrix inversion is indirectly calculated by deriving the Hermitian matrix iteratively. The Hermitian matrix may be initialized to an identity matrix. One iteration is then performed for each row of the channel response matrix, and an efficient sequence of calculations is performed for each iteration. For the i-th iteration, an intermediate row vector ai is derived based on a channel response row vector hi, which is the i-th row of the channel response matrix. A scalar ri is derived based on the intermediate row vector and the channel response row vector. An intermediate matrix Ci is also derived based on the intermediate row vector. The Hermitian matrix is then updated based on the scalar and the intermediate matrix. After all of the iterations are completed, the spatial filter matrix is derived based on the Hermitian matrix and the channel response matrix.
In a second embodiment, multiple rotations are performed to iteratively obtain a first matrix P1/2 and a second matrix B for a pseudo-inverse matrix of the channel response matrix. One iteration is performed for each row of the channel response matrix. For each iteration, a matrix Y containing the first and second matrices from the prior iteration is formed. Multiple Givens rotations are then performed on matrix Y to zero out elements in the first row of the matrix to obtain updated first and second matrices for the next iteration. After all of the iterations are completed, the spatial filter matrix is derived based on the first and second matrices.
In a third embodiment, a matrix X is formed based on the channel response matrix and decomposed (e.g., using eigenvalue decomposition) to obtain a unitary matrix V and a diagonal matrix Λ. The decomposition may be achieved by iteratively performing Jacobi rotations on matrix X. The spatial filter matrix is then derived based on the unitary matrix, the diagonal matrix, and the channel response matrix.
Various aspects and embodiments of the invention are described in further detail below.
The features and nature of the present invention will become more apparent from the detailed description set forth below when taken in conjunction with the drawings in which like reference characters identify correspondingly throughout.
The word “exemplary” is used herein to mean “serving as an example, instance, or illustration.” Any embodiment or design described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other embodiments or designs.
The filter weight computation techniques described herein may be used for a single-carrier MIMO system and a multi-carrier MIMO system. Multiple carriers may be obtained with orthogonal frequency division multiplexing (OFDM), interleaved frequency division multiple access (IFDMA), localized frequency division multiple access (LFDMA), or some other modulation technique. OFDM, IFDMA, and LFDMA effectively partition the overall system bandwidth into multiple (K) orthogonal frequency subbands, which are also called tones, subcarriers, bins, and frequency channels. Each subband is associated with a respective subcarrier that may be modulated with data. OFDM transmits modulation symbols in the frequency domain on all or a subset of the K subbands. IFDMA transmits modulation symbols in the time domain on subbands that are uniformly spaced across the K subbands. LFDMA transmits modulation symbols in the time domain and typically on adjacent subbands. For clarity, much of the following description is for a single-carrier MIMO system with a single subband.
A MIMO channel formed by multiple (T) transmit antennas at a transmitting station and multiple (R) receive antennas at a receiving station may be characterized by an R×T channel response matrix H, which may be given as:
where
The transmitting station may transmit T modulation symbols simultaneously from the T transmit antennas in each symbol period. The transmitting station may or may not perform spatial processing on the modulation symbols prior to transmission. For simplicity, the following description assumes that each modulation symbol is sent from one transmit antenna without any spatial processing.
The receiving station obtains R received symbols from the R receive antennas in each symbol period. The received symbols may be expressed as:
r=H·s+n, Eq (2)
where
The receiving station may use various receiver spatial processing techniques to recover the modulation symbols sent by the transmitting station. For example, the receiving station may perform minimum mean square error (MMSE) receiver spatial processing, as follows:
ŝ=(σn2·I+HH·H)−1·HH·r=P·HH·r=M·r, Eq (3)
where
As shown in equation (3), the MMSE spatial filter matrix M has a matrix inverse calculation. Direct calculation of the matrix inversion is computationally intensive. The MMSE spatial filter matrix may be more efficiently derived based on the embodiments described below, which indirectly calculate the matrix inversion with an iterative process instead of directly calculating the matrix inversion.
In a first embodiment of computing the MMSE spatial filter matrix M, the Hermitian matrix P is computed based on the Riccati equation. Hermitian matrix P may be expressed as:
A T×T Hermitian matrix Pi may be defined as:
The matrix inversion lemma may be applied to equation (5) to obtain the following:
where ri is a real-valued scalar. Equation (6) is referred to as the Riccati equation. Matrix Pi may be initialized as
After performing R iterations of equation (6), for i=1, . . . , R, matrix PR is provided as matrix P, or P=PR.
Equation (6) may be factored to obtain the following:
where matrix Pi is initialized as P0=I and matrix P is derived as
Equations (6) and (7) are different forms of a solution to equation (5). For simplicity, the same variables Pi and ri are used for both equations (6) and (7) even though these variables have different values in the two equations. The final results from equations (6) and (7), i.e., PR for equation (6) and
for equation (7), are equivalent. However, the calculations for the first iteration of equation (7) are simplified because P0 is an identity matrix.
Each iteration of equation (7) may be performed as follows:
ai=hi·Pi−1, Eq (8a)
ri=σn2+ai·hiH, Eq (8b)
Ci=aiH·ai, and Eq (8c)
Pi=Pi−1−ri−1·Ci, Eq (8d)
where
In equation set (8), the sequence of operations is structured for efficient computation by hardware. Scalar ri is computed before matrix Ci. The division by ri in equation (7) is achieved with an inversion and a multiply. The inversion of ri may be performed in parallel with the computation of Ci. The inversion of ri may be achieved with a shifter to normalize ri and a look-up table to produce an inverted ri value. The normalization of ri may be compensated for in the multiplication with Ci.
Matrix Pi is initialized as a Hermitian matrix, or P0=I, and remains Hermitian through all of the iterations. Hence, only the upper (or lower) diagonal matrix needs to be calculated for each iteration. After R iterations are completed, matrix P is obtained as
The MMSE spatial filter matrix may then be computed as follows:
Each iteration of the Riccati equation is performed by block 120. For the i-th iteration, the intermediate row vector ai is computed based on the channel response row vector hi and the Hermitian matrix Pi−1 from the prior iteration, as shown in equation (8a) (block 122). The scalar ri is computed based on the noise variance σn2, the intermediate row vector ai, and the channel response row vector hi, as shown in equation (8b) (block 124). Scalar ri is then inverted (block 126). Intermediate matrix Ci is computed based on the intermediate row vector ai, as shown in equation (8c) (block 128). Matrix Pi is then updated based on the inverted scalar ri and the intermediate matrix Ci, as shown in equation (8d) (block 130).
A determination is then made whether all R iterations have been performed (block 132). If the answer is ‘No’, then index i is incremented (block 134), and the process returns to block 122 to perform another iteration. Otherwise, if all R iterations have been performed, then the MMSE spatial filter matrix M is computed based on the Hermitian matrix PR for the last iteration, the channel response matrix H, and the noise variance σn2, as shown in equation (9) (block 136). Matrix M may then be used for receiver spatial processing as shown in equation (3).
In a second embodiment of computing the MMSE spatial filter matrix M, the Hermitian matrix P is determined by deriving the square root of P, which is P1/2, based on an iterative procedure. The receiver spatial processing in equation (3) may be expressed as:
where
QR decomposition may be performed on the augmented channel matrix, as follows:
where
The QR decomposition in equation (11) decomposes the augmented channel matrix into an orthonormal matrix Q and a non-singular matrix R. An orthonormal matrix Q has the following property: QH·Q=I, which means that the columns of the orthonormal matrix are orthogonal to one another and each column has unit power. A non-singular matrix is a matrix that an inverse can be computed for.
The Hermitian matrix P may then be expressed as:
R is the Cholesky factorization or matrix square root of P−1. Hence, P1/2 is equal to R−1 and is called the square-root of P.
The pseudo-inverse matrix in equation (10) may then be expressed as:
Sub-matrix Hσ
Hσ
Equation (10) may then be expressed as:
ŝ=Hσ
Matrices P1/2 and B may be computed iteratively as follows:
Yi·Θi=Zi, or Eq (16)
where
The transformation in equation (17) may be performed iteratively, as described below. For clarity, each iteration of equation (17) is called an outer iteration. R outer iterations of equation (17) are performed for the R channel response row vectors hi, for i=1, . . . , R. For each outer iteration, the unitary transformation matrix Θi in equation (17) results in the transformed matrix Zi containing all zeros in the first row except for the first element. The first column of the transformed matrix Zi contains ri1/2, ki, and li. The last T columns of Zi contain updated Pi1/2 and Bi. The first column of Zi does not need to be calculated since only Pi1/2 and Bi are used in the next iteration. Pi1/2 is an upper triangular matrix. After R outer iterations are completed, PR1/2 is provided as P1/2, and BR is provided as B. The MMSE spatial filter matrix M may then be computed as based on P1/2 and B, as shown in equation (14).
For each outer iteration i, the transformation in equation (17) may be performed by successively zeroing out one element in the first row of Yi at a time with a 2×2 Givens rotation. T inner iterations of the Givens rotation may be performed to zero out the last T elements in the first row of Yi.
For each outer iteration i, a matrix Yi,j may be initialized as Yi,1=Yi. For each inner iteration j, for j=1, . . . , T, of outer iteration i, a (T+R+1)×2 sub-matrix Y′i,j containing the first and (j+1)-th columns of Yi,j is initially formed. The Givens rotation is then performed on sub-matrix Y′i,j to generate a (T+R+1)×2 sub-matrix Y″i,j containing a zero for the second element in the first row. The Givens rotation may be expressed as:
Y″i,j=Y′i,j·Gi,j, Eq (18)
where Gi,j is a 2×2 Givens rotation matrix for the j-th inner iteration of the i-th outer iteration and is described below. Matrix Yi,j+1 is then formed by first setting Yi,j+1=Yi,j, then replacing the first column of Yi,j+1 with the first column of Y″i,j, and then replacing the (j+1)-th column of Yi,j+1 with the second column of Y″i,j. The Givens rotation thus modifies only two columns of Yi,j in the j-th inner iteration to produce Yi,j+1 for the next inner iteration. The Givens rotation may be performed in-place on two columns of Yi for each inner iteration, so that intermediate matrices Yi,j, Y′i,j, Y″i,j and Yi,j+1 are not needed and are described above for clarity.
For the j-th inner iteration of the i-th outer iteration, the Givens rotation matrix Gi,j is determined based on the first element (which is always a real value) and the (j+1)-th element in the first row of Yi,j. The first element may be denoted as a, and the (j+1)-th element may be denoted as b·ejθ. The Givens rotation matrix Gi,j may then be derived as follows:
where
for equation (19).
and matrix Bi is initialized as B0=0 (block 212). Index i used to denote the outer iteration number is initialized as i=1, and index j used to denote the inner iteration number is initialized as j=1 (block 214). R outer iterations of the unitary transformation in equation (17) are then performed (block 220).
For the i-th outer iteration, matrix Yi is initially formed with the channel response row vector hi and matrices Pi−11/2 and Bi−1, as shown in equation (17) (block 222). Matrix Yi is then referred to as matrix Yi,j for the inner iterations (block 224). T inner iterations of the Givens rotation are then performed on matrix Yi,j (block 230).
For the j-th inner iteration, the Givens rotation matrix Gi,j is derived based on the first and (j+1)-th elements in the first row of Yi,j, as shown in equation (19) (block 232). The Givens rotation matrix Gi,j is then applied to the first and (j+1)-th columns of Yi,j to obtain Yi,j+1, as shown in equation (18) (block 234). A determination is then made whether all T inner iterations have been performed (block 236). If the answer is ‘No’, then index j is incremented (block 238), and the process returns to block 232 to perform another inner iteration.
If all T inner iterations have been performed for the current outer iteration and the answer is ‘Yes’ for block 236, then the latest Yi,j+1 is equal to Zi in equation (17). Updated matrices Pi1/2 and Bi are obtained from the latest Yi,j+1 (block 240). A determination is then made whether all R outer iterations have been performed (block 242). If the answer is ‘No’, then index i is incremented, and index j is reinitialized as j=1 (block 244). The process then returns to block 222 to perform another outer iteration with Pi1/2 and Bi. Otherwise, if all R outer iterations have been performed and the answer is ‘Yes’ for block 242, then the MMSE spatial filter matrix M is computed based on Pi1/2 and Bi, as shown in equation (14) (block 246). Matrix M may then be used for receiver spatial processing as shown in equation (15).
In a third embodiment of computing the MMSE spatial filter matrix M, eigenvalue decomposition of P−1 is performed as follows:
P−1=σn2·I+HH·H=V·Λ·VH, Eq (20)
where
Eigenvalue decomposition of a 2×2 Hermitian matrix X2×2 may be achieved using various techniques. In an embodiment, eigenvalue decomposition of X2×2 is achieved by performing a complex Jacobi rotation on X2×2 to obtain a 2×2 matrix V2×2 of eigenvectors of X2×2. The elements of X2×2 and V2×2 may be given as:
The elements of V2×2 may be computed directly from the elements of X2×2, as follows:
Eigenvalue decomposition of a T×T Hermitian matrix X that is larger than 2×2 may be performed with an iterative process. This iterative process uses the Jacobi rotation repeatedly to zero out off-diagonal elements in X. For the iterative process, index i denotes the iteration number and is initialized as i=1. X is a T×T Hermitian matrix to be decomposed and is set as X=P−1. Matrix Di is an approximation of diagonal matrix Λ in equation (20) and is initialized as D0=X. Matrix Vi is an approximation of unitary matrix V in equation (20) and is initialized as V0=I.
A single iteration of the Jacobi rotation to update matrices Di and Vi may be performed as follows. First, a 2×2 Hermitian matrix Dpq is formed based on the current matrix Di, as follows:
where dp,q is the element at location (p,q) in Di, pε{1, . . . , T}, qε{1, . . . , T}, and p≠q. Dpq is a 2×2 submatrix of Di, and the four elements of Dpq are four elements at locations (p,p), (p,q), (q,p) and (q,q) in Di. Indices p and q may be selected as described below.
Eigenvalue decomposition of Dpq is then performed as shown in equation set (22) to obtain a 2×2 unitary matrix Vpq of eigenvectors of Dpq. For the eigenvalue decomposition of Dpq, X2×2 in equation (21) is replaced with Dpq, and V2×2 from equation (22j) or (22k) is provided as Vpq.
A T×T complex Jacobi rotation matrix Tpq is then formed with Vpq. Tpq is an identity matrix with four elements at locations (p,p), (p,q), (q,p) and (q,q) replaced with elements v1,1, v1,2, v2,1, and v2,2, respectively, in Vpq.
Matrix Di is then updated as follows:
Di+1=TpqH·Di·Tpq. Eq (24)
Equation (24) zeros out two off-diagonal elements at locations (p,q) and (q,p) in Di. The computation may alter the values of other off-diagonal elements in Di.
Matrix Vi is also updated as follows:
Vi+1=Vi·Tpq. Eq (25)
Vi may be viewed as a cumulative transformation matrix that contains all of the Jacobi rotation matrices Tpq used on Di.
Each iteration of the Jacobi rotation zeros out two off-diagonal elements of Di. Multiple iterations of the Jacobi rotation may be performed for different values of indices p and q to zero out all of the off-diagonal elements of Di. A single sweep across all possible values of indices p and q may be performed as follows. Index p is stepped from 1 through T−1 in increments of one. For each value of p, index q is stepped from p+1 through T in increments of one. The Jacobi rotation is performed for each different combination of values for p and q. Multiple sweeps may be performed until Di and Vi are sufficiently accurate estimates of Λ and V, respectively.
Equation (20) may be rewritten as follows:
P=(σn2·I+HH·H)−1=V·Λ−1·VH, Eq (26)
where Λ−1 is a diagonal matrix whose elements are the inverse of the corresponding elements in Λ. The eigenvalue decomposition of X=P−1 provides estimates of Λ and V. Λ may be inverted to obtain Λ−1.
The MMSE spatial filter matrix may then be computed as follows:
M=P·HH=V·Λ−1·VH·HH. Eq (27)
The MMSE spatial filter matrix M derived based on each of the embodiments described above is a biased MMSE solution. The biased spatial filter matrix M may be scaled by a diagonal matrix Dmmse to obtain an unbiased MMSE spatial filter matrix Mmmse. Matrix Dmmse may be derived as Dmmse=[diag[M·H]]−1, where diag[M·H] is a diagonal matrix containing the diagonal elements of M·H.
The computation described above may also be used to derive spatial filter matrices for a zero-forcing (ZF) technique (which is also called a channel correlation matrix inversion (CCMI) technique), a maximal ratio combining (MRC) technique, and so on. For example, the receiving station may perform zero-forcing and MRC receiver spatial processing, as follows:
ŝzf=(HH·H)−1·HH·r=Pzf·HH·r=Mzf·r, Eq (28)
ŝmrc=[diag(HH·H)−1]HH·r=[diag(Pzf)]·HH·r=Mmrc·r, Eq (29)
where
The description above assumes that T modulation symbols are sent simultaneously from T transmit antennas without any spatial processing. The transmitting station may perform spatial processing prior to transmission, as follows:
x=W·s, Eq (30)
where
For clarity, the description above is for a single-carrier MIMO system with a single subband. For a multi-carrier MIMO system, a channel response matrix H(k) may be obtained for each subband k of interest. A spatial filter matrix M(k) may then be derived for each subband k based on the channel response matrix H(k) for that subband.
The computation described above for the spatial filter matrix may be performed using various types of processors such as a floating-point processor, a fixed-point processor, a Coordinate Rotational Digital Computer (CORDIC) processor, a look-up table, and so on, or a combination thereof. A CORDIC processor implements an iterative algorithm that allows for fast hardware calculation of trigonometric functions such as sine, cosine, magnitude, and phase using simple shift and add/subtract hardware. A CORDIC processor may iteratively compute each of variables r, c1 and s1 in equation set (22), with more iterations producing higher accuracy for the variable.
At user terminal 450, Nut antennas 452a through 452ut receive the transmitted downlink modulated signals, and each antenna provides a received signal to a respective receiver unit (RCVR) 454. Each receiver unit 454 performs processing complementary to the processing performed by transmitter units 422 and provides received pilot symbols and received data symbols. A channel estimator/processor 478 processes the received pilot symbols and provides an estimate of the downlink channel response Hdn. A processor 480 derives a downlink spatial filter matrix Mdn based on Hdn and using any of the embodiments described above. A receive (RX) spatial processor 460 performs receiver spatial processing (or spatial matched filtering) on the received data symbols from all Nut receiver units 454a through 454ut with the downlink spatial filter matrix Mdn and provides detected data symbols, which are estimates of the data symbols transmitted by access point 410. An RX data processor 470 processes (e.g., symbol demaps, deinterleaves, and decodes) the detected data symbols and provides decoded data to a data sink 472 and/or controller 480.
The processing for the uplink may be the same or different from the processing for the downlink. Data from a data source 486 and signaling from controller 480 are processed (e.g., encoded, interleaved, and modulated) by a TX data processor 488, multiplexed with pilot symbols, and possibly spatially processed by TX spatial processor 490. The transmit symbols from TX spatial processor 490 are further processed by transmitter units 454a through 454ut to generate Nut uplink modulated signals, which are transmitted via antennas 452a through 452ut.
At access point 410, the uplink modulated signals are received by antennas 424a through 424ap and processed by receiver units 422a through 422ap to generate received pilot symbols and received data symbols for the uplink transmission. A channel estimator/processor 428 processes the received pilot symbols and provides an estimate of the uplink channel response Hup. Processor 430 derives an uplink spatial filter matrix Mup based on Hup and using any of the embodiments described above. An RX spatial processor 440 performs receiver spatial processing on the received data symbols with the uplink spatial filter matrix Mup and provides detected data symbols. An RX data processor 442 further processes the detected data symbols and provides decoded data to a data sink 444 and/or controller 430.
Controllers 430 and 480 control the operation at access point 410 and user terminal 450, respectively. Memory units 432 and 482 store data and program codes used by controllers 430 and 480, respectively.
The blocks in
For a firmware or software implementation, the filter weight computation techniques may be implemented with modules (e.g., procedures, functions, and so on) that perform the functions described herein. The software codes may be stored in a memory unit (e.g., memory unit 432 or 482 in
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
Number | Date | Country | |
---|---|---|---|
60691756 | Jun 2005 | US |