The combination of maximum likelihood sequence estimation (MLSE) with receiver diversity is an effective technique for achieving high performance over noisy, frequency-selective, fading channels impaired by co-channel interference. With the addition of transmitter diversity, the resulting multi-input multi-output (MIMO) frequency-selective channel has a significantly higher capacity than its single-input multi-output (SIMO) or single-input single-output (SISO) counterparts. The use of maximum likelihood multi-user detection techniques on these frequency-selective MIMO channels significantly outperforms single-user detection techniques that treat signals from other users as colored noise. However, MLSE complexity increases exponentially with the number of inputs (or transmit antennas) and with the memory of the MIMO channel, making its implementation over sever inter-symbol interference (ISI) channels very costly.
The MIMO channel can be modeled as a collection of FIR filters (i.e., an FIR filter between each input point (e.g., transmitting antenna) and each receiving point (e.g. receiving antenna), and the “memory of the channel” corresponds to the number of taps in the FIR filters.
The Discrete Matrix Multitone (DMMT) was shown to be a practical transceiver structure that asymptotically achieves the MIMO channel capacity when combined with powerful codes. It uses the Discrete Fourier Transform (DFT) to partition the frequency responses of the underlying frequency-selective channels of the MIMO systems into a large number of parallel, independent, and (approximately) memoryless frequency subchannels. To eliminate inter-block and intra-block interference, a cyclic prefix whose length is equal to the MIMO channel memory is inserted in every block. On severe-ISI MIMO channels, the cyclic prefix overhead reduces the achievable DMMT throughput significantly, unless a large FFT size is used which, in turn, increases the computational complexity, processing delay, and memory requirements in the receiver.
In short, the computation complexity increases exponentially with the number of taps in the FIR filters that may be used to model the channel.
N. Al-Dhahir and J. M. Cioffi, in “Efficiently-Computed Reduced-Parameter Input-Aided MMSE Equalizers for ML Detection: A Unified Approach,” IEEE Trans. Information Theory, pp. 903-915, May 1996, disclose use of a time-domain pre-filter in the receiver to shorten the effective channel memory and hence reduce the cyclic prefix overhead and/or the number of MLSE states. The disclosed approach, however, is for SISO systems, and not for MIMO systems.
An advance in the art is realized with a MIMO pre-filter that is constructed from FIR filters with coefficients that are computed based on environment parameters that are designer-chosen. Given a transmission channel that is modeled as a set of FIR filters with memory ν, a matrix W is computed for a pre-filter that results in an effective transmission channel B with memory Nb, where Nb<ν, where B is optimized so that Bopt=argminB trace(Ree) subject to selected constraints; Ree being the error autocorrelation function. The coefficients of W, which are sensitive to a variety of designer constraints, are computed by a processor within pre-filter at the front end of a receiver and loaded into an array of FIR filters that form the pre-filter.
where yk(j) is the signal at time k at the jth receiving antenna, hm(i,j) is the mth coefficient (tap) in the channel impulse response between the ith transmitting antenna and the jth receiving antenna, and n(j) is the noise vector at the jth receiving antenna. The memory of this path (i.e., the largest value of m for which hm(i,j) is not zero) is denoted by ν(i,j). It not unreasonable to assume, however, that the memory of the transmission channel is the same for all i,j links (ni×no such links), in which case ν(i,j)=ν. Alternatively, the ν(i,j) limit in equation (1) can be set to that ν which corresponds to maximum length of all of the ni×no channel input responses, i.e., ν=maxi,jν(i,j). All of these variables in equation (1) are actually I×1 column vectors, corresponding to the I time samples per symbol in the oversampled
where Hm is the MIMO channel coefficients matrix of size noI×ni, xk−m is a size ni×1 input (column) vector, and nk is a size noI×1 vector.
Over a block of Nf symbol periods, equation (2) can be expressed in matrix notation as follows:
or, more compactly,
yk+N
The subscripts in equation (4) indicate a range. For example k+Nf−1:k indicates the range k+Nf−1 and k, inclusive.
It is useful to define the following correlation matrices:
Rxy≡E[xk+N
Ryy≡E[yk+N
Rxx≡E[xk+N
Rnn≡E[nk+N
It is assumed that these correlation matrices do not change significantly in time or, at least, do not change significantly over a time interval that corresponds to a TDMA burst (assumed to be much shorter than the channel coherence time), which is much longer than the length of the pre-filter, in symbol periods denoted by Nf. Accordingly, a re-computation of the above matrices, and the other parameters disclosed herein, leading to the computation of pre-filter coefficients, need not take place more often than once every TDMA burst.
Once H, Rxx and Rnn are known, Rxy and Ryy are computed by RxxH* and HRxxH*+Rnn, respectively.
Given the MIMO channel matrix H with ν+1 members (H0, H1, . . . Hν), the objective is to create a MIMO pre-filter W (element 30 in
The matrix B can be expressed as B≡[B0 B1 . . . BN
The MIMO channel-shortening pre-filter W (element 30) is conditioned, or adjusted, to minimize the equalization Mean Squared Error (MSE), defined by MSE≡trace(Ree), where Ree is the autocorrelation matrix of the error vector Ek that is given by
Ek={tilde over (B)}*xk+N
where the augmented MIMO matrix, {tilde over (B)}*, is
{tilde over (B)}*≡[0n
Δ is the decision delay that lies in the range 0≦(Nf+ν−Nb−1), and s≡Nf+ν−Nb−Δ−1. The ni×ni error autocorrelation function Ree can be expressed by the following:
Ree≡E[EkE*k]
={tilde over (B)}*(Rxx−RxyRyy−1Ryx){tilde over (B)}
={tilde over (B)}*R⊥{tilde over (B)}
={tilde over (B)}*
where
Using the orthogonality principle, which states that E[Eky*k+N
W*opt={tilde over (B)}*optRxyRyy−1
={tilde over (B)}*optRxxH*(HRxxH*+Rnn)−1
={tilde over (B)}*opt(Rxx−1+H*Rnn−1H)−1H*Rnn−1. (12)
The last line shows explicitly that the MIMO channel-shortening pre-filter consists of a noise whitener Rnn−1, a MIMO matched filter H*, and a bank of FIR channel-shortening pre-filter elements.
It remains to optimize {tilde over (B)} such that the MSE is minimized, which may be obtained by computing the parameters of B that, responsive to specified conditions, minimizes the trace (or determinant) of Ree. The following discloses two approaches to such optimization.
Under one optimization approach the coefficients of B are constrained so that some coefficient of B is equal to the identity matrix, I. A solution subject to this Identity Tap Constraint (ITC) can be expressed by
BoptITC=argminBtrace(Ree) subject to B*φ=In
where φ*≡[0n
BoptITC=
Ree,minITC=(φ*
As indicated above,
Under a second optimization approach the imposed constraint is B*B=In
BoptONC=argminBtrace(Ree) subject to B*B=In
Defining the eigen-decomposition
where σ0≧σ1 . . . σn
BoptONC=U[en
where ei is unit vector with a 1 at position i, and 0's elsewhere, and
Ree,minONC=diag(σn
Illustratively, if ni=3 and Nb=3, BoptONC=U[e9,e10,e11], meaning that BoptONC is a three column matrix comprising the 9th through the 11th columns of matrix U. Stated in words, the optimum MIMO target impulse response matrix is given by the ni eigenvectors of
With the above analysis in mind, a design of a prefilter 30 can proceed for any given set of system parameters, which includes:
The structure of filters 210 is shown in
Filter section 210 in the
Processor 220 is responsive to the no signals received by antennas 21 and sampled by circuit 20, and it computes the coefficients of W, as disclosed above. W0 is a matrix that defines the coefficients in the 0th tap of the j×i FIR filters, W1 is a matrix that defines the coefficients in the 1st tap of the j×i FIR filters, etc.
The method of developing the parameters of pre-filter 30, carried out in processor 220, is shown in
Following step 100, step 110 determines the matrices, Rnn, Rxx, Rxy, and Ryy. The matrix Rnn is computed by first computing n=y−Hx and then computing the expected value E[n*n]—see equation (8) above. The matrix Rxx is computed from the known training sequences—see equation (7) above—(or is pre-computed and installed in processor 220). In may be noted that for uncorrelated inputs, Rxx=I. The matrices Rxy and Ryy are computed from the known training sequences and the received signal or directly from H and Rnn—see equations (5) and (6) above.
Following step 110, step 120 computes R⊥=Rxx−RxyRyy−1Ryx, and the sub-matrix
In accordance with the ITC approach, selecting some value of 0≦m≦Nb allows completion of the design process. Accordingly, following step 120, step 130 chooses a value for m, develops φ-≡[0n
In accordance with the ONC approach, step 130 computes the matrix U in a conventional manner, identifies the unit vectors ei, and thus obtains the matrix B. Step As with the ITC approach, step 140 develops the coefficients of matrix W in accordance with equation (12), and installs the developed coefficients within filter 210.
It should be understood that a number of aspects of the above disclosure, for example, those related to the ITC constraint and to the ONC constraint, are merely illustrative, and that persons skilled in the art may make various modifications that, nevertheless, are within the spirit and scope of this invention. For example, the pre-filter described above generates a multi-output signal, with the number of outputs being ni, that being also the number of transmitting antennas 11. This, however, is not a limitation of the principles disclosed herein. The number of pre-filter outputs can, for example, be larger than ni, for example as high as ni(Nb+1). The performance of the receiver will be better with more filter outputs, but more outputs require more FIR filters, more FIR filter coefficients, and correspondingly, a greater processing power requirement placed on processor 220.
This application is a Continuation application of Ser. No. 09/668,199, now U.S. Pat. No. 7,027,536, filed Sep. 22, 2000, which claims the benefit of Provisional application No. 60/158,713 filed on Oct. 8, 1999. This application is also related to a Provisional application No. 60/158,714, also filed on Oct. 8, 1999.
Number | Name | Date | Kind |
---|---|---|---|
4561012 | Acampora | Dec 1985 | A |
5034752 | Pourailly et al. | Jul 1991 | A |
5465396 | Hunsinger et al. | Nov 1995 | A |
5499272 | Bottomley | Mar 1996 | A |
5539832 | Weinstein et al. | Jul 1996 | A |
5715282 | Mansouri et al. | Feb 1998 | A |
5717619 | Spurbeck et al. | Feb 1998 | A |
6127971 | Calderbank et al. | Oct 2000 | A |
6219561 | Raleigh | Apr 2001 | B1 |
6314147 | Liang et al. | Nov 2001 | B1 |
6353630 | Schreib | Mar 2002 | B1 |
6470192 | Karlsson et al. | Oct 2002 | B1 |
6870882 | Al-Dhahir et al. | Mar 2005 | B1 |
Number | Date | Country |
---|---|---|
538218 | Apr 1993 | EP |
Entry |
---|
Emmanuel C. Ifeachor, Barrie W. Jervis, Digital Signal Processing A Practical Approach, Addison-Wesley Publishing, 1993. |
Number | Date | Country | |
---|---|---|---|
60158713 | Oct 1999 | US | |
60158714 | Oct 1999 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 09668199 | Sep 2000 | US |
Child | 11227626 | US |