This invention relates to methods of wireless communication using multiple antennas.
It has long been known that, at least in theory, the capacity of a wireless communication system can be increased by employing more than one antenna at the transmitter, at the receiver, or at both the antenna and the receiver.
Various multiple-antenna systems have been considered. Among them is a class of systems referred to as “Multiple Input-Multiple Output” (MIMO) systems. In typical MIMO systems, the transmission of a block of message information can be distributed across an array of two or more antennas, and across two or more discrete time intervals, which are sometimes referred to as “transmit symbol intervals” or “channel uses”.
The above-described distribution across multiple antennas may be thought of as distributing the transmitted signal across “space,” and the distribution over multiple time intervals may be thought of as distributing the transmitted signal across “time.” The signal which is to be transmitted is described by a matrix which, accordingly, is referred to as a “spacetime” signal matrix.
In one common form of notation, a typical spacetime signal matrix is T×M, wherein each of the T rows represents a distinct transmit symbol interval and each of the M columns represents a distinct transmit antenna. Within each row, the entry at each of the M column positions is a complex number representing a baseband-level signal value that is to be modulated onto a carrier and transmitted from its respective antenna during the transmit signal interval corresponding to that row.
Each row of the spacetime signal matrix is referred to as a “transmit symbol vector.” Each transmit symbol vector may be though of as a row vector containing complex-valued entries. Depending on the specific coding scheme, each of these complex-valued entries may, e.g., be a scalar symbol from a symbol constellation, or a sum of such symbols.
The Alamouti spacetime code is a particular MIMO coding scheme that has received much favorable attention. The Alamouti code works best when the signals from the respective transmit antennas are uncorrelated.
Another way to take advantage of multiple-antenna transmission is by the well-known method of beamforming. Beamforming works best when there is strong correlation among the signals from the respective transmit antennas.
There remain opportunities to improve system performance in the intermediate regime in which there is a moderate amount of correlation among the transmission antennas of a MIMO system.
We have devised a new method which provides some advantages of the Alamouti code but which can be adapted so as to provide enhanced performance for a given amount of antenna correlation.
In one broad aspect, our method involves mapping a block of complex scalar input symbols to a block-code matrix S. The matrix S is a weighted sum of dispersion matrices, in which the dispersion matrices consist of an A-matrix and a B-matrix for each input symbol, and the weights for each A and B matrix are respectively derived from the real and imaginary parts of the corresponding input symbol. The method further involves transmitting the matrix S as a spacetime signal matrix from an array of two or more antennas during two or more transmit symbol intervals. Each A matrix and each B matrix is determined, in part, by an adjustable parameter which relates to a degree of beam-forming behavior by the transmission antenna array.
In another broad aspect, our method involves decoding a block of complex scalar input symbols from a spacetime signal received on an array of two or more receive antennas from an array of two or more transmit antennas. The method includes obtaining a value α of an adjustable parameter which relates to a degree of beam-forming behavior by the transmission antenna array, and using the value α to recover individual complex scalar symbols that belong to the block.
Our method involves a type of MIMO coding scheme known as a linear dispersion code. When linear dispersion codes are used, the spacetime signal matrix S is referred to as a “block-code matrix.” The block-code matrix S is constructed, for example, from Q≧1 complex scalar input symbols s1, . . . , sQ as a weighted sum of dispersion matrices Aq and Bq, q=1, . . . , Q, according to the formula,
In the above formula, {x} and ℑ{x} respectively designate the real and imaginary parts of x, and j2=−1. The code is defined by specifying the A and B matrices, i.e., the dispersion matrices Aq and Bq.
In an exemplary embodiment of our new method, there are two transmit antennas, Q=2, the dispersion matrices include a variable phase shift ejφ of the second transmit antenna relative to the first, and the dispersion matrices further include a code adaptation parameter α. As will be seen below, the parameter α expresses a measure of how much beamforming behavior will be exhibited by the transmit antenna array.
Our exemplary dispersion matrices are defined by:
The range of possible values of α is 0≦α≦1. In the extreme cases, the code reduces to the well-known Alamouti code (α=0), or to a beamforming scheme (α=1).
That is, let x* represent the complex conjugate of x, define z1={s1}+{s2}, and define z2=−jℑ{s1}+jℑ{s2}. Then for the extreme values of α, S reduces to the following:
It is well known from information theory that wireless transmission is limited by a tradeoff between spectrum efficiency and power efficiency. Roughly speaking, the spectrum efficiency is the maximum supportable data rate per available bandwidth and the power efficiency is the energy per bit that needs to be expended in transmission. A transmission scheme can be improved by increasing the data rate at a given power efficiency, by reducing the energy expended in transmission at a given data rate, or by some combination of the two.
Let R represent the code rate of the outer code, and let Eb/N0 represent the energy per bit, per noise-power density. The pair (R, Eb/N0) controls the tradeoff mentioned above. That is, each pair (R, Eb/N0) will result in a certain block error rate in the outer code. By adjusting various parameters of the transmission scheme, it might be possible to maximize R for a given block error rate, or to minimize Eb/N0 for a given value of R, or to achieve some optimal value for the pair.
In the instant case, the parameters that may be adjusted in this manner are α and ejφ.
The phase shift φ is very important for the beamforming component of our transmission scheme, because it affects the direction and the shape of the beam. In a single-user environment, i.e. without interference, a “good” choice of φ is that which maximizes the received energy at the receive antennas.
More specifically, the received signal will be a superposition of the two or more transmit signals. In the optimum case for beamforming, absent the imposed phase shift φ, the signals received from the respective transmit antennas will differ at the receive antenna only by a phase shift caused by the difference in propagation delay. The optimum φ will compensate this phase shift and lead to coherent combining of the two transmit signals at the receive antenna, thereby maximizing the received energy.
The optimal value for φ will depend on the angular position of the user as viewed from the boresight of the transmit antenna array. For a mobile user, it is desirable to adapt the value of φ as the angular position of the user changes.
It should be noted in this regard that our transmission scheme is not limited in its applications to the downlink only, but instead can also be applied in the uplink. It should be noted further that the receiver may have a single antenna, or a multiple-antenna array.
When our transmission scheme is applied, e.g., to downlink transmissions, there will be an optimum value of α which depends on the signal to interference and noise ratio (SINR) and on the covariance matrix of the received signals in the uplink. If the antennas that receive on the uplink are identical or equivalent to the antennas that transmit on the downlink, then the correlation between the antenna signals can be measured in the uplink and applied to the transmit scheme in the downlink.
Various methods are available for obtaining a value for φ. For example, any of various well-known tracking algorithms may be used to identify an angular position of the user, or, for the uplink, to identify an angular position of the base station relative to the user.
The code adaptation parameter α may be initially estimated as a value in the range (0,1), for example as 0.5. The initial estimate may be refined using an iterative estimation procedure that may, for example, use feedback information provided by the receiver. The parameter α will typically change relatively slowly, and therefore obtaining a useful estimate will generally be unlikely to involve an excessive amount of network overhead. In general, it will be preferable to obtain at least an initial value of φ before attempting to optimize α.
Thus, with reference to
To receive signals that were transmitted as described above, it is useful to employ minimum mean square estimation (MMSE) in a manner which incorporates knowledge of the channel coefficients for signal propagation between each transmit antenna and each receive antenna. Methods for estimating and communicating the channel coefficients between the respective transmit and receive antennas are well known and need not be described here in detail. An estimate of ejφ may be made inherently, because this value can be treated as subsumed into the channel coefficients. A explicit value for α is advantageously obtained by the receiver. This may be done, e.g., through explicit signaling from the transmitter, or through a blind iterative estimation procedure.
The type of MMSE known as Widely Linear Minimum Mean Square Estimation (WL-MMSE) is especially useful in this regard. WL-MMSE is designed to improve the estimation of complex signals by using complementary information available when the transmitted signal is not circular, i.e. when the signal is rotationally variant. As will be understood from the above discussion, such complementary information may be available in our transmission scheme, at least for relatively small values of α.
The WL-MMSE detector is well known. Very briefly, a scalar random variable y is to be estimated in terms of an observation represented by a random vector x. The estimate of y is designated ŷ. The WL-MMSE problem is to find the vectors u and v such that the estimate
ŷ=U
H
x+V
H
x*
gives the least mean-square error. In the above formula, UH denotes the Hermitian conjugate of U and VH denotes the Hermitian conjugate of V.
The solution is given by:
U=[Γ−CΓ
−1
*C*]
−1
[P−CΓ
−1
*S*]
V=[Γ*−C*Γ
−1
C]
−1
[S*−C*Γ
−1
P]
γ=E[xxH]
C=E[xxT]
P=E[xyT]
S=E[xyT]
In the preceding expressions, E[•] denotes expectation value, and xT is the matrix transpose of x.
In the WL-MMSE receiver as applied to our transmission scheme, the channel coefficients and the parameters α and φ will be subsumed into the definition of the correlation matrices Γ, C, P, and S in a manner that will be well understood by those skilled in the art. When received signals are processed in the WL-MMSE receiver, the original block of data is inherently recovered from the block-code matrix via the cross-correlation matrices P and S.
Thus, with reference to
We conducted numerical simulations to examine the performance of our transmission method. In our model, there were 2 transmit and 2 receive antennas, Q=2, and modulation was QPSK. We made the following assumptions: The wireless channel was non-dispersive, i.e. we assumed a flat fading channel, and the noise component was modeled as additive white Gaussian noise (AWGN). One well-known measure of performance is the Gallager Error-Exponent Er(R), where R is the code rate and r is the correlation coefficient of the negative exponential correlation model introduced by V. A. Aalo in 1995. This exponent is known to be a measure for the reliability of the transmission without limiting the analysis to a specific forward error correction code. A special case of the Gallager Error-Exponent is the so-called cutoff rate E0(1). The cutoff rate can be used as a lower bound for the Gallager Error-Exponent in the sense that Er(R)≧E0(1)−R.