The present invention relates to a system and method for wireless communication and in particular embodiments relates to a system and method for multiple-input multiple-output communication.
Multiple-input multiple-output (MIMO) communication systems provide a spectrally-efficient means for communicating over wireless channels. Such systems can be classified into coherent systems in which the receiver has access to reliable channel state information (CSI) and non-coherent systems in which this information is not available at either the transmitter or the receiver. Coherent systems are more straightforward to design than their non-coherent counterparts. However, the analysis of coherent systems usually does not account for the cost of the resources that have to be expended to acquire reliable CSI. Ignoring this cost in static and slow fading scenarios is generally tolerable. Ignoring it in fast fading scenarios, which arise in wireless systems with high mobility, can be rather misleading. A mobile device that is traveling at 60 miles per hour has a fading coherence of about 3 ms. If the mobile system is operated at 1.9 GHz and has a symbol rate of 30K symbols per second, 3 ms allows for 50-100 symbol periods. If several training symbols per antenna are needed, only a few antennas may be trained during a fading coherence period.
In order to alleviate the cost of acquiring reliable CSI in fast fading scenarios, it is often believed to be desirable to use non-coherent signaling strategies, that is, strategies that do not require the receiver to have access to CSI. It has been shown that, for spatially-white MIMO channels with independent identically distributed (i.i.d.) block Rayleigh fading coefficients, input matrices that achieve the ergodic capacity can be expressed in the form of an isotropically distributed unitary component and a diagonal component with non-negative entries (Hochwald and Marzetta, “Unitary space-time modulation multiple-antenna communications in Rayleigh flat fading,” IEEE Trans. Inf. Theory, vol. 46, pp. 543-564, March 2000, which is hereby incorporated herein in its entirety by reference). A closed form expression for the asymptotically high signal-to-noise ratio (SNR) ergodic capacity of this type of channel has been determined for the cases where the coherence time of the channel exceeds and is below a certain threshold. When the coherence time of the channel exceeds the threshold, it was shown that, for sufficiently high SNRs, it is optimal, from a capacity perspective, for the diagonal component to be a scaled identity matrix and for the unitary matrix to be isotropically distributed on the so-called Grassmann manifold and to have the number of antennas transmitting less than or equal to half the number of symbol durations spanned by the coherence interval of the channel. Various techniques for designing rate-efficient Grassmannian constellations have been shown (L. Zheng and D. N. C. Tse, “Communication on the Grassmann manifold: A geometric approach to the noncoherent multiple-antenna channel,” IEEE Trans. Inf. Theory, vol. 48, pp. 359-383, February 2002, which is hereby incorporated herein in its entirety by reference). For the case where the SNR is below the threshold, the unitary component was shown to have the same distribution as when above the threshold, but the optimal distribution for the diagonal component was shown to be that of the square root of the eigenvalues of a beta-distributed random matrix. In contrast with the high SNR cases, for low SNRs, it has been shown that the optimal diagonal component has at most one non-zero entry at any given time.
The proximity of antennas to each other in practical wireless systems renders the realization of spatially-white channels generally difficult, even when the distance between antennas exceeds multiple wavelengths. This proximity induces correlation between the random entries of the channel matrix at the transmitter and receiver sides. A convenient means for characterizing this correlation mathematically is the so-called Kronecker model, whereby the spatially-correlated channel matrix is represented by left and right multiplication of a spatially-white channel matrix with transmitter and receiver covariance matrices, respectively. Fortunately, these matrices are dominated by the locations, geometries and beam patterns of the antennas at the receiver and the transmitter. Being characteristic of the wireless devices on which the antennas are mounted, these matrices can be accurately estimated and made available to the transmitter and the receiver. The impact of these matrices on the proper signaling methodology and the achievable rate can be quite significant. For the case in which the correlation between the channel coefficients is described by the Kronecker model, it has been shown that for spatially white channels, increasing the number of transmit antennas is almost surely beneficial. It has also been shown that the optimal input covariance can be expressed as the product of an isotropically distributed unitary component, a diagonal component with non-negative entries and a deterministic component comprising the eigenvectors of the transmitter covariance matrix.
An embodiment of the disclosure is a communication system including a plurality of antennas. A precoder is included to combine data to be transmitted with a coding matrix to form transmission symbols. The coding matrix is formed by combining a Grassmannian matrix comprising a distribution on a Grassmannian manifold with a function of a transmitter covariance matrix. A transmitter is connected to the plurality of antennas and is configured to transmit the transmission symbols.
The coding matrix may be formed according to the formula
X=QXDUA†
where, QX is an isotropically distributed unitary matrix; D is a random diagonal matrix with non-negative entries; and UA is a matrix containing the eigenvectors of the transmitter covariance matrix, in this case the † indicates that it is the conjugate transpose of that matrix. The random diagonal matrix may be determined according to the formula
where, T is the coherence time, P is the transmit power, Tr is the trace function; and ΛA is a matrix composed of the eigenvalues of the transmitter covariance matrix.
Another embodiment is a communication system including a base station with a plurality of antennas configured to transmit to a plurality of mobile receivers. The base station includes a precoder that combines data to be transmitted with a coding matrix to produce transmission symbols. The coding matrix is formed by combining a Grassmannian matrix comprising a distribution on a Grassmannian manifold with a function of a transmitter covariance matrix. A transmitter connected to the plurality of antennas and configured to transmit the transmission symbols.
The coding matrix may be formed according to the formula
X=QXDUA†
where, QX is an isotropically distributed unitary matrix; D is a random diagonal matrix with non-negative entries; and UA is a matrix containing the eigenvectors of the transmitter covariance matrix, in this case the † indicates that it is the conjugate transpose of that matrix. The random diagonal matrix may be determined according to the formula
where, T is the coherence time, P is the transmit power, Tr is the trace function; and ΛA is the eigenvalues of the transmitter covariance matrix.
Another embodiment is a communication method including combining data to be transmitted wirelessly using a plurality of antennas with a coding matrix formed by combining a Grassmannian matrix comprising a distribution on a Grassmannian manifold with a transmitter covariance matrix to produce transmission symbols, and then transmitting the transmission symbols.
The coding matrix may be formed according to the formula
X=QXDUA†
where, QX is an isotropically distributed unitary matrix; D is a random diagonal matrix with non-negative entries; and UA is a matrix containing the eigenvectors of the transmitter covariance matrix, in this case the † indicates that it is the conjugate transpose of that matrix. The random diagonal matrix may be determined according to the formula
where, T is the coherence time, P is the transmit power, Tr is the trace function; and ΛA is the eigenvalues of the transmitter covariance matrix.
For a more complete understanding of the present invention, and the advantages thereof, reference is now made to the following descriptions taken in conjunction with the accompanying drawing, in which:
The structure, manufacture and use of the presently preferred embodiments are discussed in detail below. It should be appreciated, however, that the present invention provides many applicable inventive concepts that can be embodied in a wide variety of specific contexts. The specific embodiments discussed are merely illustrative of specific ways to make and use the invention, and do not limit the scope of the invention.
In the following discussion, random and deterministic matrices will be denoted by boldface and regular face upper case letters, respectively. The m×m identity matrix will be denoted by Im, and the standard O(●) notation will be used to imply that g1(x)=O(g2(x)) if g1(x)/g2(x)→a as x→∞, a is a constant.
A frequency-flat block Rayleigh fading channel in which the signals emitted from the transmit antennas are correlated and the signals impinging on the receiver antennas are also correlated can be used as a preliminary model. This situation arises in practice when the antenna elements are not sufficiently spaced for their signals to be statistically independent. Denoting the number of transmit and receive antennas by M and N, respectively, the channel matrix can be expressed as H=XA1/2HwB1/2, where AεCM×M and BεCN×N are the Hermitian positive semi-definite transmit and receive covariance matrices, respectively, and HwεCM×N is a random matrix with zero-mean unit-variance i.i.d. circularly-symmetric complex Gaussian entries. To avoid degenerate scenarios, it can be assumed that both A and B are full rank, and other cases will be alluded to as necessary.
In the block fading model, the channel assumes a statistically independent realization over each block of T channel uses and remains essentially constant within each block. This model is realistic in perfectly interleaved frequency-hopping systems and is useful in characterizing high mobility scenarios in current Long Term Evolution (LTE) cellular systems in which time-frequency resource blocks are assigned to each user during a coherence interval. In this case, the received signal matrix can be expressed as
Y=XA1/2HwB1/2+V (1)
where XεCT×M represented the transmitted signal matrix, and YεCT×N represents the additive noise matrix; the entries of V are i.i.d. zero mean unit variance circularly-symmetric complex Gaussian random variables. The matrices A and B are normalized so that Tr(A)=Tr(B)=1, and, assuming that the power budget of the transmitter P, there is the following power constraint:
E{Tr(XX†)}≦TP (2)
The matrices A and B can be estimated from the physical properties of the propagation environment and the antenna pattern, and can hence be assumed to be known. The communication scenario that is considered is non-coherent, thereby the matrix Hw in (1) is not known to either the transmitter or the receiver.
Conditioned on X=X, the matrix Y in (1) is Gaussian distributed with
vec(Y)=(B1/2XA1/2)vec(Hw)+vec(V) (2.5)
where vec(●) is the operator that stacks the columns of the matrix argument on top of each other. Now the covariance matrix
E{vec(Y)vec†(Y)|X}=BXAX†+INT (3)
Hence, conditioned on X=X, the probability density function (PDF) of Y can be expressed as
Using this expression it is straightforward to verify that for any deterministic T×T unitary matrix Φ, p(ΦY|ΦX=ΦX). This along with Lemma 1 in Hochwald et al. (B. M. Hochwald and T. L. Marzetta, “Unitary space-time modulation multiple-antenna communications in Rayleigh flat fading,” IEEE Trans. Inf. Theory, vol. 46, pp. 543-564, March 2000, which is hereby incorporated herein in its entirety by reference) can be combined to show that the structure of the matrix X that achieves the ergodic non-coherent capacity of the channel in (1) can be expressed as
X=QXDUA† (5)
Where QX is an isotropically distributed unitary matrix fulfilling the model of a Grassmannian manifold (See Absil et al. Riemannian Geometry of Grassmann Manifolds with a View on Algorithmic Computation. Acta Applicandae Mathematicae, 80(2):199-220, 2004, http://www.cis.upenn.edu/˜cis515/Diffgeom-Gras smann.Absil.pdf, which is hereby incorporated herein in its entirety by reference), D is a random diagonal matrix with non-negative entries and UA is the matrix containing the eigenvectors of A, i.e. A=UAΛAUA†.
An implementation of equation (5) can be used in an embodiment to provide ergodic capacity (or near ergodic capacity) in a non-coherent transmission system.
In the current discussion, result of Equation (5) is used as a basis upon which an asymptotically tight expression for the ergodic high SNR non-coherent capacity of the channel model in (1), i.e. the capacity as P goes to infinity, is derived.
For a given power budget P in (2), let C(P) denote the ergodic non-coherent capacity of the channel in (1). It can then be written:
C(P) can be evaluated as P goes to infinity.
Evaluating h(Y|X)
To evaluate the second term in (6) equation (3) can be used along with the fact that, condition on X, Y, is Gaussian distributed to write:
where in writing (7) invokes (5) and the eigen properties of the Kronecker product and used λA
As the transmit power goes to infinity, it can be seen that (7) can be expressed as
This approximation is valid when D is full rank and its entries scale with P. This condition will be shown to hold for the matrix D to achieve the lower bound on the high SNR ergodic capacity derived below when the number of transmit antennas is less than or equal to the number of receive antennas, i.e., M≦N and the transmit power, P, is sufficiently large. For simplicity, the remainder of this discussion will focus on the case of M=N.
Asymptotic Bound on h(Y)
A high SNR approximation of the first term in (6) is now used. Additionally, because the focus is on the case of M=N, the results in section IIIB of Zheng et al. (L. Zheng and D. N. C. Tse, “Communication on the Grassmann manifold: A geometric approach to the noncoherent multiple-antenna channel,” IEEE Trans. Inf. Theory, vol. 48, pp. 359-383, February 2002, which is hereby incorporated herein in its entirety by reference) are also used.
As P goes to infinity, the differential entropy, h(Y), is dominated by the differential entropy of X
h(Y)=h(XA1/2HwB1/2)+O(1/P) (8.5)
Expressing the matrix A in terms of its eigen decomposition, and noting that Hw is isotropically distributed, it can be written that HwUAtHw where “” denotes equality in distribution. Using this fact and invoking (5) yields
h(XA1/2HwB1/2)=h(QXDΛA1/2HwB1/2)=h((B1/2IT)vec(QXDΛA1/2Hw)) (8.75)
Using corollary 9.6.4 from Cover et al. (T. M. Cover and J. A., Thomas, Elements of Information Theory, New York: Wiley, 1991, which is hereby incorporated herein in its entirety by reference), allows
h((B1/2IT)vec(QXDΛA1/2Hw))=h(QXDΛA1/2Hw)+T log detB (9)
To evaluate h(QXDΛA1/2Hw) one can follow the methodology in Zheng et al. for the case of M=N to express this entropy in the coordinate system corresponding to the QR-decomposition. (As used herein, the QR decomposition (also called a QR factorization) is a linear algebra operation that is a decomposition of a matrix A into a product A=QR of an orthogonal matrix Q and an upper triangular matrix R.) In particular, the matrix QXDΛA1/2Hw can be expressed as QXR, where R is a random upper triangular matrix. Since QX is isotropically distributed, the matrix QXDΛA1/2Hw QXΨDΛA1/2Hw where Ψ is an isotropically distributed M×M unitary matrix. Now, the matrix ΨDΛA1/2Hw can be expressed as ΨR. Note that if the matrix DΛA1/2 is full rank, then so DΛA1/2Hw. It will later be shown that this assumption holds and use the QR decompositions to write
h(QXDΛA1/2Hw)=h(QX)+h(R)+E{log J1} (10)
where J1 is the Jacobian of the transformation from Cartesian to QR coordinates and is given by Zheng et al., J1=Πi=1MRii2(T−i)+1. Similarly,
h(ΨDΛA1/2Hw)=h(Ψ)+h(R)+E{log J2} (11)
where J=Πi=1MRii2(M−i)+i.
Substituting for h(R) from (11) into (10) yields:
where in writing the second equality the fact that QX and Ψ are isotropically distributed is used, whence h(QX)−h(Ψ) is equal to the logarithm of the volume of the Grassmann manifold,
To obtain an upper bound on h(ΨDΛA1/2Hw), it is noted that the covariance
E{(IMΨDΛA1/2)vec(Hw)vec†(Hw)(IMΛA1/2DΨhu †)}=IME{ΨD2ΛAΨ†} (13)
To compute the last expectation the following result is used from Hiai et al. (F. Hiai and D. Petz, “Asymptotic freeness almost everywhere for random matrices,” Acta. Sci. Math. (Szeged), vol. 66, pp. 809-834, 2000) and Tulino et al. (A. M. Tulino and S. Verd'u, “Random matrix theory and wireless communications,” Found. Trend. Commun. Inf. Theory, vol. 1, no. 1, pp. 1-182, 2004, which is hereby incorporated herein in its entirety by reference).
Lemma 1:
For any isotropically distributed unitary matrix
where δik is the Kronecker delta.
Using this result yields,
Whence an upper bound on h(ΨDΛA1/2Hw) can be readily obtained by applying Theorem 9.6.5 in Cover et al., which yields:
and equality holds if and only if ΨDΛA1/2Hw is Gaussian and distributed with covariance
In other words, equality holds if and only if the entries of the matrix are ΨDΛA1/2Hw i.i.d. zero mean Gaussian random variables with variance
Upper and Lower Bounds on Capacity
Upper Bound:
To obtain an upper bound on capacity, (2) is used to bound the right hand side of (15). In particular, let λA
Notice that this bound is not achievable unless A and E{D2} are scaled identities.
Using (16) with (12) and (9) yields
Using this bound with (8) and the fact that M=N, yields
When T≧2M, C(P) can be further bounded by using Jensen's inequality and the fact that log det(●) is a concave function. Doing so, yields E{log detD2}≦log det E{D2}, with equality if and only if D is deterministic. Subject to the power constraint in (2), the upper bound of this inequality can be further bounded by
Hence,
Lower Bound:
To obtain a lower bound on capacity, recalling that, for an arbitrary distribution of D, the right hand side of (14) is achievable if and only if the entries of the matrix ΨDΛA1/2Hw are i.i.d. Gaussian distributed. This condition is satisfied if the matrix D is deterministic and given by
c.f. (2). Substituting this value of D in (15) yields
Thereby, at setting of D, yields
Since this setting of D is not necessarily optimal, it can be written
Bounds on Gap to Capacity
Comparing (18) to (19), it can be seen that setting
yields a gap to capacity of:
It should be noted that when
Δ=0. Otherwise Δ is strictly greater than zero, and an upper bound on it can be derived as follows. Let κ be the condition number of A; i.e. κ=λA
To obtain a bound on Δ, note that, because λA
Hence, it has been shown that the upper and lower bounds obtained on capacity are within a gap proportional to the logarithm of the condition number of the transmitter covariance matrix, and that that gap does not depend on the SNR or the receiver covariance matrix.
Theorem 1
For the communication scenario described in (1) and (2), when T≧2M and M=N, the ergodic high SNR capacity C(P) satisfies:
where
Furthermore, the lower bound on C(P) is achieved by input signal matrices of the from in (5) with
Effect of Transmitter Covariance Condition Number on Achievable Rate
The rate on the right hand side of (19) is achievable using the strategy outlined in Theorem 1. That theorem shows that this rate is within
of the channel capacity. To investigate the effect of increasing κ on this rate, derive upper and lower bounds on that rate and it will shown that both bounds decrease with κ, indicating that high correlation at the transmitter can be deleterious to the achievable rate if the signaling strategy of Theorem 1 were to be used.
The transmitter covariance affects the right hand side of (19) through log TrΛA−1. Hence, upper and lower bounds on the achievable rate can be derived by obtaining lower and upper bounds on log TrΛA−1, respectively.
Lemma 2
Any positive definite matrix A, satisfies
Proof: To prove the first inequality, write:
To prove the second inequality write
This inequality holds with equality when M=2 and with strict inequality when M>2 and the eigenvalues of A are distinct.
It can be verified that both inequalities are monotonically increasing in κ. Hence, using the result of this lemma in the right hand side of (19), it can be seen that both the lower and upper bounds on the rates achievable by the strategy outlined in Theorem 1 are monotonically decreasing with the condition number of the transmitter covariance matrix, which indicates that the more ill-conditioned this matrix is, the less the rate that can be achieved with this strategy. This result has an intuitive explanation. The signaling strategy in Theorem 1 allocates more power to the weak eigenmodes of the channel. Hence, when the channel matrix is ill-conditioned, most of the power is allocated to that mode, thereby affecting the power available for transmission over the strong eigenmodes of the channel.
In the above discussion the high-SNR ergodic capacity of spatially-correlated MIMO channels was discussed. Closed form expressions for a lower and an upper bound on this capacity were obtained and it was shown that difference between these bounds is monotonically decreasing in the condition number of the transmitter covariance matrix. In particular, an expression for the high-SNR ergodic capacity of the considered channel that is tight with a constant that depends solely on the signaling dimension and the condition number of the transmitter covariance matrix is obtained.
The foregoing embodiments are merely intended for describing the technical solutions of the present invention other than limiting the present invention. Although the present invention is described in detail with reference to the foregoing embodiments, persons of ordinary skill in the art should understand that they may still make modifications to the technical solutions described in the foregoing embodiments or make equivalent replacements to some technical features thereof, without departing from the scope of the technical solutions of the embodiments of the present invention.
This application claims the benefit of U.S. Provisional Application No. 61/991,399, filed on May 9, 2014, entitled “The Ergodic High SNR Capacity of the Kronecker Spatially-Correlated Non-Coherent MIMO Channel Within an SNR-Independent Gap,” which application is hereby incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
6016328 | Caire | Jan 2000 | A |
20060161432 | Zhang | Jul 2006 | A1 |
20110019755 | Lee | Jan 2011 | A1 |
20110150113 | Oyman | Jun 2011 | A1 |
20130156075 | Shirani-Mehr | Jun 2013 | A1 |
Number | Date | Country |
---|---|---|
101207464 | Jun 2008 | CN |
101277172 | Oct 2008 | CN |
102075304 | May 2011 | CN |
102130866 | Jul 2011 | CN |
2627050 | Aug 2013 | EP |
Entry |
---|
Absil, et al., “Riemannian Geometry of Grassmann Manifolds with a View on Algorithmic Computation,” http://www.cis.upenn.edu/˜cis515/Diffgeom-Grassmann.Absil.pdf; Acta Applicandae Mathematicae, 80(2):199-220, Jan. 2004. |
Gohary, R.H., et al., “Non-Coherent MIMO Communication: Grassmannian Constellations and Efficient Detection,” IEEE Transactions on Information Theory, vol. 55, No. 3, pp. 1176-1205, Mar. 2009. |
Hiai, F., et al., “Asymptotic Freeness Almost Everywhere for Random Matrices,” Acta. Sci. Math. (Szeged), vol. 66, pp. 809-834, 2000. |
Hochwald, B.M., et al., “Unitary Space-Time Modulation Multiple-Antenna Communications in Rayleigh Flat Fading,” IEEE Trans. Inf. Theory, vol. 46, pp. 543-564, Mar. 2000. |
Jafar, S.A., et al., “Multiple-Antenna Capacity in Correlated Rayleigh Fading with Channel Covariance Information,” IEEE Transactions on Wireless Communications, vol. 4, No. 3, pp. 990-997, May 2005. |
Tulino, A.M., et al. “Random Matrix Theory and Wireless Communications,” Found. Trend. Commun. Inf. Theory, vol. 1, No. 1, pp. 1-182, 2004. |
Yu, K., et al., “Modeling of Wide-Band MIMO Radio Channels Based on NLoS Indoor Measurements,” IEEE Transactions on Vehicular Technology, vol. 53, No. 3, pp. 655-665, May 2004. |
Zheng, L., et al., “Communication on the Grassmann Manifold: A Geometric Approach to the Noncoherent Multiple-Antenna Channel,” IEEE Trans. Inf. Theory, vol. 48, pp. 359-383, Feb. 2002. |
Huawei, “Adaptive Feedback: A New Perspective of the Adaptive Codebook,” 3GPP TSG RAN WG1 Meeting #58, R1-093059, Aug. 24-28, 2009, 5 pages. |
International Search Report, Application No. PCT/CN2015/078505, mailed Aug. 7, 2015, 11 pages. |
Number | Date | Country | |
---|---|---|---|
20150326284 A1 | Nov 2015 | US |
Number | Date | Country | |
---|---|---|---|
61991399 | May 2014 | US |