MULTIPLE ANTENNA TRANSMISSION WITH PER-ANTENNA POWER CONSTRAINTS

Abstract
A method, apparatus, and system to transmit signals using multiple antennas with per-antenna power constraints. The method includes initializing a precoding algorithm to a complex matrix. The precoding algorithm is for precoding signals transmitted by a plurality of antennas. The method includes iteratively processing the precoding algorithm on a per-antenna basis by, at each iteration, sequentially updating a precoder for each of the plurality of antennas. The method includes, after each iteration, determining whether the precoding algorithm has converged based on a change in a rate of mutual information across iterations. Additionally, the method includes, in response to determining that the precoding algorithm has converged, transmitting the signals using the precoding algorithm.
Description
TECHNICAL FIELD

The present application relates generally to multiple antenna wireless communication systems and, more specifically, to multiple antenna transmission schemes with per-antenna power constraints.


BACKGROUND

As multiple-input multiple-output (MIMO) technology has developed over the years, the number of antennas in the deployed MIMO systems has been steadily increasing. The concept of using a large number (hundreds or thousands) of antennas has also been brought up from an information theory perspective. However, the challenges of implementing hundreds or thousands of antennas in conventional cellular systems have been prohibitive. For example, to accommodate 1024 half-wavelength antennas in 2G Hz band, the dimension of the antenna array will be around 2.4 m×2.4 m.


Proposals for a mobile broadband system using millimeter-wave bands (MMB) have opened up opportunities to bring large MIMO antenna arrays with hundreds or thousands of antennas. For example, using millimeter-wave frequencies around 30 GHz, the dimension of an antenna array with 1024 antennas is about 16 cm×16 cm, smaller than a single sector antenna for a typical cellular base station.


The transmitter and receiver beamforming in MMB systems are different from the MIMO operations in cellular systems. With possibly thousands of antennas at a base station and hundreds of antennas at a mobile station, the spatial degree of freedom of MMB systems is much larger than that of cellular systems. To drive hundreds or thousands of antennas, a large number of power amplifiers are needed, each having its own power constraint. The design of efficient beamforming schemes to fully utilize the power of all these power amplifiers is an interesting problem with practical significance.


Therefore, there is a need in the art for improved transmission strategies in multiple antenna wireless communication systems. In particular, there is a need for methods and apparatuses that are capable of multiple antenna transmission schemes with per-antenna power constraints.


SUMMARY

A method, apparatus and system to transmit signals using multiple antennas with per-antenna power constraints.


In various embodiments, a method includes initializing a precoding algorithm to a complex matrix. The precoding algorithm is for precoding signals transmitted by a plurality of antennas. The method includes iteratively processing the precoding algorithm on a per-antenna basis by, at each iteration, sequentially updating a precoder for each of the plurality of antennas. The method includes, after each iteration, determining whether the precoding algorithm has converged based on a change in a rate of mutual information across iterations. Additionally, the method includes, in response to determining that the precoding algorithm converged, transmitting the signals using the precoding algorithm.


In various embodiments, an apparatus includes a controller, a precoding unit and a plurality of antennas. The controller is configured to initialize a precoding algorithm to a complex matrix. The precoding algorithm is for precoding signals transmitted by a plurality of antennas. The controller is configured to iteratively process the precoding algorithm on a per-antenna basis by, at each iteration, sequentially updating a precoder for each of the plurality of antennas. Additionally, the controller is configured to, after each iteration, determine whether the precoding algorithm has converged based on a change in a rate of mutual information across iterations. The precoding unit is configured to, in response to a determination that the precoding algorithm converged, precode the signals using the precoding algorithm. The plurality of antennas is configured to transmit the precoded signals.


Before undertaking the DETAILED DESCRIPTION OF THE INVENTION below, it may be advantageous to set forth definitions of certain words and phrases used throughout this patent document: the terms “include” and “comprise,” as well as derivatives thereof, mean inclusion without limitation; the term “or,” is inclusive, meaning and/or; the phrases “associated with” and “associated therewith,” as well as derivatives thereof, may mean to include, be included within, interconnect with, contain, be contained within, connect to or with, couple to or with, be communicable with, cooperate with, interleave, juxtapose, be proximate to, be bound to or with, have, have a property of, or the like; and the term “controller” means any device, system or part thereof that controls at least one operation, such a device may be implemented in hardware, firmware or software, or some combination of at least two of the same. It should be noted that the functionality associated with any particular controller may be centralized or distributed, whether locally or remotely. Definitions for certain words and phrases are provided throughout this patent document, those of ordinary skill in the art should understand that in many, if not most instances, such definitions apply to prior, as well as future uses of such defined words and phrases.





BRIEF DESCRIPTION OF THE DRAWINGS

For a more complete understanding of the present disclosure and its advantages, reference is now made to the following description taken in conjunction with the accompanying drawings, in which like reference numerals represent like parts:



FIG. 1 illustrates an exemplary wireless system which transmits messages in accordance with an illustrative embodiment of the present disclosure;



FIG. 2 illustrates a high-level diagram of an orthogonal frequency division multiple access transmit path in accordance with an illustrative embodiment of the present disclosure;



FIG. 3 illustrates a high-level diagram of an orthogonal frequency division multiple access receive path in accordance with an illustrative embodiment of the present disclosure;



FIG. 4 illustrates a block diagram of a wireless communication system in accordance with the present disclosure;



FIG. 5 illustrates an example of a channel condition model for multiple antenna transmission in a wireless communication system in accordance with various embodiments of the present disclosure;



FIG. 6 illustrates a plot of a graph illustrating calculation of a solution for multiple antenna transmission with per-antenna power constraints; and



FIG. 7 illustrates a process for transmitting signals using multiple antennas that takes into account per-antenna power constraints in accordance with various embodiments of the present disclosure.





DETAILED DESCRIPTION


FIGS. 1 through 7, discussed below, and the various embodiments used to describe the principles of the present disclosure in this patent document are by way of illustration only and should not be construed in any way to limit the scope of the disclosure. Those skilled in the art will understand that the principles of the present disclosure may be implemented in any suitably arranged system or device.



FIGS. 1-3 below describe various embodiments implemented in wireless communications systems and with the use of OFDM or OFDMA communication techniques. The description of FIGS. 1-3 is not meant to imply physical or architectural limitations to the manner in which different embodiments may be implemented. Different embodiments of the present disclosure may be implemented in any suitably arranged communications system.



FIG. 1 illustrates exemplary wireless system 100, which transmits messages according to the principles of the present disclosure. In the illustrated embodiment, wireless system 100 includes base station (BS) 101, base station (BS) 102, base station (BS) 103, and other similar base stations or relay stations (not shown). Base station 101 is in communication with base station 102 and base station 103. Base station 101 is also in communication with Internet 130 or a similar IP-based system (not shown).


Base station 102 provides wireless broadband access (via base station 101) to Internet 130 to a first plurality of subscriber stations (or user equipment (UE)) within coverage area 120 of base station 102. The first plurality of subscriber stations includes subscriber station 111, which may be located in a small business (SB); subscriber station 112, which may be located in an enterprise (E); subscriber station 113, which may be located in a WiFi hotspot (HS); subscriber station 114, which may be located in a first residence (R); subscriber station 115, which may be located in a second residence (R); and subscriber station 116, which may be a mobile device (M), such as a cell phone, a wireless laptop, a wireless PDA, or the like.


Base station 103 provides wireless broadband access (via base station 101) to Internet 130 to a second plurality of subscriber stations within coverage area 125 of base station 103. The second plurality of subscriber stations includes subscriber station 115 and subscriber station 116. In an exemplary embodiment, base stations 101-103 may communicate with each other and with subscriber stations 111-116 using OFDM or OFDMA techniques.


While only six subscriber stations are depicted in FIG. 1, it is understood that wireless system 100 may provide wireless broadband access to additional subscriber stations. It is noted that subscriber station 115 and subscriber station 116 are located on the edges of both coverage area 120 and coverage area 125. Subscriber station 115 and subscriber station 116 each communicate with both base station 102 and base station 103 and may be said to be operating in handoff mode, as known to those of skill in the art.


Subscriber stations 111-116 may access voice, data, video, video conferencing, and/or other broadband services via Internet 130. In an exemplary embodiment, one or more of subscriber stations 111-116 may be associated with an access point (AP) of a WiFi WLAN. Subscriber station 116 may be any of a number of mobile devices, including a wireless-enabled laptop computer, personal data assistant, notebook, handheld device, or other wireless-enabled device. Subscriber stations 114 and 115 may be, for example, a wireless-enabled personal computer (PC), a laptop computer, a gateway, or another device.



FIG. 2 is a high-level diagram of transmit path circuitry 200. For example, the transmit path circuitry 200 may be used for an orthogonal frequency division multiple access (OFDMA) communication. FIG. 3 is a high-level diagram of receive path circuitry 300. For example, the receive path circuitry 300 may be used for an orthogonal frequency division multiple access (OFDMA) communication. In FIGS. 2 and 3, for downlink communication, the transmit path circuitry 200 may be implemented in base station (BS) 102 or a relay station, and the receive path circuitry 300 may be implemented in a subscriber station (e.g. subscriber station 116 of FIG. 1). In other examples, for uplink communication, the receive path circuitry 300 may be implemented in a base station (e.g. base station 102 of FIG. 1) or a relay station, and the transmit path circuitry 200 may be implemented in a subscriber station (e.g. subscriber station 116 of FIG. 1).


Transmit path circuitry 200 comprises channel coding and modulation block 205, serial-to-parallel (S-to-P) block 210, Size N Inverse Fast Fourier Transform (IFFT) block 215, parallel-to-serial (P-to-S) block 220, add cyclic prefix block 225, and up-converter (UC) 230. Receive path circuitry 300 comprises down-converter (DC) 255, remove cyclic prefix block 260, serial-to-parallel (S-to-P) block 265, Size N Fast Fourier Transform (FFT) block 270, parallel-to-serial (P-to-S) block 275, and channel decoding and demodulation block 280.


At least some of the components in FIGS. 2 and 3 may be implemented in software, while other components may be implemented by configurable hardware or a mixture of software and configurable hardware. In particular, it is noted that the FFT blocks and the IFFT blocks described in this disclosure document may be implemented as configurable software algorithms, where the value of Size N may be modified according to the implementation.


Furthermore, although this disclosure is directed to an embodiment that implements the Fast Fourier Transform and the Inverse Fast Fourier Transform, this is by way of illustration only and should not be construed to limit the scope of the disclosure. It will be appreciated that in an alternate embodiment of the disclosure, the Fast Fourier Transform functions and the Inverse Fast Fourier Transform functions may easily be replaced by Discrete Fourier Transform (DFT) functions and Inverse Discrete Fourier Transform (IDFT) functions, respectively. It will be appreciated that for DFT and IDFT functions, the value of the N variable may be any integer number (i.e., 1, 2, 3, 4, etc.), while for FFT and IFFT functions, the value of the N variable may be any integer number that is a power of two (i.e., 1, 2, 4, 8, 16, etc.).


In transmit path circuitry 200, channel coding and modulation block 205 receives a set of information bits, applies coding (e.g., LDPC coding) and modulates (e.g., Quadrature Phase Shift Keying (QPSK) or Quadrature Amplitude Modulation (QAM)) the input bits to produce a sequence of frequency-domain modulation symbols. Serial-to-parallel block 210 converts (i.e., de-multiplexes) the serial modulated symbols to parallel data to produce N parallel symbol streams where N is the IFFT/FFT size used in BS 102 and SS 116. Size N IFFT block 215 then performs an IFFT operation on the N parallel symbol streams to produce time-domain output signals. Parallel-to-serial block 220 converts (i.e., multiplexes) the parallel time-domain output symbols from Size N IFFT block 215 to produce a serial time-domain signal. Add cyclic prefix block 225 then inserts a cyclic prefix to the time-domain signal. Finally, up-converter 230 modulates (i.e., up-converts) the output of add cyclic prefix block 225 to RF frequency for transmission via a wireless channel. The signal may also be filtered at baseband before conversion to RF frequency.


The transmitted RF signal arrives at SS 116 after passing through the wireless channel, and reverse operations to those at BS 102 are performed. Down-converter 255 down-converts the received signal to baseband frequency, and remove cyclic prefix block 260 removes the cyclic prefix to produce the serial time-domain baseband signal. Serial-to-parallel block 265 converts the time-domain baseband signal to parallel time-domain signals. Size N FFT block 270 then performs an FFT algorithm to produce N parallel frequency-domain signals. Parallel-to-serial block 275 converts the parallel frequency-domain signals to a sequence of modulated data symbols. Channel decoding and demodulation block 280 demodulates and then decodes the modulated symbols to recover the original input data stream.


Each of base stations 101-103 may implement a transmit path that is analogous to transmitting in the downlink to subscriber stations 111-116 and may implement a receive path that is analogous to receiving in the uplink from subscriber stations 111-116. Similarly, each one of subscriber stations 111-116 may implement a transmit path corresponding to the architecture for transmitting in the uplink to base stations 101-103 and may implement a receive path corresponding to the architecture for receiving in the downlink from base stations 101-103.


Various embodiments of the present disclosure recognize that beamforming using arrays with a large number of antennas is a fundamental enabling technology in millimeter-wave mobile communication. Transmitter and receiver beamforming can extend the range of millimeter-wave links, improve signal reception, and suppress interference among beams and neighboring cells. In order to effectively communicate over millimeter waves links, it is crucial for the transmitters and receivers to identify the strongest reflections and form transmitter and receiver beams along these paths. This can be achieved by beamforming with large antenna arrays.


The optimal transmission strategy, given the knowledge of channel station information at the transmitter and with total power constraint, is a “water-filling” strategy along the transmitter side singular vectors of the channel matrix. The optimal transmission strategy with per-antenna power constraints, however, has been a difficult problem. There is a general lack of understanding about the optimal transmission schemes and their performance for MIMO systems with per-antenna power constraints.


The present disclosure provides optimal transmission strategies with per-antenna power constraints as convex programming problems on convex regions and derives the necessary and sufficient conditions for optimality based on the Karush-Khun-Tucker (KKT) conditions. The present disclosure provides iterative algorithms that converge to the optimal solutions.


Although the present disclosure describes solutions to problems in millimeter-wave mobile communication, the embodiments of the present disclosure are applicable to MIMO and beamforming in conventional 3G and 4 G communication systems as well.



FIG. 4 illustrates a block diagram of a wireless communication system 400 in accordance with the present disclosure. Wireless communication system 400 includes transmitter 402 and receiver 404. Transmitter 402 transmits signals at a transmit end in wireless communication system 400. For example, the transmitter 402 may be a transmitter in a base station (e.g. base station 102 of FIG. 1) or relay station for downlink communication. In other embodiments, the transmitter 402 may be a transmitter in a subscriber station (e.g. subscriber station 116 of FIG. 1) for uplink communications.


The transmitter 402 includes a precoding unit 406 to precode data streams 408 to be transmitted by antennas 410. The controller 412 estimates channel properties and receives information about the properties of the receiver 404. The controller 412 calculates precoding algorithms for the precoding unit 406 to use in precoding the signals transmitted by the antennas 410. For example, in some embodiments, the transmitter 402 may implement a transmit path as shown in FIG. 2. The transmitter 402 also includes a plurality of power amplifiers 414. The power amplifiers 414 amplify the transmit power of the signals transmitted by the antennas 410. While FIG. 4 shows power amplifiers 414 connected in series with the antennas 410, any type of connections between the power amplifiers 414 and the antennas 410 may exist in the transmitter 402. For example, in various embodiments, one power amplifier 414 may be connected to a single antenna or multiple antennas in the antenna array 410. In other examples, each of the power amplifiers 414 may be connected to each of the antennas 410.


The receiver 404 receives signals at a receive end in wireless communication system 400. For example, the receiver 404 may be a receiver in a subscriber station (e.g. subscriber station 116 of FIG. 1) for downlink communication. In other embodiments, the transmitter 402 may be a transmitter for a base station (e.g. base station 102 of FIG. 1) or a relay station for uplink communications. The receiver 404 includes antennas 416 to receive signals transmitted in the wireless communication system 400. The receiver processing circuitry 418 processes the received signals to identify the received data streams 420. For example, in some embodiments, the receiver 404 may implement a receive path as shown in FIG. 3.


In these illustrative embodiments, the transmitter 402 transmits a single stream or multiple streams of data using multiple antennas 410 with element-wise power constraints. For example, the number of transmit antennas 410 is denoted as Nt, and the number of receiver antennas 416 is denoted as Nr. In this example, the general signal model for transmitter beamforming can be represented according to equation 1 below:






r=HVs+n  (Equation 1)


where s, V, r, n, and H are the transmitted signal vector, the transmitter beamformer, the received signal vector, the noise vector, and the channel matrix, respectively. The number of signal streams 408 is denoted by Ns. Without loss of generality, the present disclosure assumes that Ns≦min(Nr, Nt) and the signal streams are independent with unity total transmission power, (i.e., E{si*sj}=0 if i≠j, and E{ss}=1).


Without per-antenna power constraints, the capacity-achieving transmission and reception strategy can be easily obtained. The present disclosure provides transmitter beamforming strategies that maximize or increase mutual information with per-antenna power constraints.


In MMB systems, the transmission and reception of millimeter waves are often highly directional, which limits the number of paths the signal can travel from the transmitter 402 to the receiver 404. In addition, the number of rays that can reach the receiver and are strong enough to be detected by the receiver 404 is likely to be small due to the scarcity of scattering and large propagation loss. Thus, the present disclosure uses a simplified ray tracing model to characterize the MMB channel.



FIG. 5 illustrates an example of a channel condition model for multiple antenna transmission in a wireless communication system 500 in accordance with various embodiments of the present disclosure. For example, wireless communication system 500 in FIG. 5 is one example of wireless communication system 400 in FIG. 4. In the wireless communication system 400, the base station 502 transmits multiple wireless signals to the mobile station 504. Some of these signals may have a direct line of sight from the base station 502 to the mobile station 504. Other signals may reflect or refract off obstacles (e.g., buildings 506 or trees 508) along the path between the base station 502 to the mobile station 504.



FIG. 5 illustrates a simplified MMB channel model. The number of rays from the base station 502 to the mobile station 504 is denoted by C. The angle of arrival (AoA) and angle of departure (AoD) for the c-th ray are denoted as θc and φc, respectively. Assuming an exemplary linear array, the receiver array response vector for AoA θ can be represented according to equation 2 below:





α(θ)=[1er . . . ei(L-1)Δr]T  (Equation 2)


where Δr=2πdr sin θ/λ and dr is the antenna spacing for the receiver antenna array 416. Assuming an exemplary linear array, the transmitter array response vector for AoD θ can be represented according to equation 3 below:





β(φ)=[1et . . . ei(K-1)Δt]T  (Equation 3)


where Δt=2πdt sin φ/λ and dt is the antenna spacing for the transmitter antenna array. The use of a linear array is for illustration purposes only and other antenna configurations can also be used. The channel can be represented in matrix form according to equation 4 below:






H=Σ
c=1
C√{square root over (ρc)}·α(θc)·βc)·γc=AΓB  (Equation 4)


where A is the receiver array response of the channel, B is the transmitter array response of the channel, and Γ is the diagonal matrix representing the channel coefficients of the ray-tracing channel model. In other words, A=[α(θ1) α(θ2) . . . α(θC)];







Γ
=

[






ρ
1




γ
1










0




0





ρ
2




γ
2







0


















0


0








ρ
C




γ
C





]


;
and






B
=

[




β


(

φ
1

)





β


(

φ
2

)








β


(

φ
C

)





]





where ρc is the power of the c-th ray with Σc=1Cρc=1, and γc is the normalized complex channel coefficient for the c-th ray, (i.e., E∥γc2=1).


Referring back to FIG. 4, single-stream beamforming (i.e., Ns=1) is of practical interest as single-stream transmission happens often in MIMO systems. Transmissions in low signal to noise ratio (SNR) conditions are often single-stream. Also transmissions to or from a mobile station with a single antenna is single-stream. When the channel is highly correlated, transmission is also likely to be single-stream.


With per-antenna power constraints, the optimal transmitter beamformer is the solution to the following optimization problem and can be represented according to equation 5 below:











v
opt

=


arg







max
v




log


(

1
+


1

σ
2




v



Fv


)








s
.
t
.





v
i
*




v
i




=

p
i



,

i
=
1

,
2
,





,

N
t





(

Equation





5

)







where F=HH, v=[v1 v2 . . . vNt]T is a single-stream beamformer, and pi>0 is the power constraint on an i-th antenna. Without loss of generality, the present disclosure assumes that ƒiik=1Nr∥hki2>0, i=1, 2, . . . , Nt. Otherwise, the transmitter antennas 410 that do not contribute to the mutual information can be simply removed. Note that in these examples, optimal receiver 404 is assumed so that the achievable mutual information is only a function of transmitter beamforming. Although the objective function of problem is convex, the region defined by the per-antenna power constraints is not. In order to avoid the difficulty of optimizing on a non-convex region, the present disclosure addresses a slightly different problem by allowing the per-antenna power constraints to slack. The alternative problem can be represented according to equation 6 below:











v
opt

=


arg







max
v




log


(

1
+


1

σ
2




v



Fv


)








s
.
t
.






g
i



(
v
)






=




v
i
*



v
i


-

p
i



0



,

i
=
1

,
2
,





,


N
t

.





(

Equation





6

)







It is easy to verify that, in this example, the region defined by the relaxed per-antenna power constraints is convex. This convex region defined over custom-characterN, however, is not friendly for mathematical manipulation as any non-constant real valued function ƒ:custom-characterNcustom-character does not satisfy the Cauchy-Riemann criteria, and thus, it is not analytic. To avoid the problem of undefined derivative of the real objective function on complex variables, the present disclosure reformulates the optimization problem into a real function {tilde over (ƒ)}:custom-character2Ncustom-character. For example, by denoting ƒ(v)=






log


(

1
+


1

σ
2




v



Fv


)





and letting v=x+jy, the objective function can be equivalently defined as a real function on custom-character2N represented according to equation 7 below:











f
~



(

x
,
y

)


=


log


[

1
+


1

σ
2




(


x
T

-

jy
T


)



F


(

x
+
jy

)




]


=


log


(

1
+


1

σ
2




v



Fv


)


=


f


(
v
)


.







(

Equation





7

)







The constraints can also be represented as functions of real variables represented according to equation 8 below:






{tilde over (g)}
i(x,y)=xi2+yi2−pi≦0, i=1,2, . . . ,Nt  (Equation 8).


The present disclosure then constructs an equivalent optimization problem with real variables and a real objective function for the problem represented by equation 6 as can be represented according to equation 9 below:











(


x
opt

,

y
opt


)

=


arg







max

x
,
y






f
~



(

x
,
y

)








s
.
t
.







g
~

i



(

x
,
y

)







0


,

i
=
1

,
2
,





,


N
t

.





(

Equation





9

)







As a result, {tilde over (f)}(x,y) is convex and the region on custom-characteras defined by the per-antenna power constraints is also convex. Also, the optimal beamformer of equation 9 is at the boundary of the convex region with all per-antenna power constraints binding. This can be proven by contradiction as follows: for any vector v=x+jy, assuming the power constraint on the k-th antenna is not binding, i.e., gk(x,y)=xk2+yk2−pk<0, let δk=√{square root over (pk−(xk2+yk2))}, and ε=ψ(Σi=1Ntƒikvi)·δkek, where ek is a column vector with eki=1 if i=k and eki=0 otherwise, and ψ(w) represents the phase of a complex variable w, (i.e.,







ψ


(
w
)


=


{




1
,





if





w

=
0







w


w



,



otherwise



)

.





Then v+ε also satisfies the power constraints on all the antennas. However, (v+ε)F(v+ε)=vFv+vFε+εFV+εFε=vFv+εFε+2δk∥Σi=1Ntƒikvi∥>vFv. This contradicts the assumption that v=x+jy is the optimal solution to the problem represented by equation 9. Therefore, any optimal solution to the problem represented by equation 9 must meet all per-antenna power constraints with equality. Basically, this illustrates that the solution for the alternative optimization problem represented by equation 9 is also optimal for the original problem represented by equation 8.


For example, a single-stream beamformer vopt is an optimal solution for the problem represented by equation 8 if, and only if, the single-stream beamformer satisfies the following conditions represented according to equation 10 below:






v
i
opt=ψ(Σk≠iƒikvkopt)·√{square root over (pi)}, i=1, . . . ,Nt  (Equation 10)


where ƒik is the element of F at the i-th row and the k-th column.


The present disclosure verifies that the optimization problem represented by equation 9 satisfies the Slater's condition if pi>0,∀i. Thus, the KKT conditions are both necessary and sufficient for optimality. The present disclosure verifies that the KKT conditions can be represented according to equation 11 below:











μ
i
opt

=


1


σ
2

+



(

v
opt

)





Fv
opt




·

(


f
ii

+







k

i












f
ik



v
k
opt







p
i




)










x
i
opt

=

Re



{

ψ


(




k

i












f
ik



v
k
opt



)


}

·


p
i












y
i
opt

=

Im



{

ψ


(




k

i












f
ik



v
k
opt



)


}

·


p
i













for





k

=
1

,





,

N
t






(

Equation





11

)







where μiopt≧0, i=1, . . . , Nt are the Lagrange multipliers for the optimal solution, and vopt=xopt+jyopt. Note that μi, i=1, . . . , Nt are the auxiliary variables introduced, and the existence of μiopt≧0, i=1, . . . , Nt is shown by equation 11. Also the solutions for xiopt yiopt can be consolidated into one single equation of viopt. Finally, the necessary and sufficient conditions for the optimal solution can be consolidated as represented according to equation 12 below:






v
i
opt=ψ(Σk≠iƒikvkopt)·√{square root over (pi)}, for k=1, . . . ,Nt  (Equation 12),


which is equivalent to the optimality conditions shown in equation 10. This solution is also the optimal solution for the problem represented by equation 5.


Therefore, as long as the solution satisfies equation 12, the solution will be the optimal solution of the problem represented by equation 5. Table 1 provides an iterative algorithm to find such an optimal solution of single-stream beamforming as follows:










TABLE 1






Optimal Single-Stream Tx Beamforming

















1. Initialize ν to a complex vector such that ||νi||2 = pi, i = 1, . . . ,Nt.



2. In the n-th iteration, update νi sequentially for i = 1 . . . , Nt, νi =



ψ(Σk≠ifikνk) · {square root over (pi)}



3. Check convergence. If yes, stop; if not, go to Step 2.









The iterative algorithm outlined in Table 1 converges to an optimal solution of the problem represented by equation 5. For example, through the iterations of the algorithm, the complex beamforming weights on all antennas are updated, one at a time. Before an update for the i-th antenna, the value of the objective function of the problem represented by equation 5 is







log


(

1
+


1

σ
2




v



Fv


)


=


log


[




1
+


1

σ
2







j

i










k

i








v
j
*



f
jk



v
k





+








(


1

σ
2







j

i








v
j
*



f
ij




)



v
i


+


(


1

σ
2







k

i








f
ik



v
k




)



v
i
*


+


1

σ
2




f
ii



v
i
*



v
i






]


.





It can be verified that the update viψ(Σk≠iƒikvk). √{square root over (pi)} maximizes the objective function given the power constraint on the i-th antenna, assuming that all other vk, k≠i, are unchanged. As a result, the objective function is non-decreasing in every step of each iteration.


The present disclosure recognizes that the objective function is bounded (e.g., by the channel capacity without the per-antenna power constraints but with total power constraint that equals the sum of all the per-antenna power constraints). As a result, the objective function will converge to a certain limit. When the objective function converges, the condition vi=ψ(Σk≠iƒikvk)·√{square root over (pi)} is met for all i=1, . . . , Nt. Based on the conditions associated with equation 12, the solution is an optimal solution for the problem represented by equation 5.


For practical purposes, since maximizing the achievable mutual information is desirable, the change rate of the mutual information across iterations can be used as an indication of convergence. For example, with the achieved mutual information after the n-th iteration denoted as







f


(
n
)


=

log


[

1
+


1

σ
2





v




(
n
)




Fv


(
n
)




]






convergence is determined to occur if









f


(
n
)


-

f


(

n
-
1

)




f


(
n
)



<

ɛ
.





In these embodiments, the convergence criteria should only be applied between iterations, not between the updates for different antennas within iteration, as it is possible that convergence has not been reached although the update for a single antenna does not increase the objective function. To evaluate how fast the algorithm converges, the mutual information gap for the n-th iteration may be calculated according to equation 13 below:










τ


(
n
)


=



f


(

)


-

f


(
n
)




f


(

)







(

Equation





13

)







where ƒ(∞) denotes the maximum achievable mutual information given the per-antenna power constraints.


Various embodiments provide precoding algorithms for multi-stream beamforming (i.e., Ns>1) with element-wise power constraints. Without element-wise power constraints, the capacity achieving transmission and reception strategy is to diagonalize the channel via transmitter and receiver beamforming using the left and right singular vectors of the channel and apply “water filling” along the diagonalized channel. The present disclosure provides a transmitter beamforming strategy that maximizes mutual information with element-wise power constraints. For example, in various embodiments, the present disclosure applies similar techniques as in single-stream beamforming to analyze multi-stream beamforming per-antenna power constraints.


The per-antenna power constraints are assumed to be applied at each MIMO stream. This is often the case when power allocation among the multiple streams is not supported. For example, in single-user MIMO in LTE, the power between the multiple streams is assumed to be equal to save signaling overhead and simplify the transceiver modem implementation. In this example, the optimization problem with per-antenna power constraints can be formulated according to equation 14 below:











V
opt

=


arg







max
V



log





I

N
s


+


1

σ
2




V



FV










s
.
t
.





v
ik




v
ik
*




=

p
ik



,


for





i

=
1

,





,

N
t

,

k
=
1

,








N
s






(

Equation





14

)







where V=[v1 v2 . . . vNs] is the multi-stream beam-former, and pik>0, for i=1, . . . , Nt, k=1, . . . , Ns are the per-antenna power constraints for all streams. Again, without loss of generality, the present disclosure assumes ƒii>0, i=1, 2, . . . , Nt. Similarly, as in the single-stream beamforming case, the present disclosure illustrates that the optimal solution to the problem represented by equation 14 can be found by solving the following problem represented according to equation 15 below:











V
opt

=


arg







max
V



log





I

N
s


+


1

σ
2




V



FV










s
.
t
.





v
ik




v
ik
*






p
ik



,


for





i

=
1

,





,

N
t

,

k
=
1

,









N
s

.






(

Equation





15

)







With defining Vk as the sub-matrix of V with vk removed and denoting







C
k

=

I
+





V
_

k



F



V
_

k



σ
2







and










G
k

=

F
-


1

σ
2



F



V
_

k



C
k

-
1





V
_

k



F



,




the necessary and sufficient conditions for the optimal solution to the problem represented by equation 14 can be expressed as: a multi-stream beamformer V is an optimal solution for the problem represented by equation 14 if, and only if, the solution satisfies the following conditions represented according to equation 16 below:






v
ik=ψ(Σj≠igijkvjk)·√{square root over (pik)}  (Equation 16)


for i=1, . . . , Nt, and k=1, . . . , Ns, where gijk is the (i,j)-th entry of matrix Gk denoted above.


To illustrate this result, let X=Re{V}, Y=Im{V}, and









f
~



(

X
,
Y

)


=


log





I

N
s


+


1

σ
2




(


X
T

-

j






Y
T



)



F


(

X
+





j





Y


)







=


log





I

N
s


+


1

σ
2




V



FV





=

f


(
V
)





,




then an equivalent optimization problem defined on real variables for the problem represented by equation 15 can be represented according to equation 17 below:





(Xopt,Yopt)=arg maxX,Y{tilde over (ƒ)}(X,Y),s.t.






g
ik(X,Y)=xik2+yik2−pik≦0





for i=1, . . . ,Nt,k=1, . . . , Ns  (Equation 17).


Similar to the single-stream beamforming case, the present disclosure illustrates that the optimal solution of the convex optimization problem represented by equation 17 is also the optimal solution of the original problem represented by equation 14 by showing that all the per-antenna power constraints are binding (i.e., gik(X,Y)=0, for i=1, . . . Nt, and k=1, . . . , Ns) for the optimal solution of the problem represented by equation 17.


With some manipulation, the contribution of the k-th stream to the mutual information in the presence of the interference from all other streams can be isolated according to equation 18 below:













f


(
V
)


=

log







1
+



v
k




Fv
k



σ
2








v
k



F



V
_

k



σ
2










V
_

k




Fv
k



σ
2





I
+




V
_

k



F



V
_

k



σ
2















=


log




C
k




+

log


(

1
+


1

σ
2




v
k




G
k



v
k



)










(

Equation





18

)







where Ck is defined as denoted above. Using equation 18, the multi-stream beamforming optimization problem can be decomposed into a multiple single-stream beamforming optimization problem. Since log|Ck| is independent of vk, it becomes clear from equation 18 that in order to maximize ƒ(V), vk should be chosen such that






log


(

1
+


1

σ
2




v
k




G
k



v
k



)





is maximized. Next, this objective function can be shown to also be convex by proving that Gk is positive semi-definite.


Note that if F=HH is invertible (i.e., positive definite), Gk can be easily shown to be positive semi-definite because







G
k

=


F
-


1

σ
2



F



V
_

k



C
k

-
1





V
_

k



F


=



(


F

-
1


+


1

σ
2





V
_

k




V
_

k




)


-
1


.






In the more general case when F is positive semi-definite, let H Vk=WΛZ be the singular-value decomposition (SVD) of H Vk, where W is a Nr×(Ns−1) matrix, Λ is a (Ns−1)×(Ns−1) diagonal matrix, and Z is a (Ns−1)×(Ns−1) unitary matrix. W can be extended to a unitary matrix by adding orthogonal vectors with unit norm, (i.e., denote [W {tilde over (W)}] as the unitary matrix extended from W). Then Gk can be shown to be positive semi-definite as follows,







G
k

=




H



H

-


1

σ
2




H



W





Λ








Z




(

I
+


Z






Λ
2



Z




σ
2



)



-
1



Z





Λ






W



H


=




H




W
~




W
~




H

+


H




W


[

I
-


Λ

σ
2





(

I
+


Λ
2


σ
2



)


-
1



Λ


]




W



H


=



H




W
~




W
~




H

+


H





W


(

I
+


1

σ
2



Λ


)



-
2




W




H
.









For any vector x, xGkx=∥{tilde over (W)}Hx∥2+∥(I+1σΛ−1W†Hx2≧0, thus, proving Gk is positive semi-definite.


With Gk being positive semi-definite for all streams, it can be shown that the KKT conditions for the problem represented by equation 17 can be represented according to equation 19 below:











μ
ik
opt

=


1


σ
2

+



(

v
k
opt

)





G
k



v
k
opt




·

(


g
ii
k

+







j

i








g
ij
k



v
jk
opt







p
ik




)










x
ik
opt

=

Re



{

ψ


(




j

i








g
ij
k



v
jk
opt



)


}

·


p
ik












y
ik
opt

=

Im



{

ψ


(




j

i








g
ij
k



v
jk
opt



)


}

·


p
ik













for





i

=
1

,





,

N
t

,

k
=
1

,





,

N
s






(

Equation





19

)







where μikopt≧0, i=1, . . . , Nt, k=1, . . . , Ns, are the Lagrange multipliers for the optimal solution.


Since the optimization problem represented by equation satisfies the Slater's condition if pik>0, for i=1, . . . , Nt, k=1, . . . , Ns, the KKT conditions are both necessary and sufficient conditions for optimality. Therefore, the solution in equation 19 is optimal for the problem represented by equation 17. This result is thus proven by noting the equivalence between the solution in equation 19 and equation 16 and by noting that an optimal solution to the problem represented by equation 17 is also an optimal solution to the problem represented by equation 14.


An iterative algorithm to find an optimal solution for multi-stream beamforming is illustrated in Table 2 below.









TABLE 2





Optimal Multi-Stream Tx Beamforming















1. Initialize V to a complex matrix such that ∥vik2 = pik, i = 1, . . . , Nt, k =


1, . . . , Ns.


2. In the n-th iteration, update vik sequentially as follows:


 For (k = 1, . . . , Ns,






CalculateGk=F-1σ2FV_kCk-1V_kF.






  For (i = 1 . . . , Nt, vik = ψ(Σj≠i gijkvjk) · {square root over (pik)}


  ) End


 ) End


3. Check convergence. If yes, stop; if not, return to Step 2.









The iterative algorithm illustrated in Table 2 above converges to the optimal solution for the problem represented by equation 14. To show this, similar to the proof of the iterative algorithm outlined in Table 1, each update of vik in the iteration maximizes the objective function given the power constraint on the i-th antenna for the k-th stream, assuming that the beamforming weights for all other antennas or other streams remain unchanged. As a result, the objective function is non-decreasing over the iterations. Since the objective function is bounded, the objective function converges to a certain limit.


When the mutual information converges, the condition vik=ψ(Σj≠igijkvjk)·√{square root over (pik)} from equation 16 is satisfied for all k=1, . . . , Ns, and i=1, . . . , Nt. Based on the conditions associated with equation 16, the solution is optimal for the problem represented by equation 14.


Similar to the single-stream beamforming case, the formula for declaring convergence discussed above can be used as the stopping criteria for the iterations, and the mutual information gap discussed above can be used as a measure of convergence rate over iterations.


Various embodiments of the present disclosure provide optimal MIMO transmission schemes with per-antenna power constraints. The capacity-achieving MIMO transmission scheme for a single-user MIMO system with a total power constraint can be easily identified. The preset disclosure provides optimal MIMO transmission schemes in the single-user MIMO system with per-antenna power constraints. The optimal MIMO transmission scheme that maximizes the mutual information while satisfying the per-antenna power constraints can be formulated according to equation 20 below:











V
opt

=



max
V



log





I

N
s


+


1

σ
2




V




H



HV










s
.
t
.








k
=
1


N
s









v
ik



v
ik
*






=

p
i



,

i
=
1

,





,

N
t





(

Equation





20

)







where V=[v1 v2 . . . vNs] is the MIMO precoder with vik being the precoding coefficient on the i-th antenna for the k-th MIMO layer, and pi>0, i=1, . . . , Nt are the per-antenna power constraints on the Nt transmitter antennas 410. Although power allocation across antennas is prohibited, the power on one antenna can still be allocated among different MIMO layers on that antenna, as long as the per-antenna power constraint on that antenna is not violated.


Looking to the optimization on a per-antenna basis, the precoder on the i-th antenna can be defined according to equation 21 below:






u
i
=[v
i1
v
i2
. . . v
iN

s
]  (Equation 21)


where ui represents the transmission from the i-th antenna by all the MIMO layers. The MIMO precoder can therefore be alternatively represented according to equation 22 below:






U=V

=[u
1
u
2
. . . u
N

t
]  (Equation 22).


With F=HH, without loss of generality, assuming ƒiik=1Nr∥hki2>0, i=1, 2, . . . , Nt, the optimization problem represented by equation 20 can be alternatively represented as another non-linear programming problem according to equation 23 below:











U
opt

=



max
U



log





I

N
s


+


1

σ
2




UFU












s
.
t
.






g
i



(
U
)





=




u
i




u
i


-

p
i


=
0



,

i
=
1

,





,


N
t

.





(

Equation





23

)







In the next steps, the contribution of the transmission from the i-th antenna to the mutual information is identified. For the i-th antenna, a permutation of F is according to equation 24 below:










F
i

=

[




f
ii




q
i







q
i




Q
i




]





(

Equation





24

)







where Qi is the (Nt−1)×(Nt−1) matrix obtained by removing the i-th row and the i-th column from F, and qi is the i-th column of F without the diagonal item ƒii. Accordingly, defining Ūi as the sub-matrix of U with ui removed, defining wi according to equation 25 below:






w
i

i
q
iii  (Equation 25)


and defining Di according to equation 26 below:











D
i

=

I
+


1

σ
2





U
_

i



Q
i




U
_

i



-



f
ii


σ
2




w
i



w
i





,




(

Equation





26

)







the contribution of the transmission from the i-th antenna to the mutual information can be separated from the contribution of other antennas according to equation 27 below:










f


(
U
)


=


log




I
+


1

σ
2




UFU







=



log






I
+




1

σ
2




[




u
i





U
_

i




]




[




f
ii




q
i







q
i




Q
i




]




[




u
i








U
_

i





]





=
log




I

+

UiQiUi







+

fiiuiwi







+

wiui







+

uiui





†σ





2


=



log





Di

+

fii





σ





2

ui

+
wiui
+

wi








=


log





Di

+

log





1

+

fii





σ





2

ui

+

wi











Di

-

1

ui

+

wi
.









(

Equation





27

)







From equation 27, Di is invertible by setting ui=−wi and by the fact that mutual information is always non-negative. Also from equation 27, the contribution of the i-th antenna to the mutual information given the interference from all other antennas can be represented according to equation 28 below:










f


(



u
i




U
_

i


,
F

)


=


log


[

1
+



f
ii


σ
2





(


u
i

+

w
i


)






D
i

-
1




(


u
i

+

w
i


)




]


.





(

Equation





28

)







The objective function of the problem represented by equation 23 is real, but the variables are complex. KKT conditions do not exist for the problem represented by equation 23, because the real objective function, when defined on complex variables, does not satisfy Cauchy-Riemann Equations and is thus not differentiable. In order to apply KKT conditions, the problem represented by equation 23 is converted into an equivalent optimization on real variables. For example, letting X=Re{U}, Y=Im{U}, and









f
~



(

X
,
Y

)


=


log





I

N
s


+



(

X
+

j





Y


)



F


(


X
T

-

j






Y
T



)




σ
2






=


log




I
+


1

σ
2




UFU







=

f


(
U
)





,




then the problem represented by equation 23 is equivalent to the following problem defined on real variables X and Y according to equation 29 below:





(Xopt,Yopt)=arg maxX,Y{tilde over (ƒ)}(X,Y)






{tilde over (g)}
i(X,Y)=Σk=1Ns(xik2+yik2)−pi=0, for i=1, . . . ,Nt  (Equation 29).


A convex optimization problem is further defined by relaxing the equality power constraints to inequality according to equation 30 below:





(Xopt,Yopt)=arg maxX,Y{tilde over (ƒ)}(X,Y)






{tilde over (g)}
i(X,Y)=Σk=1Ns(xik2+yik2)−pi≦0, for i=1, . . . ,Nt  (Equation 30).


It is easy to verify that both the objective function and the region of the problem represented by equation 30 are convex. The optimal solution of the convex programming problem represented by equation 30 is also the optimal solution of the original problem represented by equation 20, (i.e., the solution to the convex programming problem represented by equation 30 is also a solution to the problem represented by equation 20 with the mapping U=X+jY and U=V). To show this, it is easy to see that the problem represented by equations 20, 23 and 29 are equivalent with the mapping U=V and U=X+jY. This leaves the need to prove the solution to the problem represented by equation 30 is also a solution to the problem represented by equation 29. The problem represented by equation 30 becomes equivalent to the problem represented by equation 29 if all the per-antenna power constraints in the problem represented by equation 30 are binding (i.e., satisfied with equality).


By contradiction, it can be proven that a solution to the problem represented by equation 30 must satisfy all the per-antenna power constraints with equality. For example, assume X and Y are the solution to the problem represented by equation 30 and the per-antenna power constraint on the i-th antenna is not binding (i.e., Σk=1Ns(xik2+yik2)−pi=uiui−pi<0). Therefore, there exists ε>0, such that [ui+ε(ui+wi)][ui+ε(ui+wi]<pi. If the i-th column in U is replaced by ui+ε(ui+wi) and the corresponding real variables are denoted as {tilde over (X)} and {tilde over (Y)}, all the per-antenna element power constraints are still satisfied by {tilde over (X)} and {tilde over (Y)}, and the value of the objective function will be








f
~



(


X
~

,

Y
~


)


=




log




D
i




+

log


[

1
+





(

1
+
ɛ

)

2



f
ii



σ
2





(


u
i

+

w
i


)






D
i

-
1




(


u
i

+

w
i


)




]



>


log




D
i




+

log


[

1
+



f
ii


σ
2





(


u
i

+

w
i


)






D
i

-
1




(


u
i

+

w
i


)




]




=



f
~



(

X
,
Y

)


.






This contradicts with the assumption that X and V are a solution to the problem represented by equation 30. Therefore, any solution to the problem represented by equation 30 must satisfy all per-antenna power constraints with equality. In this example, this solution is also a solution to the problem represented by equation 29, which is equivalent to the problem represented by equation 20.


As a result, the problem represented by equation 30 can be solved instead of the problem represented by equation 20 to find the optimal solution for the problem represented by equation 20. The problem represented by equation 30 is a convex programming problem defined on real variables, and thus, KKT conditions are applicable.


Before finding the optimal solution of the problems represented by equations 20 and 30, the present disclosure first addresses the optimal solution for ƒ(uii,F). From equation 28, ui should be chosen to maximize ƒ(uii,F) in order to maximize ƒ(U) according to equation 31 below:






u
i
opt=arg maxui(ui+wi)Di−1(ui+wi)s.t.






u
i

u
i
=p
i  (Equation 31).


Similar to showing that the solution to the convex programming problem represented by equation 30 is also a solution to the problem represented by equation 20 with the mapping U=X+jY and U=V, the solution to the problem represented by equation 31 coincides with the solution to an alternative problem with the equality constraint relaxed to inequality. In other words, the precoder on the i-th antenna should be chosen according to the following optimization problem according to equation 32 below:






u
i
opt=arg maxui(ui+wi)Di−1(ui+wi)s.t.






u
i

u
i
≦p
i  (Equation 32).


The problem represented by equation 32 is a convex programming problem. Intuitively, the objective is to maximize a quadratic function (ui+wi)Di−1(ui+wi) within a sphere defined by uiui≦pi. The center (global minimum) of the quadratic function is at −wi, while the center of the sphere is at origin. By denoting the SVD of Di according to equation 33 below:






D
i
=Z
iΓi2Zi  (Equation 33)


with ZiZi=ZiZi=I, ũi=Ziui and {tilde over (w)}i=Ziwi, the optimization problem represented by equation 32 can be transformed according to equation 34 below:






ũ
i
opt=arg maxũi(ũi+{tilde over (w)}i)Γi−2(ũi+{tilde over (w)}i)s.t.






ũ
i

ũ
i
≦p
i  (Equation 34).


In order to maximize the objective function, it is apparent that the two complex numbers, ũkiopt and {tilde over (w)}ki should be in-phase for any MIMO layer k=1, . . . , Ns. With that, the problem degenerates to power allocation among the Ns. MIMO layers at the i-th antenna. Denoting the amplitude of ũki by αki, the amplitude of {tilde over (w)}ki by βki and letting αi=[α1i α2i . . . αNsi], and βi=[β1i β2i . . . βNsi], then the optimization problem represented by equation 34 can be further simplified according to equation 35 below:










α
i
opt

=


arg







max

α
i







k
=
1


N
s











(


α
ki

+

β
ki


)

2


γ
ki
2








s
.
t
.








k
=
1


N
s




α
ki
2










p
i

.






(

Equation





35

)







The contours of the objective function in the problem represented by equation 35 are Ns-dimension ellipsoids centered at −βk. The constraints in the problem represented by equation define an Ns-dimension sphere centered at origin. The optimization problem is equivalent to finding the ellipsoidal contour with the highest objective function value that is tangent to the Ns-dimension sphere. The tangent point is exactly the optimal solution to the problem represented by equation 35.



FIG. 6 illustrates a plot of a graph 600 illustrating calculation of a solution for multiple antenna transmission with per-antenna power constraints. FIG. 6 illustrates an example when Ns=2. This optimal solution can be pictorially illustrated as drawing an ellipse 602 centered at (−β1k, −β2k) that is tangent to a circle 604 centered at origin as in FIG. 6. The optimal solution is the tangential point 606 between the elliptical contour 602 that corresponds to the highest possible objective function value and the circle 604 that represents the per-antenna power constraint.


This optimization problem can be easily solved by the method of Lagrange multiplier. It can be shown that the optimal solution to the problem defined by equation 35 should satisfy the following KKT condition according to equation 36 below:





ioptΓi2−Iiopti  (Equation 36)


where ρiopt>0 is the Lagrange multiplier chosen such that the power constraint on the i-th antenna is binding, (i.e., αiαi=pi). The value of ρiopt can be found by solving the following equation derived from equation 36 and the binding power constraint according to equation 37 below:













k
=
1


N
s





β
ki
2



(



ρ
i
opt



γ
ki
2


-
1

)

2



=


p
i

.





(

Equation





37

)







Letting q(ρi) be a function of ρi






(


i
.
e
.

,


q


(

p
i

)


=





k
=
1


N
s





β
ki
2



(



p
i



γ
ki
2


-
1

)

2



-

p
i




)




then, ρiopt is a root to the equation q(ρi)=0. From equation 37, can be seen that:













β
ki
2



(



ρ
i
opt



γ
ki
2


-
1

)

2


-

p
i



0

,






for





k

=
1

,





,


N
s

.





(

Equation





38

)







A lower bound of ρiopt can, therefore, be obtained according to equation 39 below:











ρ
i
opt




max
k




1

γ
ki
2




(

1
+



β
ki
2


p
i




)




=

ρ
i
min





(

Equation





39

)







In addition, ρiopt is unique, since the q(ρi) is a monotonically decreasing function in [ρimin,+∞). As a result, a simple Newton's method function can be used to find the value of ρiopt. The iterative update of the Newton's method is according to equation 40 below:











ρ
i



(

n
+
1

)


=



ρ
i



(
n
)


-


q


(


ρ
i



(
n
)


)




q




(


ρ
i



(
n
)


)








(

Equation





40

)







where the derivative q′(ρi) is given according to equation 41 below:











q




(

ρ
i

)


=




k
=
1


N
s







-
2



β
ki
2



γ
ki
2




(



p
i



γ
ki
2


-
1

)

3


.






(

Equation





41

)







The iterative algorithm can be initialized by ρi(0)=ρimin. To reduce the complexity of numerical solutions for piopt, an upper bound can also be given as











k
=
1


N
s





β
ki
2



(



ρ
i
opt



γ
li
2


-
1

)

3








k
=
1


N
s





β
ki
2



(



ρ
i
opt



γ
ki
2


-
1

)

2




=

p
i





where the index l corresponds to the smallest γki2 (i.e., l=arg mink γki2(Equation 42)). This results in the following solution according to equation 43 below:











ρ
i
opt




γ
li
2

(

1
+






k
=
1


N
s




β
ki
2



p
i




)


=


ρ
i
max

.





(

Equation





43

)







In other words, the numerical solution can be limited in the region of ρimin≦ρiopt≦ρimax (Equation 44) where ρimin and ρimax are defined in equations 39 and 43, respectively. This allows a simple bisection method (i.e., a binary search) to find ρiopt if so preferred over the Newton's method. The iterative update of the bisection method is given according to equation 45 below:












ρ
i



(

n
+
1

)


=


1
2



[



ρ
i
LB



(
n
)


+


ρ
i
UB



(
n
)



]











ρ
i
LB



(

n
+
1

)


=

{








ρ
i
LB



(
n
)


,





if







ρ
i



(

n
+
1

)



<
0








ρ
i



(

n
+
1

)


,



otherwise










ρ
i
UB



(

n
+
1

)



=

{






ρ
i
UB



(
n
)


,





if







ρ
i



(

n
+
1

)



>
0








ρ
i



(

n
+
1

)


,




otherwise
.












(

Equation





45

)







The algorithm can be initialized by setting ρiLB(0)=ρimin, and ρiUB(0)=ρimax. The convergence of these numerical methods is also guaranteed. The second order derivative of q(ρi) is always positive on [ρimin, +∞) (i.e.,












q




(

ρ
i

)


=





k
=
1


N
s





6


β
ki
2



γ
ki
4




(



ρ
i



γ
ki
2


-
1

)

4



>
0


,






for






ρ
i





ρ
i
min

.







(

Equation





46

)

)







Therefore, q(ρi) is convex on [ρimin,+∞). Additionally, q(ρi) is monotonically decreasing) on [ρimin,+∞), q(ρimin)≧0, and q(ρimax)≦0. It is then straightforward to prove that the equation q(ρi)=0 has a unique root on [ρimin, +∞), and both the Newton's method and the bisection method are guaranteed to converge to the unique solution.


With the solution for the optimization problem represented by equation 35 found numerically, the optimal solution for the problems represented by equations 34, 32 and 31 is also found. From equation 36, and the fact that ũkiopt and {tilde over (w)}ki should be in-phase for any MIMO layer k=1, . . . , Ns, the optimal solution for the problem represented by equation 34 is also found according to equation 46 below:






ũ
i
opt=(ρioptΓi2−I)−1{tilde over (w)}i  (Equation 46).


Because ũi=Ziui, and {tilde over (w)}i=Ziwi, the optimal solution for the problems represented by equations 32 and 32 are readily obtained according to equation 47 below:






u
i
=Z
iioptΓi2−I)−1Ziwi  (Equation 47).


In other words, given the transmission signals from all other antennas, the optimal transmission scheme from the i-th antenna can be identified. With this knowledge, the optimal solution for the problem represented by equation 30 can be solved. Using equations 27 and 47, the KKT conditions of the problem represented by equation 30 can be represented according to equation 48 below:






u
i
=Z
iioptΓi2−I)−1Ziwi, i=1, . . . ,Nt  (Equation 48)


where wi and Di are defined in equations 25 and 26, respectively, Zi and Γi are defined in equation 33, and ρiopt is the Lagrange multiplier for the problem represented by equation 35 that can be found numerically. Although the optimization variables in the problem represented by equation 30 are xki and yki, the KKT conditions in equation 48 are described in terms of ui for simplicity because of the mapping U=X+jY. Because the problem represented by equation 30 is convex and satisfies Slater condition, the KKT conditions in equation 48 are both necessary and sufficient for optimality.


Based on equation 48, the present disclosure provides an iterative algorithm to find the optimal solution. The iterative algorithm is described in Table 3 below.









TABLE 3







Optimal MIMO transmission with per-antenna power constraints


1. Initialize U to a complex matrix such that Σk=1Nsuki2 = pi, i = 1, ... , Nt.


2. In the n-th iteration, update ui sequentially as follows:


 For (i = 1 ... , Nt,


  Calculate wi, Di, Zi, and Γi as in equations 25, 26, and 33.


  Solve for the problem represented by equation 35 numerically


using either the Newton's method as described in equation 40 or the


bisection method as described in equation 45.


  Update ui using equation 47 with ρiopt being the Lagrange


multiplier for the problem represented by equation 35.


 ) End


3. Check convergence. If yes, stop; if not, return to Step 2.









The iterative algorithm illustrated in Table 3 converges to an optimal solution for the problem represented by equation 20. For example, in the update for the i-th antenna in each iteration, ui is updated such that the contribution to mutual information from the i-th antenna is maximized, without reducing the mutual information contribution from all other antennas. As a result, the mutual information does not decrease through the iterations. In addition, the mutual information is bounded. As a result, the objective function will converge to a certain limit. When the objective function stops increasing, the condition ui=ZiioptΓi2−I)−1Ziwi is met by all antennas i=1, . . . , Nt. Therefore, the KKT conditions are met, and the solution is optimal.


For practical purposes, since maximizing the achievable mutual information is desirable, the change rate of the mutual information across iterations can be used as an indication of convergence. For example, denote the achieved mutual information after the n-th iterations as







f


(
n
)


=

log






I

N
s


+


1

σ
2




U


(
n
)





FU




(
n
)






.






Convergence of the algorithm can be determined according to equation 49 if












f


(
n
)


-

f


(

n
-
1

)




f


(
n
)



<

ɛ
.





(

Equation





49

)







The convergence criteria should only be applied between iterations, not between the updates for different antenna elements within an iteration, as it is possible that convergence has not been reached although the update for a single antenna element does not increase the objective function. To evaluate how fast the algorithm converges, the Mutual Information Gap for the n-th iteration can be defined according to equation 50 below:










τ


(
n
)


=




f


(

)


-

f


(
n
)




f


(

)



.





(

Equation





50

)







where ƒ(∞) denotes the maximum achievable mutual information given the element-wise power constraints.



FIG. 7 illustrates a process for transmitting signals using multiple antennas that takes into account per-antenna power constraints in accordance with various embodiments of the present disclosure. For example, the process depicted in FIG. 7 may be performed by the controller 412 and the transmitter 402 in FIG. 4. The process may be used in transmitting in uplink or downlink communication. For example, the transmitter 402 may be located in a base station, relay station or user equipment.


The process begins by identifying a transmission scheme to use in transmitting the signals (step 705). For example, in step 705, the process may determine whether a single or multiple streams of data need to be transmitted. For single stream transmission, the process may select the algorithm described in Table 1. For multiple stream transmission, the process may select the algorithm described in Table 2. For optimal MIMO transmission with per-antenna power constraints, the process may select the algorithm described in Table 3.


The process then initializes the precoding algorithm to a complex matrix (step 710). Thereafter, the process iteratively processes the precoding algorithm on a per-antenna basis (step 715). The process then sequentially updates a precoder for each of the plurality of antennas (step 720). For example, in step 720, the process updates the precoder for each antenna in one iteration.


Thereafter, the process determines whether the precoding algorithm has converged (step 725). For example, in step 725, the process may determine whether the precoding algorithm converged based on a change in a rate of mutual information across iterations as described in equation 49 above.


If the process determines that the precoding algorithm has not converged, the process returns to step 715 and continues to iteratively process the precoding algorithm on the per-antenna basis. For example, the process proceeds to a next iteration of sequentially updating the precoder for each antenna. If the process determines that the precoding algorithm has converged, the process then precodes and transmits the signals (step 730), with the process terminating thereafter.


The present disclosure provides the necessary and sufficient conditions for optimal single-stream transmitter beamforming, multi-stream transmitter beamforming, and optimal MIMO transmission with per-antenna power constraints. The present disclosure also provides iterative algorithms to achieve the optimal single-stream beamforming, multi-stream beamforming, and MIMO transmission solutions with per-antenna power constraints. The present disclosure shows that these algorithms converge to the optimal solutions. These algorithms are generally applicable to MIMO and beamforming with any per-antenna power constraints and make no assumption on the channel. Simulation studies show that the optimal beamforming and transmission schemes with per-antenna power constraints achieve mutual information close to the channel capacity without the per-antenna power constraints. On average, the iterative algorithm achieves more than 99% of the maximum achievable mutual information after 3 iterations.


Although the present disclosure has been described with an exemplary embodiment, various changes and modifications may be suggested to one skilled in the art. It is intended that the present disclosure encompass such changes and modifications as fall within the scope of the appended claims.

Claims
  • 1. A method for transmitting signals in a wireless communication system, the method comprising: initializing a precoding algorithm to a complex matrix, the precoding algorithm for precoding signals transmitted by a plurality of antennas;iteratively processing the precoding algorithm on a per-antenna basis by, at each iteration, sequentially updating a precoder for each of the plurality of antennas;after each iteration, determining whether the precoding algorithm has converged based on a change in a rate of mutual information across iterations; andin response to determining that the precoding algorithm converged, transmitting the signals using the precoding algorithm.
  • 2. The method of claim 1 further comprising: in response to determining that the precoding algorithm has not converged, continuing to iteratively process the precoding algorithm on the per-antenna basis at a next iteration.
  • 3. The method of claim 1, wherein sequentially updating the precoder of each of the plurality of antennas comprises: sequentially updating the precoder (ui) of an i-th antenna according to: ui=Zi(ρioptΓi2−I)−1Zi†wi where Zi is a unitary matrix with columns being singular vectors of Di, where ρiopt is a scalar, where Γi2 is a diagonal matrix with diagonal entries being singular values of Di, where I is an identity matrix, where Zi† is a conjugate transpose of Zi, where wi is a complex vector for the i-th antenna, and where Di is a positive semi-definite matrix derived based on the channel matrix and a current value of the precoder.
  • 4. The method of claim 3, wherein initializing the precoding algorithm comprises initializing the precoding algorithm to the complex matrix such that Σk=1Ns∥uki∥2=ρi, where Ns is a number of streams to be transmitted, uki is a precoder for a k-th stream and the i-th antenna, and pi is a power constraint on the i-th antenna.
  • 5. The method of claim 1 further comprising: identifying a number of streams to be transmitted;in response to identifying a single stream to be transmitted, sequentially updating the precoder (vi) of an i-th antenna according to: vi=ψ(Σk≠iƒikvk)·√{square root over (pi)}where ψ(w) represents a phase of a complex variable w, where ƒki is an element of F at a k-th row and an i-th column, F is a matrix obtained by multiplying the complex matrix by a conjugate transpose of the complex matrix on the left, vk is a precoder at the k-th antenna, and pi is a power constraint on the i-th antenna.
  • 6. The method of claim 5, wherein initializing the precoding algorithm comprises initializing the precoding algorithm to the complex matrix such that ∥vi∥2=pi.
  • 7. The method of claim 1 further comprising: identifying a number of streams to be transmitted;in response to identifying multiple streams to be transmitted, sequentially updating the precoder (vik) of an i-th antenna and a k-th stream according to: vikψ(Σj≠igijkvjk)·√{square root over (pik)}where ψ(w) represents a phase of a complex variable w, gijk is the (i,j)-th entry of Gk with Gk being a matrix derived from the channel matrix and a current value of the precoder, vjk is a current value of the precoder at a j-th antenna and a k-th stream, and pik is a power constraint on the k-th stream and the i-th antenna.
  • 8. The method of claim 7, wherein initializing the precoding algorithm comprises initializing the precoding algorithm to the complex matrix such that ∥vik∥2=pik.
  • 9. An apparatus configured to transmit signals in a wireless communication system, the apparatus comprising: a controller configured to initialize a precoding algorithm to a complex matrix, the precoding algorithm for precoding signals transmitted by a plurality of antennas; iteratively process the precoding algorithm on a per-antenna basis by, at each iteration, sequentially updating a precoder for each of the plurality of antennas; and after each iteration, determine whether the precoding algorithm has converged based on a change in a rate of mutual information across iterations;a precoding unit configured to, in response to a determination that the precoding algorithm converged, precode the signals using the precoding algorithm; andthe plurality of antennas configured to transmit the precoded signals.
  • 10. The apparatus of claim 9, wherein the controller is configured to, in response to determining that the precoding algorithm has not converged, continue to iteratively process the precoding algorithm on the per-antenna basis at a next iteration.
  • 11. The apparatus of claim 9, wherein to sequentially update the precoder of each of the plurality of antennas, the controller is further configured to sequentially update the precoder (ui) of an i-th antenna according to: ui=Zi(ρioptΓi2−I)−1Zi†wi where Zi is a unitary matrix with columns being singular vectors of Di, where ρiopt is a scalar, where Γi2 is a diagonal matrix with diagonal entries being singular values of Di, where I is an identity matrix, where Zi† is a conjugate transpose of Zi, where wi is a complex vector for the i-th antenna, and where Di is a positive semi-definite matrix derived based on the channel matrix and a current value of the precoder.
  • 12. The apparatus of claim 9, wherein to initialize the precoding algorithm, the controller is further configured to initialize the precoding algorithm to the complex matrix such that Σk=1Ns∥uki∥2=pi, where Ns is a number of streams to be transmitted, uki is a precoder for a k-th stream and the i-th antenna, and pi is a power constraint on the i-th antenna.
  • 13. The apparatus of claim 9, wherein the controller is configured to identify a number of streams to be transmitted; and in response to identifying a single stream to be transmitted, sequentially update the precoder (vi) of an i-th antenna according to: vi=ψ(Σk≠iƒikvk)·√{square root over (pi)}where ψ(w) represents a phase of a complex variable w, where ƒki is an element of F at a k-th row and an i-th column, F is a matrix obtained by multiplying the complex matrix by a conjugate transpose of the complex matrix on the left, vk is a precoder at the k-th antenna, and pi is a power constraint on the i-th antenna.
  • 14. The apparatus of claim 13, wherein to initialize the precoding algorithm, the controller is further configured to initialize the precoding algorithm to the complex matrix such that ∥vi∥2=pi.
  • 15. The apparatus of claim 9, wherein the controller is configured to identify a number of streams to be transmitted; and in response to identifying multiple streams to be transmitted, sequentially update the precoder (vik) of an i-th antenna and a k-th stream according to: vik=ψ(Σj≠igijkvjk)·√{square root over (pik)}where ψ(w) represents a phase of a complex variable w, gijk is the (i,j)-th entry of Gk with Gk being a matrix derived from the channel matrix and a current value of the precoder, vjk is a current value of the precoder at a j-th antenna and a k-th stream, and pik is a power constraint on the k-th stream and the i-th antenna.
  • 16. The apparatus of claim 15, wherein to initialize the precoding algorithm, the controller is further configured to initialize the precoding algorithm to the complex matrix such that ∥vik∥2=pik.
  • 17. A system comprising: a transmitter configured to: initialize a precoding algorithm to a complex matrix, the precoding algorithm for precoding signals transmitted by a plurality of antennas;iteratively process the precoding algorithm on a per-antenna basis by, at each iteration, sequentially updating a precoder for each of the plurality of antennas;after each iteration, determine whether the precoding algorithm has converged based on a change in a rate of mutual information across iterations; andin response to a determination that the precoding algorithm converged, transmit the signals using the precoding algorithm; anda receiver configured to receive the transmitted signals.
  • 18. The system of claim 17, wherein the transmitter is configured to, in response to determining that the precoding algorithm has not converged, continue to iteratively process the precoding algorithm on the per-antenna basis at a next iteration.
  • 19. The system of claim 17, wherein to sequentially update the precoder of each of the plurality of antennas, the transmitter is further configured to sequentially update the precoder (ui) of an i-th antenna according to: ui=Zi(ρioptΓi2−I)−1Zi†wi where Zi is a unitary matrix with columns being singular vectors of Di, where ρiopt is a scalar, where Γi2 is a diagonal matrix with diagonal entries being singular values of Di, where I is an identity matrix, where Zi† is a conjugate transpose of Zi, where wi is a complex vector for the i-th antenna, and where Di is a positive semi-definite matrix derived based on the channel matrix and a current value of the precoder.
  • 20. The system of claim 19, wherein to initialize the precoding algorithm, the transmitter is further configured to initialize the precoding algorithm to the complex matrix such that Σk=1Ns∥uki∥2=pi, where Ns is a number of streams to be transmitted, uki is a precoder for a k-th stream and the i-th antenna, and pi is a power constraint on the i-th antenna.
CROSS-REFERENCE TO RELATED APPLICATION(S) AND CLAIM OF PRIORITY

The present application is related to U.S. Provisional Patent Application No. 61/529,575, filed Aug. 31, 2011, entitled “METHODS AND APPARATUS TO TRANSMIT AND RECEIVE SIGNALS USING MULTIPLE ANTENNAS”; U.S. Provisional Patent Application No. 61/531,469, filed Sep. 6, 2011, entitled “METHODS AND APPARATUS FOR BEAMFORMING USING MULTIPLE ANTENNAS WITH ELEMENT-WISE POWER CONSTRAINTS”; U.S. Provisional Patent Application No. 61/533,644, filed Sep. 12, 2011, entitled “TRANSMISSION SCHEMES FOR MULTI-ANTENNA SYSTEMS WITH PER-ANTENNA POWER CONSTRAINTS”; and U.S. Provisional Patent Application No. 61/540,284, filed Sep. 28, 2011, entitled “METHODS FOR TRANSMITTING SIGNALS IN MULTI-ANTENNA SYSTEMS WITH PER-ANTENNA POWER CONSTRAINTS”. Provisional Patent Application Nos. 61/529,575, 61/531,469, 61/533,644 and 61/540,284 are assigned to the assignee of the present application and is hereby incorporated by reference into the present application as if fully set forth herein. The present application hereby claims priority under 35 U.S.C. §119(e) to U.S. Provisional Patent Application Nos. 61/529,575, 61/531,469, 61/533,644 and 61/540,284.

Provisional Applications (4)
Number Date Country
61529575 Aug 2011 US
61531469 Sep 2011 US
61533644 Sep 2011 US
61540284 Sep 2011 US