The present application relates to the filed of power control in communication systems. More specifically, it relates to decentralized power control of a multi-antenna access point using game theory.
Power control is typically employed in uplink wireless channels in order to guarantee a sufficient strength of the user's signal while limiting its interfering effect on signals belonging to other users. Optimal power control mechanisms require the access point (AP) to be able to directly control the power transmitted by mobile stations (MSs). This direct control cannot be guaranteed in some wireless networks, such as in systems complying with the cognitive radio principle, where competitive behavior is expected to be predominant.
Heretofore, game theory, a mathematical framework thoroughly investigated and employed in economic field, has not been used as a paradigm for modeling the performance of wireless networks that involve multiple nodes (i.e., MSs) not controlled by some central authority. As these independent nodes (players in the game-theoretic jargon) have goals that are usually in conflict with each other, their selfish behavior might lead to extremely poor network performance. Game theory allows to predict the possible outcomes of interaction (game) between the competitive MSs, in terms of Nash Equilibria (NE). Therefore, it can define a set of rules to be enforced on the players that would lead to more desirable outcome.
Prior power control systems and methods in wireless communication networks have not include decentralized power control and game theory.
According to one aspect described in the present application, decentralized power control is provided in a wireless network using game theory. Although the mobile stations (MSs) are not directly controlled by the access point (AP), the game they participate in, along with its Nash Equilibria (NE), is strongly dependent on the network parameters set by the AP (for example, available bandwidth and number of AP antennas). Therefore, an exemplary system design requires the AP to set those parameters in a manner that provokes the most desirable power allocation (NE) from the MSs. The framework where one agent (set of MSs) acts subject to the strategy that the other agent (AP) chose (with the latter aware that his action is observed), is referred to as a Stackelberg game. Moreover, the corresponding optimal pair of system parameters and power allocation is referred to as a Stackelberg Equilibrium (SE).
According to another aspect described in the present application, there are two network models. The first model assumes that the MSs' actions are dictated by the transmission power minimization under minimum capacity (transmission rate) constraints, while the second model is concerned with maximizing the power efficiency of the MSs. The service provider (AP) is consumer-oriented, and it aims at maximizing the users' preferences, while saving on investments such as bandwidth and network infrastructure (namely, AP antennas).
According to yet still another aspect, a method for decentralized power control of a multi-antenna access point in a wireless network includes setting network parameters as N, number of access point antennas, and G, processing gain, receiving a Nash Equilibrium for each of a plurality of mobile stations, and calculating network parameters with the received Nash Equilibrium to maximize a long-term revenue function.
According to yet still another aspect, a system for decentralized power control of a multi-antenna access point in a wireless network includes an access point antenna having a processor programmed to calculate network parameters and a plurality of mobile stations. Each of the plurality of mobile stations have a processor programmed to calculate a Nash Equilibrium of a non-cooperative game K, P {ui ()} played by the plurality of mobile stations. The processor of the access point antenna calculates network parameters to maximize a long-term revenue function based on the Nash Equilibrium from the plurality of mobile stations.
According to further still another aspect, a system having a set of single antenna mobile stations transmitting in the same time frequency towards an access point using asynchronous code-division access with a processing gain includes an access point and a set of mobile stations. The access point maximizes a long-term revenue function which depends on number of antennas and processing gain. The set of mobile stations in communication with the access point receives the number of antennas and processing gain from the access point and each of the set of mobile stations provides a Nash Equilibrium for the long-term revenue function based on the number of antennas and processing gain. The access point calculates a number of antennas and processing gain together with the Nash Equlibrium for each of the set of mobile stations that maximize the long-term revenue function.
These and other features, aspects and advantages of the present invention will become apparent from the following description, appended claims, and the accompanying exemplary embodiments shown in the drawings, which are briefly described below.
Exemplary embodiments are described below with reference to the accompanying drawings. It should be understood that the following description is intended to describe exemplary embodiments of the invention, and not to limit the invention.
A set of K single-antenna MSs transmit in the same time-frequency resource towards an AP with transmission powers Pi, i=1, . . . , K, using asynchronous code-division access with processing gain G≧1. The set of all transmission powers is P=(P1, P2, PK)Tε where is the set of allowed MSs' powers, and the maximum transmission power per user is denoted as Pmax. The AP is equipped with N (receiving) antennas, and the independent identically distributed (iid) complex Gaussian channel gains between ith MS and jth AP antenna are denoted as hij. Using a vector notation, the set of channels between user i and N antennas is hi=(hi1, . . . , hiN)T, while the set of all channel gains is given by N×K matrix H=(h1, h2, . . . , hK). Matched filtering (MF) is used at the AP with no interference cancellation. White Gaussian noise at any of the AP antennas is independent, with single-sided power spectral density No. Interference coming from other users' signals is modelled as Gaussian noise. Assuming that the station are sending “Gaussian codewords” and, without loss of generality, that the used bandwidth is G Hz, the maximum achievable rate for the ith MS, Ci (in bit/sec), can be written as:
C
i(P,H,N,G)=log2(1+SINRi), (1)
where the Signal to Noise plus Interference Ratio for the ith MS, SINRi, at the output of the MF is easily shown to be:
In (1), the achievable rate Ci is dependent on the set of transmission powers P, channel gain matrix H and the parameters set by the AP, N and G.
In a communication system, shown in
The goal of each MS is to maximize its own (instantaneous) utility function ui(P;N,G,H), i=1, . . . , K, defined as to reflect MS's preferences, usually in terms of achievable transmission rate and/or consumed power. The degree of freedom of each MS, say ith, is its transmission power Pi, while the parameters N and G, and the channel matrix H, are given. To emphasize this point, the notation ui(Pi,P−i;N,G,H), is used where P−i stands for the vector containing all but the ith element of P (i.e., it denotes the set of other MSs' strategies). Furthermore, the MSs are independent and behave in a selfish and rational manner, with goals typically in direct conflict. The whole set of MSs can be presented as one entity that receives as input the network parameters set by the AP (N and G), and produces an output defined by a Nash Equilibrium (NE). {circumflex over (P)}(N, G,H)=({circumflex over (P)}1, {circumflex over (P)}2, . . . , {circumflex over (P)}K)T, of the non-cooperative game κ,ρ,{ui(•)} played by MSs (see
The interaction between AP 14 and the set of MSs 12 described above can be studied in the framework of Stackelberg games. The AP 14 represents the authority of the game (Stackelberg leader), playing the first move by setting the network parameters (N and G) towards the aim of increasing its revenue function ρ(N,G). The MSs 12 on the other side (Stackelberg follower) respond with the NE {circumflex over (P)}(N,G,H) of their non-cooperative game. In principle, this interchange of parameters and MS game outcomes continues until the Stackelberg Equilibrium (SE) is reached, i.e., until the AP 14 finds the set of parameters (N and G) that, together with the corresponding NEs of the MS game, maximize its long-term (i.e., average over channel fading H) revenue function ρ(N,G).
In the following, two game models are presented. In the first game, the MSs (follower) tackle the problem of minimizing the transmission power under minimum transmission rate constraint, while in the second they aim at the (unconstrained) maximization of power efficiency (bit/sec/W). For each game, the AP optimizes the network utility (in terms of collective MSs' preferences) per invested system resource, i.e., per antenna and bandwidth. Performance of the considered distributed models is assessed by comparison with the corresponding centralized scenarios.
Minimizing the Power Under Capacity Constraints
For given network parameters N and G, the goal of the MSi is to minimize its own transmission power Pi under minimum transmission rate constraint, Ci,min:
minimize Pi,i=1, . . . , K
subject to Ci(P,H,N,G)≧Ci,min, (3)
P1ε[0,Pmax],
This problem can be formulated as the non-cooperative power control game (NPG) K,P,{ui(Pi,P−i)}), where K={1, 2, . . . , K} denotes the set of K players (MSs), the players' set of strategies P reads
P={P|P
iε[0,Pmax],Ci(P,H,N,G)≧Ci,min,∀iεK}, (4)
and the ith player's utility function is defined as
u
i(Pi,P−i)=−Pi,iεK. (5)
Notably, the strategy sets for different users are coupled according to (4). Furthermore, the parameters set by the AP, i.e., N and G, and the channel gains H, influence the game through its constraints and not through its utility ui(Pi,P−i). In a game theory framework, a strictly concave utility function is preferred, such that Equation (5) is replaced with:
u
i(Pi,P−i)=−log2 Pi,iεK, (6)
where the base 2 of the log function is chosen purely for the sake of consistency with the definition of capacity (1).
The NPG K,P,{ui(•)}), is shown to be an (exact) potential game. Analysis of the game, namely the assessment, existence and uniqueness of NEs, can be simplified for the class of potential games. For a strategic game, say (Ki,Pi,{uii(•)}), to be a potential game, there needs to exist a function Ui:Pi→R such that for all iεK! and (Pii,Pi−i), (Pii,Pi−i)εPi, it satisfies either uii(Pii,Pi−i)−uii(Piii,Pi−i)=Ui(Pii,Pi−i), in which case it is called an exact potential game; or uii(Pii,Pi−i)−uii(Piii,Pi−i)>0=Ui(Piii,Pi−i)>0, in which case it is an ordinal potential game. The function Ui(•) is called a potential function. The NPG (K,P,U), has the following potential function:
Assuming the optimization problem (3) is feasible, the set of strategies P is compact. Furthermore, U(P) is a continuous and strictly concave function on the interior of P. It follows that a strategy Popt that maximizes the potential U(P), Popt=arg maxPU(P), is also a NE of the NPG (K, P,{ui(•)}). Furthermore, since the set P is also convex (in fact, it is a cone), the optimal Popt and therefore the NE, {circumflex over (P)}(N, G,H)=Popt, is unique.
Both Gauss-Seidel and Jacobi algorithms, implementing best response, better response or the gradient projection rule, are guaranteed to reach the NE of the potential game at hand. Here, the Gauss-Seidel algorithm is detailed with the best response rule. The MSs play sequentially, and at the (t+1)th iteration the ith MS updates its transmission power following:
P
i
t+1=min(Pi*,Pmax) (8)
where Pi* is the minimum power satisfying the constraint Ci=Ci,min (see (1) and (2)):
The converging point of the algorithm is the NE strategy set {circumflex over (P)}(N,G,H), where {circumflex over (P)}(N,G,H)=Popt.
The revenue function of the AP accounts for the preferences of the service provider, e.g., profit (if it is charging the users for the service while investing in equipment) or quality of service (measured in SINR ratios, achievable rates, the probability of error, etc.). Here, it is assumed that a service provider, following the users' interest, strives to minimize the total power expenditure. However, it is also interested in reducing the cost of the two primary resources: number of antennas and bandwidth. The following revenue function measures the overall average network utility per system resource:
The expectation EH[•] is taken with respect to fading, since decentralized power control by the MSs is operated according to the instantaneous channel realization, while the system optimization is based on (long-term) channel statistics. Note that the revenue function in (10) depends on the NE of the MS game {circumflex over (P)}i, which in turn is a function of parameters N and G, set by the AP.
In some embodiments, the AP is able to control optimally not only the network parameters N and G, but also the MS transmission power, P(N,G,H), toward the goal of maximizing (10) (where the NE {circumflex over (P)}(N,G,H) is substituted with the variable P(N,G,H)). From the discussion above, the decentralized solution of the power control (NE) for given N and G is the one that maximizes the potential (7). Comparing (7) with (10), it can be seen that decentralized and centralized solution coincide in this case.
The system performance results in this section are obtained for the following parameters: E[hij|2]=1,Ci,min=1 bit/sec, Pmax=2 W and the average Signal to Noise Ratio (defined as SIN R for N=1, K=1 and P=Pmax) is S N R=13 dB.
The revenue function versus the number of antennas N for G=2 and different number of users K is presented in
The optimal network parameter N (or G) set by the AP in the SE for fixed G (or N), is presented in
Maximizing the Power Efficiency
Instead of minimizing the power under the minimum transmission rate constraint as above, here MSs' preference is considered the maximization of power efficiency:
Under the assumption of selfish and rational MSs, problem (11) can be cast as a non-cooperative power control game (NPG) (K,P,{ui(•)}), where K={1, 2, . . . , K} denotes the set of K players (MSs), the players' set of strategies P reads
P={P|Piε[0,Pmax],∀iεK},
and the ith player's utility function is defined as
While this utility function strongly reflects the pragmatic preferences of the MSs, it needs a slight modification in order to avoid singularity at Pi=0, while preserving quasi-concavity on P:
where Pc could be any conveniently chosen constant (for instance, it could account for the power consumed by electronic circuitry of MS). The utility defined in (13) depends on AP parameters N and G, as well as the channel gains H.
In order to reach the NE, the Jacobi algorithm is used, where all the users update their strategy in a parallel fashion using the Newton's method:
where α is some conveniently chosen small number and dit is chosen as
The convergence point of the algorithm is the NE of the game, {circumflex over (P)}(N, G,H).
The AP has preferences compatible with the MSs. Therefore, it aims at maximizing the (overall) power efficiency, averaged over fading, while accounting for the resource expenditure:
For the centrally optimal solution, the problem maximizes the revenue function (15), by assuming that the AP can also control the set of the MSs' powers P(N,G,H). Therefore, the maximization is carried out with respect to G,N and P(N,G,H). This task can be performed numerically. As shown below, in this case the decentralized solution has degraded performance as compared to the centralized scenario.
The Detailed Description provides an analysis of an exemplary design of a multi-antenna access point with decentralized power control in the uplink channel. The optimal solution, in terms of number of antennas and bandwidth, has been studied by modeling the problem as a Stackelberg game between the access point and competitive mobile stations. In this framework, it has been shown that a larger number of users motivates the provider (i.e., access point) to invest, as the overall performance enhancement well balances the costs. It was discussed, however, that in certain decentralized scenarios the system may not efficiently cope with large amount of user. Furthermore, the well-known tradeoff between system resources, bandwidth and antennas, was confirmed.
The foregoing description of exemplary embodiments has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the present invention to the precise form disclosed, and modifications and variations are possible in light of the above teachings or may be acquired from practice of the present invention. The embodiments were chosen and described in order to explain the principles of the present invention and its practical application to enable one skilled in the art to utilize the present invention in various embodiments and with various modifications as are suited to the particular use contemplated.
The present application claims priority to U.S. Provisional Application Ser. No. 60/874,143 entitled “Method and System for Optimal Design of a Multi-Antenna Access Point with Decentralized Power Control Using Game Theory” filed on Dec. 11, 2006, and incorporated herein by reference in its entirety.
The present application relates to research supported in part by the National Science Foundation under Grant CNS-0626611. The United States government may have rights in the work.
Number | Date | Country | |
---|---|---|---|
60874143 | Dec 2006 | US |