Radio communication systems provide users with the convenience of mobility along with a rich set of services and features. This convenience has spawned significant adoption by an ever growing number of consumers as an accepted mode of communication for business and personal uses in terms of communicating voice and data (including textual and graphical information). A continual challenge in such communication systems involve allocating resources, while minimizing transmission power levels and ensuring fairness among the users (or terminals). These parameters and competing interests result in a computationally “hard” problem.
Therefore, there is a need for an approach to provide a method and system for efficiently allocating resources in a communication system. A utility-based resource management scheme is provided by exploiting optimal queue and channel state information using a scheduler associated with an algorithm, to schedule optimized traffic with minimum transmit power during communications.
According to one embodiment of the invention, a method comprises determining queuing state information of a queue. The method also comprises determining channel state information of a channel of a wireless network; and allocating transmission resources to a plurality of wireless devices based on the queuing state information and the channel state information. Each of the wireless devices is configured to operate over the wireless network. The allocation is performed according to a utility-based cross-layer resource management framework that transforms a utility function into an equivalent bipartite graph to concurrently maximize throughput and fairness in servicing the queue.
According to another embodiment of the invention, an apparatus comprises a queue; a channel conditioner configured to determine channel state information of a channel of a wireless network. The apparatus also comprises a scheduler configured to allocate transmission resources to a plurality of wireless devices, configured to operate over the wireless network, based on state information of the queue and the channel state information. The allocation is performed according to a utility-based cross-layer resource management framework that transforms a utility function into an equivalent bipartite graph to concurrently maximize throughput and fairness in servicing the queue.
According to yet another embodiment of the invention, a method comprises receiving a resource allocation over a wireless network, wherein the resource allocation is among a plurality of resource allocations performed according to a utility-based cross-layer resource management framework that transforms a utility function into an equivalent bipartite graph to concurrently maximize throughput and fairness in servicing a queue. The resource allocation is based on the queuing state information and channel state information correspond to a channel of the wireless network.
According to an exemplary embodiment, an apparatus comprises a processor configured to receive a resource allocation over a wireless network, wherein the resource allocation is among a plurality of resource allocations performed according to a utility-based cross-layer resource management framework that transforms a utility function into an equivalent bipartite graph to concurrently maximize throughput and fairness in servicing a queue. The resource allocation is based on the queuing state information and channel state information correspond to a channel of the wireless network.
Still other aspects, features, and advantages of the invention are readily apparent from the following detailed description, simply by illustrating a number of particular embodiments and implementations, including the best mode contemplated for carrying out the invention. The invention is also capable of other and different embodiments, and its several details can be modified in various obvious respects, all without departing from the spirit and scope of the invention. Accordingly, the drawings and description are to be regarded as illustrative in nature, and not as restrictive.
The embodiments of the invention are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings:
An apparatus, method, and software for providing resource allocation in a communication network are disclosed. In the following description, for the purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the embodiments of the invention. It is apparent, however, to one skilled in the art that the embodiments of the invention may be practiced without these specific details or with an equivalent arrangement. In other instances, well-known structures and devices are shown in block diagram form in order to avoid unnecessarily obscuring the embodiments of the invention.
Although the embodiments of the invention are discussed with respect to a wireless packet data network, it is recognized by one of ordinary skill in the art that the embodiments of the inventions have applicability to any type of communication system and other equivalent data transmission structures.
As shown in
The channel conditioner 115 is configured to detect channel condition, and also to perform necessary baseband processing, including processing any digitized received signal to extract the information or data bits conveyed in the received signal, typically including demodulation, decoding, and error correction operations. Alternatively, feedback can be used from sub-carrier channel measurements taken by the mobiles in order to estimate the channel for scheduling purposes.
In this exemplary embodiment, each sector of antenna array 117 comprises multiple antenna elements that enable antenna array 117 to use well-known beamforming techniques to implement multiple access signal over the OFDMA channels, for example, communicating concurrently in the same frequency bandwidth to multiple spatially separated subscribed mobile stations 101 using independent spatially directed beams.
Traditionally, significant focus has been placed on instantaneous throughput maximization at each decision period (or epoch) to address the multi-user water-filling problem that achieves Shannon capacity under power constraints at each decision epoch. The approach is throughput-optimal in both the short and long term when each user has an infinite or sufficient amount of data in the buffer. However, when the user does not have sufficient data in the buffer, this approach actually fails to consider the impact of the allocation decision to the future state of the system. By contrast, the approach, according to certain embodiments of the invention, provides realistic resource allocation policies in consideration of queue occupancies.
Other conventional systems are more fully described in G. Song and Y. (G.) Li, entitled “Cross-layer optimization for OFDM wireless networks Part I—theoretical framework,” IEEE Trans. On Commun., vol. 4, No. 1, March 2005, pp. 614-624[1]; G. Song and Y. (G.) Li, entitled “Cross-layer optimization for OFDM wireless networks Part II—Algorithm Development,” IEEE Trans. On Commun., vol. 4, No. 1, March 2005, pp. 625-634[2]; S. Kittipiyakul and T. Javidi, entitled “A fresh look at subcarrier allocation in OFDMA systems,” 43rd IEEE Conference on Decision and Control, Volume 3, pp. 3289-3294, December 2004[3]; Z. Jiang, Y Ge and Y. (G.) Li, entitled “Max-utility wireless resource management for best effort traffic,” IEEE Transactions on Communications, vol. 47, no. 6, pp. 884-895, June 1999. [4]; all of which are incorporated herein by reference in their entireties.
Generally, with wireless applications, a reliable data transmission rate, r, is the most important factor in determining the satisfaction of users. Therefore, a utility function of the data rates, U(r), should be a non-decreasing function of its argument, as it measures the user's satisfaction with his assigned data rate. In particular, in traditional network optimization, where the objective is to maximize the aggregate throughput, the utility is a linear function of the data rate—i.e., U(r)=r. However, in general, a utility function is simply a concave “nonlinear” function of the data rate or other, measurable network resources (such as transmit power level).
The utility functions can serve as an optimization objective for the adaptive physical and MAC layer techniques. Consequently, they can be used to optimize radio resource allocation for different applications and to bridge the physical, MAC, and higher layers. In practice, utility functions cannot always be obtained through theoretical derivations, but may be estimated from subjective surveys. For example, for best effort traffic, a well-accepted utility function that is derived based on survey material is
U(r)=0.16+0.8 ln(r−0.3) (1)
where r is in units of k b/s.
In order to prevent assigning too much resource to the user with good channel conditions, the utility function is typically selected such that its slope decreases with an increase in the data rate. This prevents users that have been assigned a high proportion of the network resources from always receiving at the same level—thereby providing a mechanism to balance access amongst all users.
In the conventional case, for which U(r)=r and the goal is to maximize the sum throughput
for all users that are in the active set, i ε Θ, the slope of the utility curve is a constant value of one for each user (i.e. U′(r)=1). Consequently, for a fixed transmit power level, maximum-utility is achieved by assigning each sub-carrier to the user with the best channel quality, which may result in a long-term policy that unfairly allocates resources to users who typically have good channel quality and starve those users whose channel conditions are systematically less favorable (e.g., those user who are located at the cell edge). However, it is the “derivative” of the utility function, U′(ri) that actually controls the threshold of comparison for the distribution of resources. Hence, by using a non-linear utility function whose slope decreases with an increase in of the desired service, a more equitable resource distribution policy can be constructed.
The invention, according to certain embodiments, provides an improvement over the conventional systems by considering a more practical transmission scheme in which the packets from multiple users can be, for instance, time-multiplexed for transmission over the same sub-carrier during each downlink frame. Adaptive modulation and encoding is used to determine the coding rate for each packet, which depends on the channel quality of each user. Hence, each downlink frame may include a variable number of symbols.
An approach is provided for addressing the problem of how to fairly and efficiently allocate resources (sub-carriers) in a radio communication system, such as an OFDMA (Orthogonal Frequency Division Multiple Access) system, in order to simultaneously maximize the long-term average packet throughput and fairly service the user's queues. In addition, the approach can also be utilized to determine the minimum transmit power necessary to reliably send the packets during each downlink OFDMA frame.
OFDMA, also referred to as Multi-User-OFDM, is being considered as a modulation and multiple access method for 4th generation wireless networks. OFDMA is an extension of Orthogonal Frequency Division Multiplexing (OFDM), which is currently the modulation of choice for high speed data access systems such as IEEE 802.11a/g wireless LAN (WiFi) (Wireless Fidelity) and IEEE 802.16a/d/e wireless broadband access systems (WiMAX) (Worldwide Interoperability for Microwave Access). OFDMA allows multiple users to transmit simultaneously on the different subcarriers.
In other words, certain exemplary embodiments of the invention provide resource allocation and scheduling for an OFDMA broadband wireless network through a utility-based cross-layer resource management framework that exploits queue and channel state information at the base station in order to schedule best effort traffic during each frame. For example, during each decision epoch, an efficient and fair sub-carrier allocation policy is determined for maximizing a concave utility function of the long-term average service rates (packet throughputs), queue occupancies and minimum required downlink transmit power. That is, the joint optimization problem of downlink transmit power, fair queue servicing and sub-carrier allocation is considered.
As will be made more apparent later, various exemplary embodiments of the invention address a utility-based maximum weighted matching on a bipartite graph, wherein queue occupancy or waiting times can be taken into account. In addition, the problem of determining the downlink power control is also addressed. The invention, according to certain embodiments, offers flexibility that can be readily used by an OFDMA system in which multiple packets from different users (at possibly different coding rates) can be concatenated and transmitted during a single frame by the same sub-carrier.
Various exemplary embodiments of the invention provide certain key properties of the utility function that allows us to substitute a maximum weight matching optimization for the nonlinear combinatorial cross layer optimization problem. The resulting flexibility in the graph-theoretic problem space allows for consideration of a downlink scenario in which the scheduler 107 can assign multiple packets to be sent to different users over each sub-carrier during each OFDMA frame. In addition, this alternate construction permits taking queue occupancy/packet waiting times naturally into consideration. The invention, according to certain embodiments, allows different packets to be weighted individually, (for example, according to waiting time in the queue).
In accordance with one embodiment, a utility-based cross-layer PHY (Physical Layer) transceiver design and MAC (Media Access Control) scheduling framework is provided for maximizing system throughput. This approach, for example, can be applied the IEEE 802.16e (“WiMAX”) (Worldwide Interoperability for Microwave Access) standards and other (such as 3.9G) standards.
While some examples of the invention may relate most particularly to down link, the examples of the invention are not restricted for use with downlink for WiMAX OFDMA wireless networks, and are generally applicable to other types of networks. The invention, in exemplary embodiments, may also be used, with certain modifications that are known to those skilled in the art, for uplink applications as well.
Accordingly, in step 201, queuing state information is determined for consideration in the resource allocation procedure. In addition, the channel state information is determined, per step 203. Further, transmission power levels of the mobile stations 101 for reliable communications can also be factored in the resource allocation process (step 205). Next, the transmission resources (e.g., sub-carriers) are allocated, as in step 207, based on these determined factors (i.e., queuing state and channel state, and optionally transmission power levels). The mobile stations 101 subsequently transmit data over the allotted sub-carriers (step 209).
The process, in according with some embodiments, modifies the utility framework so that the scheduler 107 can fairly allocate sub-carriers in consideration of channel conditions and queue states (backlog or packet waiting time). Namely, an analytical framework for constructing an equivalent graph theoretical approach is developed to provide utility-based sub-carrier allocation. Such dual problem construction provides a natural mechanism by which queue state and transmit power information can also be exploited in the decision framework—i.e., construction of the utility-based optimization problem as an equivalent problem of Maximum Weight Matching on a bipartite graph. This approach provides a new solution space for algorithm development beyond non-linear optimization or linear programming. This approach turns an NP (Nondeterministic Polynomial)-hard problem into a problem that can be solved in polynomial time.
It is noted that many conventional approaches have been proposed to address the problem of resource allocation and scheduling for the downlink of an OFDMA broadband wireless network in which each sub-carrier can be used to transmit a variable number of packets to multiple users within each downlink frame. The packets that arrive at the base station 100 for the users are separately queued, so that each queue is associated with one and only one user. Once scheduled for transmission over a particular sub-carrier, the packets from multiple users are retrieved from the queues and then multiplexed, to be sent over the selected sub-channel during the next downlink frame. The number of packets that can be sent to any user over any particular sub-carrier is jointly determined by the channel quality and the transmit power level that is allocated to each sub-carrier.
As mentioned, during each decision epoch, an efficient and fair sub-carrier allocation policy is determined as to maximize a concave utility function of the long-term average service rates (packet throughputs for the users) and queue occupancies under constraints on the maximum power that can be used over each sub-carrier. Once the allocation is made, the minimum transmit power for reliably sending the packets over the channel, and the corresponding channel coding rate, is determined.
By way of example, the cross-layer optimization mechanism considers key aspects of the PHY and MAC layers to provide resource allocation in the network. Under this framework, the resource allocation is performed in a manner that optimally balances the fairness and efficiency of the allocation policy. In an OFDMA wireless network, this is a non-linear optimization problem which, ultimately, can result in the optimal sub-carrier assignment to different users, the determination of the uplink/downlink transmit power levels and the assignment of data rates, as well as other allocations.
According to an embodiment for implementing utility based cross-layer optimization, a utility function maps the network resources that a user utilizes into a real number that can be used as a metric to quantify this satisfaction. In this report, a utility function is used for the cross-layer optimization and for balancing the efficiency and fairness of wireless network resource allocation in an OFDMA wireless network.
The process of
As seen in the
The approach converts an NP-hard problem into one that is solvable in O(|V|3) time (where V denotes the number of vertices in the equivalent bipartite graph). This scheme also reduces the interference to other cells by taking into account queue backlogs and channel qualities for each user in the assignment of the downlink transmit power levels to each sub-carrier. This step prevents the network from wasting transmission power when there is insufficient data to send. Thus, it allows the base station 100 to make a more efficient use of its transmission power and also reduces the amount of interference power to neighboring sites.
Gradient-based scheduling is known to be an asymptotically optimal strategy for maximizing a concave utility function. The approach provides a convenient mechanism for constructing a dual graph theoretical resource allocation problem. The algorithm provides a mechanism by which each individual packets in each user's queue can be assigned a weight that may be proportional, for example, to its position in the queue or to its waiting time in the queue or to its priority in the queue. A utility-based weight can also be imposed based on the minimum amount of downlink transmit power that it takes to reliably send each packet over each sub-carrier. In the problem formulation, these weights are taken into consideration along with the utility-based metric of long-term average packet rates in order optimize the scheduling and transmit power assignment. Thus, the total utility for each user is actually a product of the individual utilities that are related to: packet throughput, downlink transmit power level and queue backlog.
The algorithm, in one embodiment, extends conventional optimization algorithms by considering an OFDMA network in which multiple users can send packets over the same sub-carrier during each frame (in non-overlapping time slots). In this formulation, the frame contains a variable number of packets from each user, which may be encoded at different data rates (the selection of which depends on the channel quality for each user).
In a weighted bipartite graph, each edge has an associated value. The graph 303 includes weights of 3 for the following edges: (X1, Y1), (X1, Y3) and (X3, Y1), while edges (X1, Y2), (X2, Y2) and (X3, Y2) have weights of 2. Weighting of 1 is seen for edges (X2, Y1) and (X3, Y3). Without loss of generality, edges of weight 0 (e.g., edge (X2, Y3)) can be added to make a complete weighted graph. As seen in the maximum weighted bipartite matching graph 305, the sum of the values of the edges in the matching have a maximal value.
A “maximum weighted matching” is a matching D such that (1) each vertex that is included in the matching has one and only one edge in the matching (i.e., one vertex in X that is included in the matching cannot be matched to more than one vertex in Y), and (2) the sum of the edge weights in the matching D is a maximum over all such matchings. Given the sum of weights for the matching D,
a maximum weighted matching satisfies:
An illustration of matched/free vertices is provided in the graph 401, a “matching” D which includes the matched vertices (X2,Y2), (X5,Y3), (X4,Y4) and (X6,Y6)—is shown. The other vertices are free. Graph 403 shows alternating paths: Y1, X2, Y2, X4, Y4, X5, Y3, and X3.
l(x)+l(y)≧w(x,y), ∀xεX, yεY. (3)
The “equality graph” with respect to a labeling, l, is G=(V, El) where
El={(x,y): l(x)+l(y)=w(x,y)} (4)
If l is feasible and D is a perfect matching in El, then D is a max-weight matching. This is known as the Kuhn-Munkres (KM) theorem.
The base station 100 has knowledge of the queue backlog and channel state conditions (e.g., per-sub-carrier) for each user, which allows the AMC (adaptive modulation coding) controller to select the appropriate modulation format and forward error correcting scheme on a frame-by-frame basis. According to one embodiment, a block fading channel model is assumed, where the channel state remains constant during a frame but varies between different frames. Essentially, the signal-to-noise ratio (SNR) experienced by user i at a particular sub-channel, k, dictates the selected modulation and encoding mode. Thus, at an encoding rate of R, bits per symbol, a packet is mapped to a symbol-block containing Np/Rn symbols.
At the physical layer, the processing unit is a frame 509, which includes pilot(s) 511 and control parts 513. One or more data fields 515 are carried with the frame 509.
It is also assumed that the transmit power allocated to the kth sub-carrier during the nth downlink frame, p(n,k), is constrained such that 0≦P(n,k)≦Pmax (k). Consequently, the total power, Ptot (n) that may be used for downlink transmissions over the nth downlink frame is also bounded as follows:
Under the above assumptions, the maximum number of packets per frame that sub-carrier k can reliably transmit to user i can be expressed as a function of the signal-to-noise ratio at that sub-carrier. For example, with adaptive Quadrature Amplitude Modulation (QAM), the number of packets that sub-carrier k can transmit for user i is given by:
β is a constant related to a targeted bit-error-rate (BER) by the relation
β indicates the difference between the SNR (Signal-to-Noise Ratio) needed to achieve a certain data transmission rate for a practical system and the theoretical limit, respectively, and is referred to as the SNR gap. N0,i (n,k) denotes the interference and background noise that is measured by the ith user during the nth frame over the kth sub-carrier.
The AMC concept is further illustrated in
Also, it is assumed that the channel is frequency flat, and remains invariant during each frame, but can vary from frame to frame. This is a block fading channel model, which is applicable to slowly-varying (quasi-static) wireless channels. AMC is adjusted on a frame-by-frame basis. Furthermore, channel state information is available at the base station 100.
The packet arrival processes 700 to users' queue backlog during each frame are independent across frames. At the base station 100, the data link layer packets that are destined for a particular user are buffered into a queue 113 until they are serviced. In an exemplary embodiment, packets are not serviced within the same frame in which they arrive.
In the queuing system 700, bi[n] denotes the instantaneous backlog of the ith user's queue at the beginning of the nth service time (frame). The queue evolution equation is given by bi[n+1]=bi[n]−min(bi[n], Qi[n])+ai[n], where Qi[n] denotes the number of packets that are removed from the queue 703 during the nth frame and ai[n] represents the total number of arrivals during the nth frame.
The scheduler 107, which makes a sub-carrier assignment once during every frame based on each user's total channel quality and queue length, does not waste service rate. Hence, the scheduler 107 is constrained not to schedule more packets to be sent to a user than there are currently backlogged in the queue 113. This constraint appears in as the term min(bi[n], Qi[n]) in the queue evolution.
The amount of transmit power that is allocated to the kth sub-carrier is upper-bounded by a maximum proportion of the total downlink transmit power which is allocated to that sub-carrier. However, the network should not waste power. Hence, if the maximum proportion of power would actually service more packets than are currently available for transmission, then the base station 100 is constrained to reduce the level to the minimum value required in order to service the queue backlogs of all of the active users.
By way of example, utility-based optimization is considered based on the long-term average packet rate, i.e., the weighted average packet throughput over many epochs, rather than an instantaneous optimization. In general, an optimal long-term rate policy trades off between the desire to get maximum throughput and balance fairness during the current epoch and the desire to achieve the same in the future. The second goal is necessary for more fully exploiting multi-user diversity.
By exploiting an exponentially weighted low-pass time-window, the average long-term data rate for user i can be updated as follows:
where ρw>0 is a fixed (small) parameter and ri[n] is the instantaneous packet rate. Therefore, the optimization problem should be expressed as maximizing the total utility with respect to average packet rates, that is
where r[n] is the packet rate vector [r1[n], r2[n], . . . , rM[n]] and Cπ(H) is the instantaneous feasible data rate region at time n, which is determined by the current channel states, H:
H=[H1[1], . . . , H1[K], . . . , HM[1], . . . , HM[K]] (10)
and the allocation constraints of the dynamic sub-carrier allocation policy, π.
As indicated by equation (8), the long term average packet rate is also a function of the instantaneous packet rate. Hence the optimization problem in equation (9) can be equivalently expressed as
This transformation of variables enables instantaneous optimization based on V(·) with respect to the “instantaneous” packet rates instead of U(·) with respect to the long-term average packet rates. The marginal utility function is given by
If the weighting factor, ρw is small enough, then
From this approximation, it can be concluded that the current marginal utility values are totally determined by the previous resource allocation. Using a first-order Taylor formula to approximate the derivative in (14) and again assuming that ρw is small, it follows that
Since all variables which are a function of the (n−1)-st frame are fixed at time n, and hence do not really affect the optimization, the optimization problem becomes one having a linear objective function as follows:
Where the substitution, U1 (
Although there are non-linear combinatorial optimization methods to solve equation (16), a different approach is taken by reformulating the utility-based rate optimization problem into a problem that can be solved using graph theory. In this equivalent problem formulation, each queue is represented as a “set” of nodes that require servicing and each sub-carrier is represented as a “set” of server nodes on a graph.
The number of packets that can be reliably sent to user i during the nth frame is simply the sum of the total number of packets that can be reliably sent over its assigned frequency set, Di, to which user i is assigned. Hence, given a frame duration of Ts, the following results
with Δf being the sub-carrier spacing (in Hz) and wi[k, n]ε{0, 1} being the indicator that sub-carrier k is one of the sub-carriers that has been allocated to user i. However, this optimization problem can be massaged so that there is more flexibility to facilitate assigning the same sub-carrier to different users during each frame. First, the number of packets that can be transmitted over the kth sub-carrier is considered—
pi[k,n]=ci[k,n]Δf·Ts. (18)
Under maximum power, the base station transmits the maximum number of packets to the ith user over the kth sub-channel. Hence, the number of packets is always upper-bounded by
denotes the maximum number of packets that can be sent.
Letting Qi[k,n] denote the actual number of packets that are serviced from the ith queue for downlink transmission over the kth sub-carrier during the nth frame, the following queuing service constraints apply:
Under these two conditions, it follows that the maximum number of packets that can be serviced from queue i during the nth frame is
mi[n]=min(maxk(pi*[k,n]),bi[n]) (21)
Thus, in the equivalent graph construction of this problem, each queue can be represented by a set of mi[n] expanded “user nodes” on a graph.
In addition to the above two conditions, the following constraint takes the network (i.e., per sub-carrier) capacity into consideration: Letting N denote the maximum number of packets (of any length after AMC) that can be transmitted by a sub-carrier during the current frame then, Qi[k, n]≦N.
Hence, each sub-carrier can be represented by a set of N expanded “server nodes” on a graph to represent the N available time slots per frame. Therefore, the optimization problem becomes
With the new formulation of equation (22), several observations are pertinent:
Although the problem construction in (22) takes many important aspects of the channel state and the queue model into consideration, there is no real mechanism in place to help “balance” the queues 113 to ensure a more equitable queue distribution length after servicing nor is there a way to take waiting time into account. In the former case, when queue occupancy is the primary concern, a non-negative factor can be introduced into the optimization which effectively weights the connectivities between the rth expanded queue node (r=1, . . . , mi[n]) and all of the expanded sub-carrier nodes to which it is connected (i.e. for which it can at least send one packet reliably) by a factor that is proportional to the number of packets waiting behind it in the queue.
In the latter case, when packet waiting time is the primary concern, a non-negative quantity can be introduced which weights the connectivities between the rth expanded queue node (r=1, . . . , mi[n]) and all of the expanded sub-carrier nodes to which it is connected (i.e., for which it can at least send one packet reliably by a factor that is proportional to the time that the packet has spent waiting in the queue 113.
Therefore, when the utility-based optimization which takes both channel state and queue occupancy into account, the objective function becomes
Conversely, when the objective involves both channel state and waiting time, the objective function can be written as
with Wir[n] being the time that the rth buffered packet has spent in the queue. It is noted that this is a novel reconstruction of the cross-layer utility optimization problem that combines a utility-based weight with a queue state-related weight in order to better address the key issues of fairness and efficiency. According to the construction of (23) and (24), it may be assumed that the transmission power level has been pre-determined (or set to the maximum level) in order to determine the per-subcarrier capacity for the upcoming transmission frame.
Alternately, the power efficiency of the downlink transmission is taken into consideration by constructing a utility function that measures the network's satisfaction with the amount of downlink power that is necessary to reliably transmit each packet over each individual sub-carrier. Letting Pi1[n, k] represent the minimum amount of power that is necessary to send one packet over the kth sub-carrier to the ith user during the nth frame and let Ui,2 (·) be a convex utility function that measures the network's satisfaction with a certain transmit power level, then, the utility function is defined as:
Ui,2(P1[n,k]) (25)
This quantity measures the network's satisfaction with having to expend a certain amount of power to transmit. Hence, the final optimization also weights each packet with this transmission-power based utility function.
This optimization problem may be solved by finding the equivalent bipartite graph construction for this problem.
An equivalent bipartite construction can be achieved as follows. First, associated with each queue i, mi[n] expanded nodes, labeled as ai1, ai2, . . . aim
U={a11, a12, . . . , a1m
If |U|≠|V|, then one can add virtual extended nodes to the set of smaller cardinality until |U|=|V|. Letting E={(aim,vkr)} represent the set of edges representing connectivities, these connectivities indicate whether the channel can be used to reliably send at least one packet during the nth frame.
Letting ψ: E, ψ(aim, vkr)=eim[n]·Ui′(
It is noted that if Pmax (k) results in a channel packet capacity that exceeds the queue back logs of all of the active users that are connected to sub-carrier k, that the network can decrease the transmit power level to the minimum required in order to reliably send all of the packets in the queues. This results in a power saving at the base station 100 and less interference to other cells (in the case of a cellular network).
Furthermore, the amount of power that it takes to send one packet to user i (i=1, . . . , 3) over the kth sub-carrier (k=1, 2) is P11[n,k]. Since User 3 (Queue #3) does not have any packets to send, Queue #3 does not have connectivity to either sub-carrier (edge weights are zero). Graph 803 shows connectivities and edge weights for this example. It is noted that User 1 (Queue #1) does not have connectivity to sub-carrier 2. This implies that, due to channel quality, that the maximum transmit power level that can be used over the 2nd sub-carrier (Pmax (2)) is insufficient for the base station 100 to send User 2 at least one packet (again, edge weights are zero).
The corresponding edge weights 805 are shown. In this exemplary scenario, the queue backlog as the queue-dependent weight is used, although also it is contemplated that packet waiting time or another suitably defined quantity can be used.
The “maximum weight matching” on the equivalent bipartite graph is determined, wherein the scheduler 107 is required to allocate the sub-carriers (or, equivalently, assign a one or zero to wir[k, m, n]) in order to maximize the weight of the assigned edges in the graph. It is noted that maximum weight matching on a bipartite graph allows only one assignment per edge, which implies that each expanded sub-carrier node can only service one packet. This constraint prevents packets from being sent simultaneously (in time) over the same sub-carrier. However, by expanding each sub-carrier into a number of expanded server nodes, the packets from different users can be time-multiplexed and sent over a single sub-carrier during a single downlink frame.
According to one embodiment, the maximum weight matching problem can be solved using the Hungarian method or another suitably defined algorithm. For illustrative purposes, the use of the Hungarian method to solve this problem is explained.
This approach involves generating an initial labeling l and matching D in El. If D is perfect, then the process ends. Otherwise, a free vertex, a ε X, is selected, and the following is set: S={a}, T=Ø.
If Nl(S)=T, then the process update the labels (forcing Nl(S)≠T) using the following set of equations:
If Nl(S)≠T, then select v ε Nl(S)−T. If v is free, then a−v is an augmenting path. The process augments D and the vertex selecting step is revisited. If v is matched (assuming a certain z), the “alternating tree” is extended as follows: S=S∪{z}, T=T∪{v}. Next, the steps associated with determining whether Nl(S)=T are repeated.
The above process is then applied in order to determine the optimal sub-carrier assignment and transmit power level, given the queue backlogs or packet waiting times in the queue.
An exemplary application is now explained.
Initially, the maximum allowed power for the downlink transmission at each sub-carrier is selected. Under this power allocation, User 1 can send three packets and is connected to (i.e. can transmit over) sub-carriers #1 and 2, User 2 can send one packet over sub-carrier 2 but cannot transmit any packets over sub-carrier 1 (due to channel quality) and User 3 can send one packet over either sub-carrier 1 or sub-carrier 2. The network's utility of the power level that it takes to reliably send each individual packet over the sub-carriers is given by: U1,2(P11[n,1])=0.8, U1,2(P11[n,2])=0.4, U2,2(P21[n,2])=0.3, U3,2(P31[n,1])=0.9, U3,2(P31[n,2])=0.7.
Hence, the weight of the edge of each extended user node, aim, to each extended sub-carrier node, k, is the product Ui′(
The initial labeling in graph 901 is as follows:
The associated equality graph is shown in
The equality edges, El are indicated as solid lines. It is noted that the dotted lines are not edges in the equality graph, but are shown here for illustrative purposes. The number of packets that can be sent over each sub-carrier is determined by assuming that the network uses Pmax(k) over the kth sub-carrier. If Pmax(k) is more than is required in order to service all of the packets that are candidates for transmission over sub-carrier k, then it is reduced to the minimum value that is necessary to service all of the candidate packets.
Edge weights and labels in graph 1001 are shown as follows:
The algorithmic steps are shown in
Upon executing the resource allocation process, sub-carrier assignment results in subcarrier #1 servicing packet 1 from Queue #1 and packet 1 from Queue #3. Subcarrier #2 transports packet 1 from Queue #2 and the packet 2 from the Queue #1.
The above assignments account for optimal power efficiency. After the final assignment, the power used to transmit over each sub-carrier can be reduced to the minimum required to reliably transmit the number of packets that have been allocated for transmission over that sub-carrier. This is true since the original power allocation to the kth sub-carrier is Pmax(k). If Pmax(k) would result in a waste of power (i.e., if it was more than sufficient to empty the user's queues for all users who could send at least one packet over sub-carrier k), then it would be reduced to the minimum required to send all of the packets that are candidates for transmission over that sub-carrier. After assigning the packets to the sub-carriers, the transmit power can be reduced once more to be the minimum required to send all of the packets that were assigned for transmission over the k-th sub-carrier.
One of ordinary skill in the art would recognize that the processes for providing cross-layer optimization may be implemented via software, hardware (e.g., general processor, Digital Signal Processing (DSP) chip, an Application Specific Integrated Circuit (ASIC), Field Programmable Gate Arrays (FPGAs), etc.), firmware, or a combination thereof. Such exemplary hardware for performing the described functions is detailed below with respect to
The computing system 2000 may be coupled via the bus 2001 to a display 2011, such as a liquid crystal display, or active matrix display, for displaying information to a user. An input device 2013, such as a keyboard including alphanumeric and other keys, may be coupled to the bus 2001 for communicating information and command selections to the processor 2003. The input device 2013 can include a cursor control, such as a mouse, a trackball, or cursor direction keys, for communicating direction information and command selections to the processor 2003 and for controlling cursor movement on the display 2011.
According to various embodiments of the invention, the processes described herein can be provided by the computing system 2000 in response to the processor 2003 executing an arrangement of instructions contained in main memory 2005. Such instructions can be read into main memory 2005 from another computer-readable medium, such as the storage device 2009. Execution of the arrangement of instructions contained in main memory 2005 causes the processor 2003 to perform the process steps described herein. One or more processors in a multi-processing arrangement may also be employed to execute the instructions contained in main memory 2005. In alternative embodiments, hard-wired circuitry may be used in place of or in combination with software instructions to implement the embodiment of the invention. In another example, reconfigurable hardware such as Field Programmable Gate Arrays (FPGAs) can be used, in which the functionality and connection topology of its logic gates are customizable at run-time, typically by programming memory look up tables. Thus, embodiments of the invention are not limited to any specific combination of hardware circuitry and software.
The computing system 2000 also includes at least one communication interface 2015 coupled to bus 2001. The communication interface 2015 provides a two-way data communication coupling to a network link (not shown). The communication interface 2015 sends and receives electrical, electromagnetic, or optical signals that carry digital data streams representing various types of information. Further, the communication interface 2015 can include peripheral interface devices, such as a Universal Serial Bus (USB) interface, a PCMCIA (Personal Computer Memory Card International Association) interface, etc.
The processor 2003 may execute the transmitted code while being received and/or store the code in the storage device 2009, or other non-volatile storage for later execution. In this manner, the computing system 2000 may obtain application code in the form of a carrier wave.
The term “computer-readable medium” as used herein refers to any medium that participates in providing instructions to the processor 2003 for execution. Such a medium may take many forms, including but not limited to non-volatile media, volatile media, and transmission media. Non-volatile media include, for example, optical or magnetic disks, such as the storage device 2009. Volatile media include dynamic memory, such as main memory 2005. Transmission media include coaxial cables, copper wire and fiber optics, including the wires that comprise the bus 2001. Transmission media can also take the form of acoustic, optical, or electromagnetic waves, such as those generated during radio frequency (RF) and infrared (IR) data communications. Common forms of computer-readable media include, for example, a floppy disk, a flexible disk, hard disk, magnetic tape, any other magnetic medium, a CD-ROM, CDRW, DVD, any other optical medium, punch cards, paper tape, optical mark sheets, any other physical medium with patterns of holes or other optically recognizable indicia, a. RAM, a PROM, and EPROM; a FLASH-EPROM, any other memory chip or cartridge, a carrier wave, or any other medium from which a computer can read.
Various forms of computer-readable media may be involved in providing instructions to a processor for execution. For example, the instructions for carrying out at least part of the invention may initially be borne on a magnetic disk of a remote computer. In such a scenario, the remote computer loads the instructions into main memory and sends the instructions over a telephone line using a modem. A modem of a local system receives the data on the telephone line and uses an infrared transmitter to convert the data to an infrared signal and transmit the infrared signal to a portable computing device, such as a personal digital assistant (PDA) or a laptop. An infrared detector on the portable computing device receives the information and instructions borne by the infrared signal and places the data on a bus. The bus conveys the data to main memory, from which a processor retrieves and executes the instructions. The instructions received by main memory can optionally be stored on storage device either before or after execution by processor.
A radio network 2100 includes mobile stations 2101 (e.g., handsets, terminals, stations, units, devices, or any type of interface to the user (such as “wearable” circuitry, etc.)) in communication with a Base Station Subsystem (BSS) 2103. According to one embodiment of the invention, the radio network supports Third Generation (3G) services as defined by the International Telecommunications Union (ITU) for International Mobile Telecommunications 2000 (IMT-2000).
In this example, the BSS 2103 includes a Base Transceiver Station (BTS) 2105 and Base Station Controller (BSC) 2107. Although a single BTS is shown, it is recognized that multiple BTSs are typically connected to the BSC through, for example, point-to-point links. Each BSS 2103 is linked to a Packet Data Serving Node (PDSN) 2109 through a transmission control entity, or a Packet Control Function (PCF) 2111. Since the PDSN 2109 serves as a gateway to external networks, e.g., the Internet 2113 or other private consumer networks 2115, the PDSN 2109 can include an Access, Authorization and Accounting system (AAA) 2117 to securely determine the identity and privileges of a user and to track each user's activities. The network 2115 comprises a Network Management System (NMS) 2131 linked to one or more databases 2133 that are accessed through a Home Agent (HA) 2135 secured by a Home AAA 2137.
Although a single BSS 2103 is shown, it is recognized that multiple BSSs 2103 are typically connected to a Mobile Switching Center (MSC) 2119. The MSC 2119 provides connectivity to a circuit-switched telephone network, such as the Public Switched Telephone Network (PSTN) 2121. Similarly, it is also recognized that the MSC 2119 may be connected to other MSCs 2119 on the same network 2100 and/or to other radio networks. The MSC 2119 is generally collocated with a Visitor Location Register (VLR) 2123 database that holds temporary information about active subscribers to that MSC 2119. The data within the VLR 2123 database is to a large extent a copy of the Home Location Register (HLR) 2125 database, which stores detailed subscriber service subscription information. In some implementations, the HLR 2125 and VLR 2123 are the same physical database; however, the HLR 2125 can be located at a remote location accessed through, for example, a Signaling System Number 7 (SS7) network. An Authentication Center (AuC) 2127 containing subscriber-specific authentication data, such as a secret authentication key, is associated with the HLR 2125 for authenticating users. Furthermore, the MSC 2119 is connected to a Short Message Service Center (SMSC) 2129 that stores and forwards short messages to and from the radio network 2100.
During typical operation of the cellular telephone system, BTSs 2105 receive and demodulate sets of reverse-link signals from sets of mobile units 2101 conducting telephone calls or other communications. Each reverse-link signal received by a given BTS 2105 is processed within that station. The resulting data is forwarded to the BSC 2107. The BSC 2107 provides call resource allocation and mobility management functionality including the orchestration of soft handoffs between BTSs 2105. The BSC 2107 also routes the received data to the MSC 2119, which in turn provides additional routing and/or switching for interface with the PSTN 2121. The MSC 2119 is also responsible for call setup, call termination, management of inter-MSC handover and supplementary services, and collecting, charging and accounting information. Similarly, the radio network 2100 sends forward-link messages. The PSTN 2121 interfaces with the MSC 2119. The MSC 2119 additionally interfaces with the BSC 2107, which in turn communicates with the BTSs 2105, which modulate and transmit sets of forward-link signals to the sets of mobile units 2101.
As shown in
The PCU 2136 is a logical network element responsible for GPRS-related functions such as air interface access control, packet scheduling on the air interface, and packet assembly and re-assembly. Generally the PCU 2136 is physically integrated with the BSC 2145; however, it can be collocated with a BTS 2147 or a SGSN 2132. The SGSN 2132 provides equivalent functions as the MSC 2149 including mobility management, security, and access control functions but in the packet-switched domain. Furthermore, the SGSN 2132 has connectivity with the PCU 2136 through, for example, a Frame Relay-based interface using the BSS GPRS protocol (BSSGP). Although only one SGSN is shown, it is recognized that that multiple SGSNs 2131 can be employed and can divide the service area into corresponding routing areas (RAs). A SGSN/SGSN interface allows packet tunneling from old SGSNs to new SGSNs when an RA update takes place during an ongoing Personal Development Planning (PDP) context. While a given SGSN may serve multiple BSCs 2145, any given BSC 2145 generally interfaces with one SGSN 2132. Also, the SGSN 2132 is optionally connected with the HLR 2151 through an SS7-based interface using GPRS enhanced Mobile Application Part (MAP) or with the MSC 2149 through an SS7-based interface using Signaling Connection Control Part (SCCP). The SGSN/HLR interface allows the SGSN 2132 to provide location updates to the HLR 2151 and to retrieve GPRS-related subscription information within the SGSN service area. The SGSN/MSC interface enables coordination between circuit-switched services and packet data services such as paging a subscriber for a voice call. Finally, the SGSN 2132 interfaces with a SMSC 2153 to enable short messaging functionality over the network 2150.
The GGSN 2134 is the gateway to external packet data networks, such as the Internet 2113 or other private customer networks 2155. The network 2155 comprises a Network Management System (NMS) 2157 linked to one or more databases 2159 accessed through a PDSN 2161. The GGSN 2134 assigns Internet Protocol (IP) addresses and can also authenticate users acting as a Remote Authentication Dial-In User Service host. Firewalls located at the GGSN 2134 also perform a firewall function to restrict unauthorized traffic. Although only one GGSN 2134 is shown, it is recognized that a given SGSN 2132 may interface with one or more GGSNs 2133 to allow user data to be tunneled between the two entities as well as to and from the network 2150. When external data networks initialize sessions over the GPRS network 2150, the GGSN 2134 queries the HLR 2151 for the SGSN 2132 currently serving a MS 2141.
The BTS 2147 and BSC 2145 manage the radio interface, including controlling which Mobile Station (MS) 2141 has access to the radio channel at what time. These elements essentially relay messages between the MS 2141 and SGSN 2132. The SGSN 2132 manages communications with an MS 2141, sending and receiving data and keeping track of its location. The SGSN 2132 also registers the MS 2141, authenticates the MS 2141, and encrypts data sent to the MS 2141.
A radio section 2215 amplifies power and converts frequency in order to communicate with a base station, which is included in a mobile communication system (e.g., systems of
In use, a user of mobile station 2201 speaks into the microphone 2211 and his or her voice along with any detected background noise is converted into an analog voltage. The analog voltage is then converted into a digital signal through the Analog to Digital Converter (ADC) 2223. The control unit 2203 routes the digital signal into the DSP 2205 for processing therein, such as speech encoding, channel encoding, encrypting, and interleaving. In the exemplary embodiment, the processed voice signals are encoded, by units not separately shown, using the cellular transmission protocol of Code Division Multiple Access (CDMA), as described in detail in the Telecommunication Industry Association's TIA/EIA/IS-95-A Mobile Station-Base Station Compatibility Standard for Dual-Mode Wideband Spread Spectrum Cellular System; which is incorporated herein by reference in its entirety.
The encoded signals are then routed to an equalizer 2225 for compensation of any frequency-dependent impairments that occur during transmission though the air such as phase and amplitude distortion. After equalizing the bit stream, the modulator 2227 combines the signal with a RF signal generated in the RF interface 2229. The modulator 2227 generates a sine wave by way of frequency or phase modulation. In order to prepare the signal for transmission, an up-converter 2231 combines the sine wave output from the modulator 2227 with another sine wave generated by a synthesizer 2233 to achieve the desired frequency of transmission. The signal is then sent through a PA 2219 to increase the signal to an appropriate power level. In practical systems, the PA 2219 acts as a variable gain amplifier whose gain is controlled by the DSP 2205 from information received from a network base station. The signal is then filtered within the duplexer 2221 and optionally sent to an antenna coupler 2235 to match impedances to provide maximum power transfer. Finally, the signal is transmitted via antenna 2217 to a local base station. An automatic gain control (AGC) can be supplied to control the gain of the final stages of the receiver. The signals may be forwarded from there to a remote telephone which may be another cellular telephone, other mobile phone or a land-line connected to a Public Switched Telephone Network (PSTN), or other telephony networks.
Voice signals transmitted to the mobile station 2201 are received via antenna 2217 and immediately amplified by a low noise amplifier (LNA) 2237. A down-converter 2239 lowers the carrier frequency while the demodulator 2241 strips away the RF leaving only a digital bit stream. The signal then goes through the equalizer 2225 and is processed by the DSP 2205. A Digital to Analog Converter (DAC) 2243 converts the signal and the resulting output is transmitted to the user through the speaker 2245, all under control of a Main Control Unit (MCU) 2203—which can be implemented as a Central Processing Unit (CPU) (not shown).
The MCU 2203 receives various signals including input signals from the keyboard 2247. The MCU 2203 delivers a display command and a switch command to the display 2207 and to the speech output switching controller, respectively. Further, the MCU 2203 exchanges information with the DSP 2205 and can access an optionally incorporated SIM card 2249 and a memory 2251. In addition, the MCU 2203 executes various control functions required of the station. The DSP 2205 may, depending upon the implementation, perform any of a variety of conventional digital processing functions on the voice signals. Additionally, DSP 2205 determines the background noise level of the local environment from the signals detected by microphone 2211 and sets the gain of microphone 2211 to a level selected to compensate for the natural tendency of the user of the mobile station 2201.
The CODEC 2213 includes the ADC 2223 and DAC 2243. The memory 2251 stores various data including call incoming tone data and is capable of storing other data including music data received via, e.g., the global Internet. The software module could reside in RAM memory, flash memory, registers, or any other form of writable storage medium known in the art. The memory device 2251 may be, but not limited to, a single memory, CD, DVD, ROM, RAM, EEPROM, optical storage, or any other non-volatile storage medium capable of storing digital data.
An optionally incorporated SIM card 2249 carries, for instance, important information, such as the cellular phone number, the carrier supplying service, subscription details, and security information. The SIM card 2249 serves primarily to identify the mobile station 2201 on a radio network. The card 2249 also contains a memory for storing a personal telephone number registry, text messages, and user specific mobile station settings.
While the invention has been described in connection with a number of embodiments and implementations, the invention is not so limited but covers various obvious modifications and equivalent arrangements, which fall within the purview of the appended claims. Although features of the invention are expressed in certain combinations among the claims, it is contemplated that these features can be arranged in any combination and order.
This application claims the benefit of the earlier filing date under 35 U.S.C. § 119(e) of U.S. Provisional Application Ser. No. 60/841,319 filed Aug. 31, 2006, entitled “Method and Apparatus For Providing Resource Allocation Using Utility-Based Cross-Layer Optimization,” the entirety of which is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
60841319 | Aug 2006 | US |