RATE ALLOCATION METHOD AND APPARATUS FOR OPTIMIZATION OF ADAPTIVE WIRELESS VIDEO STREAMING

Information

  • Patent Application
  • 20150245318
  • Publication Number
    20150245318
  • Date Filed
    February 25, 2014
    10 years ago
  • Date Published
    August 27, 2015
    9 years ago
Abstract
A method and an apparatus for adaptively allocating available bandwidth for network users. The method is particularly beneficial in improving a viewing experience for mobile devices accessing the network, while also maximizing the number of supported users within the network. An adaptively adjusted control parameter is used in conjunction with a utility function to assign a provisional rate for users entering the network. Based on the assigned provisional rate, the method then admits prospective new network users, if enough free capacity exists to service the new user at the provisional rate.
Description
BACKGROUND OF THE INVENTION

1. Field of the Invention


Example embodiments relate generally to wireless communication, and more particularly to a method and apparatus for adaptively allocating available bandwidth for network users. The method may be particularly applied to hypertext transfer protocol (HTTP)-based video streaming where a lower variability in rate adjustments with smoother temporal evolution of streaming rates may improve a user's viewing experience while maximizing the number of supported users.


2. Related Art


As shown in FIG. 1, an inherent conventional characteristic of wireless networks is user heterogeneity in the sense that users 10 close to a base station 1 enjoy strong channels, while distant users 20 receive far weaker signals. To increase the QoE for all users 10/20, ideally a higher time-average rate would be applied to all users. However, due to a shortage of network resources, a more realistic alternative is to allow users to have heterogeneous channel conditions, and hence different resource requirements for a given bit rate.


Especially with regard to video traffic, this type of traffic is experiencing tremendous growth that is partially fueled by the proliferation of online video content and the steady expansion in transmission bandwidths. The amount of video traffic is forecast to double annually in the next several years, and is expected to account for the dominant share of wireline and wireless Internet traffic.


The huge growth in traffic volume goes hand in hand with a shift towards hypertext transfer protocol (HTTP)-based adaptive streaming mechanisms, which allow the video rate to be adjusted to the available network bandwidth. In view of these trends, it is critical to design video streaming mechanisms that use the available network resources efficiently and provide an adequate quality-of-experience (QoE) to the users. While a comprehensive video quality perception metric is hard to define, it is widely agreed that the QoE improves not only with a higher time-average rate, but also with a smoother temporal evolution. Typically a fundamental trade-off arises between these two criteria, because a higher time-average rate entails a more responsive scheme that makes more aggressive rate adjustments, compromising smoothness. Measurement experiments indicate that currently deployed schemes do not necessarily perform well in that regard, and may induce high variability and even oscillations as a result of interactions among several rate-adaptive users. This has triggered numerous proposals for enhancements, involving a variety of techniques, ranging from multipath solutions, network caching, traffic shaping and layered coding to improved bandwidth estimation at the client side in conjunction with dynamic rate selection at the server side.


The above challenges are particularly pertinent in wireless networks where the available bandwidth is not only relatively limited, but also inherently uncertain and time-varying due to fading and user mobility. On the other hand, resource allocation mechanisms in wireless cellular networks offer greater capabilities for controlling the user throughput, especially since the wireless link is commonly the bottleneck in the end-to-end path. In particular, scheduling algorithms at the base stations tend to provide support for specifying suitable target throughputs for the various users, and thus ensuring a smooth streaming rate. The proposed enhancements of HTTP-based adaptive streaming mechanisms mentioned above have exploited a wide range of approaches for mitigating the impact of variations in network bandwidth at the client and/or server, but have not leveraged the option to exercise direct control over the user throughput at a network element.


SUMMARY OF INVENTION

Example embodiments provide a method and apparatus for adaptively allocating available bandwidth for network users, especially for hypertext transfer protocol (HTTP)-based video streaming where the quality-of-experience (QoE) for viewers may be particularly impacted by variations in rate adjustments. In particular, the method may iteratively adapt a control parameter based on real-time aggregate load information for all network users, each time a network user enters or leaves the network. This control parameter may in turn be used to determine a provisional rate for prospective new users that are initially attempting to enter the network. By iteratively adapting the control parameter over time (and, in turn, iteratively adapting the provisional rate for prospective new users of the network), the aggregate long-term rate utility for the network may be optimized to ensure smoothness over longer time scales. This method may be implemented without the knowledge of channel and traffic parameters of the network users.


In one embodiment, a rate allocation method includes adapting, by a processor, a control parameter value as user equipment (UE) enter a network sector based on a total load for tagged UE of the network sector and a determination to tag a prospective new UE, wherein the tagged UEs of the network sector correspond with all current and prospective network sector UEs that entered the network sector at an instance when the processor determined that the free capacity of the network sector is larger than or equal to a capacity threshold; assigning, by the processor, a provisional rate for the prospective new UE of the network sector based on the control parameter value and a determined marginal pay-off for admitting the prospective new UE; determining, by the processor, free capacity of the network sector; admitting, by the processor, the prospective new UE if enough free capacity exists in order for the network sector to service the prospective new UE at the provisional rate, wherein this method is accomplished without the knowledge of channel and traffic statistics of the network sector.


In another embodiment a device includes a processor, configured to, adapt a control parameter value as user equipment (UE) enter a network sector based on a total load for tagged UE of the network sector and a determination to tag a prospective new UE, wherein the tagged UEs of the network sector correspond with all current and prospective network sector UEs that entered the network sector at an instance when the processor determined that the free capacity of the network sector is larger than or equal to a capacity threshold; assign a provisional rate for the prospective new UE of the network sector based on the control parameter value and a determined marginal pay-off for admitting the prospective new UE; determine free capacity of the network sector; admit the prospective new UE if enough free capacity exists in order for the network sector to service the prospective new UE at the provisional rate, wherein the processor does not require knowledge of channel and traffic statistics of the network sector.





BRIEF DESCRIPTION OF THE DRAWINGS

The above and other features and advantages of example embodiments will become more apparent by describing in detail, example embodiments with reference to the attached drawings. The accompanying drawings are intended to depict example embodiments and should not be interpreted to limit the intended scope of the claims. The accompanying drawings are not to be considered as drawn to scale unless explicitly noted.



FIG. 1 is a simplified depiction of groups of users in a conventional wireless network;



FIG. 2 is a simplified depiction of groups of user equipment (UE) in a wireless network, according to an example embodiment;



FIG. 3 is a diagram illustrating an example structure of the user equipment (UE), according to an example embodiment; and



FIG. 4 is a flowchart of a rate allocation method that determines throughput, according to an example embodiment.





DETAILED DESCRIPTION

While example embodiments are capable of various modifications and alternative forms, embodiments thereof are shown by way of example in the drawings and will herein be described in detail. It should be understood, however, that there is no intent to limit example embodiments to the particular forms disclosed, but on the contrary, example embodiments are to cover all modifications, equivalents, and alternatives falling within the scope of the claims. Like numbers refer to like elements throughout the description of the figures.


Before discussing example embodiments in more detail, it is noted that some example embodiments are described as processes or methods depicted as flowcharts. Although the flowcharts describe the operations as sequential processes, many of the operations may be performed in parallel, concurrently or simultaneously. In addition, the order of operations may be re-arranged. The processes may be terminated when their operations are completed, but may also have additional steps not included in the figure. The processes may correspond to methods, functions, procedures, subroutines, subprograms, etc.


Methods discussed below, some of which are illustrated by the flow charts, may be implemented by hardware, software, firmware, middleware, microcode, hardware description languages, or any combination thereof. When implemented in software, firmware, middleware or microcode, the program code or code segments to perform the necessary tasks may be stored in a machine or computer readable medium such as a storage medium, such as a non-transitory storage medium. A processor(s) may perform the necessary tasks.


Specific structural and functional details disclosed herein are merely representative for purposes of describing example embodiments. This invention may, however, be embodied in many alternate forms and should not be construed as limited to only the embodiments set forth herein.


It will be understood that, although the terms first, second, etc. may be used herein to describe various elements, these elements should not be limited by these terms. These terms are only used to distinguish one element from another. For example, a first element could be termed a second element, and, similarly, a second element could be termed a first element, without departing from the scope of example embodiments. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items.


It will be understood that when an element is referred to as being “connected” or “coupled” to another element, it can be directly connected or coupled to the other element or intervening elements may be present. In contrast, when an element is referred to as being “directly connected” or “directly coupled” to another element, there are no intervening elements present. Other words used to describe the relationship between elements should be interpreted in a like fashion (e.g., “between” versus “directly between,” “adjacent” versus “directly adjacent,” etc.).


The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of example embodiments. As used herein, the singular forms “a,” “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises,” “comprising,” “includes” and/or “including,” when used herein, specify the presence of stated features, integers, steps, operations, elements and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components and/or groups thereof.


It should also be noted that in some alternative implementations, the functions/acts noted may occur out of the order noted in the figures. For example, two figures shown in succession may in fact be executed concurrently or may sometimes be executed in the reverse order, depending upon the functionality/acts involved.


Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which example embodiments belong. It will be further understood that terms, e.g., those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.


Portions of the example embodiments and corresponding detailed description are presented in terms of software, or algorithms and symbolic representations of operation on data bits within a computer memory. These descriptions and representations are the ones by which those of ordinary skill in the art effectively convey the substance of their work to others of ordinary skill in the art. An algorithm, as the term is used here, and as it is used generally, is conceived to be a self-consistent sequence of steps leading to a desired result. The steps are those requiring physical manipulations of physical quantities. Usually, though not necessarily, these quantities take the form of optical, electrical, or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated. It has proven convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the like.


In the following description, illustrative embodiments will be described with reference to acts and symbolic representations of operations (e.g., in the form of flowcharts) that may be implemented as program modules or functional processes include routines, programs, objects, components, data structures, etc., that perform particular tasks or implement particular abstract data types and may be implemented using existing hardware at existing network elements. Such existing hardware may include one or more Central Processing Units (CPUs), digital signal processors (DSPs), application-specific-integrated-circuits, field programmable gate arrays (FPGAs) computers or the like.


It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise, or as is apparent from the discussion, terms such as “processing” or “computing” or “calculating” or “determining” of “displaying” or the like, refer to the action and processes of a computer system, or similar electronic computing device, that manipulates and transforms data represented as physical, electronic quantities within the computer system's registers and memories into other data similarly represented as physical quantities within the computer system memories or registers or other such information storage, transmission or display devices.


Note also that the software implemented aspects of the example embodiments are typically encoded on some form of program storage medium or implemented over some type of transmission medium. The program storage medium may be any non-transitory storage medium such as magnetic (e.g., a floppy disk or a hard drive) or optical (e.g., a compact disk read only memory, or “CD ROM”), and may be read only or random access. Similarly, the transmission medium may be twisted wire pairs, coaxial cable, optical fiber, or some other suitable transmission medium known to the art. The example embodiments not limited by these aspects of any given implementation.


Section I
Introduction

Since the key aim of the target user usage rates is to ensure smoothness over longer time scales, purposefully not accounting for fast variations in channel conditions and instead focusing on the impact of slower dynamics arising due to the arrivals (service requests) and departures (service completions) of users assists the rate allocation method according to example embodiments. The target rates for the various users can be selected from either a continuous range or a discrete set, and adapted at any point in time based on the user dynamics. Admission control may also be incorporated into the rate allocation method, by allowing a user to be assigned a zero target rate upon arrival.


A sample path upper bound is first established for an objective function (the aggregate average utility reduced by a rate variation penalty). The upper bound is characterized through the optimal value of a relatively simple mathematical program, either a linear program or a concave optimization problem, depending on whether the set of admissible target rates is discrete or continuous. The upper bound is asymptotically achievable (in a ratio sense) in a regime where the offered traffic and capacity grow large in proportion. Asymptotic optimality may be achieved by ‘static’ policies which either deny service to a user upon arrival or assign it a fixed target rate (in accordance with the optimal solution of the above-mentioned mathematical program), without requiring advance knowledge of the duration.


The asymptotic optimality of static rate assignment policies provides a natural paradigm for the design of sound, lightweight approaches, only involving the calculation of the asymptotically optimal rate assignments. The latter task is elementary when all the relevant system parameters are known, but in practice several parameters will be subject to uncertainty and variation over time, raising estimation challenges and accuracy issues. Instead, the fact that the asymptotically optimal rate assignment policy exhibits a specific structure may be exploited, allowing it to be represented through just a single variable, where the load induced on the system is unity. Leveraging these two properties, an approach for learning and tracking the asymptotically optimal rate assignments by adapting a single variable so as to drive the observed load to unity may be developed. Instant example embodiments rely on observations of the occupancy state, without any explicit knowledge or estimates of the system parameters. Instant example embodiments perform well, even in scenarios with a relatively moderate amount of capacity and offered traffic.


Setting target rates at a base station is commonly supported by existing wireless scheduling algorithms, and complementary with bandwidth estimation algorithms at a client device and dynamic rate selection at a network server. The resulting throughput smoothing makes the latter tasks substantially lighter, and the joint control reduces the potential for oscillations and harmful interactions among separately controlled competing adaptive players.


Section II
Model Description and Problem Solving Formulation

Considering a single wireless downlink channel shared by streaming users from a finite set of classes indexed by s∈S. The various classes could reflect heterogeneity among users and video objects in terms of wireless channel characteristics, available compression rates, or Quality Rate (QR) dependencies. Let τi be the arrival time of the ith user, i.e., the time when the user requests a particular video, with τ1≧0. Denote by s(i) the index of the class to which the ith user belongs. Let σi be the service time of the ith user, i.e., the nominal duration of the video of interest.


Rate ri(t) is assigned to the ith user at time t≧0. Rate ri(t) should not be thought of as an instantaneous transmission rate, but rather rate ri(t) represents a target for the average throughput over a time scale which is long compared to the slot level on which wireless schedulers operate, and commensurate with the buffering depth at the video client. The value of ri(t) only has significance for t∈[τi, τii], but for notational convenience the value may be defined for all t≧0. The rate assignments for class-s users are constrained to a finite set Rs:






r
i(t)∈Rs(i)∪{0} for all t≧0  (1)


Rate ri(t) may correspond to the various compression rates available for videos streamed by class-s users. The constraint ri(t)=0 is not explicitly imposed for t∉[τiii], since this will not be directly relevant in the utility optimization problem of the instant example embodiments. The capacity requirement per unit bit rate for class-s users is cs, which would depend on channel quality characteristics such as the Signal-to-Interference-and-Noise Ratio. Assuming the total capacity to be normalized to unity, the rate assignments may then satisfy the further constraint:













i
=
1










c

s


(
i
)






r
i



(
t
)






1





for





all





t


0.




(
2
)







The quantity ρs=1/cs may be interpreted as the transmission rate of a class-s user if it were assigned the full resources.


The QoE experienced by a streaming user depends on the time-average rate utility as well as the temporal variability. To achieve an optimal trade-off between these two criteria, it is assumed that the QoE metric for the ith user during the time interval [0,T], τi≦T, can be expressed as:






A
i(0,T)−γVi(0,T)  (2a)


Ai(0,T) represents the time-average rate utility, Vi(0,T) is a penalty term for the rate variability, and γ is a nonnegative weight factor. Based upon this understanding, it is supposed that






A
i(0,T)=∫t=ττii(T)Us(i)(ri(t))dt  (2b)


Where σi(T)=min{σi,T−τi} and Us(•) a concave utility function with Us(0)≡0. The function Us(•) (which can be obtained using well-known methods for QoE-based multi-stream scalable video over networks) may be chosen to reflect class-related QR dependencies, such that Us(r(t)) provides a proxy for the instantaneous quality for a class-s user's video with compression rate r(t). These QR dependencies will be video-dependent, where a ‘high-detail’ video with a lot of scene changes (e.g., an action movie) will have a ‘steeper’ QR dependency compared to a ‘slower’ video.


Vi(0,T) may be an arbitrary nonnegative function of {ri(t)}t∈[0,T], with the property that Vi(0,T)=0 when ri(t)≡ri for all t∈[τiii(T)]. For example, Vi(0,T) could represent the time standard deviation of quality over the interval [0,T], as shown below:











V
i



(

0
,
T

)


=


(


1


σ
i



(
T
)








i
=

τ
i




τ
i

+


σ
i



(
T
)








(



U

s


(
i
)





(


r
i



(
t
)


)


-



A
i



(

0
,
T

)




σ
i



(
T
)




)

2








t




)


1
2






(

2





c

)







The metric Ai(0,T)−γVi(0,T) may fit into any well-known video QoE model.


The maximum achievable net aggregate QoE during the time interval [0,T] may thus be expressed as the optimal value OPT, (T) of the optimization problem:










max





i
=
1


I


(
T
)









[



A
i



(

0
,
T

)


-

γ







V
i



(

0
,
T

)




]



,




(

2

d

)







with I(T)=sup{i:τi≦T}, subject to the constraints:












r
i



(
t
)





R

s


(
i
)






{
0
}






for





all





t




[

0
,
T

]







and




(
3
)











i
=
1


I


(
T
)












i
=
1


I


(
T
)










c

s


(
i
)






r
i



(
t
)







1





for





all





t




[

0
,
T

]





(
4
)







Observe that OPTγ(T) is nonincreasing in γ, and in particular bounded from above by OPT0(T) (i.e., net aggregate utility) where the penalty for the rate variability is entirely dropped.


Remark 1


For non-trivial choices of Vi(0,T) (such as time variance), determining the rate assignments that achieve OPTγ(T) would require a clairvoyant or off-line optimization scheme with full knowledge of the sample path in terms of the arrival times and service times. In practice, one would use online schemes that need to make rate assignment decisions without any prior knowledge of future arrival times or residual service times. As will be shown however, under mild assumptions, suitable online schemes asymptotically achieve the same long-term value for







1
T




OPT
γ



(
T
)






for any γ>0 as clairvoyant schemes, even when exempt from a penalty for rate variability, can achieve for







1
T





OPT
0



(
T
)


.





Remark 2


The above formulation allows for zero rate assignments, i.e., ri(t)=0 for some time epoch t∈[τiii(T)], which may be interpreted as a service request being denied or a flow being aborted. Thus it may not be meaningful in practice to change the rate assignment from a zero to a non-zero value. In the above formulation, we do not explicitly rule out such rate changes, and achieving OPTγ(T) might require such actions in certain cases. However, the rate assignment policies that we develop never make such changes, and in fact never change the initial rate assignment at all. Yet, they asymptotically achieve the same long-term value for







1
T




OPT
γ



(
T
)






for any γ>0 as schemes that are allowed to make such changes and are not penalized for rate variability can achieve for







1
T





OPT
0



(
T
)


.





Remark 3


Considering a myopic rate assignment policy, which at any point in time t assigns rates so as to maximize Σi:t∈[τiii]Us(ri(t)), subject to the constraints (1) and (2), disregarding the effect of rate variations. It can be shown that the utility achieved by such a myopic policy approaches OPT0 (T) as γ↓0 for any given sample path with a bounded number of arrivals during [0,T].


Remark 4


Considering the class of fixed-rate assignment policies, which satisfy the further constraint that ri(t)≡ri, for all t∈[τiii(T)], and assuming that Vi(0,T)>0 unless ri(t)=ri for all t∈[τ1ii(T)]. It can be shown that OPTγ(T) is achievable by a policy from this class as y→∞ for any given sample path with a bounded number of arrivals during [0,T].


Remark 5


The service time σi of the ith user could depend on the rate allocated to the user over time. For instance, poor rate allocations could lead to low quality or rebuffering, and result in the user viewing the video for a shorter duration. However, good models are needed for such user behavior and, even with good models, such systems may be difficult to analyze. Also, note that σi and the amount of time for which the ith user is served could potentially be different. For instance, if the user is allocated high rates, the user could quickly download the video and essentially leave the system. However, most of the current players try to ‘match’ the video compression rates with the allocated rates to exploit better channels for better quality. In those settings, σi serves as a good approximation of the amount of time for which the ith user is served.


Section III
Utility Upper Bounds

In order to derive upper bounds for the maximum achievable net aggregate utility (which upper bounds maximum achievable net aggregate QoE), some bounds hold in a sample path sense, but for other results, a statistical assumption may be made:


A1: class-s users arrive as a renewal process with mean interarrival time αs


Or, a stronger Markovian assumption may be made:


A2: class-s users arrive as a Poisson process of rate λs,


Where the assumption above is used in conjunction with:


B: class-s users have i.i.d. service times with mean βs


The offered traffic associated with class s∈S is defined, under assumptions A1 or A2 and B as ρsβss or ρssβs, respectively. Denote by J(t)=|{i:τi≦t<τii}| the number of ‘potentially’ active users at time t. Note that the process {J(t)}t≧ evolves over time as the number of customers in an infinite-server queue, i.e., a G/G/∞ or M/G/∞ under Assumptions A1 or A2, respectively.


Two scenarios may be distinguished depending on whether the rate sets Rs are (D) discrete or (C) continuous in nature of the form [rsmin,rsmax]. For compactness, in the discrete case, we denote by K=Σs∈SMs, with Ms=|Rs|<∞, the total number of class-rate tuples. For later use, the set I(s,T)={i:s(i)=s,τi≦T} may be defined to contain the indices of the users that belong to class s and arrive before time T.


A. Scenario D
Proposition 1:

In case of discrete rate sets, OPT, (T) is bounded from above sample path wise by the optimal value OPT-UB-D(T) of the linear program LP-UB-D(T):







max


[




(

x
sr

)


s


S

,

r


R
s



]



R
+
K








s

S











r


R
s










x
sr




U
s



(
r
)












sub









r


R
s









x
sr








i


I


(

s
,
T

)











σ
i









s

S














s

S











r


R
s










c
s



rx
sr





T




Under Assumptions A1 and B,









1
T






OPT

-
UB
-

D


(
T
)





OPT
-
UB
-
D





in probability as T→∞, where OPT-UB-D is the optimal value of the linear program LP-UB-D:










max


[




(

f
sr

)


s


S

,

r


R
s



]



P
+
K








s

S











r


R
s










ρ
s



f
sr




U
s



(
r
)









(
5
)







sub









r


R
s









f
sr





1








s

S







(
6
)










s

S











r


R
s










c
s


r






ρ
sr





1




(
7
)







Proof:


Let σir be the amount of time during [τiii(T)] that the ith user (τi≦T) is assigned rate r∈Rs(i). Let xsri∈I(s,T)σir be the total amount of time during [0,T] that any class-s user is assigned rate r∈Rs. Noting that Us(0)=0, Ai(0,T) may be written as Σr∈Rs(i)σirUs(i)(r), and thus










i
=
1


I


(
T
)










A
i



(

0
,
T

)



=




s

S











r


R
s










x
sr




U
s



(
r
)









Invoking the rate constraints (3) and observing that Σr∈Rs(i)σir≦σi(T)≦σi, the following is obtained:














r


R
s









x
sr







i


I


(

s
,
T

)











σ
i






s




S




(
8
)







Further, integrating the capacity constraints (4) over t∈[0,T], the following is obtained:













s

S











r


R
s










c
s



rx
sr





T




(
9
)







Thus, any rate assignment solution that satisfies constraints (3) and (4) will satisfy constraints (8) and (9) as well. Therefore, OPT0(T) is bounded from above by the maximum value of Σs∈SΣr∈RsxsrUs(r) subject to the constraints (8) and (9), which concludes the proof of the first statement of the proposition.


In order to establish the second statement, it is noted that, introducing variables fsrxsr/(ρsT), the linear program LP-UB-D(T) may be equivalently stated as











max


[




(

f
sr

)


s


S

,

r


R
s



]



P
+
K








s

S











r


R
s










ρ
s



f
sr




U
s



(
r
)













sub









r


R
s










ρ
s



f
sr







1
T






i


I


(

s
,
T

)











σ
i









s

S
















s

S











r


R
s










c
s


r






ρ
s



f
sr





1





(

9

a

)







The key renewal theorem and the law of large numbers imply that








1
T






i


I


(

s
,
T

)









σ
i



->

ρ
s





in probability as T→∞.


As reflected in the proof of the above proposition, the linear program LP-UB-D may be interpreted as follows. The decision variables fsr represent the fractions of class-s users that are assigned rate r∈Rs, so their sum cannot exceed unity as expressed by the constraint (6). Further observe that ρs is the mean number of class-s users with an active request at an arbitrary epoch. Thus the objective function (5) represents the expected utility rate at an arbitrary epoch, while the constraint (7) stipulates that the ‘load’, i.e., the expected total capacity requirement, cannot exceed unity.


Scenario C
Continuous Rate

For nonnegative real numbers r and a, denote











P
s



(

r
,
a

)


=

{






aU
s



(

r
a

)


,


if





a

>
0

,






0
,


if





a

=
0.










(

9

b

)







Note that Ps(•,•) is a concave function.


Proposition 2:

In case of continuous rate sets, OPT0(T) is bounded from above sample path wise by the optimal value OPT-UB-C(T) of the convex optimization problem CP-UB-C(T):











max


[


(


x
s
+

,

r
s
+


)


s

S


]



P

2



S











s

S








x
s
+




U
s



(

r
s
+

)













subr
s
min



r
s
+




r
s
max










s



S









x
s
+






i


I


(

s
,
T

)










σ
i










s




S











s

S








c
s



r
s
+



x
s
+




T





(

9

c

)







Also, under Assumption A1 and B,








1
T


OPT


-


UB


-



C


(
T
)



->

OPT


-


UB


-


C





in probability as T→∞, where OPT-UB-C is the optimal value of the convex optimization problem CP-UB-C:










max


[


(


a
s

,

r
s


)


s

S


]



P
+

2



S











s

S








ρ
s




P
s



(


r
s

,

a
s


)








(
10
)









suba
s



r
s
min




r
s




a
s



r
s
max










s



S




(
11
)








a
s



1









s



S




(
12
)










s

S








c
s



ρ
s



r
s




1




(
13
)







Proof:


Let σi0 be the amount of time during [τiii(T)] that the ith user (τi≦T) is assigned rate 0 and σi+i(T)−σi0 the amount of time that it is assigned some rate r∈Rs(i). Let xs0i∈I(s,T)σi0 and xs+i∈I(s,T)σi+. Noting that Us(0)=0, Jensen's inequality implies that:














i


I


(

s
,
T

)










A
i



(

0
,
T

)






x
s
+




U
s



(

r
s
+

)




,
with




(

13

a

)








r
s
+

=


1

x
s
+







i


I


(

s
,
T

)












t
=

τ
i




τ
i

+


σ
i



(
T
)








r
i



(
t
)





t






,




(

13

b

)







so that rs+∈Rs. Invoking the rate constraints (3) and observing that σi+≦σi(T)≦σi, the following is obtained:











x
s
+






i


I


(

s
,
T

)










σ
i






s




S




(
14
)







Further, integrating the capacity constraints (4) over t∈[0,T], the following is obtained:













s

S










r


R
s









c
s



r
s
+



x
s
+






T
.





(
15
)







Thus, any rate assignment solution that satisfies constraints (3) and (4) will satisfy constraints (14) and (15) as well. Therefore, OPT0(T) is bounded from above by the maximum value of Σs∈Sxs+Us(rr+) subject to the constraints rs+∈Rs for all s∈S, (14) and (15), which concludes the proof of the first statement of the proposition.


In order to establish the second statement, we note that, introducing variables rs=asrs+, as=xs+/(ρsT), the convex optimization problem CP-UB-C(T) may be equivalently stated as:











max



(


a
s

,

r
s


)


s

S




P
+

2



S











s

S








ρ
s




P
s



(


r
s

,

a
s


)














suba
s



r
s
min




r
s




a
s



r
s
max










s



S










a
s



ρ
s





1
T






i


I


(

s
,
T

)










σ
i










s





S











s

S








c
s



ρ
s



r
s




1.





(

15

a

)







The key renewal theorem and the law of large numbers imply that








1
T






i


I


(

s
,
T

)









σ
i



->

ρ
s





in probability as T→∞.


Remark 6


Inspection of the above proof shows that the upper bound OPT-UB-C(T) applies for discrete rate sets as well, and even if Ai(0,t) were replaced by







σ
i
+





U

s


(
i
)



(


1

σ
i
+







t
=

τ
i




τ
i

+


σ
i



(
T
)








r
i



(
t
)





t




)

.





However, the upper bound will not be asymptotically achievable in general for discrete rate sets.


Section IV
Asymptotic Optimality
OPT-UB-D is Asymptotically Achievable

Under Assumptions A2 and B, the upper bound OPT-UB-D for the long-term average utility rate (as provided in Proposition 1) is asymptotically achievable (in a ratio sense) in a regime where the offered traffic and transmission capacity grow large in proportion. More specifically, consider a sequence of systems, each indexed by a positive integer n. Each system has the same set of user classes S, the same rate sets Rs and the same utility functions Us(•). In the n-th system the class-s capacity requirement is cs(n)=cs/n, and the arrival rate and mean service time are such that ρs(n)s(n)βs(n)=nρs. OPT-D(n) may be the maximum achievable long-term average utility rate in the n-th system, and OPT-UB-D(n) may be the optimal value of the linear program LP-UB-D for the n-th system. Since the number of ‘potentially’ active users evolves as the number of customers in an infinite-server queue, it follows that the system is regenerative. The renewal-reward theorem then implies that the long-term average utility rate Uπ(n) achieved in the n-th system by a stationary policy π exists, and can further be used to argue that OPT-D(n) is well-defined as well.


Proposition 3:

Under Assumption A2 and B,











lim

n
->






OPT


-



D

(
n
)




OPT


-


UB


-



D

(
n
)





=
1




(

15

b

)







Proposition 1 implies OPT-D(n)≦OPT-UB-D(n) for all n. By definition, OPT-D(n) is no less than the long-term average utility rate Uπ(n) achieved in the n-th system by any given policy π. Also, substituting cs(n)=cs/n and ρs(n)=nρs in the linear program LP-UB-D for the n-th system yields OPT-UB-D(n)=n×OPT-UB-D. Thus it suffices to show that








liminf

n
->






U
π

(
n
)



n
×
OPT


-


UB


-


D




1




for some specific policy π. There are several candidate policies π for which the latter can be shown.


Option (i)
Complete Partitioning

For every class-rate pair (s,r), define Csr(n)=└nρsfsr*┘, where (fsr*)s∈S,r∈Rs is the optimal solution to the linear program LP-UB-D (where usually, only |S|+1 of these variables will in fact be non-zero, as will be shown later). Consider a policy π which assigns an arriving class-s user a nominal rate r∈Rs with probability fsr*. In the n-th system the user is then admitted and provided that rate for its entire duration, if the total number of class-s users with that rate assignment remains below Csr(n), or otherwise rejected. Note that the aggregate capacity requirement for admitted users in the n-th system is bounded from above at all times by:













s

S










r


R
s









c
s

(
n
)




rC
sr

(
n
)










s

S










r


R
s









c
s


r






ρ
s



f
sr
*





1




(

15

c

)







By virtue of (7), the capacity constraints are obeyed. In effect, this yields K (or typically only |S|+1) independent Erlang-B loss systems with offered loads nρsfsr*, and capacities Csr(n). Since each of these systems is critically loaded, it can be easily shown that the blocking probabilities for each of the class-rate pairs (s,r) tend to zero as n→∞. Thus, for any ε>0, the expected number of active class-s users with rate assignment r∈Rs in the n-th system will be larger than (1−ε)nρsfsr* for n sufficiently large. Noting that the utility enjoyed by each of these users is Us(r), it may be deduced that:











U
π

(
n
)





(

1
-
ɛ

)


n





s

S










r


R
s









ρ
s



f
sr
*




U
s



(
r
)







=


(

1
-
ɛ

)


n
×
OPT


-


UB


-


D





(

15

d

)







When n is sufficiently large,








liminf

n
->






U
π

(
n
)



n
×
OPT


-


UB


-


D





1
-
ɛ





is implied. Since ε>0 can be arbitrarily small, the statement follows.


Option (ii)
Complete Sharing

Consider a policy π which assigns an arriving class-s user a nominal rate r∈Rs with probability fsr*. In the n-th system the user is then admitted and provided that rate for its entire duration, if the total capacity required by all active users remains below 1, or otherwise rejected. In effect, this yields an Erlang loss system with K (or typically only |S|+1) classes with offered loads nρsfsr* and per-user capacity requirements rcs, and total available capacity n. Since Σs∈SΣr∈Rscssfsr*≦1 by virtue of (7), this system is at most critically loaded, and it can be easily shown that the blocking probabilities for each of the class-rate pairs (s,r), s∈S, r∈Rs, tend to zero as n→∞. The statement then follows in a similar way as in the previous case.


As reflected in the proof of the above proposition, asymptotic optimality is achieved by ‘static’ rate assignment policies, which either deny service to a user or assign it a fixed rate independent of the state of the system at the time of arrival, without requiring advance knowledge of the duration. The latter features make such policies straightforward to implement. The specific complete partitioning and complete sharing policies primarily serve the purpose of mathematical tractability, and may not necessarily be ideal from a practical perspective. The asymptotic optimality of static rate assignment policies yields however a useful principle for the design of heuristic approaches. The only caveat is that solving the linear program LP-UB-D requires knowledge of the relevant channel and traffic parameters, such as the per-class arrival rates λs, the interarrival times αs, and/or the class-dependent mean service times βs of user. Estimation of these channel and traffic parameters requires knowledge of the distribution of the signal strength and/or signal-to-noise-and-interference rations in each cell of the network, as well as statistics on the type of user device (e.g., small or large screen sizes, etc.) and statistics on the type of video content requested (e.g., low or high motion), which may be cumbersome to collect. In practice, several of these channel and traffic parameters will be subject to uncertainty and variation over time, raising estimation challenges. In order to guide the design of such policies, the structural properties of the optimal solution to the linear program LP-UB-D may be examined and exploited, as described below.


OPT-UB-C is Asymptotically Achievable

Under Assumptions A2 and B, the upper bound OPT-UB-C for the long-term average utility rate as provided in Lemma 2 is asymptotically achievable (in a ratio sense) in a regime where the offered traffic and transmission capacity grow large in proportion. More specifically, let us consider a sequence of systems, each indexed by a positive integer n. Each system has the same set of user classes S, the same rate limits rsmin and rsmax, and the same utility function Us(•) as described above. In the n-th system the class-s capacity requirement is cs(n)=cs/n, and the arrival and service rate parameters are such that ρs(n)s(n)s(n)=nρs. Let OPT-C(n) be the maximum achievable long-term average utility rate in the n-th system, and let OPT-UB-C(n) be the optimal value of the convex optimization problem CP-UB-C for the n-th system. As before, the regenerative nature of the system and the renewal-reward theorem ensure that the long-term average utility rate Us(n) achieved in the n-th system by a stationary policy π, and in particular OPT-D(n), exists.


Proposition 4:

Under Assumptions A2 and B,











lim

n
->






OPT


-



C

(
n
)




OPT


-


UB


-



C

(
n
)





=
1




(

15

e

)







Proof:


Proposition 1 implies that OPT-C(n)≦OPT-UB-C(n) for all n. By definition, OPT-C(n) is no less that the long-term average utility rate Us(n) achieved in the n-th system by any given policy π. Also, substituting cs(n)=cs/n and ρs=nρs in the convex optimization problem CP-UB-C for the n-th system yields that OPT-UB-C(n)=n×OPT-UB-C. Thus it suffices to show that








liminf

n
->






U
π

(
n
)



n
×
OPT


-


UB


-


C




1




for some specific policy π. There are several candidate policies π for which the latter can be shown.


Option (i)
Complete Partitioning

For every class s∈S with as*>0, define Cs(n)=└nrss┘, and consider a policy π which assigns each arriving class-s user a nominal rate rs*/as*, where (as*,rs*)s∈S is the optimal solution to the convex optimization problem OPT-UB-C. In the n-th system the user is then admitted and provided that rate for its entire duration, if the total number of active class-s users remains below Cs(n), or otherwise rejected. Note that the aggregate capacity requirement of the admitted users in the n-th system is bounded from above at all times by














s

S








c
s

(
n
)




C
s

(
n
)









s

S








c
s



ρ
s



r
s
*




1

,




(

15

f

)







With, by virtue of (13), the capacity constraints are obeyed. In effect, this yields S independent Erlang-B loss systems with offered loads nρs and capacities Cs(n). It can be easily shown that the blocking probability for class s∈S tends to 1−as* as n→∞. Thus, for any ε>0, the expected number of active class-s users in the n-th system will be larger than (1−ε)nass for n sufficiently large. Noting that the utility enjoyed by each of these users is








U
s



(


r
s
*


a
s
*


)


,




it may be deduced that:












U
π

(
n
)





(

1
-
ɛ

)


n





s

S








a
s
*



ρ
s




U
s



(


r
s
*


a
s
*


)






=


(

1
-
ɛ

)


n
×
OPT


-


UB


-


C


,




(

15

g

)







When n is sufficiently large,








liminf

n
->






U
π

(
n
)



n
×
OPT


-


UB


-


C





1
-
ɛ





may be implied. Since ε>0 can be taken arbitrarily small, the statement follows.


Option (ii)
Complete Sharing with Pre-Filtering

Consider a policy π which assigns each arriving class-s user a nominal rate rs*/as* if as*>0 and zero otherwise. In the n-th system the user is then admitted and provided that rate for its entire duration with probability as*, if the total capacity required by all active users remains below 1, or otherwise rejected. In effect, this yields an Erlang loss system with |S| classes with offered loads nρsas*rs* and per-user capacity requirements rs*/as*, and total available capacity n. Since Σs∈Scsρsrs*, this system is at most critically loaded, and it can be easily shown that the effective blocking probability for class s∈S tends to 1−as* as n→∞. The statement then follows in a similar way as in the previous case.


Like in the discrete-rate case, the complete partitioning and complete sharing policies are primarily considered for the sake of transparency, and may not necessarily be ideal for practical implementation purposes, but the asymptotic optimality of static rate assignment policies provides a strong basis for the design of heuristic approaches. As before, the only caveat is that solving the convex optimization problem CP-UB-C requires knowledge of various channel and traffic parameters that will be subject to uncertainty and variation over time, raising all sorts of estimation challenges. We will later revisit these issues and present parsimonious, online policies which retain the simplicity of a static rate assignment, but do not require any explicit knowledge or estimates of the traffic statistics. In order to guide the design of such policies, we will examine and exploit the structural properties of the optimal solution to the convex optimization problem CP-UB-C.


Section V
Structural Properties of (Asymptotically) Optimal Policies

As an initial step towards the design of parsimonious, online rate assignment algorithms we now proceed to examine the structural properties of the optimal solutions to the linear program LP-UB-D and the convex optimization problem CP-UB-C. For brevity, we will henceforth refer to these optimal solutions as the asymptotically optimal rate assignments.


Structure of Optimal Solution to LP-UB-D

With regard to the structure of the optimal solution to the linear program LP-UB-D, it will be convenient to use the representation Rs={rs(1), rs(2), . . . rs(Ms)}, with rs(1)<rs(2)< . . . <rs(Ms). In the case that Σs∈Scsρsrs(Ms)≦1, the optimal solution is easily seen to be







f

sr
s

(

M
s

)



=
1




for all s∈S, and we henceforth assume that Σs∈Scsρsrs(Ms)>1.


For compactness, the notation ΔUsm=Us(rs(m))−Us(rs(m−1)) and ΔRsm=rs(m)−rs(m−1), with rs(0)≡0 is introduced. Define






Q
sm
=U
sm/(csΔRsm),  (15h)


which may be interpreted as a marginal payoff coefficient, i.e., the ratio of the improvement in the rate utility and the increment in the required capacity when increasing the rate of a class-s user from rs(m−1) to rs(m). The concavity of the function Us(•) implies that Qsm−1≧Qsm, m=2, . . . Ms. Denoting xsm=fsrsm), ysmn=mMsxsm, and noting that Us(0)≡0, the following may be defined:

















r


R
s








rf
sr


=






m
=
1


M
s





r
sm



x
sm









=






m
=
1


M
s





x
sm






n
=
1

m



Δ






R
sn











=






m
=
1


M
s




Δ






R
sm






n
=
1

m



x

n





s











=






m
=
1


M
s




Δ






R
sm



y
sm







,
and




(
16
)













r


R
s









f
sr




U
s



(
r
)




=






m
=
1


M
s





x
sm




U
s



(

r
s

(
m
)


)










=






m
=
1


M
s





x
sm






n
=
1

m



Δ






U
sn











=






m
=
1


M
s




Δ






U
sm






n
=
m


M
s




x
sn










=






m
=
1


M
s




Δ






U
sm



y
sm










(
17
)







Thus the linear program LP-UB-D may be equivalently stated as











max


[


(

y
sm

)



s

S

,

m


{

1
,





,

M
s


}




]



P
+
K








s

S










r


R
s









ρ
s






m
=
1


M
s




Δ






U
sm



y
sm















suby

s





1




1









s



S










y
sm




y

sm
-
1







m


=
2

,





,

M
s

,






s


S












s

S








c
s



ρ
s






m
=
1


M
s




Δ






R
sm



y
sm






1





(
18
)







The constraints ys1≦1 and ysm≦ysm−1, m=2, . . . , Ms, imply ysm≦1 for all m=1, . . . , Ms. By adopting the latter weaker constraints, the linear program LP-UB-D may be viewed as a relaxed version of a knapsack problem, for which it is well-known that the optimal solution is obtained by selecting items in decreasing order of the relative reward ratio until the capacity is exhausted. Specifically, let π(•,•) be a one-to-one mapping from (s,m)s∈S,m∈{1, . . . ,Ms} to {1, 2, . . . , K}, and with minor abuse of notation, define bπ(s,m)=csρsΔRsm, vπ(s,m)sΔUsm, and zπ(s,m)=ysm. By replacing the constraints ys1≦1 and ysm≦ysm−1, m=2, . . . , Ms, by ysm≦1 for all m=1, . . . , Ms, then the linear program LP-UB-D takes the form:











max


(


z
1

,





,

z
K


)



P
+
K








k
=
1

K




v
k



z
k













subz
k



1





k


=
1

,





,
K











k
=
1

K




b
k



z
k




1





(

18

a

)







which amounts to the continuous relaxation of a knapsack problem. Without loss of generality, it may supposed that the mapping π(•,•) is such that








v
1


b
1





v
2


b
2








v
K


b
K


.





Let KR be the size of the set {vk/bk:k=1, . . . , K}. Thus, KR≦K is the number of distinct values of vk/bk, or equivalently, the number of distinct values of Qsm. Let πR(•,•) be the mapping from (s,m)s∈S,m∈{1, . . . ,Ms} to {1, 2, . . . , KR} so that πR (s1,m1)<πR(s2,m2)custom-characterQs1m1>Qs2m2. For k=1, . . . , KR, define bkl∈πR−1(k)bl, vkl∈πR−1(k)vl, and QR(k)=Qsm for (s,m)∈πR−1(k).


Denote W={W∈[0,1]KR:wk=1 if wk+1>0}. For a given w ∈W, define






z
π(s,m)
=y
sm
=w
π

R

(s,m)
, m=1, . . . ,Ms, s∈S,  (18b)


and











f

sr
s

(
m
)



=


y
sm

-

y

sm
+
1




,

m
=
1

,





,

M
s

,

s

S





(

18

c

)







Below is an auxiliary lemma.


Lemma 1

With the above definitions,














s

S










r


R
s









c
s



ρ
s



rf
rs




=





k
=
1

K




b
k



z
k



=




k
=
1


K
R






b
_

k



w
k





,
and




(
19
)










s

S










r


R
s









ρ
s



f
rs




U
s



(
r
)





=





k
=
1

K




v
k



z
k



=




k
=
1


K
R






v
_

k




w
k

.








(
20
)







The above two identities follow directly from (16) and (17) along with the definition of zπ(s,m) and ysm in terms of wπR(s,m).


The next lemma provides a useful characterization of the optimal solution to the linear program LP-UB-D. For a given w∈W, define LD(w)=Σs∈SΣr∈Rscsρsrfsr as the ‘load’ associated with w, with fsr defined in terms of w as specified above.


Lemma 2

If LD(w)=1, then the variables fsr constitute an optimal solution to the linear program LP-UB-D.


It may easily verified that the variables fsr provide a feasible solution to the linear program LP-UB-D. In order to establish optimality, note that if LD(w)=1, then the identity (19) implies that













k
=
1

K




b
k



z
k



=





k
=
1


K
R






b
_

k



w
k



=
1





(

20

a

)







Since w∈W means wk=1 if wk+1>0, it then follows that (zk*)k=1, . . . ,K with







z
k
*

=

w

R


(


π

-
1




(
k
)


)



π

*





is an optimal solution to the above continuous relaxation of the knapsack problem. Because the latter problem is a relaxed version of the linear program LP-UB-D, the identity (19) then ensures that the variables fs, in fact constitute an optimal solution.


As a final observation, note that if Σs∈Scsρsrs(Ms)s∈SΣm=1MscsρsΔRsmk=1KRbk≧1, then there exists a vector w*∈W for which LD(w*)=1, given by










w
k
*

=

{




1
,





k
<

K
R
*


,








(

1
-




l
=
1



K
R
*

-
1





b
_

l



)

/


b
_


K
R
*



,





k
=

K
R
*


,






0
,





k
>

K
R
*


,









(
21
)







with






K
R
*


=

min


{

k
:





l
=
1

k




b
_

l



1


}






(

21

a

)







The above discussion shows that determining the asymptotically optimal rate assignments is in fact no harder than solving a knapsack problem. The only caveat is that the coefficients of the knapsack problem involve various channel and traffic parameters. While the rate sets Rs and the capacity requirements cs can essentially be thought of as a matter of definition (specification of classes), the corresponding amounts of offered traffic ρs, would in practice typically involve uncertainty and variation over time, raising measurement-related trade-offs. Instead adaptive, online algorithms may be developed which do not require any explicit knowledge or estimates of the traffic statistics.


Remark 7


Based on the insight into the structure of the optimal solution to linear program LP-UB-D, there are two further trunk reservation type policies that can be shown to be asymptotically optimal, in addition to the complete partitioning and sharing policies identified in the proof of Proposition 3.


Structure of Optimal Solution to CP-UB-C

We now examine the structure of the optimal solution to the convex optimization problem CP-UB-C. In the case that Σs∈Scsρsrsmax≦1, the optimal solution is easily seen to be rs*=rsmax for all s∈S, and we henceforth assume that Σs∈Scsρs rsmax>1. First of all, observe that the problem may be equivalently expressed as











max


[


(

y
s

)


s

S


]



P
+


S










s

S








ρ
s




V
s



(

y
s

)












sub





s

S








ρ
s



y
s





1





(

21

b

)







where Vs(ys) is the optimal value of the optimization problem










max


(


a
s

,

r
s


)



P
+
2






a
s




U
s



(


r
s


a
s


)







(
22
)








suba
s



r
s
min




r
s




a
s



r
s
max






(
23
)







a
s


1




(
24
)








c
s



r
s




y
s





(
25
)







Further the function may be defined












U
^

s



(
r
)


=

{





r

r
s
min





U
s



(

r
s
min

)






r
<

r
s
min








U
s



(
r
)





r


r
s
min










(

25

a

)







It may be demonstrated that Vs(ys) equals the optimal value of the optimization problem:










max



r
^

s



P
+







U
^

s



(


r
^

s

)






(
26
)







sub



r
^

s




r
s
max





(
27
)








c
s




r
^

s




y
s





(
28
)







And, the optimal solution as*, rs* of problem (22)-(25) and the optimal solution {circumflex over (r)}s* of problem (26)-(28) are related as rs*=max{rsmin,{circumflex over (r)}s*}, as*=min{1,{circumflex over (r)}s/rsmin}. In order to prove this, it may first be noted that, for a given r the objective function (22) is increasing in as because the function Us(•) is concave:




















a
s





[


a
s




U
s



(


r
s


a
s


)



]


=




U
s



(


r
s


a
s


)


-



r
s


a
s





U

s





(


r
s


a
s


)





0





(

28

a

)







Thus, for given rs, the objective function is optimized by maximizing as subject to the constraints (23) and (24), i.e., taking as=min{1,rs/rsmin}, where we invoke the fact that rs≦rsmax, because otherwise there is no feasible solution. Substitution then shows that the objective function may be written as Ûs(rs), and the statement follows.


It may be concluded that the optimization problem CP-UB-C may be equivalently expressed as the optimization problem CP-UB-C-EQ given below











max


[


(


r
^

s

)


s

S


]



P
+


S










s

S








ρ
s





U
^

s



(


r
^

s

)













sub



r
^

s





r
s
max










s



S











s

S








c
s



ρ
s




r
^

s




1





(
29
)







Given an optimal solution (rs′)s∈S, the necessary KKT conditions (which in fact are also sufficient because of the concavity) imply that there exist Lagrange multipliers κ≧0 and (vs)s∈S≧0 such that












ρ
s




U
s

(
-
)




(

r
s


)






κ






c
s



ρ
s


+

v
s



,




(

29

a

)









ρ
s




U
s

(
+
)




(

r
s


)






κ






c
s



ρ
s


+

v
s



,




(

29

b

)








κ
(





s

S








c
s



ρ
s



r
s




-
1

)

=
0

,




(

29

c

)









v
s



(


r
s


-

r
s
max


)


=
0

,




(

29

d

)







Where Us(−)(•) and Us(+)(•) denote the left- and right-derivatives of the function Ûs(•), respectively.


The various classes may now be partitioned into four categories:





(i) s∈S1, with rs′<rsmin, so that as=rs′/rsmin, rs=rsmin, vs=0, and Ûs′(rs′)=Us(rsmin)/rsmin=κcs;  (29e)





(ii) s∈S2, with rs′=rsmin, so that as=1, rs=rsmin, vs=0, and Ûs(+)(rsmin)=Us′(rsmin)≦κcs≦Ûs(−)(rsmin)=Us(rsmin)/rsmin;  (29f)





(iii) s∈S3, with rs′∈(rsmin,rsmax), so that as=1, rs=rs′, vs=0, and Ûs′(rs′)=Us′(rs)=κcs;  (29g)





(iv) s∈S4, with rs′=rsmax, so that as=1, rs=rsmax, and Ûs′(rs′)=Us′(rs)≧κcs.  (29h)


Section VI
Design of Adaptive Rate Allocation Policies

We now discuss how the insights in the structural properties of the optimal solutions to the linear program LP-UB-D and the convex optimization problem CP-UB-C can be leveraged to design adaptive, online rate assignment algorithms.


Subsection A. “Scenario D” (Discrete Rate)

Sub-subsection A1. Preliminary Results:


First using the structural properties (obtained in the section ‘Structure of optimal solution to LP-UB-D’, above) to establish that the asymptotically optimal rate assignment fractions fsr* only depend on the offered traffic through a single scalar variable θ*. This fact will play an instrumental role in the design of adaptive rate assignment algorithms.


Let G(•) be a function on W (as defined in the section ‘Structure of optimal solution to LP-UB-D,’ above) of the form:











G


(
w
)


=


h
0

+




k
=
1


K
R





h
k



(

w
k

)





,




(

29

i

)







Where h0 is a constant coefficient and each of the functions hk(•) is strictly decreasing and continuous. Further denote θmin=G(1KR)=h0k=1KRhk(1) and θmax=G(10)=h0k=1KRhk(0), where 1k is the KR-dimensional vector whose first k components are 1 and whose remaining KR−k components are 0, k=0, . . . , KR.


Lemma 3


The function G(•) is injective, i.e., for any θ∈[θminmax], there exists a unique w(θ)=G−1(θ)∈W such that G(w(θ))=θ, which may be obtained as











w
k



(
θ
)


=

{





1
,




k
<

K


(
θ
)










h

K


(
θ
)



-
1




(

θ
-

G


(

1

K


(
θ
)



)


+


h

K


(
θ
)





(
1
)



)


,




k
=

K


(
θ
)








0
,




k
>

K


(
θ
)






,






(

29

j

)







with






K(θ)=min{k:1≦k≦KR,G(1k)≦θ}.  (29k)


It is easily verified by substitution that G(w(8)) equals











h
0

+




k
=
1



K


(
θ
)


-
1









h
k



(
1
)



+


h

K


(
θ
)





(


h

K


(
θ
)



-
1




(

θ
-

G


(

1

K


(
θ
)



)


+


h

K


(
θ
)





(
1
)



)


)


+




k
=


K


(
θ
)


+
1



K
R









h
k



(
0
)




=



h
0

+




k
=
1


K


(
θ
)










h
k



(
1
)



+
θ
-

G


(

1

K


(
θ
)



)


+




k
=


K


(
θ
)


+
1



K
R









h
k



(
0
)




=

θ
.






(
291
)







Further observe that any vector w∈W, w≠w(θ), is either component wise smaller or larger than w(θ), and strictly so in at least one component. Since each of the functions hk(•) is strictly decreasing, it then follows that G(w) is either strictly smaller or larger than G(w(θ)), i.e., G(w)≠θ.


Based on the above lemma, assignment fractions fsr(θ) may be introduced and defined in terms of the variables wk(θ) (similar to the ‘Structure of optimal solution to LP-UB-D’ section, above). The latter assignment fractions may be interpreted as follows. Users of the class(es) s∈S for which there is an ms(θ)∈{1, . . . , Ms} such that (s,ms(θ))∈πR(K(θ)) are either assigned a nominal rate rs(ms(θ)) with probability wk(θ)(θ) or rate rs(ms(θ)−1) otherwise. Users of all the other classes s∈S are always assigned a nominal rate rs(ms(θ)), where ms(θ)=max{0}∪{m:1≦m≦MsR*s,m)≦K(θ)}, which may be thought of as the largest rate rs(m) (or possibly zero) for which the marginal payoff Qsm is sufficiently high.


Defining LD(θ)=Σs∈SΣr∈Rscssfsr(θ) as the ‘load’ associated with θ, and denoting by LDmaxs∈Scsρsrs(Ms) the maximum possible load (i.e., the load if all users were assigned the highest possible rate for their respective classes). Further define





θ*=min{θ∈[θminmax]:LD(θ)≦1}.  (29m)


The next lemma collects a few useful properties of the function LD(•).


Lemma 4:




(i) The function LD(•) is strictly decreasing and continuous on [θminmax], with LDmin)=LDmax and LDmax)=0;





(ii) If LDmax<1, then θ*=θmin;





(iii) If LDmax≧1, then LD(θ*)=1.  (29n)


Invoking the identity (19), the following may be written











L
D



(
θ
)


=




k
=
1


K
R










b
_

k




w
k



(
θ
)








(
290
)







It follows directly from the definition that each of the variables wk(θ) is continuous and decreasing in θ, and strictly so for at least one value of k for any θ. This proves Part (i). Parts (ii) and (iii) are direct consequences of Part (i). The next lemma provides a useful optimality result based on Lemmas 2 and 4.


Lemma 5

The variables fsr(θ*) constitute an optimal solution to the linear program LP-UB-D.


If LDmax<1, then Part (ii) of Lemma 4 implies that θ*=θmin, so that wk(θ*)=1 for all k=1, . . . , KR. This in turn means that








f

sr
s

(

M
s

)





(

θ
*

)


=
1




for all s∈S, which was identified as the optimal solution in this case in section ‘Structure of optimal solution to LP-UB-D’, described above.


If LDmax≧1 then Part (iii) of Lemma 4 implies that LD(θ*)=1, which in turn means that LD(θ*))=1. It then follows from Lemma 2 that the variables fsr(θ*) are optimal.


Lemma 6

If LDmax≧1, then







θ
0

=

Q


π
R

-
1




(

K


(

θ
*

)


)







is the optimal dual variable corresponding to constraint (7) of the linear program LP-UB-D.


For each s∈S, let







m
s

=

min


{


m


:








f

sr
s

(
m
)





(

θ
*

)



>
0

}






and










η
s

=

{



0





if









r


R
s










f
sr



(

θ
*

)




<
1

,







ρ
s



(



U
s



(

r
s

(

m
s

)


)


-


θ
0



c
s



r
s

(

m
s

)




)





otherwise
.









(

29

p

)







We claim that (ηs)s∈S and θ0 are the optimal dual variables corresponding to the constraints (6) and (7), respectively. Indeed, the variables (ηs)s∈S and θ0 are dual feasible, and invoking the definition of fsr(θ*) and the fact that LD(θ*)=1, it can be checked that they satisfy the complementary slackness conditions:













f
sr



(

θ
*

)




(



ρ
s




U
s



(
r
)



-


θ
0



c
s



ρ
s


r

-

η
s


)


=

0








r


R
s





,



s

S


,







θ
0

(





s

S











r


R
s










c
s


r






ρ
s




f
sr



(

θ
*

)





-
1

)

=
0

,







η
s

(





r


R
s










f
sr



(

θ
*

)



-
1

)

=

0








s

S




,




(

29

q

)







which in particular means that θ0 must be optimal.


Sub-Subsection A2. An Adaptive Rate Allocation Scheme: ARA-D


Lemma 5 reduces the problem of determining the asymptotically optimal rate assignment fractions to that of finding the value of θ* with associated load LD(θ*)=1. Proceeding to develop an adaptive scheme for performing the latter task in an online manner without any explicit knowledge of the amounts of offered traffic ρs, for convenience we take the function G(•) to be affine, with






h
0
=Q
R(0)>QR(1) and






h
k(wk)=wk(QR(k)−QR(k−1))  (29r)


Where QR(k)=Q(s,m) for (s,m)∈πR−1(k), 1≦k≦KR In particular, θmin=G(1KR)=QR(KR) and θmax=G(10)=QR(0). We henceforth make the further natural assumption that any rates that cannot possibly be supported have been removed from the rate sets, i.e., rs(Ms)≦ps, for all s∈S. The main thrust of the adaptive scheme is to treat θ as a control parameter and use estimates of LD(θ) to drive it towards the value θ* for which LD(θ*)=1. There are obviously various potential ways in which the value of θ can be adapted and in which LD(0) can be estimated. The specific scheme that we develop below, termed ARA-D, has the key advantage that it naturally extends to scenarios where no discrete user classes are specified.


The ARA-D scheme is comprised of two closely coupled components: the rate assignment component ARA-DR(θ) and the θ-learning component ARA-DA. The ARA-DR(θ) component uses the θ values produced by the ARA-DA component to assign a nominal rate to each arriving user. In turn, the ARA-DA component adapts the θ values based on observations of the occupancy state generated by the rate decisions of the ARA-DR(θ) component, and serves to find the value θ* for which LD(θ*)=1.


We now proceed to describe both components of the ARA-D scheme in more detail. Denote by Nsr(t) the number of class-s users provided rate r in the system at time t. Defining the free capacity at time t as










m


(
t
)


=

1
-




s

S











r


R
s










c
s





rN
sr



(
t
)


.









(

29

s

)







Rate Assignment Component: ARA-DR(0)

Given the value of the control parameter θ, ARA-DR(θ) assigns an arriving class-s user a nominal rate r∈Rs with probability fsr(θ) as defined previously. Denote by θi the value of θ used for the i-th arriving user with arrival epoch τi, and denote by Ri the nominal rate assigned to that user. If R is either zero or larger than the capacity constraint allows for, i.e., cs(i)Ri>m(τi), then the ith user is blocked, i.e., ri(t)=0 for all t∈[τiii]. Otherwise, the ith user is admitted and provided rate Ri for its entire duration.


Note that the assignment fractions fsr(θ) can be efficiently calculated given the value of θ for the above choice of G(•). Specifically, we have KR(θ)=min{1≦k≦KR:QR(k). Users of the class(es) s∈S for which there is an ms(θ)∈1, . . . , Ms} such that (s,ms(θ))∈πR(KR(θ)), are either assigned a nominal rate rs(ms(θ)) with probability












Q
R



(



K
R



(
θ
)


-
1

)


-
θ




Q
R



(



K
R



(
θ
)


-
1

)


-


Q
R



(


K
R



(
θ
)


)







(

29

t

)







or rate rs(ms(θ)−1) otherwise. Users of all the other classes s∈S are always assigned a nominal rate rs(ms(θ)), where ms(θ)=max{0}∪{m:1≦m≦M,Qsm>θ}, i.e., the largest rate rs(m) (or possibly zero) for which the marginal payoff Qsm is strictly higher than θ, or equivalently QR (KR(θ)).


Adaptive Learning Component: ARA-DA

The role of ARA-DA is to use estimates of LD(θ) to adapt the control parameter θ and drive it to the value θ* such that LD(θ*)=1. The key challenge here is that the occupancy state only provides an indication of the carried traffic, and that blocking must be properly taken into account to obtain an (unbiased) estimate of the actual offered traffic. In order to achieve that, a tagging procedure may be used, involving tagging the ith user when the free capacity m(τi) at the arrival epoch of that user is larger than mmin=maxs∈Scsrs(Ms)≦1. This choice implies that tagging is independent of the class of a user, and that every user that gets tagged will either be admitted, or assigned a zero rate and be blocked. In other words, blocked users with non-zero nominal rate assignments will never be tagged. This ensures that the nominal rates assigned to tagged users can be tracked without having to monitor blocked users.


Letting the 0-1 variable {circumflex over (T)}i=I{m(τi)≧mmin} indicate whether the ith user is tagged or not, and denoting by {circumflex over (N)}sr(t) the number of tagged class-s users provided rate r at time t, the load associated with tagged users at time t may be defined as












n
^



(
t
)


=




s

S











r


R
s










c
s


r




N
^

sr



(
t
)






,




(

29

u

)







This may be readily calculated without maintaining any detailed state information or tracking blocked users.


When the jth user arrives, ARA-DA computes the value θj+1 as follows





θj+1=[θj+ε({circumflex over (n)}j)−{circumflex over (T)}j)]θminθmax  (30)


With step size ε>0, an arbitrary initial value θl, and [x]ab equal to a, x or b if x<a, a≦x≦b or x>b, respectively. Although the value of θj+1 is already known at the time of the arrival of the jth user, it is worth stressing that ARA-DR ‘lags’ by one time step, in the sense that the value of θj is used for its nominal rate assignment.


As may be recognized in the update rule (30), ARA-DA balances the load associated with the tagged users with the fraction of users which are tagged. This, in turn, steers the load of the system to unity and drives the value of θ to θ*, as will be proved next under some suitable assumptions.


Convergence Proof

By making Assumption A2 (Poisson arrival processes) and a slightly strengthened version of Assumption B, namely that the service times of the various users are independent and have phase-type distributions, the class of phase-type distributions is well-known to be dense in the set of all distributions for positive random variables.


In order to establish convergence, the stationary behavior of the system if it operated under ARA-DR(θ) with a fixed value θ may be considered. Specifically, let â(θ) be the fraction of users which get tagged and let (Nsrθ)s∈S,r∈Rs be a random vector with the stationary distribution of the occupancy state. Then the PASTA property implies











a
^



(
θ
)


=



lim

J










j
=
1

J








T
^

j


J


=

P
[





s

S











r


R
s










c
s



rN
sr
θ


r





1
-

m
min



]






(
31
)







Considering the following projected ordinary differential equation (see a well-known method, in Section 4.3 of H. J. Kushner, G. G. Yin (2003), Stochastic Approximation and Recursive Algorithms and Applications; referred to in the remainder of this specification as “Kushner & Yin”) which captures the smoothened evolution of {θj}j≧1 under the update rule (30):













θ


(
t
)





t


=




a
^



(

θ


(
t
)


)




(



L
D



(

θ


(
t
)


)


-
1

)


+

z


(

θ


(
t
)


)







(
32
)







where z(theta) is determined as









{





-


a
^



(

θ
min

)





(



L
D



(

θ
min

)


-
1

)







if







L
D



(

θ
min

)



<
1

,


θ
=

θ
min


;






0



otherwise
.








(

32

a

)







Note that [LD(θ)−1]+ may be interpreted as the excess load when operating under a fixed value θ. For each t, the function z(t) is nonnegative, and guarantees that










θ


(
t
)





t



0




for θ(t)=θmin, thus ensuring that θ(t), like θj, cannot fall below θmin.


Lemma 7

For any initial value θ(0)∈[θminmax],











lim

t






θ


(
t
)



=

θ
*





(

32

b

)







Note that the ODE in (32) is well posed, as follows from the well-known solution to the ODE (see Theorem 4.1 of Chapter 8 of Kushner & Yin).


Consider the following (Lyapunov) function










V


(
θ
)


=


1
2




(

θ
-

θ
*


)

2






(

32

c

)







Since (θmin−θ*)z(θmin)≦0,













V


(

θ


(
t
)


)





t





-


a
^



(

θ


(
t
)


)





(


θ


(
t
)


-

θ
*


)



(



L
D



(

θ


(
t
)


)


-
1

)






(
33
)







It follows from Lemma 4 that LD(θ)−1 is either strictly negative or strictly positive when 0 is larger or smaller than 0*, respectively, and hence


(θ(t)−θ*)(LD(θ(t))−1)<0 as long as θ(t)≠θ*. Noting that â(θ)>0, we conclude that













V


(

θ


(
t
)


)





t




{






=
0

,




θ
=

θ
*








<
0

,




θ


θ
*





.






(

33

a

)







In order to complete the proof, it suffices to verify that the functions LD(•) and â(•) (appearing in the upper bound (33) for










V


(

θ


(
t
)


)





t


)




are continuous. It follows from Lemma 4 that the function LD(•) is continuous, so it only remains to be established that â(•) is a continuous function. Using (31), we can conclude that â(θ) is a linear function of the distribution of (Nsrθ)s∈S,r∈R. Since the service times of the various users have phase-type distributions, the system behavior can be described as a finite irreducible Markov process. Further, the transition rate matrix associated with this Markov process is a piecewise linear function of θ. Then, using the well-known perturbation bound (given in Section 3.1 of G. E. Cho, C. D. Meyer (2001), Comparison of Perturbation Bounds for the Stationary Distribution of Markov Chain, Lin. Alg. Appl. 335 (1-3), 137-150; referred to herein as “Cho & Meyer”), it may be concluded that the stationary distribution of this Markov process is a continuous function of θ. Using this observation, we conclude that â(•) is a continuous function.


The next theorem states that for a small enough step size ε and sufficiently large number of iterates, virtually all of the iterates obtained by the ARA-D scheme will belong to an arbitrarily close neighborhood of θ*.


Theorem 1


For any δθ>0, the fraction of iterates (θj)1≦j≦T/ε that take values in (θ*−δ0,θ*+δθ) goes to one in probability as ε↓0 and T→∞.


The result follows from the well-known Theorem 4.1 (in Chapter 8 of Kushner & Yin) and the asymptotic stability of θ* established in Lemma 7 once we have verified that all the conditions of Theorem 4.1 are satisfied in the present setting.


The set H, the variables θjε, Yjε, Zjε, ξjε, and the sigma algebra Fjε used in the exposition of Theorem 4.1 (Kushner & Yin) correspond to certain variables in the present setting as described next. To see the correspondence, it will be helpful to compare the update rule (30) repeated below





θj+1=[θj+ε({circumflex over (n)}j)−{circumflex over (T)}j)]θminθmax  (33b)


against the update rule considered in Theorem 4.1 (Kushner & Yin) stated below





θj+1εHjε+εYjε)=θjε+ε(Yjε+Zjε).  (33c)


Also, note that the decision for the jth user depends only on θj and the free capacity just before its arrival.


In the present setting H=[θminmax], and the variables θjε correspond to the variables θj. Also, for each j,






Y
j
ε
={circumflex over (n)}j)−{circumflex over (T)}j,  (33d)


while










Z
j
ɛ

=

{








θ
min

-

θ
j
ɛ


ɛ

-

Y
j
ɛ


,







if






θ
j
ɛ


+

ɛ






Y
j
ɛ



<

θ
min


,










θ
max

-

θ
j
ɛ


ɛ

-

Y
j
ɛ


,







if






θ
j
ɛ


+

ɛ






Y
j
ɛ



>

θ
max


,






0
,




otherwise
.









(

33

e

)







Note that Zjε plays a role similar to that of z(•) in (32). The ‘memory’ random variables ξjε correspond to





(Nsrj),{circumflex over (N)}srj))s∈S,r∈Rs  (33f)


Let Fjε be a sigma algebra that measures at least {θiε,Yi−1εiε,i≦j} for each j, and let Ejε[•] denote the expectation with respect to Fjε.


Next, the conditions (A1.1), (A1.4), (A1.5), (A1.7), (A4.1)-(A4.7) and (A4.3.1) stated in Theorem 4.1 (Kushner & Yin) as sufficient conditions are satisfied in the present setting. Condition (A1.1) (uniform integrability of {Yjε:ε,j}) is satisfied since Nsr is bounded from above, and {circumflex over (N)}sr≦Nsr for each s∈S, r∈Rs. Conditions (A1.4) and (A1.5) are satisfied if we let βjε=0 and gjε(θ,ξ)=g(θ,ξ) for each j, where











g


(

θ
,
ξ

)


=





s

S











r


R
s










c
s




N
^

sr


r



-


T
^

j



,




(
34
)







for





ξ

=


(


N
sr

,


N
^

sr


)



s

S

,

r


R
s








(

34

a

)







Condition (A1.7) is satisfied due to the boundedness of





(Nsr(•),{circumflex over (N)}sr(•))s∈S,r∈Rs  (34b)


Condition (A4.1) holds since we can define the transition function Pjε(•,•|θ) using the rules for admitting and tagging users, satisfying:






P
j
εj+1ε∈•|)=Pjεjε,•|θjε),  (34c)


Provided the service times are exponentially distributed. When the service times have phase-type distributions, the above arguments can be generalized by appropriately modifying the memory variables to account for (a finite number of) phases involved in each service. Hence ξj+1=(Nsrj+1),{circumflex over (N)}srj+1)s∈S,r∈Rs ‘depends’ only on ξj, and the decision for the jth user which depends only on θj and ξj. Also, note that Pjε(•,•|θ) is measurable since the probability transition function is continuous in ξjε and θjε. Condition (A4.2) is satisfied since the transition function Pjε(•,•|θ) is independent of j and ε. Condition (A4.3) holds since the probability transition function Pjε(•,•|θ) is continuous in ξjε and θjε. Condition (A4.4) is satisfied because g(θ,ξ) defined in (34) is continuous in θ and ξ. In addition, (ξj(θ):j≧i) is the process evolving under a fixed value θ with initial condition ξi(θ)=ξi. Condition (A4.5) holds since g(θ,ξ) is bounded. Condition (A4.6) is satisfied (because (ξj(θ):j≧i) is a stationary Markov process) if we take:







g
(θ)=â(θ)(LD(θ)−1).  (34d)


Also, note that g(θ) can be viewed as the expected value of (34) if the occupancy processes were stationary, and evolved under a fixed value θ. Condition (A4.7) holds since ξjε is bounded.


Thus, we have verified that all the conditions stated in Theorem 4.1 (Kushner & Yin) are satisfied.


Next, we consider a modification of the ARA-DA scheme, termed ARA-DA-DSZ, which uses decreasing step sizes instead of the constant step size ε used in (30). When the jth user arrives, ARA-DA-DSZ computes the value θj+1 as follows:





θj+1=[θjj({circumflex over (n)}j)−{circumflex over (T)}j)]θminθmax  (34e)


The step size sequence (εj)j≧1 satisfies the conditions (DSZ.1)-(DSZ.2) listed below.





(DSZ.1) εj≧0 for each j, limj→∞εj=0 and Σj=1εj=∞  (34f)


(DSZ.2) There is a sequence of integers (αj)j≧1 converging to infinity such that











lim
j




sup

0


j




α
j









ɛ

j
+

j





ɛ
j


-
1





=
0




(

34

g

)







Note that (DSZ.2) requires that the step sizes change slowly. The sequence (εj)j≧1 with







ɛ
j

=

1
j





satisfies conditions (DSZ.1)-(DSZ.2).


The next result states that for ARA-D with the component ARA-DA replaced with ARA-DA-DSZ, almost all the iterates obtained will belong to an arbitrarily close neighborhood of θ*. Let t0=0 and tjj′=0j−1εj′. Let ω(t) be the value of j such that tj≦t<tj+1.


Theorem 2


For any δθ>0, the fraction of iterates (θj)1≦j≦ω(T) that take values in (θ*−δθ,θ*+δθ) goes to one in probability as T→∞.


The result follows from well-known Theorem 4.3 (Chapter 8 of Kushner & Yin) and the asymptotic stability of θ* established in Lemma 7 once we have verified that all the conditions of Theorem 4.3 are satisfied in the present setting.


The conditions (5.1.1) and (A2.8) of well-known Theorem 4.3 are satisfied since the step size sequence (εj)j≧1 satisfies (DSZ.1)-(DSZ.2). We can verify that the conditions (A2.1), (A2.2), (A2.4), (A2.6), (A4.3), (A4.7), (A4.8), (A4.9), (A4.10), (A4.11), (A4.12) and (A4.3.1) given in well-known Theorem 4.3 are satisfied by using arguments identical or quite similar to those used in the proof of well-known Theorem 1 (Kushner & Yin) for verifying (A1.1), (A1.5), (A1.7), (A1.4), (A4.3), (A4.7), (A4.1), (A4.2), (A4.4), (A4.5), (A4.6) and (A4.3.1), respectively.


Subsection B. “Scenario C”

We first use the structural properties obtained in section ‘Structure of optimal solution to CP-UB-C’ (above) to establish that the asymptotically optimal rate assignments only depend on the offered traffic through a single scalar variable θ*, which in fact corresponds to the Lagrange multiplier associated with the capacity constraint (29). This fact will play an instrumental role in the design of adaptive rate assignment algorithms.


For a given θ∈P+, define






r
s
(θ)=sup{r∈Rs∪{0}:psUs(−)(r)≧θ},  (34h)





and






r
s
+(θ)=sup{r∈Rs∪{0}:psUs(−)(r)≧θ},  (34i)


with the convention that Us(−)(0)=∞ for all s∈S, ensuring that rs(θ) and rs+(θ) are well-defined for all θ∈P+.


Note that Us(−)(r)=Us(rsmin)/rsmin for all r∈(0,rsmin] and equals the left derivative of Us(•) for all r∈(rsmin,rsmax], so






r
s
(θ)=rs+(θ)=0 if θ>psUs(−)(rsmin),  (34j)






r
s
(θ)=0,rs+(θ)=rsmin if θ=psUs(−)(rsmin),  (34k)





and






r
s
(θ)=rs+(θ)=rsmax if θ≦psUs(−)(rsmax).  (34l)


In case Us(−)(•) is strictly decreasing on (rsmin,rsmax], i.e., Us(•) is strictly concave, we have rs(θ)=rs+(θ) for all θ∈ps(Us(−)(rsmax),Us(−)(rsmin)), and let






r
s
0(θ)=rs(θ)=rs+(θ)  (34m)


If in addition the left- and right-derivatives of Us(•) are equal, i.e., Us′(•) exists and is continuous, then we must have psUs′(rs0(θ))=θ. Define LC(θ)=Σs∈Scsρsrs(θ) and LC+(θ)=Σs∈Scsρsrs+(θ) as the lower and upper ‘load’ values associated with θ. Denote by LCmaxs∈Scsρsrsmax the maximum possible load, i.e., the load if all users were assigned the highest possible rate for their respective classes. Further define





θ*=inf{θ∈P+:LC(θ)≦1},  (34n)


so that θ*=0 when LCmax≦1. When LCmax>1, θ* may be equivalently defined as θ*=sup{θ∈P+:LC+(θ)≧1}.


The next lemma provides a useful optimality result.


Lemma 8:


Let {circumflex over (r)}s(θ*)∈[rs(θ*),rs+(θ*)], with the further property that Σs∈Sρscs{circumflex over (r)}s(θ*)=1 in case LCmax≧1.


Then the variables ({circumflex over (r)}s(θ*))s∈S constitute an optimal solution to the problem CP-UB-C-EQ, and θ* is the optimal Lagrange multiplier associated with the capacity constraint (29).


If LCmax<1, then θ*=0, so that {circumflex over (r)}s(θ)=rs(θ*)=rsmax for all s∈S, which was identified as the optimal solution in this case in section ‘Structure of optimal solution to CP-UB-C’ (above). It is also easily seen that the optimal Lagrange multiplier equals 0 in this case.


By construction and the fact that Σs∈Sρscs{circumflex over (r)}s(θ*)=1, the variables {circumflex over (r)}s(θ*) constitute a feasible solution. In order to prove optimality, for each s∈S let










v
s

=

{



0





if








r
^

s



(

θ
*

)



<

r
s
max


,








ρ
s



(



U
s

(
-
)




(



r
^

s



(

θ
*

)


)


-


θ
*



c
s



)



0





if








r
^

s



(

θ
*

)



=


r
s
max

.










(

34

o

)







Where (vs)s∈S and θ* are the optimal Lagrange multipliers associated with CP-UB-C-EQ. Indeed, it is easily verified that the necessary KKT conditions are satisfied by construction, and hence the variables {circumflex over (r)}s(θ*) in fact constitute an optimal solution.


While the variables ({circumflex over (r)}s(θ*)s∈S in the above lemma constitute an optimal solution to the problem CP-UB-C-EQ, it is important to observe it may occur that {circumflex over (r)}s(θ*)∈(0,rsmin), meaning that these variables may not directly provide feasible rate assignments. As the next lemma states, however, the same expected utility can be obtained through a probabilistic scheme, which only uses rate assignments rs(θ*) and rs+(θ*) which are feasible by definition.


Lemma 9:


If {circumflex over (r)}s(θ*)∈(rs(θ*),rs+(θ*)), then













U
^

s



(



r
^

s



(

θ
*

)


)


=




q
s



(

θ
*

)





U
s



(


r
s
-



(

θ
*

)


)



+


(

1
-


q
s



(

θ
*

)



)




U
s



(


r
s
+



(

θ
*

)


)





,




with




(

34

p

)








q
s



(

θ
*

)


=





r
s
+



(

θ
*

)


-



r
^

s



(

θ
*

)






r
s
+



(

θ
*

)


-


r
s
-



(

θ
*

)






[

0
,
1

]






(

34

q

)







Lemmas 8 and 9 reduce the problem of determining the asymptotically optimal rate assignments to that of finding the value of θ*. It is worth observing here that the existence of {circumflex over (r)}s(θ*)∈[rs(θ*),rs+(θ*)], satisfying Σs∈Sρscs{circumflex over (r)}s(θ*)=1 is ensured by that fact that LC(θ*)≧1≧LC+(θ*) in case LCmax≧1.


In developing an adaptive scheme, termed ARA-C, for finding the value of θ* in an online manner without any explicit knowledge of the amounts of offered traffic ρs, the further natural assumption is made that any rates that cannot possibly be supported have been removed from the rate sets, i.e., rsmax≧ρs for all s∈S.


The main thrust of the adaptive scheme is to treat θ as a control parameter and use estimates of LC(θ) to drive it towards the value θ*. Specifically, the ARA-C scheme is comprised of two closely coupled components: the rate assignment component ARA-CR(θ) and the θ-learning component ARA-CA. The ARA-CR(θ) component uses the θ values produced by the ARA-CA component to assign a nominal rate to each arriving user. In turn, the ARA-CA component adapts the θ values based on observations of the occupancy state generated by the rate decisions of the ARA-CR(θ) component, and serves to find the value θ*.


Proceeding to describe both components of the ARA-C scheme in more detail, the free capacity in the system at time t may be defined as:










m


(
t
)


=

1
-




i
=
1










c

s


(
i
)







r
i



(
t
)


.








(

34

r

)







ARA-CR(θ)

Given the value of the control parameter θ, ARA-CR(θ) assigns an arriving class-s user a nominal rate rs(θ). Note that the nominal rate assignment can be computed even more easily than in the case of ARA-DR(θ), and does not even require any knowledge of the capacity requirements of any other classes. In fact, ARA-CR(θ) can be readily applied to assign nominal rates to individual users when there are no classes defined. In addition, ARA-CR(θ) can be applied in the case of discrete rate sets in section ‘Alternative schemes for Scenario D: ARA-A’ (above), when we use a linear interpolation of the utility function, which will always result in rs(θ) belonging to Rs (or being 0). Denote by θi the value of θ used for the ith arriving user with arrival epoch τi, and denote by Ri the nominal rate assigned to that user. If Ri is either zero or larger than the capacity constraint allows for, i.e., cs(i)Ri≧m(τi), then the ith user is blocked, i.e., ri(t)=0 for all t∈[τiii]. Otherwise, the ith user is admitted and provided rate Ri for its entire duration. Thus, the free capacity in the system at time t may be expressed as











m


(
t
)


=

1
-




i
=
1










c

s


(
i
)





R
i



A
i



I

{


τ
i


t



τ
i

+

σ
i



}






,




(

34

s

)







Where the 0-1 variable Ai=I{m(τi)≧mmin} indicates whether the ith user is admitted or not.


ARA-CA

The role of ARA-CA is to use estimates of LC(θ) to adapt the control parameter θ and drive it to the value θ*. ARA-CA operates in a similar fashion as ARA-DA, and in particular uses a similar tagging procedure, where the ith user is tagged when the free capacity m(τi) at the arrival epoch of that user is larger than mmin=maxs∈Scsrsmax≧1. Just like before, this choice implies that tagging is independent of the class of a user, and ensures that the nominal rates assigned to tagged users can be tracked without having to monitor blocked users.


Let the 0-1 variable {circumflex over (T)}i=I{m(τi)≧mmin} indicate whether the i th user is tagged or not. Define the load associated with tagged users at time t as












n
^



(
t
)


=




i
=
1










c

s


(
i
)





R
i




T
^

i



I

{


τ
i


t



τ
i

+

σ
i



}





,




(

34

t

)







This can be readily calculated without maintaining any detailed state information or tracking blocked users.


When the jth user arrives, ARA-DA computes the value θj+1 according to the exact same update rule (30), as ARA-CA, except that the term {circumflex over (n)}(τj) is now calculated from the above formula. Although the value of θj+1 is already known at the time of the arrival of the jth user, it is worth stressing that ARA-DR ‘lags’ by one time step, in the sense that the value of θj is used for its nominal rate assignment.


Just like ARA-CA, ARA-DA balances the load associated with the tagged users with the fraction of users which are tagged. This, in turn, steers the load of the system to unity in a certain sense and drives the value of θ to θ*.


A rigorous convergence proof is however far more complicated than for the ARA-D scheme since the load fails to be continuous as function of the control parameter θ. Note that the nominal rate assignment rs(θ) for class-s users is discontinuous as it jumps from 0 to rsmin (or higher) around θ=psUs(−)(rsmin), and may have other discontinuity points as well when Us(•) is not strictly concave. While the nominal rate assignment may be continuous elsewhere, even a slight change could lead to a change in the maximum number of class-s users that can be supported, and hence still cause the load to be discontinuous.


Since the load is discontinuous, the problem departs from the usual assumptions in stochastic approximation, and falls outside the well-know methodological framework of Kushner & Yin. The complications caused by the discontinuity are further exacerbated by the fact that the evolution of the system occupancy under the ARA-C scheme can not be described in terms of a finite-dimensional Markov process because of the continuum of possible rate assignments.


Despite the fundamental obstacles in establishing a rigorous proof, it is expected that the convergence statement of Theorem 1 actually continues to hold for the ARA-D scheme. If convergence occurs, then it must either be the case that θ*≦maxs∈SpsUs(−)(rsmax), so that rs(−)(θ*)=rsmax all s∈S in case LCmax<1, or in case LCmax≧1 the observed average load must tend to unity, because otherwise ARA-CA would induce a persistent drift in θ, i.e., it must be the case that














s

S









ρ
s



c
s




r
_

s



=
1

,




with




(

34

u

)








r
_

s

=


lim

J










j
=
1

J








R
j



I

{


s


(
j
)


=
s

}








j
=
1

J







I

{


s


(
j
)


=
s

}









(

34

v

)







denoting the long-term average nominal rate assigned to class-s users.


For classes s∈S for which Us(−)(•) is continuous around θ*, we simply have rs=rs(θ*). For other classes, we will have rs=qsrs(θ*)+(1−qs)rs+(θ*) i.e.,











q
s

=




r
s
+



(

θ
*

)


-


r
_

s





r
s
+



(

θ
*

)


-


r
s
-



(

θ
*

)





,




with




(

34

w

)







q
s

=


lim

J










j
=
1

J








I

{


θ
j

>

θ
*


}




I

{


s


(
j
)


=
s

}








j
=
1

J







I

{


s


(
j
)


=
s

}









(

34

x

)







denoting the long-term fraction of iterates that are (just) above θ*, i.e., the long-term fraction of class-s users that are assigned a nominal rate rs(θ*). In particular, for any class s for which psUs(−)(rsmin)=θ*, we will have rs(θ*)=0, rs+(θ*)=rsmin, and qs= rs/rsmin. Lemmas 8 and 9 then imply that the nominal rate assignments coincide with the asymptotically optimal ones.


Alternative Schemes for Scenario D: ARA-A

As noted earlier, under ARA-DR(θ) the users of most classes are assigned the largest possible nominal rate for which the marginal payoff is strictly higher than θ, or equivalently QR(KR(θ)). Typically only some users of one particular class receive a larger nominal rate assignment so as to make full use of the available capacity. This is also borne out by the fact that QR(KR(θ*)) can be shown to be the optimal dual variable corresponding to the capacity constraint (18). This suggests the following somewhat simpler alternative rate assignment scheme, which we will refer to as ARA-AR (θ). Given the value of the control parameter θ, ARA-AR(θ) assigns an arriving class-s user a nominal rate 0 if Qs1<0 and rs(θ)=max{rs(m): Qsm≧θ} otherwise. The latter scheme, like ARA-C, has the key advantage that the rate assignment does not involve any parameters of other user classes, and can be applied in conjunction with ARA-DA as described before, yielding the scheme ARA-A. In particular, interpreting Qsm as a left-derivative, ARA-C can be viewed as an extension of ARA-A.


Based upon the understanding of example embodiments described above, FIG. 2 depicts groups of user equipment (UE) 10a/20a in a wireless network 40 interfacing with a base station 30, in accordance with an example embodiment. The base station 30 may include, for example, a data bus 30a, a transmitting unit 30b, a receiving unit 30c, a memory unit 30d, and a processor 30e (see FIG. 3).


The transmitting unit 30b, receiving unit 30c, memory unit 30d, and processor 30e may send data to and/or receive data from one another using the data bus 30a.


The transmitting unit 30b is a device that includes hardware and any necessary software for transmitting signals including, for example, control signals or data signals via one or more wired and/or wireless connections to other network elements in the wireless network 40.


The receiving unit 30c is a device that includes hardware and any necessary software for receiving wireless signals including, for example, control signals or data signals via one or more wired and/or wireless connections to other network elements in the wireless network 40.


The memory unit 30d may be any device capable of storing data including magnetic storage, flash storage, etc.


The processor 30e may be any device capable of processing data including, for example, a microprocessor configured to carry out specific operations based on input data, or capable of executing instructions included in computer readable code including, for example code stored in the memory unit 30d. For example, the processor 30e is capable of implementing a sniffing function which allows the base station 30 to receive data broadcasted by one or more other base stations, for example, by synchronizing with the one or more other base stations in the same known manner as a UE. Further, the processor 30e is capable of analyzing subframes received from another BS in order to estimate which subframes are ABS subframes and which subframes are not.


According to at least some example embodiments, operations described herein as being performed by a base station 30 may be performed by a device having the structure of base station 30 illustrated in FIG. 3. For example, the memory unit 30d may store executable instructions corresponding to each of the operations described with reference to FIG. 4 as being performed by the base station 30. Further, the processor 30e may be configured perform each of the operations described with reference to FIG. 4 as being performed by a base station, for example, by executing executable instructions stored in the memory unit 30d. Further, any other base station (not shown) that may be in the wireless network 40 may also have the structure and/or operation of the base station 30.


Based on the understanding of the network 40, a discussion of an example rate allocation method that determines the aggregate long-term rate utility, as shown in FIG. 4, is described herein. The method of FIG. 4 may be accomplished by the processor 30e that may be located within any network node (such as base station 30) of the wireless network 40. It should be understood that the method of FIG. 4 may be re-initiated each time a user arrives or departs from the network, and therefore, this method may be performed on a continuous basis to continuously refine and adapt the control parameter θ, as described herein. The control parameter θ may be assigned an arbitrary positive initial value, at the very outset of the method, before the processor 30e initially starts operating. For the purpose of the FIG. 4 discussion, the network is initially assumed to be empty, such that there are no active UEs accessing the network when the processor 30e starts the method shown in FIG. 4. Therefore, the free capacity m (as defined in Equation 29s) is initialized to be unity (i.e., 1, indicating a maximum amount of free capacity being available), and the aggregate load of tagged users n (as defined in Equation 29u) is initialized to 0 (as there are initially no tagged UEs). Also for purposes of the FIG. 4 discussion, it should be understood that this method may be independently applied to each individual sector of a network cell, and therefore, for purposes of this FIG. 4 discussion the term “network” may be equated with the term “network sector.”


Discrete Rates (“Scenario D”)

In order to fully appreciate the various method steps of FIG. 4, a simplified, illustrative example may be assumed where the network 40 is supporting classes of users (S=4) that may enter and exit the network over time. Class-1 UEs 20a (see FIG. 2) may be located relatively far from serving base station 30, and these UEs may be requesting low-grade video content (e.g., low-motion video on a small screen). For example sake, the possible rate assignments for these class-1 UEs may be assumed to be one of three choices: either 1 Mb/s, or 2 Mb/s, or 0 Mb/s (in the event the processor 30e determines that a user will not be admitted). For the class-1 UEs 20a, users that are assigned the rate of 1 Mb/s will enjoy a corresponding utility level U1(1) that may be assigned a numerical value of 6. The value of this utility level Us(r) (which may be any positive value in an unbounded range for a UE class s as function of the service rate r), may be determined by a network operator using any well-known method of quantitatively defining the quality-of-experience (QoE) as perceived by the UEs at a given rate. The relationship between QoE and rate is dependent on content as well as type of UE device (i.e., the value may change based on the viewing of low- or high-quality motion video, whether the UE device has a small or large screen, etc.). Following along with the illustrative example, UEs that are assigned the rate of 2 Mb/s will enjoy a corresponding utility level that may be assigned a numerical value of 8. UEs assigned a zero (0 Mb/s) rate will have a utility level of zero (0). Therefore, a class-1 utility function for the class-1 UEs may be mathematically represented by U1(0)=0, U1(1)=6 and U1(2)=8.


The capacity requirement coefficient cs is a coefficient that may be assigned to a class of UEs that are collocated together within a network cell, where the capacity requirement coefficient cs is function of the signal strength that the UE experiences (which depends on the user's proximity to a serving base station along with other propagation conditions). In particular, the capacity requirement coefficient cs represents the fraction of the total available resources at the serving base station required to support a 1 Mb/s service rate for a single class-s user, and may be measured in units of s/Mb. For example, cs=0.1 s/Mb represents that a fraction 0.1 of the total available resources at the serving base station is required in order to support a 1 Mb/s service rate for a single class-s UE. Allocating a 2 Mb/s service rate to a single class-s UE would then require a fraction 0.2 of the total available resources at the serving base station. In the illustrative example, it is assumed that for the class-1 UEs, the capacity requirement coefficient cs may be presumed to be c1=0.1. Therefore, allocating a 1 Mb/s service rate to a single class-1 UE requires a fraction 0.1 of the total available resources of the base station (where this is derived by 1 Mb/s*0.1=0.1). Also, allocating a 2 Mb/s service rate to a single class-1 UE requires a fraction 0.2 of the total resources (where this is derived by 2 Mb/s*0.1=0.2). Meanwhile, another class of UEs (class-2 UEs 20a) may also be factored into this illustrative example. The class-20 UEs are also located relatively far from the serving base station 30 (FIG. 3), but these UEs are watching high-grade video content (e.g., high-motion video on a larger screen). The class-2 UEs enjoy similar signal strength conditions, as compared to the class-1 UEs, and for this reason the capacity requirement coefficient for class-2 UEs is identical to that for class-1 UEs (i.e., c2=0.1). However, because of the high-grade video content, the possible rate assignments for the class-2 UEs are 2 Mb/s, or 4 Mb/s or zero (0 Mb/s). The correspondingly assigned utility levels are 7 (for 2 Mb/s) and 12 (for 4 Mb/s), respectively, hence the class-2 utility function is given by U2(0)=0, U2(2)=7 and U2(4)=12.


In this illustrative example, a third user class is presumed. Class-3 UEs 10a (FIG. 2) are located relatively close to the serving base station 30, and these UEs request low-grade video content. Since class-3 UEs request similar video content, as compared to class-1 UEs, the possible rate assignments and corresponding utility levels are identical to those for class-1 UEs. Because of the more favorable signal strength conditions, however, the capacity requirement coefficient c3 for class-3 UEs is only c3=0.05, meaning that allocating a 1 Mb/s service rate to a single class-3 UE only requires a fraction 0.05 of the total available resources at the serving base station (rather than 0.1 as for class-1 and class-2 users). Similarly, allocating a 2 Mb/s service rate to a single class-3 UE requires a fraction 0.1 of the total available resources at the serving base station. For purposes of the illustrative example, a fourth user class is presumed, where class-4 UEs 10a are located relatively close to the serving base station, but watch high-grade video content. The capacity requirement coefficient c4 for class-4 UEs is identical to that for class-3 UEs, and the possible rate assignments and corresponding utility levels are the same as for the class-2 UEs.


As shown in step S100 of FIG. 4, a prospective new class-s UE may arrive within the network. In this step, the processor 30e determines which network class (from among S possible classes available) the new UE belongs to, prior to assigning a provisional rate for the new UE.


In step S102, the processor 30e may assign a provisional rate to a new UE that requests service from the network based on the current value of the control parameter θ, which may be considered as a marginal resource value threshold. It should be appreciated that, as an increasing number of new UEs later arrive or depart the network, and the method of FIG. 2 is repeated, this control parameter θ becomes iteratively refined over a period of time of network use (see the updating of control parameter θ, in steps S110 and S112, below), in which case the most currently available value of control parameter θ may be adopted in this step (that is to say, the initial arbitrary control parameter θ value becomes replaced, as the method of FIG. 4 is repeated and the network becomes optimized over time). Therefore, based on a current value of control parameter θ (the current value either being the initial arbitrary value, or an updated value that is determined over time), the processor 30e may assign the new UE a provisional rate r, as described in the “Rate Assignment Component: ARA-DR(theta)” portion of Subsection A2 of this document (see Section VI). This provisional rate r may be considered a place-holder rate, as the new UE is not actively using the network at this time. In particular, assuming a concave rate utility function, the provisional rate r may be set such that the marginal rate utility for the new UE equals the current value of the control parameter θ, or the provisional rate r is set to the largest amount for which the marginal rate utility for the new UE is no less than the current value of the control parameter θ. Note that this assignment of a provisional rate r does not require actual traffic load information or signal strength statistics for any of the UE classes.


Rate assignments may be explained using the illustrative example, as follows. For the illustrative example, it is assumed that a new UE belongs to class-1, and a current value of the control parameter θ may be presumed to be θ=40. If the processor 30e were to attempt to assign the UE a provisional rate of 1 Mb/s, the marginal pay-off Q11 (equation 15h) of adding the new UE may be determined by (U1(1)−U1(0))/(c1(1−0))=(6−0)/0.1=60. Because the calculated marginal pay-off Q11 is larger than 40, the processor 30e therefore will determine that there is a utility benefit in adding the new UE at a rate of 1 Mb/s. If the new UE were to be assigned a provisional rate of 2 Mb/s, the marginal pay-off Q12 would then be (U1(2)−U1(1))/(c1(2−1))=(8−6)/0.1=20, which is smaller than 40. Therefore, the processor 30e would determine that there is no utility benefit in adding the new UE at this rate. Thus, the processor 30e would determine that the provisional rate assignment for the new UE shall be 1 Mb/s. However, had the current value of the control parameter θ not been 40, but instead only 10, then the processor 30e would assign a provisional rate of 2 Mb/s.


On the other hand, had the current value of the control parameter not been 40, but instead 70, then even the marginal pay-off for a 1 Mb/s rate assignment (the lowest rate available, according to the illustrative example) would not have been sufficiently large, and therefore in such a scenario the processor 30e would make a ‘zero rate’ assignment (indicating that the UE will ultimately be blocked from the network, as described further herein). Likewise, had the new UE not belonged to class 1, but instead to class 2, then the marginal pay-off Q{21} for a 2 Mb/s rate assignment would be (U2(2)−U2(0))/(c2(2−0))=(7−0)/(0.1×2)=35, and the rate assignment would have been 0 even for the current value of the control parameter θ=40.


In step S104, the processor 30e then determines the free capacity (m) based on the capacity requirements of all current network UEs, as defined in Equation 29s (shown above), where this determination is derived by simply adding the capacity requirements (at their maximum service rate) for all the current UEs of the network. It is important to note that this free capacity (m) is based on actual, real-time user demand. For example, suppose that the network currently supports 3 class-1 UEs (2 at a rate of 1 Mb/s and 1 at a rate of 2 Mb/s), 0 class-2 UEs, 2 class-3 UEs (at a rate of 2 Mb/s), and 1 class-4 UE (at a rate of 4 Mb/s). Thus N{11}=2, N{12}=1, N{22}=0, N{24}=0, N{31}=0, N{32}=2, N{42}=0 and N{44}=1. The total capacity requirement for all the current UEs is then 0.1×1×2+0.1×2×1+0.1×2×0+0.1×4×0+0.05×1×0+0.05×2×2+0.05×2×0+0.05×4×1=0.8, and thus the current free capacity is m=1−0.8=0.2. It should be understood that in an actual system implementation, the free capacity (m) may not necessarily be repeatedly calculated, but rather this information may be maintained in a storage array, and only modified when a UE is admitted into the network (i.e., the free capacity would be decreased by the capacity requirement of that particular UE) or a UE leaves the network (i.e., the free capacity would be increased by the capacity requirement of that specific UE), such that that the current free capacity (m) value may simply be accessed, without the need for continually recalculating this value.


In step S104, the processor 30e then compares the current free capacity (m) against a capacity threshold (mmin=maxs∈Scsrs(Ms)≦1, as described in the “Adaptive Learning Component: ARA-DA” portion of Subsection A2 of this document, within Section VI). The capacity threshold mmin represents the largest amount of capacity required to support any single UE at the UEs maximum allowed rate. If the current free system capacity (m) exceeds the capacity threshold, then the method proceeds to step S106 in order to tag the new UE (and later, in step S118, the new tagged UE is then admitted into the network). The purpose for tagging the new UE is simply to ensure that the UE is included in all future load estimates in an unbiased manner, which does not discriminate based on class identities. It should be understood that the decision on whether or not to tag a UE depends on the system state at the time of the UEs arrival, but not on the class or the provisional rate assignment of the UE.


In step S110, once the new UE has been tagged, the processor 30e may then update control parameter θ (via Equation 30). The determination of the updated control parameter θ first involves calculating the current aggregate load of all tagged UEs (i.e., the total capacity requirement n for all the tagged current UEs). In order to calculate the aggregate load of all tagged UEs fi, for example, suppose that the network supports 3 class-1 UEs (2 at a rate of 1 Mb/s, only one of which is tagged, and 1 also tagged at a rate of 2 Mb/s), 0 class-2 UEs, 2 class-3 UEs (at a rate of 2 Mb/s) one of which is tagged, and 1 tagged class-4 US (at a rate of 4 Mb/s). Thus N{11}=1, N{12}=1, N{22}=0, N{24}=0, N{31}=0, N{32}=1, N{42}=0 and N{44}=1. Therefore, the total capacity requirement ii for all the tagged current UEs is then 0.1×1×1+0.1×2×1+0.1×2×0+0.1×4×0+0.05×1×0+0.05×2×1+0.05×2×0+0.05×4×1=0.6. It should be understood that in an actual system implementation, the aggregate load for the tagged UEs {circumflex over (n)} would not be repeatedly calculated but maintained in a storage array, and only modified when a tagged UE is admitted into the network (decreased by the capacity requirement of that particular user), or when a tagged UE leaves the network (increased by the capacity requirement of that specific UE), such that the current value can simply be accessed, without the need for continuously recalculating this value.


Suppose that the current value of the control parameter is θ=40, and that the step size is ε=0.08. It should be understood that step size E relates to a magnitude of change that may be implemented in determining the value of control parameter θ, where step size E may be appropriately chosen by a network operator (where it is noted that a smaller value of step size E may provide higher accuracy in determining control parameter θ, with the trade-off that a smaller step size ε will cause the method of FIG. 4 to converge on the optimized control parameter θ value more slowly and be less responsive in tracking changes in system statistics, such as gradual variations in the offered load or spatial distribution of UEs over the course of a day). Based on the assumed control parameter θ=40 and step size ε=0.08, the updated value of the control parameter θ is obtained (via equation 30, above), by adding 0.08×(0.6−1)=−0.032, yielding θ=39.968, where the value 0.6 refers to the aggregate load {circumflex over (n)} for all the tagged UEs, and the value of 1 represents the indicator variable value {circumflex over (T)} (which may be a value of 0 or 1, where 1 indicates the UE is tagged and 0 indicates the UE is not tagged). The purpose of this update is to continue to adaptively refine the control parameter θ over time (and use the refined control parameter θ value in step S102, the next time a new UE attempts to enter the network).


After the new UE is tagged (step S106), and the control parameter θ has been updated (step S110), in step S114 the processor 30e may update the total load ii for all tagged UEs (as defined in via Equation 29u) by adding the capacity requirement for the newly tagged UE to the load for all other tagged UEs.


In step S117, the processor 30e determines whether the new UE has been given a zero rate assignment (as described in step S102, above). In the event that the new UE has been assigned a zero rate, then the new UE is blocked from the network (at step S130). In the event that the new UE has not been assigned a zero rate, then the newly tagged UE is admitted into the network (in step S118), where the new UE is allocated the provisional rate (as determined in step S102).


As shown in FIG. 2, if in step S104 the processor 30e determines that the current free system capacity (m) does not exceed the capacity threshold mmin, then the processor 30e determines that the new UE shall not be tagged, as indicated in step S108. In this event, in step S112, the processor 30e updates the control parameter θ, in a similar fashion as step S110 (although this time, the update of the control parameter θ accounts for the fact that the new UE is not tagged, and therefore the processor 30e incrementally increases the control parameter θ, rather than decreasing this value). For example, supposing again that the current value of the control parameter is θ=40, and the step size is ε=0.08. Then, the updated value of the control parameter may be obtained by adding 0.08×0.6=0.048, where the value 0.6 refers once again to the aggregate load for all the tagged UEs and the value of the 0-1 indicator variable {circumflex over (T)} is now 0, because the new UE is not tagged. Just like in step S110, the purpose of this updated control parameter θ value is to continue to adaptively refine the value over time (and use the refined control parameter θ value in step S102, the next time a new UE attempts to enter the network).


Following step S112, the processor 30e then determines, in step S115, whether the new UE has been given a zero rate assignment (as described in step S102, above). In the event that the new UE has been assigned a zero rate, then the new UE is blocked from the network (at step S130). In the event that the new UE has not been assigned a zero rate, then the method proceeds to step S116.


In step S116, the processor 30e compares the current free capacity (m) against the provisional capacity requirement assigned to the new UE. The provisional capacity requirement of the new UE is calculated as the product of the provisional rate assignment of the new UE (as determined in S102) and the capacity requirement coefficient cs. If the current free capacity (m) exceeds the provisional capacity requirement assigned to the new UE, the processor 30e determines to admit the new UE (as shown in step S118), and the UE is allocated the provisional rate (as determined in step S102). Otherwise, the new UE is blocked, and not admitted to the network, as shown in step S130.


In the event that the new UE is admitted (step S118), then in step S120 the processor 30e re-calculates the free capacity (m) (as defined in Equation 29s). This determination of the free capacity (m) now accounts for the newly admitted UE, using the provisional rate for the new UE (as calculated in step S102). This re-calculation of the free capacity (m) may be accomplished by decreasing the previous value of the free capacity (m) by the capacity requirement of the new UE.


At the termination of this method associated with the arrival of a new UE (either following step S120, following the re-calculation of the free capacity, or following step S130 when the user is blocked), the method may then be re-started with the arrival of another new UE (the method will restart), or with the departure of a UE (in step S122, described below).


When a UE of the network departs, the processor 30e may determine which network class (from among S possible classes) the departing UE belongs to, prior to updating or recalculating the free capacity (as defined in Equation 29s). This determination of the free capacity (m) now accounts for the vacancy of the departing UE, using the capacity requirement for that UE as determined by the provisional rate that was allocated to the UE upon arrival (as previously calculated in step S102) for the duration of time that the network serviced the UE along with the capacity requirement coefficient for the class of the UE. Specifically, the calculation of the free capacity (m) may be accomplished by increasing the previous value of the free capacity (m) by this capacity requirement of the departing UE.


In step S126, the processor 30e determines whether the departing UE is a tagged user. In the event that the UE is not tagged, this example method ends. However, in the event that the departing UE is a tagged UE, the processor 30e then updates the total load {circumflex over (n)} for all tagged UEs (as defined in via Equation 29u) by decreasing the previous value of the aggregate load for all tagged UEs by the capacity requirement for the departing UE as determined by the provisional rate that was allocated to that UE upon arrival for the duration of time that the network service the UE along with the capacity requirement for the class of the UE, in step S128.


Following the termination of this method for the departing UE, either after the processor 30e determines that the UE is not a tagged UE (step S126), or following the updating of the total load ii for all tagged UEs (step S128), the method of FIG. 2 may be reinitiated when a new UE arrives in the network (step S100), or when another UE departs from the network (step S122).


Continuous Rates (“Scenario C”)

Having described the method of FIG. 2 using a discrete set of possible target rates for users (as described above), it should be understood that this same method may also be implemented using a continuous range of potential rates for UEs. Such a method would be identical to the description of FIG. 2 (above), with only some modifications to step S102. Specifically, in step S102, rather than having the processor 30e determine the marginal pay-off Qsm (equation 15h) for adding a UE, by trying each discrete rate in order to find provisional rates with marginal pay-offs Qsm that are larger than control parameter θ, the processor 30e may merely calculate the provisional rate based on equation (34h) to be the largest rate for which the marginal pay-off is larger than the control parameter θ. In the continuous rate scenario, the marginal pay-off is defined as the left-derivative of the utility function Us for the class of the new UE, divided by the capacity requirement coefficient cs for the class s of the new UE. After the processor 30e makes this determination of the provisional rate, and assigns this provisional rate to the user in step S102, use of this provisional rate in the remaining steps of FIG. 2 is identical to the steps describe above with regard to the discrete rate scenario.


Example embodiments having thus been described, it will be obvious that the same may be varied in many ways. Such variations are not to be regarded as a departure from the intended spirit and scope of example embodiments, and all such modifications as would be obvious to one skilled in the art are intended to be included within the scope of the following claims.

Claims
  • 1. A rate allocation method, comprising: adapting, by a processor, a control parameter value as user equipment (UE) enter a network sector based on a total load for tagged UE of the network sector and a determination to tag a prospective new UE, wherein the tagged UEs of the network sector correspond with all current and prospective network sector UEs that entered the network sector at an instance when the processor determined that the free capacity of the network sector is larger than or equal to a capacity threshold;assigning, by the processor, a provisional rate for the prospective new UE of the network sector based on the control parameter value and a determined marginal pay-off for admitting the prospective new UE;determining, by the processor, free capacity of the network sector;admitting, by the processor, the prospective new UE if enough free capacity exists in order for the network sector to service the prospective new UE at the provisional rate,wherein this method is accomplished without the knowledge of channel and traffic statistics of the network sector.
  • 2. The method of claim 1, wherein the adapting comprises: assigning an initial arbitrary value to be the control parameter value,iteratively adjusting the initial arbitrary value each time a UE enters the network sector by, determining a load demand for all tagged UEs of the network sector, wherein the load demand is based on a load of all tagged UEs at their respective service rates,selecting a step size value for the control parameter,determining an increment value based on the determined load and the step size,adding the increment value to one of the initial arbitrary value and the current control parameter value to arrive at a current control parameter value.
  • 3. The method of claim 2, wherein the assigning of the provisional rate for the prospective new UE includes, determining the marginal pay-off for admitting the prospective new UE for each of a plurality of available provisional rates for the prospective new UE,selecting the largest of the available provisional rates that has a marginal pay-off that exceeds the control parameter to be the provisional rate for the prospective new UE.
  • 4. The method of claim 2, wherein, the assigning the provisional rate for the prospective new UE includes, determining the largest rate for which the marginal pay-off for admitting the prospective new UE exceeds the control parameter, wherein this determined largest rate is the provisional rate for the prospective new UEthe determining of the free capacity of the network sector includes, summing a real-time load demand of all current UEs at the current UE's service rate.
  • 5. The method of claim 4, wherein the admitting of the prospective new UE includes, comparing the determined free capacity of the network sector to the capacity threshold, wherein the capacity threshold is the largest amount of capacity that is required to support any particular network sector UE at a maximum allowed rate for the particular network sector UE,tagging the prospective new UE if the determined free capacity is equal to or greater than the capacity threshold.
  • 6. The method of claim 5, further comprising: incrementally increasing the current control parameter value by a positive increment value based on the step size and the total load of the tagged UEs, if the prospective new UE is not tagged; andincrementally decreasing the current control parameter value by a negative increment value based on the step size and the total load of the tagged UEs, if the prospective new UE is tagged.
  • 7. The method of claim 6, wherein the admitting of the prospective new UE includes, performing the following steps if the prospective new UE is tagged, incrementally increasing the total load for tagged UEs by a provisional capacity requirement for the prospective new UE, the provisional capacity requirement being equal to the provisional rate of the prospective new UE multiplied by a capacity requirement coefficient, wherein the capacity requirement coefficient is a function of a signal strength experienced by the prospective new UE,admitting the prospective new UE as a newly tagged-and-admitted UE,incrementally decreasing the free capacity of the network sector by the provisional capacity requirement for the prospective new UE.
  • 8. The method of claim 6, wherein the admitting of the prospective new UE includes, performing the following steps if the prospective new UE is not tagged, comparing the free capacity of the network sector against the provisional capacity requirement for the prospective new UE,if the free capacity of the network sector equal or exceeds the provisional capacity requirement for the prospective new UE, admitting the prospective new UE to the network sector, andincrementally decreasing the free capacity of the network sector by the provisional capacity requirement for the prospective new UE,blocking the prospective new UE from the network sector, if the free capacity of the network sector is less than the provisional capacity requirement for the prospective new UE.
  • 9. The method of claim 6, wherein the assigning of the provisional rate for the prospective new UE includes, assigning the prospective new UE a zero-rate, if the marginal pay-off for admitting the prospective new UE does not exceed the control parameter value for a lowest rate that is available for the prospective new UE,blocking the prospective new UE from the network sector, if the prospective new UE is assigned a zero-rate.
  • 10. The method of claim 2, further comprising: if a UE departs the network sector, performing the steps of, incrementally increasing the free capacity of the network sector by a provisional capacity requirement for the departing UE, the provisional capacity requirement being equal to the provisional rate of the departing UE multiplied by a capacity requirement coefficient, wherein the capacity requirement coefficient is a function of a signal strength experienced by the departing UE,if the departing UE is a tagged UE, incrementally decreasing the total load for tagged UEs by the provisional capacity requirement for the departing UE.
  • 11. A device, comprising: a processor, configured to, adapt a control parameter value as user equipment (UE) enter a network sector based on a total load for tagged UE of the network sector and a determination to tag a prospective new UE, wherein the tagged UEs of the network sector correspond with all current and prospective network sector UEs that entered the network sector at an instance when the processor determined that the free capacity of the network sector is larger than or equal to a capacity threshold;assign a provisional rate for the prospective new UE of the network sector based on the control parameter value and a determined marginal pay-off for admitting the prospective new UE;determine free capacity of the network sector;admit the prospective new UE if enough free capacity exists in order for the network sector to service the prospective new UE at the provisional rate,wherein the processor does not require knowledge of channel and traffic statistics of the network sector.
  • 12. The device of claim 11, wherein the processor adapts the control parameter value by being further configured to, assign an initial arbitrary value to be the control parameter value,iteratively adjust the initial arbitrary value each time a UE enters the network sector by, determine a load demand for all tagged UEs of the network sector, wherein the load demand is based on a load of all tagged UEs at their respective service rates,select a step size value for the control parameter,determine an increment value based on the determined load and the step size,add the increment value to one of the initial arbitrary value and the current control parameter value to arrive at a current control parameter value.
  • 13. The device of claim 12, wherein the processor assigns the provisional rate for the prospective new UE by being further configured to, determine the marginal pay-off for admitting the prospective new UE for each of a plurality of available provisional rates for the prospective new UE,select the largest of the available provisional rates that has a marginal pay-off that exceeds the control parameter to be the provisional rate for the prospective new UE.
  • 14. The device of claim 12, wherein, the processor assigns the provisional rate for the prospective new UE by being further configured to, determine the largest rate for which the marginal pay-off for admitting the prospective new UE exceeds the control parameter,wherein this determined largest rate is the provisional rate for the prospective new UE
  • 15. The device of claim 14, wherein the processor admits the prospective new UE by being further configured to, compare the determined free capacity of the network sector to the capacity threshold, wherein the capacity threshold is the largest amount of capacity that is required to support any particular network sector UE at a maximum allowed rate for the particular network sector UE,tag the prospective new UE if the determined free capacity is equal to or greater than the capacity threshold.
  • 16. The device of claim 15, wherein the processor is further configured to, incrementally increase the current control parameter value by a positive increment value based on the step size and the total load of the tagged UEs, if the prospective new UE is not tagged; andincrementally decrease the current control parameter value by a negative increment value based on the step size and the total load of the tagged UEs, if the prospective new UE is tagged.
  • 17. The device of claim 16, wherein the processor admits of the prospective new UE by being further configured to, perform the following steps if the prospective new UE is tagged, incrementally increase the total load for tagged UEs by a provisional capacity requirement for the prospective new UE, the provisional capacity requirement being equal to the provisional rate of the prospective new UE multiplied by a capacity requirement coefficient, wherein the capacity requirement coefficient is a function of a signal strength experienced by the prospective new UE,admit the prospective new UE as a newly tagged-and-admitted UE,incrementally decrease the free capacity of the network sector by the provisional capacity requirement for the prospective new UE.
  • 18. The device of claim 16, wherein the processor admits the prospective new UE by being further configured to, perform the following steps if the prospective new UE is not tagged, compare the free capacity of the network sector against the provisional capacity requirement for the prospective new UE,if the free capacity of the network sector equal or exceeds the provisional capacity requirement for the prospective new UE, admit the prospective new UE to the network sector, andincrementally decrease the free capacity of the network sector by the provisional capacity requirement for the prospective new UE,block the prospective new UE from the network sector, if the free capacity of the network sector is less than the provisional capacity requirement for the prospective new UE.
  • 19. The device of claim 16, wherein the processor assigns the provisional rate for the prospective new UE by being further configured to, assign the prospective new UE a zero-rate, if the marginal pay-off for admitting the prospective new UE does not exceed the control parameter value for a lowest rate that is available for the prospective new UE,block the prospective new UE from the network sector, if the prospective new UE is assigned a zero-rate.
  • 20. The device of claim 12, wherein the processor is further configured to, if a UE departs the network sector, perform the steps of, incrementally increase the free capacity of the network sector by a provisional capacity requirement for the departing UE, the provisional capacity requirement being equal to the provisional rate of the departing UE multiplied by a capacity requirement coefficient, wherein the capacity requirement coefficient is a function of a signal strength experienced by the departing,if the departing UE is a tagged UE, incrementally decrease the total load for tagged UEs by the provisional capacity requirement for the departing UE.