The present invention relates to network traffic management. In particular, the present invention relates to a system and method for resource allocation in wireless or wired networks.
Ideally, wireless networks should be optimized to deliver the best Quality of Service (in terms of reliability, delay, and throughput) to subscribers with the minimum expenditure in resources. Such resources include transmitted power, transmitter and receiver complexity, and allocated frequency spectrum. Over the last few years, there has been an ever increasing demand for wireless spectrum resulting from the adoption of throughput hungry applications in a variety of civilian, military, and scientific settings. Because the available spectrum is non-renewable and limited, this demand motivates the need for efficient wireless networks that maximally utilize the spectrum. Although there is a severe shortage in the spectrum, it is well-documented now that a significant fraction of the available spectrum is under-utilized [1]. This, in fact, is the main motivation for the cognitive networking framework where secondary users are allowed to use the spectrum in the off time, where the primary users are idle, in an attempt to maximize the spectral efficiency [2]. Unfortunately, the cognitive radio approach is still facing significant regulatory and technological hurdles [3], [4] and, at best, offers only a partial solution to the problem.
This limitation of the cognitive radio approach is intimately tied to the main reason behind the under-utilization of the spectrum; namely the large disparity between the average and peak traffic demand in the network. As an example, in a typical cellular network, one can easily see that the traffic demand in the peak hours is much higher than that at night, which inspires the different rates offered by cellular operators. Now, the cognitive radio approach assumes that the secondary users will be able to utilize the spectrum in the off-peak times but, unfortunately, at those particular times one may expect the secondary traffic characteristics to be similar to that of the primary users (e.g., at night most of the primary and secondary users are expected to be idle).
In the traditional approach, wireless networks are constructed assuming that the subscribers are equipped with dumb terminals with very limited computational power. It is obvious that the new generation of smart devices enjoy significantly enhanced capabilities in terms of both processing power and available memory. Moreover, according to Moore's law predictions, one should expect the computational and memory resources available at the typical wireless device to increase at an exponential rate. This observation should inspire a similar paradigm shift in the design of wireless networks whereby the capabilities of the smart wireless terminals are leveraged to maximize the utility of the frequency spectrum, a non-renewable resource that does not scale according to Moore's law.
The introduction of smart phones has resulted in a paradigm shift in the dominant traffic in mobile cellular networks. While the primary traffic source in traditional cellular networks was real-time voice communication, one can argue that a significant fraction of the traffic generated by the smart phones results from non-data-requests (e.g., file downloads). This feature allows for more degrees of freedom in the design of a scheduling algorithm.
There is a need for a new approach to the resource allocation aspect of the problem and a new paradigm that offers spectral gains in a variety of relevant scenarios. More specifically, there is a need for a proactive resource allocation framework that exploits the predictability of daily usage of wireless devices to smooth out the traffic demand in the network, and hence, reduce the required resources to achieve a certain point on the Quality of Service (QoS) curve.
The disclosed system and method comprises a proactive resource allocation framework in which the predictability of user behavior is exploited to balance the network traffic over time, and hence, significantly reduce the bandwidth required to achieve a given blocking/outage probability. The disclosed proactive resource allocation framework avoids limitations associated with off-peak demand, and hence, achieves a significant reduction in the peak to average demand ratio without relying on out of network users. It is based on a simple model in which smart devices are assumed to predict the arrival of new requests and submit them to the network T time slots in advance. Using tools from large deviation theory, the resulting prediction diversity gain is quantified to establish that the decay rate of the outage event probabilities increases linearly with the prediction duration T. This model is then generalized to incorporate the effect of prediction errors and the randomness in the prediction lookahead time T. In the cognitive networking scenario, the appropriate use of proactive resource allocation by the primary users results in more spectral opportunities for the secondary users at a marginal, or no, cost in the primary network outage.
In alternative embodiments, functionality is provided on smart devices and/or at servers. The allocation techniques may be adapted for use in wired as well as wireless networks.
Usage of wireless devices is highly predictable. This claim is supported by a growing body of evidence that range from the recent launch of Google Instant to the interesting findings on predictable mobility patterns [5]. In this context, a relevant example is the fact that preference for a particular news outlet is not expected to change frequently. So, if the smart phone observes that the user is downloading CNN, for example, in the morning for a sequence of days in a row then it can safely anticipate that the user will be interested in the CNN again the following day. Coupled with the fact that the most websites are refreshed at a relatively slow rate, as compared with the dynamics of the underlying wireless network, one can now see the potential for scheduling early downloads of the predictable traffic to reduce the peak to average traffic demand by maximally exploiting the available spectrum in the network idle time.
It is important to observe here the temporal and spatial scales at which this predictability phenomenon exhibits itself. First, there is a growing body of evidence that behavioral patterns can be accurately predicted at the single user level. On the temporal scale, the requests are largely predictable at the scale of the application layer (e.g., minutes and hours) which is much slower than the dynamics of the physical, medium access, and network layers. This property is a key enabler for exploiting capacity enhancing techniques that introduce delays at the same time scale.
The disclosed system and method improves the spectral efficiency of wireless or wired networks through the judicious exploitation of the predictable behavior of users. More specifically, in the current paradigm, traffic requests are considered urgent, at the time scale of the application layer, and hence, have to be served upon initiation by the network users in order to satisfy the required QoS metrics. However, if the user devices can predict the requests to be generated by the corresponding users and submit them in advance, then the network has the flexibility in scheduling these requests over an expanded time horizon as long as the imposed deadlines are not violated. When a predictive network serves a request before its deadline, the corresponding data is stored in cache memory of the user device and, when the request is actually initiated, the application pulls the information directly from the memory instead of accessing the network. It is worth noting that not all applications, although predictable, can be served prior to their time of initiation. For example, some multimedia traffic maybe predictable, but can only be served on a real-time basis as they are based on live interactions between users. However, predicting these types of requests can still be considered an advantage, as the network may schedule other non-real-time requests while taking into account the predicted real-time requests in a way that enhances the QoS of all applications.
Unless otherwise stated, a simplified model of a single cell slotted wireless network where the aggregate requests are allowed to arrive only at the beginning of each slot is used. The number of arriving requests at time slot n>0 is denoted by Q(n) which is assumed to be ergodic and to follow a Poisson distribution with rate λ. All requests are assumed to have the same amount of required resources which is taken to be unity. That is, each request has to be totally served in a single slot by consuming one unit of resource. Moreover, the wireless network has a fixed capacity C per slot. Furthermore, it is assumed that a predictive wireless network can anticipate the arrival of each request by an integer number of time slots in advance. That is, if q(n), 1≦q≦Q(n), is the ID of a request predicted at the beginning of time slot n, the predictive network has the capability of serving this request no later than the next Tq(n) slots. Hence, when a request q(n) arrives at a predictive network, it has a deadline or expiration time at time slot Dq(n)=n+Tq(n) as shown in the prediction model of
In the non-predictive network, all arriving requests at the beginning of time slot n have to be served in the same time slot n, i.e., if q(n) is a non-predicted request, its deadline or expiration time is Dq(n)=n meaning that Tq(n)=0. It is assumed that an outage event occurs at a certain time slot if and only if at least one of the requests in the system expires in this slot. The model operates as the time scale of the application layer at which: 1) the current paradigm, i.e., non-predictive networking, treats all the requests as urgent; 2) each slot duration is in the order of minutes and possibly hours; and 3) the system capacity is fixed because the channel fluctuation dynamic are averaged out at this time scale.
The probability of outage, P(outage), is the performance metric under a scaling regime whereby λ and C increase such that the ratio
is kept at a constant value γ, 0≦γ≦1. In other words, λ is scaled as Cγ for each choice of γ. Under this assumption, the diversity gain is defined as
for both the non-predictive and predictive networks.
The diversity gain for the two networks when both witness the same arrival process is Q(n), n>0 per slot. The difference is in only the deadlines of the arriving requests. The deadline for a request q(n) is slot n when the network is non-predictive, and is n+Tq(n) when the network is predictive with Tq(n)=1,2, . . . . In general, as the system capacity C grows, the outage probability is expected to decrease. Tools of large deviation theory [6], [7] may be used to characterize d(γ), which quantifies the achievable diversity-multiplexing tradeoff, in different scenarios. The following result determines the prediction diversity gain for the deterministic look-ahead time case, i.e., Tq(n)=T ∀q(n).
Theorem 1.
The diversity gain of proactive scheduling for the above model with T-slot prediction equals
d
P(γ)=(1+T)(1−γ).
Noting that the diversity gain of the non-predictive scenario is obtained as a special case by setting T=0, i.e., dN(γ)=(1−γ), this result reveals that proactive scheduling offers a multiplicative gain of (1+T) in the achievable diversity advantage.
Proof.
Start with the non-predictive benchmark corresponding to T=0. In this case, the outage probability in any slot n corresponds to the event {Q(n)>C}, which can be expressed as
For large values of C, tightest Chernoff bound [6] can be used to upper bound the outage probability as
P
N(outage)≦eC−C
Furthermore, from (2), it is obvious that
Hence, by taking the log of the upper and lower bounds on PN(outage) in (3), (4) and dividing by −C log C it follows directly that the diversity gain of the non-predictive network is equal to
d
N(γ)=1−γ. (5)
For T>0, it is easy to see that the First-In-First-Out (FIFO), or equivalently Earliest Deadline First (EDF), scheduling policy minimizes the outage probability in this simple scenario. To characterize the diversity gain, the following two events to upper and lower bound the outage event are defined
In the steady state, i.e., when n→∞, shown in [8]
Pr(Ld(n))≦Pp(outage)≦Pr(Ud(n)).
Further
Combining these two relationships results in the diversity gain expression:
d
P(γ)=(1+T)(1−γ).
In a more general case, Tq(n), 0≦q≦Q(n), n>0 is a sequence of i.i.d. nonnegative integer-valued random variables defined over a finite support Tmin, Tmin+1, . . . , Tmax. First, consider the scenario in which probability mass function (PMF) of Tq(n) does not scale with C and establish the critical dependence of the achievable diversity gain on Tmin>0.
Lemma 2.
Let the PMF of Tq(n) be given by
and the probabilities pk's are constants that do not depend on C. Then,
d
P(γ)=(1+Tmin)(1−γ).
Proof:
A lower bound on the outage probability can be obtained by considering only the fraction of the requests corresponding to Tmin whereas an upper bound can be obtained by making Tq(n)=Tmin ∀q(n). Both bounds have the same decay rate corresponding to the stated diversity advantage.
It is clear that the diversity gain of random T scenario is dominated by the requests with T=Tmin, and hence, under the previous assumptions the system does not experience any prediction diversity gains when Tmin=0.
When the fraction of requests corresponding to Tmin decays as C grows, which is reasonable to expect in many emerging applications as most of the new demand corresponds to predictable and delay tolerant data traffic, then the proactive resource allocation framework is able to harness improved prediction diversity gains. This can be viewed as follows. To illustrate the idea, assume that Tmin=0 and pT
d
P(γ)=1+α−γ (7)
as long as 1+α−γ is smaller than 2(1−γ) or equivalently, α≦1−γ. Otherwise, the diversity gain is determined by the requests with T=1 and is given by
d
P(γ)=2(1−γ). (8)
This argument is extended in [8] for more general distributions of the look-ahead time T.
The proactive resource allocation paradigm significantly enhances the prediction diversity gain under the assumption of perfect, i.e., error free, prediction. The effect of prediction error on the prediction diversity gain may be quantified. Consider the deterministic T scenario, and assume that the traffic of the non-predictive system is characterized by the process Q(n), n>0 which represents the number of arriving requests at the beginning of time slot n with T=0. This process is Poisson with rate Cγ. Moreover, the system is operating according to the Shortest Deadline First scheduling policy. The model differentiates between the following two prediction error events.
The network mistakenly predicts a request and serves it resulting in an increase in the traffic load.
The predictive network fails to predict a request and, as a consequence, it encounters an urgent arrival with Tq(n)=0.
Therefore, the arriving requests QE(n), n>0 can be regarded as the superposition of two arrival processes: 1) Q′(n) corresponding to the predicted request at the beginning of time slot n with deadline n+T; and 2) Q″(n) corresponding to the urgent requests arriving requests at the beginning of time slot n and must be served instantaneously. The judicious design of the prediction algorithm aims to strike the optimal balance between these two events. This point is illustrated in the following special case: Q′(n) is Poisson with rate Cγ′, where γ′ε, and Q″(n) is Poisson with rate Cγ″, γ″≦γ such that
C
γ′
+C
γ″
≧C
γ. (9)
The constraint γ″≦γ follows directly from the fact that the arrival rate of the urgent requests cannot exceed the arrival rate of requests in the error free scenario. On the other hand, the constraint (9) reflects the fact prediction errors can only increase the arrival rate. In this model, a necessary and sufficient condition for perfect prediction is γ′=γ and γ″=−∞ resulting in QE(n)=Q′(n)=Q(n+T). The lookahead time T is a function of (γ′,γ″) reflecting the fact that more aggressive prediction algorithms result in a larger T at the expense of introducing larger prediction errors. Finally, assume that, given γ′ and γ″, both processes Q′(n) and Q″(n) are independent.
By setting γ′=a′γ and γ″=a″γ, the diversity gain of the predictive network will be given by
d
P(γ)=min{(1+T(α′,α″))(1−max{α′,α″}γ),1−″γ}. (10)
If max{α′, α″}=α″ the diversity of the predictive network becomes dP(γ)=1−α″γ. However, since α″≦1 and from (9), max{α′, α″}=α″ if and only if α′=α″=1 corresponding to the scenario where the predictive mechanism is useless. Therefore, in the following case where α′≧α″ in which case the prediction diversity gain is given by
d
P(γ)=min{(1+T(α′,α″))(1−α′γ),1−α″γ} (11)
implying that the predictive system achieves a strictly improved diversity gain over the non-predictive system if and only if,
min{(1+T(α′,α″))(1−α′γ),1−α″γ}>1−γ. (12)
An upper bound on the prediction diversity, for a given (α′,α″), corresponds to case where the optimum operating point for the two quantities inside the min{.} are equal, i.e.,
Hence, for a given (α′,α″), a prediction algorithm that achieves (14) is optimal in terms of the achievable prediction diversity and there is no benefit in increasing T further. Based on that, the achievability of prediction diversity gains hinges on the existence of prediction algorithms that satisfy the following necessary conditions
At this point, the previous model for prediction errors illustrates the tradeoff between the two types of error events identified earlier. More accurate models may be developed that reflect the nature of the traffic requests and the dynamics of the employed prediction algorithms. Referring to
Moreover, for the two simulated random T scenarios, although Tmin is chosen to be 2 and 1, the corresponding outage probability curves are upper bounded by the outage probability of the predictive case with deterministic T=2. This actually may be a consequence of the small values of C in
Referring to
Pr(T=t)=pt=(T
where Tmin≦t≦Tmax. The predictive system is then simulated for different values of p and the outage probability results are depicted. Moreover, the uniform distribution of T over the interval in=0 to Tmax=5 is plotted on the same
The previous example embodiment demonstrates the potential gains that can be leveraged from the proactive resource allocation framework when all the requests belong to the same class of QoS. In another example embodiment, a network has two QoS classes that can be considered as primary and secondary users sharing the same resources. The effect of prediction by the primary user only on the prediction diversity gain of the secondary network is evaluated. Clearly, the analysis can be extended to allow for prediction by the secondary user as well; but the example is limited to this special case for simplicity. It is assumed that the number of secondary arrivals at the beginning of time slot n is Qs (n), where Qs(n) follows a Poisson distribution with rate λs=Cγ
The outage probability of the secondary user and its diversity gain when the primary user is non-predictive is analyzed. At the beginning of time slot n, the system is supposed to witness Qp(n)+Qs(n) arriving requests with deadline is slot n, i.e., must be served in the same slot of arrival. The primary system has a fixed capacity C per slot. In order to enhance the utilization of its resources, the primary user allows secondary requests to be served by the remaining resources from serving the primary requests. Thus, at slot n, the remainder of C−Qp(n) is assigned to serve the secondary requests. The following result characterizes the achievable diversity gain in this scenario.
Theorem 3.
In the non-predictive scenario, the primary and secondary diversity are equal and given by
d
N
s(γp,γs)=dNp(γp,γs)=1−γp. 17)
Proof:
The outage probability of the primary system PNp(outage) is identical to the one analyzed in the previous section. As a result, the primary diversity gain is given by
d
N
p(γp,γs)=1−γp. (18)
The secondary system encounters an outage at a given slot when the remaining resources from serving the primary requests at this slot are less than the number of arriving secondary requests at the beginning of the same slot. Thus, if the primary network suffers an outage in a certain slot with at least one arriving secondary request, the secondary system goes in outage as well. The secondary system, consequently, encounters an outage at slot n if and only if
Q
p(n)+Qs(n)>C and Qs(n)>0.
If probability of the secondary network when the primary network is non-predictive is denoted by PNs(outage), hence
P
N
s(outage)=Pr(Qp(n)+Qs(n)>C,Qs(n)>0). (19)
The two random variables Qp(n)+Qs(n) and Qs(n) are dependent but their joint distribution can simply be obtained by transformation of variables. By setting Y=Qp(n)+Qs(n) and U=Qs(n), the exact expression of PNs(outage) is
The diversity gain of the secondary system coexisting with a non-predictive primary network is defined by
For large values of C, the outer sum of the right hand side of (20) is dominated by γ=C+1. However, the inner sum is not dominated by a single value of u because of (y−u)! u! in the denominator. Consequently, as C→∞, PNs(outage) can be written as
Characterizing dNs(γp,γs) from (21) is, however, difficult, so another approach is used based on the asymptotic behavior of upper and lower bounds on PNs(outage).
The upper bound on PNs(outage) is determined as follows. Since Pr (,)≦Pr() with equality if and only if A⊂, then
P
N
s(outage)≦Pr(Qp(n)+Qs(n)>C). (22)
The random variable Qp(n)+QS(n) has a Poisson distribution with mean Cγ
The event that there is at least one secondary arrival with a primary outage at slot n is considered a sufficient but not necessary condition on a secondary outage at slot n. That is,
L
N
s(n){Qp(n)>C,Qs(n)>0}, n→∞.
Note the event LNs(n) is not necessary for a secondary outage at slot n as there may be Qp(n)<C but Qs(n)>C−Qp(n) which results in a secondary outage at slot n too.
Furthermore, at steady state, Pr(LNs(n)) becomes independent of n as both arrival processes, Qp(n) and Qs(n), are stationary, hence use Pr(LNs) instead. Since LNs(n) is a sufficient T condition for a secondary outage, then PNs(outage)≧Pr(LNs). Hence,
Therefore
yielding
d
N
s(γp,γs)≦1−γp. (28)
From (24), (28), it follows that
d
N
s(γp,γs)=1−γp. (29)
Hence, the diversity gain of the secondary network in a primary non-predictive mode is the same as the diversity gain of the primary network although the arrival rate of secondary requests is strictly smaller than the primary arrival rate.
In this case, the system predicts the primary arrivals by T time slots in advance. It is assumed that T is deterministic and fixed for all primary requests, i.e., the deadline for the primary requests Qp(n) is n+T. The system, however, is assumed to be non-predictive for the secondary requests, i.e., the deadline for the secondary requests Qs(n) is n. When the system dedicates all the per-slot capacity C to serve the primary requests, according to the EDF policy, secondary requests arriving at the beginning of time slot n are served if and only if C is strictly larger than the number of primary requests existing in the system at the beginning of this slot. Unfortunately, this service policy does not enhance the outage performance of the secondary system although it minimizes the outage probability of the primary. The main reason is the large variations in the number of served primary requests per slot that takes on values from 0 to C. These variations are quite close to the variations in the number of served primary requests per slot in case of non-predictive primary network.
Referring to
The main idea motivating the first service policy is to minimize the probability of the dominant outage event instead of minimizing the overall outage probability. Thus, the diversity gain of the primary network is not affected while creating more opportunities for secondary requests. Consequently, the outage probability of the secondary network is enhanced at the same diversity gain of the primary network.
Service Policy 2 (SP2):
The primary network is assigned a fixed capacity per slot of C−└Cβ┘ where β<1. It uses this fixed capacity to serve as much as possible of primary requests in the system according to the shortest deadline request policy.
Clearly SP2 achieves the optimal primary diversity advantage, i.e., dPp(γp)=(1+T)(1−γp). Moreover, it is shown, numerically, in the following that the outage probability of the secondary network is improved because of the dedicated capacity of └Cβ┘. At this point, it is observed that SP2 allocates a fixed capacity per slot to the primary network. However, due to the variability of the arrival process, one may expect some performance gains if the service policy adaptively decides on the allocated capacity for the primary network based on the number of requests in the system at each slot and their deadlines. This intuition motivates the following policy.
Service Policy 3 (SP3): Let Np(n) be the number of the primary requests in the system at the beginning of time slot n, and Ndp(n) be the number of these requests whose deadline is slot n. Then, the capacity of the primary network at slot n is calculated as
min{C,Ndp(n)+f×(Np(n)−Ndp(n))}
where 0≦f≦1. After that, the network serves the primary requests according to the EDF policy.
The performance of SP3 is highly dependent on the design parameter f. At f=0, the system, at steady state, is serving only the requests whose deadline is the current slot. In this case the system is similar to the non-predictive network in terms of primary and secondary outage probabilities. At f=1, the system is very selfish, and hence, achieving the optimal primary outage probability. The following numerical results, however, show that intermediate values for f result in significant improvement in the secondary outage while keeping the primary outage probability almost indistinguishable from the optimal one.
The performance of a network with primary and secondary users has been evaluated numerically with the same parameters of
In
The present disclosure relates to a paradigm for resource allocation that exploits the predictability of user behavior to minimize the spectral resources (e.g., bandwidth) needed to achieve certain QoS metrics. Unlike the tradition reactive resource allocation approach, in which the network can only start serving a particular user request upon its initiation, the disclosed resource allocation approach anticipates future requests which allows the network more flexibility in scheduling those potential requests over an extended period of time. By adopting the outage (blocking) probability as the QoS metric, the proactive resource allocation framework may achieve significant spectral efficiency gains in several interesting scenarios. More specifically, the notion of prediction diversity gain may be used to quantify the gain offered by the proposed resource allocation algorithm under different assumption on the performance of the traffic prediction technique. Moreover, in a network with two QoS classes, prediction at one side only does not only enhance its diversity gain, but it also improves the outage probability performance of the other user.
In one embodiment, a smart phone application runs on the Android and/or the iPhone-iPad operating systems. In an alternative embodiment, the application executes at a server. This application uses machine learning techniques to build a profile for the smart phone user. This profile is used to build a list of anticipated requests by the user in the near future. The application locates nearby free (or authorized) Wi-Fi hotspots through the location information available from the GPS service and the Wi-Fi service (or any similar technique). When the smart phone or other user device is in the range of such hotspots, it automatically pre-fetches the date corresponding to the anticipated user requests. These data are cached in the local memory of the device and used later upon the initiation of the actual request by the user. The user benefits by: 1) lowering the amount of data it downloads from the network and the associated monthly cost; 2) enhancing the experience by reducing the probability of experiencing outages. The network also benefits because some of the peak data requests are rerouted via the Wi-Fi networks. Based on the user profile, the information refreshment rate and the local memory management technique are optimized. Finally, the application includes a recommendation system that makes the appropriate suggestions to the user based on the available data in the local memory, the user preferences (in terms to anticipated cost, data plan, etc. . . . ), and the current network conditions.
The previously described application is a representative implementation of the technology when both the supply and demand are variable. In this example, the varying supply corresponds to the probabilistic nature of the availability of Wi-Fi networks in the range of the smart phone or device. By predicting the future user demand, the disclosed system and method is able to facilitate a better matching between the supply and demand resulting in improving the welfare of both the network user and operator. It is worth noting that the application may be further optimized based on the context information available from the multitude of sensors and GPS incorporated in the smart device. Other operating systems (beyond Android, iPhone, and iPad) may be used to empower smart wireless or wired devices.
Therefore, while certain embodiments of the present invention are described in detail above, it is to be understood that the scope of the invention is not to be considered limited by such disclosure, and modifications are possible without departing from the spirit of the invention as evidenced by the following claims: REFERENCES
This application claims priority to U.S. Provisional Patent Application No. 61/387,245 filed on Sep. 28, 2010, which is incorporated herein by reference.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US11/53740 | 9/28/2011 | WO | 00 | 9/18/2013 |
Number | Date | Country | |
---|---|---|---|
61387245 | Sep 2010 | US |