The present invention relates generally to communication networks, and more specifically to admission control and scheduling techniques for controlling the transfer of data through such networks.
A very important and challenging problem in the design of high-speed communication networks is that of providing Quality of Service (QoS) guarantees, usually specified in terms of loss probabilities or packet delays in the network. For example, the control of packet delays is often of crucial importance, particularly for real-time applications such as video delivery systems, wireless networks, multimedia networks, call centers, etc. A basic decision that has to be made in such contexts is that of connection admission control, i.e., one has to determine when a new user can be admitted to the system, while still fulfilling the QoS requirements of all users already in the system. Moreover, users already in the system have to be scheduled in the most efficient manner so as to maximize the number of users that can be admitted.
Exemplary admission control and scheduling techniques are disclosed in U.S. Pat. No. 5,838,663, issued Nov. 17, 1998 in the name of inventors A. Elwalid, D. Mitra and R. H. Wentworth and entitled “Method for Admission Control and Routing by Allocating Network Resources in Network Nodes,” which is incorporated by reference herein. For example, this patent discloses a technique that may be utilized in an admission control element of a router, switch or other network processing element to determine the maximum number of connections that can be admitted, for a given buffer size, without introducing undue amounts of loss or delay in the network traffic. However, in certain circumstances it is possible that this technique may substantially underestimate the maximum number of admissible connections, e.g., in the case of large buffer sizes. A need therefore exists for improved techniques which can provide a better estimate of the maximum number of admissible connections.
In addition, with regard to the problem of resource allocation, it is well known that there is a tradeoff between bandwidth and buffer space. More particularly, an input that is fed to a network link having unlimited bandwidth capacity requires zero buffer space and vice versa. However, conventional techniques have been unable to determine in an efficient manner the optimum tradeoff between bandwidth and buffer space. It would therefore also be desirable if improved techniques were available for determining this tradeoff. Such techniques could be used, for example, in network configuration, in the above-noted admission control, as well as in the scheduling of users already admitted to a network.
The invention provides improved techniques for controlling the transmission of packets over a communication link in a network.
In an illustrative embodiment, data packets from multiple input sources are admitted to or otherwise processed in a node of the network based at least in part on a determined number of supportable connections N. The multiple input sources may be modeled as deterministic on-off sources subject to dual leaky bucket regulation, such that a given source has associated therewith a peak rate p, a mean rate r and a maximum burst period T. The multiple sources may comprise homogeneous sources, each having the same peak rate p, mean rate r and maximum burst period T. The multiple sources may alternatively comprise heterogeneous sources of different types i, i=1, 2, . . . K, wherein a given source of type i has peak rate pi, mean rate ri, and maximum burst period Ti, and Ni denotes the number of sources of type i.
In accordance with one aspect of the invention, a determination is made regarding the number of supportable connections N, for a given buffer size in a node of the network, that will permit maintenance of one or more specified quality of service requirements for packet transmission. This determination in the case of homogeneous input sources utilizes a combined approximation of the maximum number of supportable connections for a range of buffer sizes, the combined approximation being generated as a combination of a first approximation for small buffer sizes and a second approximation for large buffer sizes. The first approximation may be a square root approximation configured to reflect positive correlations between the multiple sources, while the second approximation may be a Brownian bridge approximation configured to reflect negative correlations between the multiple sources. The combined approximation may be a concave curve combination of first and second curves corresponding to the respective first and second approximations.
In accordance with another aspect of the invention, an optimal operating point on a buffer-capacity tradeoff curve is determined for each of a number of different types of heterogenous input sources. The tradeoff curve specifies the variation in the buffer size as a function of the capacity of a link over which the packet transmission connections are made, and the operating point designates the given buffer size and corresponding capacity for a particular number of connections. The determination of the optimal operating point may involve partitioning a buffer size B and link capacity C into (Bi, Ci) for the i different types of sources in conjunction with maximizing N for a given connection mix vector.
In accordance with yet another aspect of the invention, the optimal operating point may be determined in accordance with a proportional control algorithm. For example, the proportional control algorithm may reduce a multiplier λ if an actual buffer utilization is higher than an actual bandwidth utilization, by an amount proportional to the absolute difference in the utilizations, and increase λ if an actual buffer utilization is lower than an actual bandwidth utilization, also by an amount proportional to the absolute difference in the utilizations. As a result, the proportional control algorithm converges at the optimal operating point without requiring explicit determination of an admissible region.
The techniques of the present invention may be implemented at least in part in the form of one or more software programs running on a router, switch or other type of programmable network processing element.
The present invention will be illustrated below in conjunction with exemplary scheduling techniques implemented in a node of a communication network. It should be understood, however, that the invention is not limited to use with any particular type of communication network device or network configuration. The disclosed techniques are suitable for use with a wide variety of other systems and in numerous alternative applications. A given network node may correspond, by way of example and without limitation, to a router, switch or other network processing element. The term “node” as used herein should therefore be understood to encompass any network processing element capable of utilizing one or more of the techniques of the invention.
The term “packet” as used herein is intended to include any grouping or other arrangement of data suitable for transmission over a communication network link. A given “packet” in accordance with this definition may therefore include only a single bit, but more typically will include multiple bits arranged in accordance with a specified packet format.
The node 10 includes a buffer 15 that receives packets associated with the inputs 11. Although only a single buffer is shown in this example, it will be appreciated by those skilled in the art that the node may include multiple such buffers. Also associated with the node 10 is an admission control element 16 and a scheduler 18. The admission control element 16 controls the admission of connections within the node 10, e.g., determines which of the inputs 11 are permitted to deliver packets into the buffer 15, while the scheduler 18 retrieves packets from the buffer 15 for transmission over the link 14 in accordance with a specified scheduling policy. One or both of the elements 16 and 18 operate in accordance with the techniques of the invention, to be described in greater detail below, but these elements may otherwise be configured in a well-known conventional manner. For example, such elements may be implemented using one or more software programs stored in a memory of the node and executed by a processor of the node, as will be illustrated in conjunction with
The processor 110 may represent, e.g., a microprocessor, a computer, a central processing unit (CPU), an application-specific integrated circuit (ASIC), as well as portions or combinations of these and other suitable processing devices. The memory 112 may represent, e.g., an electronic memory, a magnetic or optical disk-based memory, as well as portions or combinations of these and other memory devices. For example, the memory 112 may represent a memory associated on a common integrated circuit with the processor 110, or a separate electronic memory integrated with the processor 110 into a computer or other device, and may be used to implement the buffer 15 of
The present invention provides techniques for determining an appropriate number of sources that may be admitted by a given network node, such as node 10 described in conjunction with
The invention will be illustrated using an example model in which the inputs 11 of
The data traffic from the deterministic on-off sources is assumed to be regulated, i.e., subject to admission control, in accordance with a well-known dual leaky bucket (DLB) technique. This technique specifies the following three parameters for a given source: a peak rate p, a maximum average or sustainable rate r (also referred to herein as the mean rate), and a maximum allowed burst period or on-time T (which is defined as the maximum time that the source is allowed to send consecutively at the peak rate p).
The sources may be heterogenous, i.e., of different types i, i=1, 2, . . . K. A given source of type i has mean rate ri, peak rate pi and on-time Ti, and Ni denotes the number of sources of type i. The total number of sources is given by N, where
In the case of homogeneous traffic, all of the sources have the same traffic descriptor (r, p, T), and the resources B and C can be rescaled by the number of inputs N as follows:
B≡Nb and C≡Nc.
It should be understood that the above-noted deterministic on-off source assumption, and any other assumptions made herein, are for purposes of illustration only, and should not be construed as limiting the scope of the invention in any way. Alternative embodiments of the invention may be configured in which one or more of the described assumptions do not apply.
In general, the invention in the illustrative embodiment involves determining the probability of the actual buffer content exceeding the maximum buffer size B, this probability being denoted herein as L(B, C), or, equivalently, the probability that packet delay D exceeds B/C. For further simplification of the description, it will be assumed that
ΣiNiri<C<ΣiNipi,
which excludes certain trivial situations.
With regard to the above-noted case of homogeneous traffic, we have determined that the loss curve, i.e., the decay rate of the loss probability as a function of b, is convex at a specific b, if the packet arrivals are negatively correlated on a time scale related to overflow. Conversely, the loss curve is concave at a specific b, if the packet arrivals are positively correlated on the time scale related to overflow.
As noted above, deterministic on-off sources exhibit positive correlation on short time scales and negative correlation on larger time scales. These correlations can be visualized by examining the variance of the traffic A(t). As a preliminary matter, the distribution of A(t) can be made explicit as follows. The period of the source is denoted by S:=T+T′. We assume that T<T′, but an analogous reasoning applies to the case that T≧T′. Because an epoch U of the start of the burst is uniformly distributed on the interval [0, S], the following three cases can be distinguished:
One can then calculate the variance σ2(t):=p2VarX(t), where VarX(t) is given by:
Between 0 and S, a plot of the variance σ2(t) is first convex, then concave, and finally convex again. It should be noted that concavity indicates negative correlations, and, conversely, convexity indicates positive correlations.
The first part of the above-noted loss curve is concave, due to the positive correlations on the short time scale, whereas the second part will be convex, because of the negative correlations on the somewhat longer time scale. The present invention provides specific approximations for both of these parts of the loss curve, and a combined approximation which provides a good fit for a variety of different types of traffic over both large and small buffer sizes. For small values of b, we will use a “square root” small buffer approximation, to be described below, in which the positive correlations are exploited. For larger values of b we will use an approximation, also to be described below, based on the well-known Brownian bridge, which naturally incorporates the negative correlations. The combined approximation is formed as a concave curve combination of the small and large buffer approximations.
In this case, the positive correlations between the input traffic sources are dominant. This observation is utilized in this section to specify an expression defining the maximum number of sources to be admitted, e.g., in the form of a defined admissible region.
A. Approximation
First define
It can be shown that the amount of traffic I(b) generated by a single one of the homogeneous sources increases rapidly for small b as √{square root over (b)}:
Notice that this formula reflects the positive correlations: I(b) is concave for small b. This leads to the loss approximation
B. Admissible Region
For small values of B it is expected that the number of admissible sources grows rapidly. If e−δ is the required loss fraction, we have to solve
where NB is the Brownian bridge approximation of the number of admissible flows for a given buffer size B. Implicit differentiation with respect to B yields
where ′ denotes derivative with respect to the argument C/N. This yields √{square root over (B)}·dN/dB→K/2 when B↓0, with
Hence, NB grows quickly for small B, i.e., as N0+K√{square root over (B)}; N0 follows from N0·α(C/N0)=−δ
This sections deals with an approximation of the loss probability for large values of the buffer size B, and a corresponding expression for the admissible region.
A. Approximation
Let B(t) be Brownian motion with drift EB(t)=μt and VarB(t)=σ2t. It can be verified that, for tε[0, S],
E(B(t)|B(S)=μS)=μt, and
where (B(t)|B(S)=μS) denotes the Brownian bridge for tε[0, S].
Now consider traffic generated by a single stream of a well-known N·D/D/1 queue, i.e., a queue fed by N input sources, each of which emits a packet every, say, S units of time. Let Ā(t) be the number of packets generated by such a stream in the interval [0, t). It can be shown that EĀ(t)=t/S and Var Ā(t)=t(S−t)/S2. In other words, for μ=1/S and σ2=1/S, the first two moments of Ā(t) and the Brownian bridge coincide. In the central limit regime, i.e., when the load of the queue is relatively high, the mean and variance essentially determine the moment generating function of the input process:
This justifies the use of the Brownian bridge as an approximation of the N·D/D/1 queue. The probability distribution of the Brownian bridge is explicitly known in the art.
For deterministic on-off sources, VarA(t) is not proportional to t(S−t), but it does have the property that it drops to 0 at time S. We can therefore choose μ and σ2 of the Brownian bridge to fit the mean and to conservatively bound the variance VarA(t) with a function proportional to t(S−t). The resulting Brownian bridge can then be applied on the corresponding process to get a conservative estimate.
By way of example, we can choose μ=rS=pT. The conservative, i.e., dominating, variance should satisfy σ2BB(t)≧σ2(t) for all tε[0, S]. If 0≦T≦S/2, this may be done by choosing σ2BB(t)=σ2t(S−t) where
having equality at t=0, S/2, and T, and strict inequality elsewhere. T should be replaced by S−T in the formula S/2<T≦S.
Let Bt(•) be a Brownian bridge with parameters μBB(•) and σ2BB(•). Because of the conservative choice of τ2, and by direct application of the overflow probability formula for the Brownian bridge, we obtain the following expression for the loss probability:
B. Admissible Region
For the N·D/D/1 queue with impulses of size pτ every S units of time, the Brownian bridge approximation looks like
For a given loss probability L=e−δ, we obtain the maximum number of admissible sources NB as
As will be described below, this expression provides improved results relative to conventional techniques for determining the admissible region.
The link rate or capacity C for each of these examples is assumed to be 150 Mbps. As is apparent from the above table, the classes are arranged in order of increasing burstiness. The plots in
Each of these figures includes a number of individual curves, which correspond to (1) the above-described Brownian bridge approximation (BB) for large buffer size, (2) an asymptotically exact but computationally difficult conventional measure (exact), (3) the conventional approach of A. Elwalid, D. Mitra and R. H. Wentworth (EMW) as described in the above-cited U.S. Pat. No. 5,838,663, (4) the above-described approximation for small buffer size (SmallBuf), and (5) a concave envelope of BB and SmallBuf.
The terms “small” and “large” as used herein with regard to buffer size are intended to include without limitation those buffer sizes for which the respective SmallBuf and BB approximations provide a good approximation of the exact measure.
It can be seen that the conventional EMW approach generally performs well for small buffer sizes, but loses its efficiency for larger buffer sizes, as was mentioned previously. The Brownian bridge approximation of the present invention, however, is close to the exact curve for large buffer size, e.g, buffer sizes greater than about 2.5 Mb, as can be seen from
The concave envelope provides a unified approximation. The portion of the SmallBuf curve for small buffer sizes is merged with the portion of the BB curve for large buffer sizes, by joining the two portions with a linear segment such that a concave envelope is formed. It is apparent that the concave envelope curve provides a good fit for all of the example traffic classes, and is nearly always still conservative.
The case of heterogeneous traffic will now be considered. It is assumed that there are multiple classes of regulated sources sharing common resources C and B. The sources are homogeneous within each class, but may differ in traffic parameters, e.g., DLB parameters, across classes. The difference in parameters generally implies that the deterministic on-off profiles for each class may have different period on-times and amplitudes.
In accordance with the invention, we optimally partition (B, C) into (Bi, Ci) among the classes, given a fixed number Ni of sources in each class. The term “optimal” in this context refers to maximizing the size of the admissible region by maximizing N for a given connection mix vector
Partitioning of resources, which can support diverse QoS requirements by protecting individual classes, generally loses the multiplexing advantage obtained by sharing across classes. However, partitioning is still very efficient in the practical case of small number of classes K, each with large number of connections Ni>>K∀i.
We will initially formulate a general version of the above-noted partitioning problem, followed by a specialized solution for the Brownian bridge. We conclude this section by describing the relevance of the partitions to conventional generalized processing sharing (GPS) and shared buffer management.
A. Solution to the General Problem
Consider the admissible region specified by
Ri(Ci, Bi, Ni)=δi:=−log Li
for each class i with distinct traffic parameters. For a given Ni, δi pair, the above equation defines a convex Bi versus Ci tradeoff curve (i.e., a buffer-bandwidth tradeoff curve, more generally referred to herein as a buffer-capacity tradeoff curve), for the concave envelope introduced in the previous section, as well as for the small and large buffer approximations individually. To find the admissible region we then seek to maximize N subject to
At the above-noted optimum, all of the loss constraints in this formulation would hold with equality, as otherwise it would be possible to reduce Bi and/or Ci for the corresponding class and thereby admit more connections. The fact that N needs to be integral may be ignored for simplicity, since this is of minor consequence when N>>1. For fixed N, in particular the optimal value, the buffer size B can be determined as a function of the other parameters, i.e., Bi=Bi(Ci, Nηi, δi). Now consider the intermediate problem:
This is a standard convex minimization to which well-known Strong-Lagrangian principles can be applied, which yields the following Kuhn-Tucker conditions:
for some global (class-dependent) non-negative Lagrange multiplier λ. It follows that the maximum value of N retains feasibility of the following conditions for some λ, which represents the slope of each of the Bi versus Ci tradeoff curves at the optimal operating point:
As indicated above, the operating region in
within the operating region for some i, as occurs for two of the curves in
The multiclass problem has thus been reduced to a two variable optimization on N, λ, which can be solved using a simple bisection search procedure on λ for fixed N followed by an outer bisection on N. This general procedure is used to obtain the admissible region for the concave fit, but further simplification is possible if the Brownian bridge approximation is adequate, as will be described below.
The general procedure is applicable to all traffic types with convex tradeoff curves, of which leaky bucket regulated traffic is just an example. In fact, the procedure holds for all traffic types, since the tradeoff curve has been shown to be convex for all traffic types, as is described in the above-cited U.S. Provisional Application No. 60/244,956, and in an article by K. Kumaran and M. Mandjes, “The buffer-bandwidth trade-off curve is convex,” Queueing Systems, Vol. 38, No. 4, August 2001, pp. 471–483, which is incorporated by reference herein.
Thus, it can be shown that the above-noted convexity holds under more general circumstances, motivated by the supposition of “diminishing returns” in buffer-capacity tradeoffs. This refers to the intuitive observation that, for a fixed number of connections and specified performance target, one obtains diminishing savings in bandwidth as buffer space is increased, and vice versa. Note that the tradeoff curves could be obtained by measurements when traffic characteristics are not available explicitly.
The above techniques can also be utilized to provide a control algorithm to achieve the optimal operating point without explicitly computing the admissible region. The per-class (Bi, Ci) tradeoff curves, along with measurements of buffer and bandwidth utilizations, are the main drivers of this control. By way of example, the following simple proportional control converges at the unique optimal value of λ, which necessarily exists if the problem is at all feasible:
Reduce (Increase)λ, if the buffer utilization is higher (lower) than the bandwidth utilization, by an amount proportional to the absolute difference in the utilizations. Reducing λ would decrease the Bi and increase Ci for every class, while increasing λ does the opposite. Hence the above proportional control algorithm efficiently reallocates resources to achieve balanced buffer and bandwidth usage overall.
It should be noted that the above-described partitioning solution neglects possible inter-class sharing of resources, which could lead to significant inefficiency when the number of classes is large. Alternative embodiments of the invention could be configured to take such inter-class sharing or resources into account.
B. Brownian Bridge
Applying the above-described partitioning technique to this special case, we seek to maximize N subject to
It can be shown that the necessary convexity conditions hold here, and that these equations can be solved nearly in closed form, except for numerically solving for λ in
where the optimum value for N is given by
for λ satisfying this equation, and where
C. Relation to GPS and Buffer Management
The optimal partitioning results described above may be used to set GPS weights and to perform buffer management in an integrated fashion. More particularly, given the optimal (Bi, Ci) split, the GPS weight Φi for each class i connection (assuming per connection queuing) can be set as φi=Ci/(NiC). If the queuing is per-class, the weights would be Φi=Ci/C for class i. However to guarantee QoS requirements, the corresponding buffer space generally must be guaranteed as well. This can be accomplished using the well-known conventional technique of virtual partitioning, for which the per-class nominal allocations are given by
from the optimal partition. The techniques of the invention can thus be used to integrate GPS, buffer management and admission control in a heterogeneous traffic setting, which has previously proved a challenge.
Each of these figures includes a number of individual curves, which correspond to (1) the above-described Brownian bridge approximation, (2) a first bounding line formed by linear fit to corner points of the “exact” approach referred to in conjunction with
It is apparent from the figures that the Brownian bridge and concave envelope approximations of the present invention improve on the EMW approach. These approximations also generally correspond well to the results for the exact approach, while being computationally much simpler. The discrepancy between the exact and concave envelope curves in
The present invention in the illustrative embodiment described herein provides improved approximations for controlling traffic from homogeneous regulated sources. As was described above, these approximations also provide the basis for techniques to optimally partition resources among heterogeneous classes in accordance with the invention. Advantageously, the techniques of the invention are fast, accurate, and simple.
The techniques of the invention are suitable for use in a wide variety of communication network applications, including wireless networks, high-speed multimedia networks, Internet Protocol (IP) networks, etc.
As indicated previously, the above-described embodiments of the invention are intended to be illustrative only. For example, the invention can be implemented in a wide variety of different data processing devices, using software, hardware or combinations of software and hardware. In addition, the admission region approximations and buffer-capacity tradeoff curve optimizations described herein in conjunction with the illustrative embodiments may be varied to accommodate the particular requirements of a given application. Moreover, the invention may be used in conjunction with many different types of QoS guarantees other than those specifically described herein. These and numerous other alternative embodiments may be devised by those skilled in the art without departing from the scope of the following claims.
The present application claims the priority of U.S. Provisional Application No. 60/244,956, filed Nov. 1, 2000 in the name of inventors K. Kumaran and M. Mandjes and entitled “Admission Control for Regulated Connections in a Communication Network,” which is incorporated by reference herein.
Number | Name | Date | Kind |
---|---|---|---|
5838663 | Elwalid et al. | Nov 1998 | A |
6046981 | Ramamurthy et al. | Apr 2000 | A |
6222824 | Marin et al. | Apr 2001 | B1 |
6314085 | Saranka | Nov 2001 | B1 |
6856946 | Dathe et al. | Feb 2005 | B1 |
6907467 | Venemans | Jun 2005 | B1 |
20020163887 | Suni | Nov 2002 | A1 |
Number | Date | Country | |
---|---|---|---|
20020097716 A1 | Jul 2002 | US |
Number | Date | Country | |
---|---|---|---|
60244956 | Nov 2000 | US |