The present invention generally relates to connection admission control and more particularly to connection admission control in packet-oriented, multi-service networks with relatively strict delay and loss requirements.
Connection admission control (CAC) is generally a question of controlling the number of connections using a given set of resources in a communication network, thereby ensuring that admitted connections have access to the resources that are required to fulfil their Quality of Service (QoS) requirements. On the link level, CAC serves to restrict the number of connections simultaneously present on a transport link in the network. This means that new connections may be rejected in order to protect connections that are already admitted for transport over the link.
The issue of connection admission control in packet-oriented networks with limited transmission resources and relatively strict delay and loss requirements, such as higher generation radio access networks, is generally quite complex. With the introduction of multi-service networks, such as Universal Mobile Telecommunications System (UMTS) and similar communication networks, it becomes even more difficult to find an efficient CAC strategy that works in the multi-service environment and at the same time fulfils practical requirements such as limited computational complexity and high accuracy.
The practical requirements on the CAC algorithm mainly imply that the CAC decisions need to be taken fast, because hundreds or thousands of connections may arrive to a network node each second, and that the CAC algorithm has to make relatively accurate estimates of the resource requirements so that the CAC decisions are not too conservative, nor too optimistic.
In the prior art, connection admission control is fairly simple, based on the concept of effective bandwidths. This generally means that each individual connection is assigned a bandwidth value that represents the “effective” resource usage of the connection during its lifetime. When a new connection arrives to the node, the effective bandwidth of the connection is estimated based on factors such as the traffic characteristics and the QoS requirements. Subsequently, the CAC algorithm checks whether the sum of effective bandwidths of the admitted connections and the new connection exceeds the link capacity or not. The algorithm is so simple that the CAC decisions can be taken on-line. This approach thus satisfies the requirement on limited computational complexity.
Unfortunately, CAC based on effective bandwidths is generally not capable of ensuring that the QoS requirements in a multi-service traffic environment are actually fulfilled since the associated set or region of admissible traffic mixes with a single linear boundary is not a sufficiently accurate estimate of the true admissible region. The linear admissible region obtained from the effective-bandwidth algorithm is highly dependent on how well the assigned effective bandwidths represent the actual resource usage of the connections, and even a slight misestimation of the required resources may result in QoS degradations (underestimation) or a significant waste of valuable resources (overestimation).
A comparison of different connection admission control algorithms in ATM/AAL2 based third generation mobile access networks can be found in reference [1].
Reference [2] relates to a connection admission control strategy for ATM core switches and describes an effective-bandwidth algorithm for constant-bit-rate (CBR) connections as well as a connection admission control approach for variable-bit-rate (VBR) traffic. In the latter case, when statistically multiplexible VBR traffic (S-VBR) and non-statistically multiplexible VBR traffic (NS-VBR) are intermixed, the admissible region boundary is approximated by two piecewise linear segments corresponding to respective cell-loss-limited regions.
Reference [3] consider the problem of bounding the loss rates of the aggregation of ON-OFF sources in a bufferless model.
The present invention overcomes these and other drawbacks of the prior art arrangements.
It is a general object of the present invention to provide an efficient connection admission control strategy for a multi-service traffic environment.
It is particularly important that the connection admission control is highly accurate, thus providing optimized resource utilization, while still ensuring that the QoS requirements of the entire multi-service traffic mix are met. It is thus an object of the invention to estimate the true admissible region as accurately as possible.
It is also an object of the invention to provide a computationally efficient connection admission control algorithm, thus allowing on-line decisions on the acceptance of new connections.
These and other objects are met by the invention as defined by the accompanying patent claims.
The present invention is generally based on the recognition that the true admissible region for a multi-service traffic mix can be well approximated by a construction of a non-linear admissible region and one or more linear admissible regions. This makes it possible to accurately control admission of a new connection onto a transport link by checking whether the multi-service traffic mix defined by previously admitted connections together with the new connection is contained within an intersection of the non-linear admissible region and the linear admissible region(s), and admitting the connection only if the traffic mix is contained within the intersection of regions.
By properly identifying the non-linear admissible region and the linear admissible region or regions according to the QoS requirements of the mixed services and the traffic characteristics, accurate decisions on the admittance of new connections can be taken to optimize the resource utilization while ensuring the QoS requirements.
In general, the admissible regions in the construction of the “true” admissible region are related to respective QoS requirements such as packet delay and overload (packet loss).
In radio access networks such as the Universal Terrestrial Radio Access Network (UTRAN), and especially on those links connecting base stations and radio network controllers, the use of delay-limited linear admissible regions in combination with an overload-limited non-linear admissible region has turned out to be particularly beneficial.
For improved performance and flexibility, each service class in the multi-service traffic mix is preferably associated with a class-specific delay-limited admissible region. Each class-specific delay-limited admissible region is normally defined as a linear admissible region that contains a set of traffic mixes that fulfil a given class-specific packet delay requirement.
Advantageously, the overload-limited non-linear admissible region contains the set of traffic mixes for which the probability of temporarily overloading the queuing system associated with the transport link is smaller than a given target value.
It has been validated that the linear approximation of the delay-limited admissible regions is very accurate. The linearity of these admissible regions means that the evaluation of whether a given traffic mix is contained within each of the delay-limited regions can be performed in a computationally efficient manner, because efficient single-class approximations can be extended to multiple service classes.
The non-linearity of the overload-limited admissible region generally implies that the probability of overload must be evaluated individually for each traffic mix. In many cases, this results in far too heavy calculations for on-line evaluation. However, by exploiting the so-called statistical gain within the different classes and not (or only partially) between classes, a much more computationally efficient algorithm is obtained.
In certain situations, it is not necessary to use both the linear and non-linear admissible regions. In fact, if the delay requirements are loose or the link capacities are large enough, it may be sufficient to check whether the traffic mix is contained within the non-linear admissible region. In other circumstances, it may be sufficient to use a construction of multiple linear admissible regions. In effect, the intersection of multiple linear admissible regions generally defines a non-linear region, which has a piecewise linear boundary.
The invention offers the following advantages:
Other advantages offered by the present invention will be appreciated upon reading of the below description of the embodiments of the invention.
The invention, together with further objects and advantages thereof, will be best understood by reference to the following description taken together with the accompanying drawings, in which:
Throughout the drawings, the same reference characters will be used for corresponding or similar elements.
In general, the task of the connection admission control (CAC) function in a network node is to decide whether a new connection that arrives to the node can be accepted for transport over a link of a given capacity such that the Quality of Service (QoS) requirements of the new connection and the already admitted connections are not violated. In order to ensure that the amount of resources is sufficient to serve the traffic demands, and at the same time ensure that expensive resources are not wasted, reliable CAC methods are needed.
For a better understanding of the invention it may be useful to begin by describing the basic concept of conventional CAC based on effective bandwidths in more detail. For example, the CAC algorithm currently in use in the transport and control platform Cello from Ericsson assigns an effective bandwidth to each connection, and simply estimates the resource usage of all connections as the sum of their effective bandwidths. In multi-service networks, connections can normally be grouped into service classes based on their traffic descriptors, and it is thus possible to assign an effective bandwidth to each service class. When a new connection arrives to the Cello node, the effective bandwidth of the connection is calculated by means of a simple exponential formula. Subsequently, the following inequality is checked:
where N is the number of connections in the traffic mix defined by the admitted connections and the new connection, the EBi values are the effective bandwidths of the connections and C is the link capacity. In this way, it can be determined whether the sum of the effective bandwidths of the traffic mix under consideration exceeds the link capacity or not. If the estimated resource usage of the traffic mix does not exceed the link capacity, the new connection is admitted for transport on the associated link, otherwise the connection request is rejected.
However, experiments reveal that conventional CAC algorithms based on effective bandwidths are generally not capable of ensuring that the QoS requirements in a multi-service traffic environment are actually fulfilled since the linear admissible region thus obtained is not a sufficiently accurate estimate of the true admissible region, as schematically illustrated in
In contrast to the prior art, the invention preferably estimates the true admissible region by a construction of a generally non-linear admissible region and one or more linear admissible regions, as schematically illustrated for two service classes in
An efficient CAC algorithm can be devised by properly identifying the non-linear admissible region and the linear admissible region or regions according to the QoS requirements of the mixed services and the traffic characteristics of the network under consideration. In this way, the resource utilization can be optimized while ensuring the QoS requirements.
In the following, the invention will mainly be described with respect to radio access networks such as the Universal Terrestrial Radio Access Network (UTRAN) in third generation mobile communication systems or other similar future communication systems.
With the introduction of third generation mobile systems, such as the Universal Mobile Telecommunications System (UMTS), both equipment vendors and network operators face new challenges. In contrast to second generation systems, a packet-switched, multi-service network capable of fulfilling the specific requirements of the new radio interface technology called Wideband Code Division Multiple Access (WCDMA) is required.
Switching and multiplexing technologies used for the first releases of UTRAN for UMTS are based on Asynchronous Transfer Mode (ATM) in combination with the ATM Adaptation Layer type 2 (AAL2). Future releases will be deployed using also Internet Protocol (IP) technologies.
On transport links of UTRAN radio access networks, and especially on those links connecting base stations and radio network controllers, resource allocation is complex because the Quality of Service (QoS) requirements are quite strict and the amount of transmission resources is relatively low.
UMTS is a multi-service network, in which the various service classes generally have different QoS requirements. Packet delay is usually the most important performance measurement in the network, and for each service class, the delay budget for the whole system (end-to-end) determines the maximum acceptable delay in the UTRAN transport network. For example, the end-to-end maximum delay requirement for voice traffic is around 180 ms, and the maximum delay requirement in the UTRAN transport network is around 5-6 ms. The delay requirements for other services are not very different from that of voice. In case best effort services such as those provided in IP based networks, the delay requirements are relatively strict because the use of soft handover sets a practical limit on the packet delays. In addition, studies focusing on TCP performance show that the application-level throughput is significantly degraded if the delay on the Iub interface is larger than a few milliseconds.
The delay requirement is typically defined in a probabilistic manner:
Pr(Di>{tilde over (D)}i)<{tilde over (ε)}i, (2)
where Di is the delay of a packet of class-i, {tilde over (D)}i is the maximum (target) delay for class-i, Pr(Di>{tilde over (D)}i) is the probability of packet delay criteria violation and {tilde over (ε)}i is a small target probability. A maximum delay of 5 ms and a target probability of 1% thus translates into the requirement that the probability of a delay larger than 5 ms should be smaller than 1%.
The invention will now be described with respect to a particular example of a UTRAN network based on ATM/AAL2.
The role of the UTRAN network is basically to transport MAC frames from the RNCs 122 to the Node Bs 124 (in the Iub downlink direction) and to transport MAC frames from Node Bs 124 to RNCs 122 (in the Iub uplink direction). In the UTRAN transport network, MAC frames are encapsulated into Iub frames. The Iub framing overhead contains information used in Node Bs 124 to encode the frame into the appropriate radio frame format and to send it out on the radio interface at the right time (tout). Iub frames are segmented and packed into AAL2 CPS (Common Part Sublayer) packets, which are multiplexed into ATM cells. By AAL2 multiplexing, several AAL2 packets from different connections can be carried within an ATM cell. In an ATM network, cells are transported along a predefined path using the VPI/VCI (Virtual Path and Virtual Circuit Identifier) fields in the ATM header. The CID (Connection Identifier) field in the AAL2 header identifies a specific AAL2 connection within an ATM VC. the so-called CU timer (TCU) determines how long the multiplexer should wait for arriving AAL2 packets if an ATM cell is partly filled. Therefore, multiplexing efficiency also depends on the value of TCU, as realized in reference [5].
For admission control, we focus on situations when the transport links are highly loaded. In this region, the effect of TCU on delay performance is expected to be negligible. It is also assumed that the packing density does not depend on the Iub frame size. These assumptions are supported by the analysis provided in reference [6]. In UTRAN, a new AAL2 connection is set up for each new RAB. In general, the AAL2 CAC allocates resources for the new AAL2 connections in the transport network, and makes CAC decisions based on traffic descriptors and QoS parameters.
At this stage, it will be useful to introduce an adequate queuing model for the UTRAN transport network, with specific emphasis on the Iub interface.
An Exemplary Queuing Model for the UTRAN Network
As discussed earlier, the arrival pattern of Iub frames is determined by the MAC scheduler, which schedules MAC frames periodically according to the timing requirements of WCDMA. In other words, the traffic is shaped by the MAC scheduler such that the lowest time-scale behavior is periodic irrespectively of the type of application. UTRAN traffic can thus be modeled as a superposition of periodic traffic sources. However, the carried traffic is generally not seen as a continuous periodic packet flow, but rather the user/application level traffic model is reflected in the UTRAN transport network such that the carried traffic is modeled by a series of active and inactive intervals. These intervals will be referred to as ON (active) and OFF (inactive) periods.
It is desirable to establish a model, which allows us to derive the probability of packet delay requirement violation under the consideration of delays due to:
Assume that we have a system with the following input parameters:
Input parameters (i = 1, . . . , K)
The total delay of a packet from service class-i includes two parts: Di=Wi+Si, where Si is the service time and Wi is the waiting time in the queue. The service time is the time taken by the packet server to service a packet and send it out of the buffer. For example, if the bit rate of the output link is 100 Kbps (kilobits per second) then the service time for a packet that is 100 bits long will be 1 ms. The delay of lost packets is considered to be infinite.
The workload (the amount of unfinished work) plays an important role in a multi-class queue analysis because in contrast with the per-class waiting time, workload is a global measure and allows us to calculate the per-class waiting times in case of a First-In-First-Out (FIFO) service discipline. In a FIFO queue the waiting time is approximated by the workload. The workload and accordingly the waiting time strongly depend on the lengths of the ON and OFF periods. If the system is in an overload situation in which the input rate R of active connections exceeds the link capacity C, the workload has an increasing component. When the overload situation ends, there will be a decreasing component.
In general, delays on a time-scale longer than the transmission time interval (TTI) have to be studied by examining the workload accumulated over several TTI periods. The accumulated workload is burst-length dependent and generally describes the component of the workload due to the ON-OFF behavior. The development of the accumulated workload depends on the random nature of the traffic sources, i.e. the distribution of the ON and OFF periods, the dependency among the traffic sources and so forth.
In UTRAN, the delay requirements are relatively strict, typically shorter than or of the same order as the TTI, and therefore we are generally interested in the short time-scale behavior of the workload. Besides, the signaled traffic descriptors directly available from the network do not include any characterization of burstiness thus gives no possibility to properly characterize the long time-scale behavior of the workload. It may be possible to measure the burst lengths. However, this is generally of little interest in the evaluation of the probability of delay requirement violation in UTRAN.
Before describing the actual model construction it will be useful to examine the impact of buffer size on the workload. Systems with large buffers can absorb the packets generated during overload situations, resulting in large waiting times without losses. On the other hand, a moderate size buffer (˜TTImax), where TTImax=max [TTIi], is filled up quickly in an overload situation, and then packets suffer from at most a waiting time corresponding to the queue length or they get lost. The delay requirements in UTRAN are relatively strict (D˜TTI), which means that the waiting time reaches the predefined delay requirement very fast in an overload situation, and therefore the queue can not smooth out an overload situation efficiently even if considering infinite buffer lengths. We thus consider a system that can buffer packets at least the packet delay requirement but not significantly longer. It is furthermore assumed that in an overload situation, the delay of all packets is always larger than the delay requirement. Applying these assumptions we establish a combined model. Considering packets in the system, two types of delay requirement violation events can be observed:
The following relation expresses this decomposition:
and we define two measures as:
Simulations and experiments show that the model assumptions are perfectly valid on the time-scale corresponding to the transmission time interval (TTI). In this region the delays hardly depend on the length of the ON periods, and delay violations are dominated by the periodic packet emission.
Now, we are in the position to determine the main relation between the input parameters and a relevant CAC decision. The stationary probability Π(n) of a particular number of active connections n=(n1, n2, . . . , nK) is calculated using a multi-dimensional binomial distribution as follows:
From the model assumptions, it follows that packet loss probability can be well approximated by overload probability. The probability εilost that a packet of class-i arrives at an overload situation (R>C) and gets lost, can thus be calculated as:
where R(n) represents the input rate of the active connections in a given state n.
In case of a normal situation (R≦C) the waiting time is dominated by the periodic packet emission, because the transient period (due to the change of the number of active sources when R≦C) is approximately TTImax long and hardly influences the waiting time. Therefore the probability of a packet arriving later than its delay criteria but not getting lost can be calculated as:
Finally, the probability of delay criteria violation is the sum of the two probabilities as follows:
εi=εilost+εidelayed. (9)
An Exemplary CAC Algorithm for the UTRAN Network
When a new connection arrives, the CAC algorithm normally needs to check:
In accordance with a preferred embodiment of the invention, the true admissible region is approximated by an intersection of K regions (also referred to as hyper-planes) with linear borders and a region with a generally non-linear border. The K regions with linear borders are delay-limited and referred to as delay-limited linear admissible regions. The i-th delay-limited linear region preferably contains the mixes where the delay requirement of the i-th traffic class is fulfilled. The region with a non-linear border is overload-limited and referred to as an overload-limited non-linear admissible region. The overload-limited non-linear region preferably contains the mixes for which the probability of temporarily overloading the queuing system is smaller than a given target value. If the activity factor of each class is 1, the overload-limited border becomes linear. In this case, it is the border of the mixes that do not overload the system.
For example, assume that we have two service classes with different delay requirements. We are interested in the region where traffic mixes containing connections from both services can be accepted, and the admissible region is defined as the intersection of delay-limited and overload-limited regions, as illustrated in
If the activity factors are equal to 1 for both service classes, the overload-limited region becomes linear and contains the mixes that do not overload the buffer. In practice, however, not all activity factors are equal to 1. This means that the overload-limited region generally will be non-linear, containing traffic mixes that can overload the queuing system only with a small probability. If the activity factor one or more of the service classes is smaller than 1 (which means the service is bursty on a time scale that is larger than its TTI), the overload-limited region generally becomes concave as schematically illustrated in
The linear approximation of the delay-limited admissible regions is very accurate and means that the evaluation of whether a given traffic mix is contained within each of the delay-limited regions can be performed in a computationally efficient manner.
The non-linearity of the overload-limited admissible region generally implies that the probability of overload must be evaluated individually for each traffic mix.
Checking the Delay Violation Due to Packet Loss—the Overload-Limited Region
Although it is possible to use equation (7) to check the delay violation due to packet loss, the evaluation of equation (7) may be too demanding for on-line use (depending on the available processing resources). By exploiting the so-called statistical gain only within different classes and not between classes, a much more computationally efficient algorithm is obtained. Therefore, the following preferred strategy of the invention is proposed. Keeping in mind that loss probability and overload probability are more or less interchangeable according to the model assumptions, an upper bound for the target loss probability {tilde over (ε)}ilost is derived:
where we introduce Ai as a per-class limit on the number of simultaneously active connections in ON state. Preferably, the per-class limit Ai is defined as the number of connections from class-i such that the probability that more than Ai connections from class-i are active at the same time is small. The idea is to “distribute” the probability 1−{tilde over (ε)}ilost over the terms on the right hand side of the last inequality and to find minimal values of Ai such that the following sets of inequalities:
are fulfilled, where Kα is the number of classes with activity factor αi<1 and {tilde over (ε)}ilost is the target probability assigned to class-i. For traffic classes with α=1, Ai=Ni. This construction makes it possible to compute Ai values independently of each other, thus significantly reducing the computational complexity of equation (7). With enough memory for the CAC algorithm, one may wish to obtain the Ai values from tables calculated off-line, as will be described in more detail later on.
Using the per-class limits (A1, A2, . . . , AK), the necessary condition for accepting a given traffic mix (N1, N2, . . . , NK) is defined as:
where ρi is the average load generated by one active traffic source from class-i and defined as ρi=bi/TTIi. The evaluation of the inequalities (16) generally corresponds to the evaluation of whether the traffic mix is contained within the non-linear overload-limited region.
The statistical gains between classes can at least partially be taken into account by extending the basic algorithm in the following manner. It is clear that by using equations (14) and (15), only the statistical gain of multiplexing sources from the same class will be exploited. Keeping the property that Ai values can be obtained independently from each other, but taking into account partially statistical gains from multiplexing different classes one may proceed as follows:
1. Find Ai* for all i using eq. (14) with:
2. Repeat the following procedure until the MGi values are no longer increasing:
Consider the classes with αk>αi and ρk<ρi, ∀ i, k. If MGk>MGi, then let αi′:=αk and calculate MGi′ as described in step 1. If MGi′>MGi, then let MGi:=MGi′. (19-A)
Consider the classes with αk>αi and ρk≧ρi, ∀ i, k. If MGk>MG, then let MGi:=MGk. (19-B)
3. Finally, the Ai values are found as:
Ai=Ni(1−MGi). (20)
Checking the Delay Violation Due to Delayed Packets—the Delay-Limited Regions
There is no exact formula for evaluating the delay violation due to delayed packets, as recognized in reference [7]. We define the admissible region for checking the delay violation due to delayed (but not lost) packets by K hyper-planes. The i-th hyper-plane defines the region where the probability of a packet arriving later than its delay criteria but not getting lost is equal to or smaller than a given target probability, εidelayed≦{tilde over (ε)}idelayed. This means that each class-specific packet delay requirement requires that the probability of the class-specific packet delay being larger than a given class-specific maximum delay is smaller than a given target value. We propose a method for hyper-plane construction that uses only a single-class calculation for evaluating the corresponding delay violation. For this purpose, the following two measures are introduced:
The analysis of a constant service-time queue fed by ni homogeneous periodic (active) connections is mainly based on the results in reference [8], and the probability of packet delay criteria violation Pr(Dj(i)>{tilde over (D)}j|ni connections are active) is calculated in the following way:
where Dj(i) denotes the delay of a packet from class-j assuming that the delay of the associated queue comes from only class-i connections, {tilde over (D)}j is the target delay criteria of packets from class-j, and the following additional measures are introduced:
The proposed formula for determining the TN matrix is:
where the stability criteria is not needed, because it is included in the inequalities (16).
The TE matrix can be determined from the TN matrix as follows:
TEij=TNjj/TNij. (26)
Using traffic class equivalents, the necessary condition of accepting a given traffic mix (N1, N2, . . . , NK is defined as:
If we approximate by TNij the maximum number of connections from class-i if one additional connection from class-j is present in the system, as proposed above, the constant should be set to 1. Using a fixed constant equal to 1 has proved to be important in the case of priority scheduling (as discussed later on), but can be used for FIFO scheduling as well. In a more conservative approach, the constant is set to zero, assuming that we approximate by TNij the maximum number of connections from class-i if no additional connection from class-j is present in the system.
The evaluation of the inequalities (27) corresponds to the evaluation of whether the traffic mix is contained within each of the linear delay-limited regions.
Alternatively, the probability of packet delay criteria violation can be approximated by the following expression for the class-specific complementary distribution function Qi(x) of the workload in a FIFO queue, using a Brownian bridge approximation [9, 10]:
Consequently, instead of using equations (21)-(24), the probability of packet delay criteria violation is now determined as:
Compared to equations (21)-(24), equations (30) and (31) do not involve a lot of summations and therefore the probabilities of packet delay criteria violation can be calculated much faster. Once the probabilities of packet delay criteria violation has been calculated according to equations (30) and (31), the TN and TE matrices can be determined according to equations (25) and (26) and the inequalities (27) can be checked.
In yet another alternative embodiment of the invention, the following even faster approximation for determining the TN matrix is used:
The TN matrix is thus determined according to equations (32) and (33), and the TE matrix is determined as usual according to equation (26). Once the TN and TE matrices are determined, the inequalities (27) may be checked.
Naturally, by using the above fast approximations, the accuracy will be slightly reduced. The trade-off between computational complexity and accuracy has to be carefully evaluated by the system designer for each particular application.
The introduced queuing model for ATM/AAL2 and the associated CAC calculations can be more or less directly applied also in IP based UTRAN networks provided that the IP networks are CAC-enabled. The above model and the related methodology are applicable irrespectively of the transport technology used by the network.
Flow Chart
For a better overview of the CAC algorithm according to a preferred embodiment of the invention, reference is now made to the basic flow chart of
It can be noted that since TNij and TEij depend only on the service classes present in the system, but not on the actual number of connections, these matrices are generally not necessary to calculate on-line. The TN and TE matrices are only updated when a new service class is added. Consequently, if the connection belongs to a new service class (Y) not yet included in the TN and TE matrices, the relevant traffic descriptors and QoS requirements of this service class are added (step S2) to the general information database used by the CAC algorithm and the TN and TE matrices are updated (step S3). This update can usually be performed rapidly without having to re-compute the whole matrices. However, if the normal TE and TN update proves to be too slow for on-line use, a fast update (step S4) of these matrices is required in order to be able to make a fast decision on the acceptance of the connection. Such a fast update may for example be based on peak bandwidths, or by using equations (32) and (33) above. On the other hand, if the connection belongs to a service class already present in the system (N), then the TN and TE matrices are read (step S5) and the delay violation due to delayed packets is checked (step S6) according to formula (27).
The Ai values depend on Ni, αi, the target loss probabilities {tilde over (ε)}ilost and the number Kα of traffic classes with activity factors αi<1. The Ai values can be calculated on-line (step S7), or if on-line calculation turns out to be too slow, tables with pre-calculated values can be constructed in advance (step S8) and subsequently accessed during on-line evaluation. At least some Ai values can thus be pre-calculated for a range of different values of Ni and/or αi ({tilde over (ε)}ilost), and stored in memory tables, as illustrated below.
This means that for a number of a values (e.g. 0.1, 0.2, . . . , 0.9), tables with Ai values are stored, and for a certain [αi, Ni] pair, an Ai value can be read from the corresponding table for a certain target probability. If the memory consumption is too high, only a certain window [1, 2, . . . , Nmax] should be stored. This is normally sufficient, since the number of connections from a certain service class only changes in steps of one (+1: one connection arrives to the system, −1: one connection leaves the system). This means that the necessary window location and size can be tracked easily, and the tables can be updated whenever necessary. Other solutions using for example interpolation techniques can also be applied. Once the Ai values are obtained, the delay violation due to packet loss is checked (step S9) according to formula (16).
Finally, the results of the two delay violation checks are combined (step S10) to produce a final CAC decision. In this particular embodiment of the invention, a given traffic mix is accepted only if both sets of inequalities (16) and (27) are fulfilled.
If the update and the decision making parts are carefully separated, the decisions can surely be made on-line even for large systems with many traffic classes. However, the update speed needs to be checked carefully, because it is possible that also the updates can be performed on-line.
Example of Implementation of an Admission Controller
Connection admission control is normally exercised over output link resources of network nodes, accepting or rejecting connections in accordance with a CAC algorithm. The CAC algorithm, may for example be implemented as hardware, software, firmware or any suitable combination thereof. In ATM networks, for example, traffic descriptors are signaled to the network nodes, and the nodes make decisions as to whether connections can be admitted based on the signaled information. In general, taking the network perspective, a connection is only admitted if it is accepted by all nodes taking part in the end-to-end transmission of that connection.
Performance of the Proposed CAC
The performance of the invention has been evaluated in simulations. In a first simulation, the following input parameters were considered: C=1920 Kbps, TTI1=20 ms, b1=336 bits, α1=0.65, TTI2=10 ms, b2=1512 bits, α2=0.85, TTI3=40 ms, b3=2688 bits, α3=1, {tilde over (D)}1=5 ms, {tilde over (D)}2=8 ms, {tilde over (D)}3=20 ms, and {tilde over (ε)}=0.001. The admissible region obtained using the invention is illustrated in
The system with the traffic mixes on the surface was simulated to check if these mixes really fulfil the delay requirement.
The example of FIGS. 11A-B generally corresponds to that of FIGS. 10A-B, but now with a smaller link capacity C=1024 Kbps. In case of smaller links, the calculation of the TN matrix as proposed by equation (25) tends to be more conservative. In this case, values of TN may be relatively small, and therefore the fact that the elements of TN are integers may constitute a problem. For example, TNij=4 may result in a significantly smaller admissible region than TNij=5. This problem may be solved by using any of a number of conventional interpolation techniques, allowing the elements of TN to take real values.
It is important to understand that the preceding description is merely intended to serve as a framework for an understanding of the invention.
In certain situations, it is not necessary to use both the linear and non-linear admissible regions. In fact, if the delay requirements are loose or the link capacities are large enough, it may be sufficient to check whether the traffic mix is contained within the non-linear admissible region. In other circumstances, it may be sufficient to use a construction of multiple linear admissible regions. In effect, the intersection of multiple linear admissible regions generally defines a non-linear region (which is piece-wise linear).
By properly identifying linear and non-linear admissible regions, the invention can also be applied to other transport mechanisms including natural extensions and developments of the basic UTRAN concept. For other types of multi-service networks, the non-linear admissible region and/or the linear admissible region or regions have to be identified according to the network-specific traffic characteristics and QoS requirements.
Notes on Use with Priority Scheduling
It should be noted that the traffic delay also depends on the scheduling principle applied in the network. If packets of all services wait in the same queue (FIFO) and the packets are served in order of arrival, the most stringent delay requirement has to be met. This can be avoided by service differentiation, having different queues for services with different delay requirements. The delay-limited linear regions are equivalent to “effective bandwidths” being assigned to connections. Effective bandwidths calculated for FIFO scheduling can be extended directly for priority scheduling as proposed in reference [11]. With respect to the invention, this means that instead of a single linear region for each service class with FIFO scheduling, there will generally be multiple linear regions for each service class, depending on the number priority levels.
Prioritization means that a packet from a lower priority queue can be served only if all the higher priority queues are empty. Segmentation is used to minimize the influence of large low priority packets already in the server on high priority traffic. The segment size s is an additional model parameter. Wi and Si apply to the last segment instead of the whole packet. Wi cannot be calculated directly using the work-load of the system because higher priority packets can over-take lower priority packets. Similarly to the FIFO case, the delay violation events are lost and delayed packets. When calculating εilost and εidelayed, the only difference from the FIFO case is that from class-i point of view, the system is overloaded only if the input rate of traffic classes with higher (or equal) priority is higher than C.
We consider three cases depending on the priority levels of class-i and class-j. If class-i and class-j have the same priority level, then TNij is calculated in the same way as for FIFO scheduling set forth above. If class-i has higher priority we follow a different method. The event that the last segment (of size slast) of the class-j packet could not be served before time D is equivalent to the event that all segments before the last one could not be served before D′=D−slast/C. Formally, denote B(i)(0, t) the server availability in [0, t] seen by the class-j packet, if the higher priority traffic is given by class-i, when it has arrived at time 0, and then:
By using a conservative approximation of the server availability process: B(i)(0, t)≈t−Ai(0, t)/C and the Brownian bridge approximation of the arrival process, the following formula is obtained:
where Φ(x; μ, σ2) denotes the normal distribution. In order to demonstrate the accuracy of this approach, TNij values calculated using equation (35) are compared with exact values (C=920 Kbps, bi=320 bits, slast=320 bits, TTIi=20 ms, {tilde over (D)}=10 ms, {tilde over (ε)}jdelayed=0.1%) as illustrated below.
If class-i has lower priority, the effect of a segment possibly under service from class-i on delays of class-j packets is neglected. This means that the TNij values are set to infinity (∞).
In general, simulations have shown that priority scheduling is more advantageous than FIFO scheduling and the effect of the service differentiation is that the QoS requirements can be met at higher resource utilization.
Notes on Use in the Multiple-Links Scenario
The CAC algorithms proposed by the invention method has mainly been presented and evaluated for the single link scenario. In the multiple-links scenario, the overall CAC decision is composed of more than one Link Admission Control (LAC) decision. In practice, a method working in the multiple-links scenario is usually identical to the single-link algorithm, because no information on the resources along the end-to-end path is available. If the proposed methods are applied in the multiple-links scenario, an “overload-limited” region can be computed for the different links individually. Single-link effective bandwidths can be extended to the network level, e.g. as proposed in reference [12]. Essentially, the “effective bandwidths” calculated by the invention do not change in other links of the network. This means that the proposed single-link methods can be applied to the multiple-links scenario without modifications.
The embodiments described above are merely given as examples, and it should be understood that the present invention is not limited thereto. Further modifications, changes and improvements which retain the basic underlying principles disclosed and claimed herein are within the scope and spirit of the invention.
[12] The Output of a Switch, or Effective Bandwidths for Networks by Damon Wischik, Queuing Systems, Vol. 32, pp. 383-396, 1999.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/SE02/00468 | 3/13/2002 | WO |