The present invention relates to a method and apparatus for congestion control.
For on-demand streaming services congestion control is usually performed via Resource Admission Control. Such a system checks, either by monitoring the number of ongoing flows of which the characteristics are assumed to be known, or by monitoring the (momentary) bit rate of the aggregate of flows directly, whether or not the new flow of which the characteristics are assumed to be known, will still fit on all the links the flow will travel through. If this check gives a positive answer, the flow is accepted, otherwise the flow is rejected. This decision is enforced either at the application level, by not setting up the session, or by a policy enforcer at the edge of the network which blocks the traffic stemming from the rejected flow even if the session would have been set up. In such an architecture the user gets either the video in full quality or is denied the service.
An alternative way of congestion control is via scalable codecs. Therein each multimedia flow is encoded in layers of decreasing importance.
In contrast to the relying on Resource Admission Control, methods based on scalable codecs never deny the user access to the service, but the quality is sometimes lower than he/she aimed for.
A drawback of all these known methods is that there is no absolute guarantee that packets of higher importance are neither lost nor dropped.
To overcome the drawbacks a method according to the invention includes the steps of monitoring the amount of incoming flows to this node such that, based upon this amount of incoming flows and on a current number of accepted layers, a next number of accepted layers for entry into said node is determined by consultation of an action table.
In this way, by letting the number of accepted layers depend on the previous number, on the number of incoming flows and on an action table, a more accurate congestion control method is obtained while at the same time guaranteeing the more important packets to be preserved.
In an enhanced embodiment this action table is dynamically adjusted based on observed traffic towards said node.
This has the advantage that decisions will reflect the traffic status, allowing for a more precise determination of the allowed layers, even more improving the quality.
By dynamically adjusting the action table at regular intervals based upon traffic observed during an observation period, and whereby a Markov Decision Process is used for determining an optimum action table, even a better quality is obtained.
The action table may be centrally calculated by a network congestion controller and further communicated to said node, or can be stored locally within the node itself.
The action table can be updated locally within a node or can be centrally updated by a network congestion controller.
In case the action tables are updated within the nodes, a communication, between different nodes implementing said method, of the resulting action as determined by consultation of said action table can be performed, such that, in case conflicting actions arise, a further heuristic control for adapting the next number of accepted layers, per node, is executed within said nodes. In this way potentially conflicting situations arising between neighbouring nodes can be solved.
The present invention relates as well to a congestion control device for implementing the present method and to a network congestion controller for communicating action tables and their updates to certain embodiments of congestion control devices.
It is to be noticed that the term ‘coupled’, used in the claims, should not be interpreted as being limitative to direct connections only. Thus, the scope of the expression ‘a device A coupled to a device B’ should not be limited to devices or systems wherein an output of device A is directly connected to an input of device B. It means that there exists a path between an output of A and an input of B which may be a path including other devices or means.
It is to be noticed that the term ‘comprising’, used in the claims, should not be interpreted as being limitative to the means listed thereafter. Thus, the scope of the expression ‘a device comprising means A and B’ should not be limited to devices consisting only of components A and B. It means that with respect to the present invention, the only relevant components of the device are A and B.
The above and other objects and features of the invention will become more apparent and the invention itself will be best understood by referring to the following description of an embodiment taken in conjunction with the accompanying drawings wherein
a and b show examples of a decision table to be used in some embodiments of congestion control devices,
The description and drawings merely illustrate the principles of the invention. It will thus be appreciated that those skilled in the art will be able to devise various arrangements that, although not explicitly described or shown herein, embody the principles of the invention and are included within its spirit and scope. Furthermore, all examples recited herein are principally intended expressly to be only for pedagogical purposes to aid the reader in understanding the principles of the invention and the concepts contributed by the inventor(s) to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions. Moreover, all statements herein reciting principles, aspects, and embodiments of the invention, as well as specific examples thereof, are intended to encompass equivalents thereof.
It should be appreciated by those skilled in the art that any block diagrams herein represent conceptual views of illustrative circuitry embodying the principles of the invention. Similarly, it will be appreciated that any flow charts, flow diagrams, state transition diagrams, pseudo code, and the like represent various processes which may be substantially represented in computer readable medium and so executed by a computer or processor, whether or not such computer or processor is explicitly shown.
An embodiment of the method proposes a new type of congestion control via scalable codecs, referred to as choking-based congestion control. This method can be used in a number of networks such as the one which is schematically depicted in
In
According to an embodiment of the invention, at least one of the nodes comprises a congestion control device. In
Most embodiments of congestion control devices according to the invention assume that the multimedia flows are encoded in a scalable way. Scalable video coding is standardized, e.g., in annex G of ITU-T Rec. H.264 “Advanced video coding for generic audiovisual services” and implies that the encoded video flow is built up of a base layer and at least one enhancement layers. Many standard codecs, e.g. MPEG2, 4 have scalable extensions, but other proprietary schemes can be used as well. In packet-based transport the bit streams associated with each layer are packetized in such a way that, based on an identifier in the header of each packet, it is known to which layer it belongs to. An example may be the DiffSery CodePoint (DSCP) or Type of Service (ToS) byte within the IP header which can be used for this purpose, so to identify to which layer the packet belongs to. However other identifiers in other type of headers may be used as well.
When the decoder at the user's premises only receives the base layer, the video can be decoded in basic quality. The more layers the decoder receives, the better the quality of the decoded video. As illustrated in
Most embodiments of congestion control devices according to the invention also rely on a scheduling technique, performed by a scheduler in the individual node where it is part of. In
In accordance to the invention, an embodiment of the congestion control device will therefore include a controller which will be adapted to determine, at predetermined instances in time t, the value of l(t+1) for the next particular time instance t+1, based on the current value of l(t), based on the current number n(t) of incoming flows to the node and based upon a decision table T. This controller will thus be able to determine up to which layer l to support in the next time slot t+1. In
This decision table is denoted T in
Instead of increasing or decreasing by 1, these actions “+” or “−” can also mean that in other embodiments e.g. 2 or 3 layers less or more are allowed.
Another example of a table is shown in
1) no action is taken, keeping the number of allowed layers in the next slot to 6, e.g. l(t+1)=6) if the number of flows is between 140 and 170 boundaries included; (in accordance with the second displayed row of the table)
2) the action is to decrease the number of accepted layers, e.g. from 6 to 5, in the next slot if there are more than 170 flows; (in accordance with the third displayed row on the table), and
3) the action is to increase the number of accepted layers, e.g. from 6 to 7, if there are less than 140 flows (in accordance with the first displayed row on the table).
Similar considerations as to the previous table 5a can be made: so a “+” action can represent an increase with a predetermined number, which may be larger or equal to 1, while a “−” action represents a decrease with another predetermined number larger or equal than 1, which may be the same to the predetermined number for increasing, but which can also differ from that.
A table of the type of
In a simple embodiment such a decision table may be stored locally in the node or in the congestion control device itself.
A flowchart illustrating a detailed embodiment for implementing the different steps of the described method is shown in
In more complex embodiments, the congestion control device is adapted to regularly update this table, e.g. every 20 minutes for the previous example where the time instances for determining l(t) and l(t+1) were of the order of magnitude of seconds. This update can be based on the observed traffic, during an observation period, towards the network node, but this is not necessary. In order to update this table the congestion control device CCIM is adapted to model the traffic over a certain period of time, referred to as the observation period, which is thus much longer than the time slots for updating the layer acceptance number. More precisely, it determines some parameters of an a priori chosen traffic model. Such a model can be a Markov model, but it can also be another type as will be explained in the next paragraph.
In order to update the decision table, used for determining l(t+1), a method may for instance consist of the following procedure: besides the presently active decision action table a number of preselected or predetermined alternative decision tables are maintained. These could for instance be obtained by setting the thresholds in binned tables of the type of
V[l(t),n(t)]=R[l(t),n(t),a(l(t),n(t))]+V[l(t+1),n(t+1)] (1)
With V[l(t),n(t)] (and V[l(t+1),n(t+1)]) representing the value function when there are l layers accepted and n input flows at time t (and the future value function at time t+1 respectively)
R[l(t),n(t),a(l(t),n(t))] representing an instantaneous reward function
a(l(t),n(t)) represents the action taken when there are I layers allowed and n input flows at time t, as given in the decision table
An example of an instantaneous reward function is given by the following formula (2):
R[l(t),n(t),a(l(t),n(t))]=α·G(l(t))·n(t)−β·n(t)·max{(F−C)/F,0}−γ·n(t)·l{α(l(t),n(t))≠0} (2)
where
α, β and γ are positive constants with the following interpretations; α being the reward per time unit the operator gets per flow supported, β being the penalty the operator has to pay per lost packet and γ being the penalty the operator has to pay per quality change.
α·G(l(t)) is the reward associated with transporting the flows up to layer l in slot t, e.g., the price a single user is willing to pay to receive a video and the quality corresponding to layer l.
F is the traffic volume in slot t. For n(t) flows F corresponds to n(t)·l(t)
and
C is the link capacity, i.e., the amount of information that can be transmitted per slot t. Alternatively C can be chosen slightly smaller than the link capacity to better avoid overflow.
After the observation period the alternative decision table that has accumulated the highest value is promoted to the active decision table if it exceeds the value associated with the active decision table. In that case the active decision table is demoted to be one of the alternative decision tables. For this method to be efficient a lot of alternative action tables need to be evaluated. An alternative method that avoids this is described next.
For this alternative method first a transition matrix has to be created, which is based on the observation of the traffic. Such a transition matrix can contain entries representing the likelihood that the number of flows are increased/decreased from a certain value, indexed by the row of the matrix for this entry, to another value, indexed by the column of the matrix for this entry. In this case the transition matrix TRM [n(t),n(t+1)] is thus built based on the observed difference between the absolute number of flows in a current time slot t and the next time slot t+1. With this transition matrix as input, the decision table to be used for determining the l(t) can be updated based on e.g. Markov Decision Process theory, hereafter abbreviated by MDP theory, optimizing the average of a value function under the assumption that only three possible actions can be taken as these mentioned in a previous paragraph:
1) allow one or more layer extra in the next slot, as represented by action “+”,
2) allow the same amount of layers in the next slot, as represented by action “0” or
3) allow one or more layers less in the next time slot, as represented by action “−”,
as corresponding to the actions as related to the previously described decision table.
MDP theory allows to select the optimum action table for a Markov process described by such a transition matrix TRM[n(t),n(t+1)] and a given value function. A value function V(l(t),n(t)) may consist of the sum of an instantaneous reward and an expected future value, which itself depends on the transition matrix and the actions taken, as expressed by the following formula (3):
V[l(t),n(t)]=R[l(t),n(t),a(l(t),n(t))]+ΣiTRM[n(t),j]·V[l(t+1),j] (3)
With V[l(t),n(t)] representing the value function when there are l layers accepted and n input flows at time t
R[l(t),n(t),a(l(t),n(t))] representing an instantaneous reward function
a(l(t),n(t)) represents the action taken when there are I layers allowed and n input flows at time t, as given in the decision table
and Σi TRM[n(t),j]·V[l(t+1),j] representing the average future value associated with moving from state (l(t),n(t)) to (l(t+1),n(t+1)), where l(t+1) is the number of allowed layers in the next time slot determined by the action a(l(t),n(t)) and the current number of allowed layers l(t).
As a matter of fact, l(t+1)=l(t)+1{α(l,n)=“+”}−1{α(l,n)=“−”}, where 1A is the indicator function that takes the value 1 is the statement A is true and 0 otherwise.
An example of an instantaneous reward function R[l(t),n(t),a(l(t),n(t))] can be given by the already given formula (2). Another example of such a reward function can be given by the following formula (4):
R[l(t),n(t),a(l(t),n(t))]=α·G(l(t))−β·max{(F−C)/F,0}−γ·1{α(l(t),n(t))≠0} (4)
where
α, β and γ are positive constants
α·G(l(t)) is the reward associated with transporting the flows up to layer l in slot t, e.g., the price users are willing to pay to receive a video and the quality corresponding to layer l.
F is the traffic volume in slot t. For n flows F corresponds to n·l
and
C is the link capacity.
Within both expressions (2) and (4) max {(F−C)/F,0} is approximately equal to the packet loss during slot t, such that this second term is equal to the discount for lost packets. The last term is the discount associated with fluctuations (where 1{α(l,n)≠0} is an indicator function which takes the value 1 if α(l,n)=“0” and 0 otherwise) which discourages changing the maximum supported layer l too often from slot to slot.
This process is schematically illustrated in
Finally, the transition matrix is normalized such that all entries of a row are multiplied by the same number resulting in that the sum of the entries of each row is equal to 1. This enables the optimum decision table T to be obtained via, e.g., the iteration of the value function using an algorithm of the MDP theory.
This optimum decision table is the decision table that maximizes equations (1) given (2), as is explained in
This process of building the transition matrix and determining the optimum decision table may be done every observation period T, with in a preferred embodiment T>>time slot t. Remark that the period over which the traffic is observed does not need to be equal to T, but can be longer, e.g., 2 T, and can be adapted during the course of the process. In that respect the observation period and the table adaptation period can be different, and each be adjustable during the course of the process.
Based on this learned model an embodiment of such an enhanced controller is adapted to calculate the optimal decision table, for instance using Markov Decision Process (MDP) theory, for each node individually. Remark that, although the optimal decision tables only rely on the current amount of input flows n(t), the actions taken do take the likely future evolution of the traffic into account. For example the MDP theory allows anticipating the most likely future evolution if the observation period is chosen long enough for all possible events to have occurred and short enough with respect to known diurnal evolutions.
The effect of this learning process is illustrated in
A more detailed implementation for such a more complex congestion control device CCIM′ comprised in one node is shown in
In this respect it is to be mentioned that the functions of the various elements shown in the figures, including any functional blocks labeled as “controllers”, may be provided through the use of dedicated hardware as well as hardware capable of executing software in association with appropriate software. When provided by a processor, the functions may be provided by a single dedicated processor, by a single shared processor, or by a plurality of individual processors, some of which may be shared. Moreover, explicit use of the term “processor” or “controller” should not be construed to refer exclusively to hardware capable of executing software, and may implicitly include, without limitation, digital signal processor (DSP) hardware, network processor, application specific integrated circuit (ASIC), field programmable gate array (FPGA), read only memory (ROM) for storing software, random access memory (RAM), and non volatile storage. Other hardware, conventional and/or custom, may also be included.
Apart from the previously described embodiments, other embodiments are possible where, in order to avoid potential contradicting decisions between neighbouring nodes, which can each implement the explained method, a coordination strategy is used. In a tree-based network architecture with local decision tables the simplest example for such a coordination strategy can be based on aggressive decreasing and careful increasing. More specifically, referring back to the network of
1) enforce every “−” action from the ANs and ignore the “+” action from the IM e.g. turn a “+” action in a “0” in the IM, if at least one AN has the action “−”;
2) if only the IM has the “−” action and all the AN have either “+” or “0” actions, enforce a “−” action on the AN that supports the highest number of flows;
3) If no AN asks for a “−” action and only one AN asks for a “+” action, allow this only if the IM did not ask for a “−” action;
4) If no AN asks for a “−” action and more than one AN ask for a “+” action, allow only one AN the “+” action, e.g., the one with the minimum I and maximal n, provide the IM did not ask for a “−” action.
The extension to multiple nodes can thus involve a local decision table per node, associating with each local state lk(t),nk(t) with k representing the index per node, the appropriate action in node k, plus a coordination or tie-breaker strategy. This involves some communication in between the different congestion control devices between the nodes, which is depicted by means of the dashed arrows between the individual congestion control devices in
Apart from this exchange of information another function of this global network congestion controller can be to determine the table updates. For that purpose it observes how the number nk(t) of flows evolves over time for each node k (=1 . . . K). This can be done via its knowledge of the network topology and exchange of information with the application provider. It is then adapted to build either a transition matrix for each node individually (to capture how nk evolves at node k) or a global transition matrix for the network (to capture how (n1(t), . . . , nk(t), . . . , nK(t) evolves), resulting from observations over an observation period T. It is further adapted to solve the MDP for this transition matrix for each node individually (with the same localized reward function and the same set of possible actions) or globally where the total reward is a weighted sum of the local rewards. A decision table per node results, which is then further communicated to the local nodes.
Even in the previous case, where the nodes individually determine and adapt their action table and in case a tie breaker procedure is needed a network congestion controller can calculate the possibly tie-broken actions itself and communicate them to the individual nodes or the network congestion controller can relay the decisions of the local nodes to each other such that they can consequently take further decisions on their own, provided that they have access to their own congestion state and provided that they know when a coordination action is needed.
Such an embodiment with a central or global network congestion controller NC is depicted in
In another embodiment the traffic can be observed via the bit rate on the links instead of measuring the number of flows.
While the principles of the invention have been described above in connection with specific apparatus, it is to be clearly understood that this description is made only by way of example and not as a limitation on the scope of the invention, as defined in the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
09290462 | Jun 2009 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2010/058120 | 6/10/2010 | WO | 00 | 3/2/2012 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2010/145982 | 12/23/2010 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6535557 | Saito et al. | Mar 2003 | B1 |
7209443 | Mukai et al. | Apr 2007 | B2 |
20010047423 | Shao et al. | Nov 2001 | A1 |
20030067872 | Harrell et al. | Apr 2003 | A1 |
20030195977 | Liu et al. | Oct 2003 | A1 |
20040194142 | Jiang et al. | Sep 2004 | A1 |
20070115841 | Taubman et al. | May 2007 | A1 |
20070165524 | Mascolo | Jul 2007 | A1 |
20090196194 | Paloheimo et al. | Aug 2009 | A1 |
20130114594 | Van Zijst | May 2013 | A1 |
Number | Date | Country |
---|---|---|
1968413 | May 2007 | CN |
1387585 | Feb 2004 | EP |
7-264197 | Oct 1995 | JP |
2003-078561 | Mar 2003 | JP |
2005-064970 | Mar 2005 | JP |
2005-204157 | Jul 2005 | JP |
2006-060448 | Mar 2006 | JP |
2007-504694 | Mar 2007 | JP |
WO 2006048842 | May 2006 | WO |
Entry |
---|
English Bibliography for JP Pat. App. Publication No. JP 7-264197, published Oct. 13, 1995, in Japanese, printed from Thomson Innovation on Jun. 21, 2013, 3 pp. |
English Bibliography for JP Pat. App. Publication No. JP 2003-078561, published Mar. 14, 2003, in Japanese, printed from Thomson Innovation on Jun. 21, 2013, 4 pp. |
English Bibliography for JP Pat. App. Publication No. JP 2005-064970, published Mar. 10, 2005, in Japanese, printed from Thomson Innovation on Jun. 21, 2013, 3 pp. |
English Bibliography for JP Pat. App. Publication No. JP 2005-204157, published Jul. 28, 2005, in Japanese, printed from Thomson Innovation on Jun. 21, 2013, 3 pp. |
English Bibliography for JP Pat. App. Publication No. JP 2006-060448, published Mar. 2, 2006, in Japanese, printed from Thomson Innovation on Jun. 21, 2013, 3 pp. |
English Bibliography for JP Pat. App. Publication No. JP 2007-504694, published Mar. 1, 2007, in Japanese, printed from Thomson Innovation on Jun. 21, 2013, 3 pp. |
Cohen et al., “Streaming Fine-Grained Scalable Video over Packet-Based Networks,” IEEE Telecommunications Conference, vol. 1, pp. 288-292, XP001195579, Nov. 27, 2000. |
International Search Report for PCT/EP2010/058120 dated Aug. 11, 2010. |
English Bibliography for Chinese Patent Application Publication No. CN1968413A, published May 23, 2007, printed from Thomson Innovation on Mar. 4, 2014, 8 pages. |
Number | Date | Country | |
---|---|---|---|
20120155258 A1 | Jun 2012 | US |