This application claims priority of Korean Patent Application No. 10-2019-0175007, filed on Dec. 26, 2019, in the KIPO (Korean Intellectual Property Office), the disclosure of which is incorporated herein entirely by reference.
The present disclosure relates to a system for network delay guarantee based on flow aggregates and interleaved regulators.
The key to the asynchronous traffic shaping (ATS) technology being standardized in IEEE 802.1 time sensitive network (TSN) task group (TG) is the theorem that a minimal interleaved regulator (IR), attached to a class-based first-in-first-out (FIFO) system does not increase delay upper bound while suppresses the burst accumulation.
However, the existing theorem is to apply the IR to the class-based FIFO system in a limited way, and thus there is a limit in that it is applicable only to a small-scale unit network due to the complexity.
In the related art, there is a need for a delay guarantee technique having a low complexity level that may be implemented in a large-scale network.
In order to accomplish the object, in an embodiment of the present disclosure, there is provided a system for network delay guarantee based on flow aggregates and interleaved regulators.
A system for network delay guarantee based on flow aggregates and interleaved regulators comprises at least one unit network configured to guarantee first-in-first-out (FIFO) for flow aggregates; and at least one interleaved regulator per flow aggregate located at an input port or an output port of the at least one unit network, wherein among flows passing the unit network, flows that share the same network input port and the same network output port are aggregated into a single flow aggregate.
In addition, the above means for solving the problem does not list all features of the present disclosure. Various features of the present disclosure and the advantages and effects thereof may be understood in more detail with reference to specific embodiments below.
According to an embodiment of the present disclosure, by placing IRs between unit networks and performing flow aggregate-based scheduling in the unit networks, the maximum delay may be guaranteed with low complexity even when the flow aggregates traverse multiple networks.
The above and other features and advantages will become more apparent to those of ordinary skill in the art by describing in detail exemplary embodiments with reference to the attached drawings, in which:
In the following description, the same or similar elements are labeled with the same or similar reference numbers.
The present invention now will be described more fully hereinafter with reference to the accompanying drawings, in which embodiments of the invention are shown. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used herein, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “includes”, “comprises” and/or “comprising,” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. In addition, a term such as a “unit”, a “module”, a “block” or like, when used in the specification, represents a unit that processes at least one function or operation, and the unit or the like may be implemented by hardware or software or a combination of hardware and software.
Reference herein to a layer formed “on” a substrate or other layer refers to a layer formed directly on top of the substrate or other layer or to an intermediate layer or intermediate layers formed on the substrate or other layer. It will also be understood by those skilled in the art that structures or shapes that are “adjacent” to other structures or shapes may have portions that overlap or are disposed below the adjacent features.
In this specification, the relative terms, such as “below”, “above”, “upper”, “lower”, “horizontal”, and “vertical”, may be used to describe the relationship of one component, layer, or region to another component, layer, or region, as shown in the accompanying drawings. It is to be understood that these terms are intended to encompass not only the directions indicated in the figures, but also the other directions of the elements.
Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
Preferred embodiments will now be described more fully hereinafter with reference to the accompanying drawings. However, they may be embodied in different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
First, before describing a system for network delay guarantee based on flow aggregates and interleaved regulators according to the present disclosure, the ATS technique according to the prior art will be described.
It may be proved based on Min-plus algebra that a maximum delay of the system according to the IR-attached TSN asynchronous approach as shown in
The mathematical symbols used therefor may be defined as follows.
(1) N={0, 1, 2, . . . }, N+={1, 2, 3, . . . }, and R+=[0, ∞), which are real numbers other than a negative number.
(2) F is a set of sequences, for n of n∈N+, if An∈[∞, +∞), A=(A1, A2, . . . ), A∈F.
(3) Finc is a subset of F and wide-sense increasing sequences. Namely, if A∈Finc, A∈F, which is equivalent to the case that for all n∈N+, An≤An+1.
(4) G is a sequence composed of positive integers
A packet sequence of one flow is expressed as (A, L), where A=(A1, A2, . . . ) is a sequence of packet observation time and L=(L1, L2, . . . ) is a sequence of packet length. For example, an observation time of the second packet of the corresponding flow is A2, and the length thereof is L2. A∈Finc, and L E G. Assuming that (A, L) is an input sequence and (D, L) satisfying D∈Finc is an output sequence in the system S, if A≤D, S is FIFO.
Here, a function (or, operator) H satisfying four conditions as follows may be put into consideration.
C1: Π is a function of Finc×G→F. Namely, Π receives a packet sequence (A, L) and converts into a time sequence.
C2: Π is causal. Namely, if Π(A, L)=A′, A′n depends on A1, . . . , An−1 and L1, . . . , Ln, but is independent from Am (m≥n) or Lm (m≥n+1). Namely, A′n does not depend on An.
C3: Π is homogeneous with respect to A. Namely, for a certain constant h∈R and a certain sequence A∈Finc, L∈G, Π(A+h, L)=Π(A, L)+h.
C4: Π is isotone with respect to A. Namely, for A′ of A′∈Finc, if A≤A′, for a certain sequence L∈G, Π(A, L)≤Π(A′, L).
Now, Π-Regularity may be defined.
Definition 1 (Pi-Regularity): for a function ∩ satisfying C1 to C4, if A≥∩(A, L), the flow packet sequence (A, L) is ∩-regular.
Here, ∩(A, L) may be interpreted as an arrival curve regulated by a leaky bucket or the like. Namely, A≥∩ (A, L) may be interpreted in the way that all packets arrive later than the arrival curve.
Further, the following may be proved.
Theorem 1 (Minimal Π-Regulator): if Π is a function satisfying C1 to C4 (namely, corresponding to a Pi-regular operator), the “minimal Π-regulator” is a FIFO system that converts an input flow packet sequence (A, L) to an output sequence (D, L), and at this time, D1=A1 and Dn=max{An, Dn−1, Π(D, L)n}. In this case, the following is established.
(1) The system defined as above is a Π-regulator for the corresponding flow.
(2) (Minimality) For any Π-regulator that converts (A, L) into (D′, L) and for all n∈N+, D′n≥Dn.
(3) The fact that the flow (A, L) is Π-regular is equivalent to D=A. Examples of the minimal Π-regulator include a minimal g-regulator, a packetized greedy shaper and the like, and the leaky bucket shaper corresponds to a special case of the packetized greedy shaper.
Next, a packet sequence (A, L, F) in which several flows are mixed may be considered. Ln is a length of an nth packet. Fn is a flow to which the nth packet belongs. If all flows are Π-Regular, IR is a FIFO system that makes the output sequences of all flows be Π-Regular. In this process, the transmission of inner packets may be delayed. Namely, all packets stored in the queue wait in the system until a packet at the first location (head) of the queue of the FIFO system satisfies the Π-Regular of the corresponding flow.
Meanwhile, the minimal IR may also be defined as follows.
Theorem 2: A packet sequence (A, L, F) may be put into consideration. If a Πf-Regulator is defined for each flow belonging to the packet sequence, the minimal IR is a FIFO system that converts the input packet sequence (A, L, F) to an output sequence (D, L, F), and at this time, D1=A1 and Dn=max{An, Dn−1, ΠFn (DFn, LFn)I(n)}. Here I(n) is an order to an nth packet in the flow to which the nth packet belongs. In this case, the following is established.
(1) The system defined as above is an IR for the packet sequence.
(2) (Minimality) For any IR that converts (A, L, F) into (D′, L, F) and all n∈N+, D′n≥Dn.
(3) The fact that all flows belonging to (A, L, F) are Π-regular is equivalent to D=A. Examples of the minimal IR include an urgency-based scheduler and the like.
Theorem 3: A packet sequence (A, L, F) is input to the FIFO system S. A flow f belonging to the corresponding sequence is Πf-Regular. An output sequence (D, L, F) is input to the minimal IR. In the corresponding minimal IR, a Πf function is applied to the flow f. If the output sequence of the minimal IR is (E, L, F), a maximum delay of the entire system where the minimal IR is attached to S is identical to a maximum delay of only the system S. Namely, supn∈N+(Dn−An)=supn∈N+(En−An). This may be explained as illustrated in
In order to establish the theorem of
(1) Every flow input to the FIFO system S must be Π-regular. In other words, there must be a limit in the average input speed and the maximum burst size. For example, an ingress characteristic, like passing a regulator such as a leaky bucket, must be provided.
(2) The FIFO system S outputs all input packets FIFO.
(3) The IR regulates every flow to reproduce the existing ingress characteristics. Namely, the Π-regular characteristic parameters (for example, the average input speed and the maximum burst size) are made identical to the case of inputting.
(4) (Minimal IR) The IR transmits immediately when the packet at the head of the queue meets the output condition. Such IR is called a minimal regulator.
(5) The IR may provide zero delay for packets satisfying the output condition. For example, if a packet comes in when the queue is empty, the packet may be cut-through.
Here, even though all of the above conditions are satisfied and the maximum delay is not increased, this is a property that holds only for the maximum delay of the entire system. That is, the maximum delay of a specific flow may increase. If the maximum delay of the entire FIFO system is D and the maximum delay of a specific flow f is Df (D>Df), the maximum delay of the corresponding flow is increased to D after the IR is applied.
In the embodiment of the present disclosure, the theorem that the maximum delay does not increase even in a more general situation may be satisfied by relaxing the condition (2) among the above conditions. The minimal IR defined in Theorem 2 is minimal with no special conditions for the packet sequence (A, L, F). It is sufficient if the Er-Regulator is defined for every individual flow of (A, L, F). Therefore, Theorem 4 below holds even if the condition (2) among the above conditions is changed as follows.
Restated condition (2): The system guarantees FIFO only for packets of an individual flow. Hereinafter, this will be called a “flow FIFO system”.
Theorem 4: At least one flow packet sequence (Ar, Lf) is input to the flow FIFO system S. The flow packet sequence f is Πf-Regular. An output sequence (D, L, F) of the flows is input to the minimal IR. The corresponding minimal IR regulates by applying a Πf function to the flow f. The output sequence of the minimal IR is called (E, L, F). The maximum delay of the entire system where the minimal IR is attached to the system S is identical to the maximum delay of only the system S. Namely for all flows f and n (n>0), supf,n(Dfn−Afn)=supf,n(Efn−Afn).
The minimal IR does not increase the maximum delay of the flow FIFO system S. Here, Afn is an ingress time of the nth packet of the flow f, Dfn is a departure time at the system S, and Efn is a departure time at the IR.
Proof: The maximum delay of the system S is assumed as d. Also, it is assumed that a virtual device for sending out packets of all flows f from the system S exactly at the time (Af+d) is provided. If the specific flow f is H-regular, it means that Af already satisfies the arrival curve, and thus, the output of the virtual device having the departure time (Af+d) also satisfies the same arrival curve. Namely, the virtual device is also a Π-regulator. Since the input of the virtual device is a packet sequence (D, L, F) of the mixed flows and d is the maximum delay of the system S, Dn≤(An+d) for all n. Since the departure time of the virtual device is (An+d), the virtual device is FIFO for the corresponding sequence.
Meanwhile, the minimal IR, as in Theorem 2 described above, has a delay less than or equal to any other H-regulator. Thus, Ef≤Af+d. Namely, the departure time of the minimal IR is always less than or equal to Af+d. This holds for all flows f, so Theorem 4 is proven.
Namely, according to Theorem 4, if the FIFO system is FIFO only for each flow, the existing theorem is established. According to an embodiment of the present disclosure, as the theorem is expanded as above, it is possible to use the IR and apply the theorem even in the following cases.
Therefore, the IR may be applied to a single network having various routes to which flows having various priorities are input.
By using the above characteristics, it is possible to design an end-to-end delay guarantee system as described below.
(1) Flows are divided into two kinds, namely high priority flows and low priority flows.
(2) The low priority flows are put in a single FIFO queue at the output port of all nodes and processed in a strict priority manner. If necessary, preemption may also be applied.
(3) The high priority flows may be handled as follows.
It contains a network and an ingress edge node of the adjacent network. Only the high priority traffic is depicted. The network in
Through the structure as described above, it is possible to ensure the maximum delay of all high priority flow aggregates, and also it is possible to ensure the maximum delay of flows belonging to the flow aggregate. The burst size increases due to the flow aggregation at the network input port and separation at the output port, but regains its original value while passing the IR.
In this case, the size of the unit network may be appropriately set in consideration of the demanded average delay and maximum delay. For example, as the number of relay nodes included in the unit network increases, namely as the unit network increases, the number of passes through IRs in the entire route of the flow may decrease and the average delay may decrease. However, in this case, the number of flows processed by the IR increases, and it may be a burden to process the flows at a line rate. In addition, increasing the number of flow aggregates in the unit network may also be a burden. Meanwhile, if the unit network is smaller, the number of IRs increases but the maximum delay of the flow may be reduced. However, in this case, it may be not preferable in terms of the average delay. Once the size of the unit network is set and IRs are arranged accordingly, it is required to exchange information such as flow information between the unit networks and maximum delay value required for admission control.
Hereinafter, the numerical analysis of the system designed as described above will be described.
If a flow i traverses only latency-rate (LR) servers Sj while passing the network (with total k LR servers), the maximum value of the end-to-end delay experienced by packets in the flow i may be expressed as Equation 1 below.
In addition, mathematical symbols frequently used are shown in Table 1 below.
The PGPS (packetized generalized processor sharing), which may be regarded as the most ideal scheduler, and the DRR (deficit round robin) based on a simple round robin may be included in the LR server. Here, the latency of the PGPS is given as Equation 2 below.
In addition, the latency of a general DRR including the case where the quantum value may be smaller than the maximum packet length is given as Equation 3 below.
Here, F is the sum of quantum values of active flows, N is the number of active flows, and the quantum refers to the amount of data serviced at one time, which is determined in proportion to the service rate allocated to each flow.
A FIFO scheduler is also an LR scheduler with the latency given as Equation 4 below.
where N is the number of active flows
The unit network shown in
First, in case of PGPS scheduler, from Equation 1 and Equation 2 the network delay of the flow-based framework is given as Equation 5 below.
Assuming for simplicity that σi=Li=Lmax=L, and ρi=r/nph, for all i. Similarly, for FA-based framework with the PGPS, there are ph FAs in an output port, and maximum burst of an FA is nL, therefore:
The difference between the two-maximum delay,
The difference is linearly proportional to the FA size (the number of flows in an FA) and maximum packet size, and exponentially proportional to the network size (the max number of hops in a network), and inversely proportional to the capacity of the link. The difference is zero when n=1, and positive for all n, h>1, which means the smaller bound by the FA framework
Meanwhile, in case of the DRR scheduler, from Equation 1 and Equation 3 we obtain in flow-based framework
For simplicity, assuming that φi=Li=Lmax=L, since the quantum values are determined to be proportional to the flows' arrival rates which are all identical for flow-based framework. Similarly, for the FA-based framework assuming that φi=Li=Lmax=L. Every FA has the same aggregated arrival rate. Therefore, with ph FAs in an output port, and maximum burst of an FA to be nL,
The difference
has a similar form with the one with PGPS schedulers. It is linearly proportional to the FA size and exponentially proportional to the single network size. The difference is zero when n=1.
It can be seen that the delay bound gains by applying FA framework is always positive for all n, h>1. Even if the networks are interconnected to from a bigger internetwork, in the identical flows case the gain becomes only larger, since the delay bounds of all the flows are identical.
We investigate the case where the network in
since we assume ρi=r for highest priority FIFO scheduler. There are h such nodes in a network, therefore the network delay bound is given as Equation 7 below.
The difference
is larger than 0 when n≥2, which means with more than just one flows in FA, the embodiment of the present disclosure performs better than ATS.
It can be seen that the embodiment of the present disclosure with DRR schedulers performs better than both flow-based framework and ATS when the h and n are large. This is because the dominant terms in max(DiF_DRR) and max(DiATS) are proportional to hnph, while the dominant term in max(DiFA_DRR) is proportional to hph and hph.
We will investigate the delay bound of the embodiment of the present disclosure with multiple networks interconnected. We will focus on how the delay bound changes with the choice of a single network size, given a fixed internetwork size. An example network for the analysis of the proposed framework is depicted in
Larger h means smaller d, n, and the number of minimal IRs. If h=E, then d=1, the networks are merged into a single network, and there is no interim IRs. If h=E and n=1, then there is no flow aggregation, which is identical to the IntServ framework. On the other hands, smaller h means smaller network size and more minimal IRs. If h=1, then IR resides at every node, which is similar to the ATS framework, except that the ATS uses FIFO scheduler.
Consider the end-to-end delay bounds of the IntServ, ATS, and the proposed framework. From the constants p, F, E, nph=F, hd=E, we get d=E/h, n=F/ph. First, for IntServ, since it has the “pay burst only once” property, from equation 5:
Second, for the embodiment of the present disclosure, from equation 6:
Similarly:
Now consider a case where p=2, E=16, F=65536=216, r=lGbps, L=10 Kbit. The RHS (Right hand side) of equation 9 give the following red curve in
The possible sets of parameters are {h=1, d=16, n=32768=28}, {h=2, d=8, n=214}, {h=4, d=4, n=212}, . . . , {h=8, d=2, n=28}, {h=16, d=1, n=1}. If h=16, then n=1, and the case becomes identical to an IntServ framework. With h=16, the total delay of IntServ is identical to that of the proposed framework, indeed. Also note that h=1 gives the same bound. Note that all the possible choices of h and d give smaller than or equal delay bounds than that of IntServ. They are always smaller than that of ATS. The optimal choice in this case is {h=8, d=2}, which gives 1.347 sec of delay bound, which is almost 8 times better than IntServ, and 16 times better than ATS.
This result is remarkable. By only diving a path into two parts, put a single IR in the path, and aggregating flows accordingly, we can reduce the delay bound as much as to ⅛.
While the present disclosure has been described with reference to the embodiments illustrated in the figures, the embodiments are merely examples, and it will be understood by those skilled in the art that various changes in form and other embodiments equivalent thereto can be performed. Therefore, the technical scope of the disclosure is defined by the technical idea of the appended claims The drawings and the forgoing description gave examples of the present invention. The scope of the present invention, however, is by no means limited by these specific examples. Numerous variations, whether explicitly given in the specification or not, such as differences in structure, dimension, and use of material, are possible. The scope of the invention is at least as broad as given by the following claims.
Number | Date | Country | Kind |
---|---|---|---|
10-2019-0175007 | Dec 2019 | KR | national |