The present application is related to the following U.S. Pat. applications: U.S. Ser. No. 09/607,013, now U.S. Pat. No. 6,715,005; U.S. Ser. No. 09/607,133, now abandoned; and U.S. Ser. No. 10/417,467, filed Apr. 16, 2003, for “MMPP Analysis of Network Traffic Using a Transition Window”. The content of these cross-referenced co-pending applications is hereby incorporated herein by reference.
This invention relates in general to the field of computer technology, and particularly to systems for the transfer of data. More specifically, the invention relates to the real-time modeling and analysis of data communication of self-similar network traffic at multiple levels.
The flow of information in a network is often called ‘traffic’. Units of information used in network communication are referred to as ‘packet’. Packets generally arrive at a point in the network at random intervals resulting in ‘bursts’ of traffic resulting in congestion and ‘idle’ periods in which traffic is somewhat more sparse.
Systems that use a network to communicate messages can derive significant benefits from analysis that provides to the system a characterization of the network traffic. The Poisson Process is widely utilized to model aggregate traffic from voice sources. A Markov Modulated Poisson Process (MMPP) is often utilized to model aggregate traffic from data sources. Network traffic has been shown to be self-similar, therefore, a method used to analyze network traffic should be able to display behavior that is bursty and self-similar.
The present method uses a multilevel model that utilizes the model claimed and described in co-pending patent application Docket Number RPS920030018US1 filed concurrently herewith and entitled MMPP ANALYSIS OF NETWORK TRAFFIC USING A TRANSITION WINDOW as the base and replicating the model once for each time-scale displayed by the self-similar traffic. A single level, 2 state MMPP model is shown in
The present method and system serve to model and analyze asynchronous network traffic that is bursty and self-similar using a Markov modulated Poisson process (MMPP) and self-similar traffic by making the MMPP model multilevel, where each level in the model represents a different time scale. By ‘self-similar’ is meant that the traffic displays the same characteristics of behavior (e.g. bursty or idle) at different time scales. This permits the same principles such as an MMPP model to be applied at each different scale. The model employs a transition window to determine the transition between states. This transition window is represented as [λBmax, λImin] wherein λBmax is the upper boundary for heavy traffic arrival in the bursty state and λImin is the lower boundary for light traffic arrival in the idle state.
The complexity of the model grows as the number of levels in the model increases. This is not a problem because a model with something in the order of four levels has been deemed to be adequate. For example, others have confirmed that TCP traffic has been is described by at most four time scales. The present invention describes an example of a three-level model, although the model is general enough to represent any number of time scales. This model is an effective means to provide for network traffic analysis either in batch mode or in real time.
The invention relates to an article comprising a computer-readable medium which stores computer-executable instructions for processing traffic flow patterns associated with network data transmission. The instructions cause a machine to: a) receive traffic pattern data associated with the network transmission of data packets relating to the times of arrival of network data packets; b) apply an MMPP algorithm to the received pattern to define the traffic as being in the bursty state or the idle state; and c) repeat the steps a) and b) one or more additional times at different time-scale levels. The different time levels are based on a bottom-up analysis and rely on the generation of a trace of a traffic pattern for a given time scale and the analysis of the trace to generate a trace of the next scale pattern. The algorithm utilizes a transition window to determine the transition between states. This transition window is represented as [λBmax, λImin] wherein λBmax is the upper boundary for heavy traffic arrival in the bursty state and λImin is the lower boundary for light traffic arrival in the idle state.
The system analyzes network traffic that is bursty and self similar. It employs an MMPP to model network traffic (in real time or in a batch model) at a first level representing a given time scale. It then repeats the process to model the network traffic at one or more additional levels representing time scales that differ from the time scale in the first step. Each level typically includes 2 states of network traffic behavior comprising a bursty state representing heavy traffic conditions and an idle state representing light traffic conditions. The system employs a transition window to determine the transition between states. This transition window is represented as [λBmax, λImin] wherein λBmax is the upper boundary for heavy traffic arrival in the bursty state and λImin is the lower boundary for light traffic arrival in the idle state. However, when the inter-arrival times for the bursty state and the idle state become approximately equal, the system defaults to a single state model. The analysis comprises the generation of a trace of a traffic pattern for a given time scale. This generated trace is then used to generate a trace of the next time scale pattern.
An MMPP model 110, also sometimes referred to as a bimodal sequencer, is shown in
Since the representation of the network traffic in a model is an approximation, the length of the burst during state PB is an approximation with burst edges that are defined somewhat arbitrarily. In practice, the burst length is defined to satisfy the requirements of the user process. As previously mentioned, these values are used as the transition criteria between states. These boundary values define a transition window [λBmax, λImin] that has as the left side the parameter λBmax and as the right hand side the parameter λImin. The first parameter λBmax determines an upper bound for the packet inter-arrival time for the bursty state and the parameter λImin determines a lower bound for the packet inter-arrival time for the idle state. For the bursty state, λBmax defines the probability ρB that a packet with inter-arrival time lower than λBmax belongs to the bursty state. Similarly, for the idle state, λImin defines the probability ρI that a packet with inter-arrival time higher than λImin belongs to the idle state. Based on these probabilities, a decision can be made for each arriving packet of the particular state transition induced by the arrival.
Algorithms are described that allow the model to track changes in the network traffic dynamically. As the network traffic characteristics change over time, the mean inter-arrival times for the bursty state (λBmean) and for the idle state (λImean) also change over time. For the model to track these changes over time, the values λBmax and λImin change in proportion to the changes in the traffic. The values λBmax and λImin define the sides of a transition window of length k=λImin−λBmax. The size of the transition window [λBmax,λImin] can be changed dynamically to be used in adaptive algorithms that control the process transition between states. For implementation in an algorithm used in that fashion, the transition window [λBmax, λImin] can grow larger and smaller by changing the value of λImin and λBmax accordingly. The specific value of the parameters used depends on the specific application of the algorithm.
Referring now to
The analysis generates a sequence of traces of the form τ1.τ2, . . . , τk, where τk={(Λk1,Pk1), (Λk2,Pk2), . . . ,(Λki,Pki)}. Each trace τk represents a different characteristic scale of the self-similar network traffic at a different scale and where, Λki=the time stamp of leading of the packet and Pki=the packet/burst size.
Packet arrivals and inter-burst transitions are detected in the following manner. Assume that packet Pi−1 presently belongs to burst state P1. Then, the task is to detect whether packet Pi belongs still to burst state P1 or to the idle state P2. The detection logic compares the incoming packet inter-arrival time λi with λBmax and λImin. Four cases are possible:
Case 1. λi<λBmax and λi<λImin: Pi is detected to belong to burst state PB.
Case 2. λi>λBmax and λi>λImin: Pi is detected to belong to idle state PI.
Case 3. λi>λBmax and λi<λImin: Pi is detected to be inside of the transition window [λBmax, λImin]
In case 3, the next state transition selected is dependent on the user process requirements. This method can be applied to improve the performance of the network attached devices as will be described hereinafter. In particular, the application of the transition window approach will be described for managing the synchronization process in low-latency, high-bandwidth networks.
Case 4. λi<λBmax and λi>λImin: This is not a valid combination because both events can not occur at the same time.
These four cases (304, 306, 308, and 310) are illustrated in
The traffic analysis algorithm has to accomplish several things:
1. Analyze the traffic to determine the type of traffic at the time. Then it has to adjust the parameters of the model to the traffic parameters.
2. For bursty self-similar traffic, but for other traffic as well, the algorithm analyzes the traffic to determine the parameters for each different time scale. Then it adjusts the parameters of the model to the traffic parameters.
3. It detects the changes to the traffic from time to time, adjusting the model parameters as needed.
Of course, these are all different aspects of the same problem, that of having the capability to represent highly variable network traffic.
Although typically the network traffic can be characterized as being bursty and self-similar, as described by the multi-scale MMPP model described above, the conditions of the traffic are such that a high variability can be expected from one time period to the other (minute to minute, hour to hour, day to day, week to week, etc.). It is important for the model used by the methodology to be able to capture this variability in a dynamic way, real-time. The model of the invention assumes bursty self-similar traffic, but it adapts to changes in the traffic from very light traffic, when traffic is not bursty, to more heavy bursty traffic, on to very heavy peaks where the traffic acquires self-similar characteristics. The model assumes the structure of a 2-state multilevel model, or a simpler 2-state single-level model or a single-state model, depending on the traffic. Between the two extremes, there is a continuum of conditions that are represented by the model. The model restructures itself adaptively to the changing conditions of the network traffic. For example, under light traffic conditions, the network traffic can be characterized by a simple Poisson distribution. As traffic intensity goes down, the mean inter-arrival time λBmean approaches that of the idle state inter-arrival time λImean to the point where the two are no longer distinguished by the model. When the characteristics of the traffic are such that λImean approximately equals λBmean, a single state in the MMPP model represents the network traffic.
For the purposes of the model, the following states are considered, along with the related parameters in
The theoretical maximum number of levels in the model depends on the number of different time scales displayed by the network traffic. That number is believed to be in the range of four levels. In practice, the scale will also be determined by one of the following conditions:
1. The length of the trace is too short to capture the information at scales beyond some number of levels.
2. The higher the time scale, both the storage requirements needed to handle a longer trace and the required computational times will be higher. Because of the storage and computational requirements for each level, some practical limit needs to be defined to the system.
3. The number of levels is defined as a system parameter. The time scales will not reflect self-similar traffic beyond some specified value. Therefore, the value should be established experimentally by the designer. Once an upper limit has been established for the different time scales, the procedure will track changes to the workload automatically and will adjust to the instantaneous burstiness of the workload. The method must then track traffic changes from one end of the spectrum to the other (a to d above).
Other, more complicated modeling schemes are possible. However, this example represents an adequate implementation of the invention.
As previously noted, the traffic burst analysis and trace generation consists of the sequential generation of traces of the form τ1, τ2, . . . , τk, where τk={(Λk1,Pk1), (Λk2,Pk2), . . . ,(Λki,Pki)}. Each trace τk represents a different characteristic scale of the self-similar network traffic at a different scale where Λki=time stamp of leading of the packet and Pki=the packet/burst size. This process applies a constructive or bottom-up approach (vs. a deconstructive or analytical approach, which is top-down.)
The input stream of each packet (i) is analyzed as shown in
Burst inter-arrival time, λ1i=Λ1i−Λ1i−1, as follows:
The trace τ1 is generated at 402 as follows. An incoming packet (i) is read at 404 and i is set to a value of 1. The leading edge of the packet i is detected at 406. If the leading edge is not found, a second attempt is made to detect it. If detected, the arrival time is stored at 408 as time stamp t=Λi. The arrival of the trailing edge of the packet is detected at 410 and the formula Pi=t−Λi representing the time interval between the detection of the leading edge and the trailing edge and the time stamp is calculated and recorded. If the trailing edge is not detected the first time, the process is repeated until detected. The packet size is then stored at 412. This process is repeated for each packet until the end of the trace is reached at 414. If the end is not reached, then the process is repeated for the next packet, i=i+1. The end of the trace is signaled at 418.
The four test cases are as follows:
Case 1. λi<λBmax and λi<λImin: Pi is detected to belong to burst state PB.
Case 2. λi>λBmax and λi>λImin: Pi is detected to belong to idle state PI.
Case 3. λi>λBmax and λi<λImin: Pi is detected to be inside of the transition window [λBmax, λImin]. In this case, the next state transition selected is dependent on the user process requirements.
Case 4. λi<λBmax and λi>λImin: This is not a valid combination because both can not occur.
The following trace is recorded into an ordered set with sequential format. Assume the following:
τ1={(Λ11,P11),(Λ12,P12), . . . , (Λ1i,P1i)},
Where,
Next, trace τj can be analyzed as shown in
From the analysis, the following trace is derived:
τ2={(Λ21,P21),(Λ22,P22), . . . ,(Λ21,P21)},
Each trace can thus be analyze to produce a higher level trace. In general, the following set of traces are derived:
τ1, τ2, τ3, . . . , τk,
where trace τk={(Λkl,Pk1), (Λk2,Pk2), . . . , (Λki,Pki)}, represents a different characteristic scale of the self-similar network traffic at a different scale.
Since the scale for each consecutive level is approximated by an exponential distribution, the following ordering is established:
λj−1max<λjmin<λji<λjmax<λj+1min
This relates to a 3 level, 6 state MMPP model to simulate heavy traffic that is bursty and self-similar as shown in
1. Use algorithm τA to analyze input stream and generate trace τ1={(Λ11, P11), (Λ12,P12), . . . ,(Λ1i,P1i)}.
2. Analyze parameters Λ1i (the time stamp of leading of the packet), and P1i (the packet size) from trace τ1. From this the inter-arrival time λi is computed. Thus, there are four possible cases:
Case 1. λi<λ1Bmax and λi<λ1Imin: Pi is detected to belong to burst state PB.
Case 2. λi>λ1Bmax and λi>λ1Imin: Pi is detected to belong to idle state PI.
Case 3. λi>λ1Bmax and λi<λ1Imin: Pi is detected to be inside of the transition window [λ1Bmax, λ1Imin]. In this case, the next state transition selected is dependent on the user process requirements.
Case 4. λi<λ1Bmax and λi>λ1I1min: This is not a valid combination because both can not occur.
These four cases are used as the test criteria in algorithm τB to generate Λ2i (the time stamp of leading of the burst), and P2i (the burst size). This analysis of trace τ1 generates trace τ2={(Λ21, P21), (Λ22,P22), . . . , (Λ2i,P2i)}.
2. Analyze parameters Λ2i (the time stamp of leading of the packet), and P2i (the packet size) from trace τ2. From this the inter-arrival time λi is computed.
Case 1. λi<λ2Bmax and λi<λ2Imin: Pi is detected to belong to burst state PB.
Case 2. λi>λ2Bmax and λi>λ2Imin: Pi is detected to belong to idle state PI.
Case 3. λi>λ2Bmax and λi<λ2Imin: Pi is detected to be inside of the transition window [λBmax, λImin]. In this case, the next state transition selected is dependent on the user process requirements.
Case 4. λi<λ2Bmax and λi>λ2Imin: This is not a valid combination because both can not occur.
These four cases are used as the test criteria in algorithm τB to generate Λ3i (the time stamp of leading of the burst), and P3i (the burst size). This analysis of trace τ2 generates trace τ3={(Λ31,P31),(Λ32,P32), . . . ,(Λ3i,P3i)},
3. Analyze parameters Λ3i (the time stamp of leading of the packet), and P3i (the packet size) from trace τ3. From this, inter-arrival time λi is computed. As explained before, four cases are possible:
Case 1. λi<λ3Bmax and λi<λ3Imin: Pi is detected to belong to burst state PB.
Case 2. λi>λ3Bmax and λi>λ3Imin: Pi is detected to belong to idle state PI.
Case 3. λi>λ3Bmax and λi<λ3Imin: Pi is detected to be inside of the transition window [λBmax, λImin]. In this case, the next state transition selected is dependent on the user process requirements.
Case 4. λi<λ3Bmax and λi>λ3Imin: This is not a valid combination because both can not occur.
These four cases are used as the test criteria in algorithm τB to generate Λ4i (the time stamp of leading of the burst), and P4i (the burst size). This analysis of trace τ3 generates trace τ4={(Λ4I,P41), (Λ42,P42), . . . , (Λ4i,P4i)},
This relates to a 2-level, 4-state MMPP model to simulate heavy traffic that is bursty and self-similar, as shown in
Use algorithm τA to analyze input stream and generate trace τ1={(Λ11,P11),(Λ12,P12), . . . ,(Λ1i,P1i)}.
Analyze parameters Λ1i (the time stamp of leading of the packet), and P1i (the packet size) from trace τ1. From this the inter-arrival time is λi computed. As explained before, there are four cases are possible:
Case 1. λi<λ1Bmax and λi<λ1Imin: Pi is detected to belong to burst state PB.
Case 2. λi>λ1Bmax and λi>λ1Imin: Pi is detected to belong to idle state PI.
Case 3, 4: These cases are the same as for Example 1.
These four cases are used as the test criteria in algorithm s to generate Λ2i (the time stamp of leading of the burst), and P2i (the burst size). This analysis of trace τ1 generates trace τ2={(Λ21,P21), (Λ22,P22), . . . ,(Λ2i,P2i)}.
Analyze parameters Λ2i (the time stamp of leading of the packet), and P2i (the packet size) from trace τ2. From this the inter-arrival time is λi computed. As explained before, four cases are possible:
Case 1. λi<λ2Bmax and λi<λ2Imin: Pi is detected to belong to burst state PB.
Case 2. λi>λ2Bmax and λi>λ2Imin: Pi detected to belong to idle state PI.
Case 3, 4: These cases are the same as for Example 1.
These four cases are used as the test criteria in algorithm τB to generate Λ3i (the time stamp of leading of the burst), and P3i (the burst size). This analysis of trace τ2 generates trace τ3={(Λ31,P31),(Λ32,P32), . . . ,(Λ3i,P3i)}.
This is directed to a 1-level, 2-state MMPP model to simulate heavy traffic that is bursty and self-similar, and is shown in
First, use algorithm τA to analyze input stream and generate trace τ1={(Λ11,P11), (Λ12,P12), . . . ,(Λ1i,P1i)}.
Then, analyze parameters Λ1i (the time stamp of leading of the packet), and P1i (the packet size) from trace τ1. From this the inter-arrival time is λi computed. As explained before, there are four cases are possible:
Case 1. λi<λ1Bmax and λi<λ1Imin: Pi is detected to belong to burst state PB.
Case 2. λi>λ1Bmax and λi>λ1Imin: Pi is detected to belong to idle state PI.
Case 3, 4: These cases are the same as for Example 1.
These four cases are used as the test criteria in algorithm τB to generate Λ2i (the time stamp of leading of the burst), and P2i (the burst size). This analysis of trace τ1 generates trace τ2={(Λ21,P21),(Λ22,P22), . . . ,(Λ2i,P2i)}.
1. Use algorithm τA to analyze input stream and generate trace τ1={(Λ11,P11),(Λ12,P12), . . . ,(Λ1i,P1i)}.
2. Analyze parameters Λ1i (the time stamp of leading of the packet), and P1i (the packet size) from trace τ1.
3. From this the inter-arrival time is λi computed. As explained before, there are four possible cases:
Case 1. λi<λ1Bmax and λi<λ1Imin: There are no packets detected that belong to burst state PB.
Case 2. λi>λ1Bmax and λi>λ1Imin and: All packet inter-arrivals are detected as belonging to idle state PI.
Case 3, 4: These cases are the same as for Example 1.
These four cases are used as the test criteria in algorithm τB to generate Λ21 (the time stamp of leading of the burst), and P2i (the burst size). Since there are no bursts in this traffic, trace τ2 does not exist.
A K-level, N-state MMPP model to simulate variable traffic is shown in
The conditions of the traffic are such that a high variability can be expected from one time period to the other (minute to minute, hour to hour, day to day, week to week, etc.). It is important for the model used by the methodology to be able to capture this variability in a dynamic way, real-time. The model of the invention assumes bursty self-similar traffic, but it adapts to changes in the traffic from very light traffic (pattern 1) when traffic is not bursty, to more heavy bursty traffic (pattern 2), on to very heavy peaks (pattern 3) where the traffic acquires self-similar characteristics. The model assumes the structure of a 2-state multilevel model, or a simpler 2-state single-level model or a single-state model, depending on the traffic. Between the two extremes, there is a continuum of conditions that are represented by the model. The model restructures itself adaptively to the changing conditions of the network traffic.
While the invention has been described in combination with specific embodiments thereof, there are many alternatives, modifications, and variations that are likewise deemed to be within the scope thereof. Accordingly, the invention is intended to embrace all such alternatives, modifications and variations as fall within the spirit and scope of the appended claims.
Number | Name | Date | Kind |
---|---|---|---|
5276677 | Ramamurthy et al. | Jan 1994 | A |
5343465 | Khalil | Aug 1994 | A |
5883819 | Abu-Amara et al. | Mar 1999 | A |
5886907 | Abu-Amara et al. | Mar 1999 | A |
6058260 | Brockel et al. | May 2000 | A |
6075770 | Chang et al. | Jun 2000 | A |
6144639 | Zhao et al. | Nov 2000 | A |
6233226 | Gringeri et al. | May 2001 | B1 |
6304549 | Srinivasan et al. | Oct 2001 | B1 |
6373824 | Tang et al. | Apr 2002 | B1 |
6377992 | Fernández et al. | Apr 2002 | B1 |
6414942 | Ito et al. | Jul 2002 | B1 |
6526259 | Ho | Feb 2003 | B1 |
6715005 | Rodriguez et al. | Mar 2004 | B1 |
Number | Date | Country | |
---|---|---|---|
20040208131 A1 | Oct 2004 | US |