The present invention relates to a packet switched network, preferably an Internet Protocol (IP) network.
In particular, it relates to enhancing performance for data (e.g. IP) packets when a link layer protocol, adapted to fragment said data packets at each hop, is used within the network.
The PPP protocol is a link layer protocol developed to provide an auto configuring, multi-protocol mechanism for exchanging variable length packets over serial point-to-point links. One protocol that can be transported over PPP is the IP.
When both real-time critical traffic and low priority traffic are sent in the same IP network over PPP links, it is very important that the real-time critical traffic is not interfered by the low priority traffic in order to fulfil the delay requirements of the real-time critical traffic. This is of course dependent on the link technology that is being used. However, the present invention concerns preferably PPP links. Even if the delay requirements of the real-time critical traffic are important to fulfil, the lower priority traffic may also have delay requirements. Therefore, this lower priority traffic must also be considered.
If the low priority IP packets are long, the transmission of such packets may block the link for a long period of time. (The length of an IP packet may differ from e.g. 40 bytes to 1500 bytes.) This is solved by fragmentation of the IP packets at PPP level by using the PPP Multilink Protocol (MP), which is further described in K. Sklower et. al., “The PPP Multilink Protocol (MP)”, RFC 1990. Originally, the MP was a protocol that facilitated the use of several physical links as one logic link. One part of the standard also defines a method for splitting and recombining packets.
The fragmentation creates shorter packets, which decreases the blocking time for the real-time critical traffic (high priority traffic). The use of MP introduces extra queues at the link layer besides the original PPP queue. More than one queue at the link layer for a certain router introduces a need for a scheduler at the link layer as showed in
Each MP fragment comprises a PPP header, an MP header and a payload portion. There are two versions of the MP header; a long header and a short header. The long header has a size of four bytes and the short header has a size of two bytes. Which of the two headers that is used depends on the maximum number of MP fragments that is needed (2^10 or 2^24) for a fragmentation of one IP packet, since the difference between the two headers is the number of bits that indicates a sequence number. The sequence number indicates the order of the MP fragments of an IP packet. If a B-bit (Beginning) is marked in an MP fragment, it indicates that this is the beginning of the IP packet and if an E-bit (End) is marked, that indicates accordingly that the fragment is the end of the IP packet. The header keeps track on in what order the fragments should be reassembled by using a sequence number, which also results in that no re-ordering occurs between the reassembled IP packets.
When using MP for fragmentation, the fragmentation is performed per PPP link and the fragments must be reassembled to an IP packet at the end of every link as shown in
When MP fragments from several IP packets are transported in parallel, as in the present invention, it is impossible to reassemble them correctly if it is not possible to differentiate the MP fragments by employing different classes. This may be done by the MP Multi-Class Extension (MCE) protocol. The MCE allows a sender to fragment the packets of various priorities into multiple classes of fragments, and thereby allowing high-priority packets to be sent between fragments of lower priorities, as well as fragments belonging to different IP packets of same priority are allowed to exist mixed on a single link. MP multi-class extension bits 100 are located in the MP header as shown in
In MP pipelining, the long low-priority IP packets are fragmented and MP MCE framed, and in the present invention they are referred to as MP MCE fragments. The high-priority IP packets may only be PPP framed, but if the PPP link is a bundle (several physical links), reordering might occur between the PPP packets. An alternative is thus to also MP MCE frame these packets, in addition to the PPP frame. However, these packets are only framed, not fragmented. In the present invention they are referred to as PPP frames but it should be noted that MP MCE may also be used and they may hence also be MP MCE frames.
Furthermore,
U.S. Pat. No. 5,497,371 describes a packet transport system for mixed traffic in which a packet fragmentation protocol allows traffic of different classes to occupy a single physical link. Low priority packets are fragmentated while high priority packets are never fragmentated. In order to avoid potential fragment mis-ordering, packets are only allowed to be fragmentated on a trunk or a link, where the link acts like a first-in first-out buffer.
U.S. Pat. No. 6,072,772 shows a crossbar switch where fragmentation of packets with variable size into packets with a fixed size provides guarantees for deterministic bandwidth and delay.
U.S. Pat. No. 6,148,000 shows a switching apparatus for receiving and transmitting data units each segmented into a series of cells of data. The apparatus comprising a storage device for storing the series of cells received at each incoming port until the last cell of said series is received.
K. Sklower et. Al., “The PPP Multilink Protocol (MP)”, RFC 1900 proposes a method for splitting, recombining, and sequencing datagrams across multiple logical data links.
C. Bormann, “The Multi-Class Extension to Multi-Link PPP”, RFC 2686 proposes a fragment-oriented solution for the real-time encapsulation format part of the architecture. This document is regarded as the closest prior art.
Thus, the objective problem of the present invention is to enhance performance for data packets when a link layer protocol adapted to fragment said data packets, e.g. the MP is used within a packet switched network.
An advantage with the present invention is that the transmission time for a long low-priority data packet is decreased by pipelining the fragments of the data packet when a link layer fragmentation is used over several links after each other. The fragments are not reassembled into data packets at every intermediate router but instead directly sent to e.g. the last router in the pipeline where they are reassembled. This is achieved by dividing the IP packets into fragments having a size such that an entire IP header fits into one fragment, whereby the identity of the IP packet that the fragment belongs to is revealed.
Another advantage with the present invention is that the transmitting time is not just minimized for the high priority data packets but the delay qualities for the low priority data packets is also taken into consideration.
Yet another advantage with present invention is that MP MCE fragments belonging to several IP packets are able to coexist on common links provided that that the MP multi-class extension protocol is used.
a shows an egress interface without pipelining.
b shows an ingress interface without pipelining.
c shows fragmentation and reassembling of IP packets at each hop
a shows an egress interface with pipelining according to the present invention.
b shows an ingress interface of an intermediate node with pipelining according to the present invention.
c shows an ingress interface of the end node in the pipeline according to the present invention.
a shows an egress interface with pipelining with a separate buffer for arriving MP fragments according to the present invention.
b shows an ingress interface of an intermediate node with pipelining according to the present invention.
c shows an ingress interface of the end node in the pipeline according to the present invention.
Preferred embodiments of the present invention will now be described with references in particular to
The solution to the objective problem according to the present invention is achieved by sending a fragment belonging to a first data packet, preferably with low priority, from a first intermediate router to a second intermediate router independent of if other fragments of the first IP packet have arrived to the first intermediate router. That is provided that the fragment comprising the header portion of the data packet is arriving to the second router before the other fragments belonging to the same data packet. This process is called pipelining. The transmission delay of low priority data packets is thereby minimized.
The present invention is exemplified herein by employing MP with MCE in an IP network. However it is possible to employ another link layer protocol with a per-hop fragmenting capability (e.g. ATM or Frame Relay), wherein a data packet is fragmented into fragments framed with a header portion comprising an identification of said data packet.
The MP MCE may indicate the priority of the packets as described above. According to the present invention, it can also be used for indicating from which router the fragments origin in order to reassemble the MP fragments into IP packets correctly. E.g., half of the classes may be used for priority purposes and the other half may be used to indicate from which router a packet origin, or all classes may be used for origin indication. All MP MCE fragments belonging to the same IP packet are required to belong to the same class and preferably, all fragments from a certain MP MCE queue shall belong to the same class. During defragmenting, the fragments are sorted into the correct queue according to their multi-class.
The fragmentation is shown in
MP MCE fragments in the MP MCE buffer on the egress side of an intermediate router may either have arrived to the router as MP MCE fragments sent from a previous router, referred to as arrived MP MCE fragments, or as IP packets which were MP MCE fragmented at the egress side of the router, referred to as locally generated MP MCE fragments. When an IP packet is MP MCE fragmented, each MP MCE fragment is stored in the MP MCE buffer one after each other. When the same MP MCE buffer is used for locally generated MP MCE fragments and arriving MP MCE fragments, arriving MP MCE fragments can not be stored in between the locally generated MP fragments. In the worst case, an arriving MP MCE fragment belonging to a first IP packet have to wait for all the MP MCE fragments belonging to said first IP packet. This waiting procedure consumes some of the delay-profit obtained by the use of MP pipelining. This is illustrated in
That is solved according to the present invention by having several MP MCE queues 404, one for locally generated MP MCE fragments and one or more for arriving MP fragments fragmented in a previous router as in
At the end of the last link, in the direction of the aggregation, i.e. the uplink direction where MP MCE fragmentation is used, each MP MCE class is required to have its own reassemble queue. Basically, this means that every MP MCE queue further down in the network must have one corresponding reassembling queue in end of the link. As shown in
In the opposite direction, i.e. the downlink direction, it is enough with one queue per priority level at the starting point of the pipeline. The MP fragments 606 must there be classified, by using the MP multi-class extension bits 100, in order to ensure that the fragments 606 belonging to the same IP packet are filtered and reassembled at the correct router 602. As shown in
The reassembling (i.e. defragmentation) is performed similar to the one hop case and similar to how the packets with lost MP MCE fragments should be treated. If the MP MCE fragment comprising the header portion is lost, the remaining fragments belonging to the same IP packet may be discarded in one of the subsequent routers, i.e. if there is no index corresponding to the MP multi-class extension of the MP MCE fragment. A solution is to store MP MCE fragments which do not have a corresponding entry in the L2 forwarding table until the MP MCE fragment comprising the header portion arrives, or until a predetermined time period has passed or until the end MP MCE fragment shows up and then discard all the other MP MCE fragments belonging to the same IP packet. If the MP MCE fragments are sent to the end of the pipeline, the fragments will there be discarded as they would have been done for the one hop case (if an intermediate or the last fragment is lost).
If an MP MCE fragment, comprising a marked E-bit, arrives, the entry in the L2 forwarding table may be discarded when the fragment is sent. This should only be performed if all the fragments are transmitted in order. That may be ensured by checking the sequence number in the MP header. The above mentioned methods are only examples how this fault handling may be managed. Naturally, other solutions are also possible.
In a preferred embodiment according to the present invention shown in
b shows the ingress interface of an intermediate router with pipelining in accordance with the present invention. MP MCE/PPP frames 729 and MP MCE fragments 728 are demultiplexed by a demultiplexer 714 into separated flows. The MP MCE/PPP frames are MP/MCE or PPP deframed by a deframer 716 located in the intermediate router. The MP MCE fragments 728 are transferred further to a subsequent router according to their MCE that is associated with an output port number of the intermediate router. The MP MCE fragments 728 are thus either deframed or defragmented.
c shows an ingress interface of an end node in the pipeline. MP MCE/PPP frames 729 and MP MCE fragments 728 are demultiplexed by a demultiplexer 714 into separated flows according to the multi-class, i.e. the MP MCE/PPP frames 729, MP MCE fragments 728(2) belonging to the first IP packet and MP MCE fragments 728(3) belonging to the second IP packet. The MP MCE/PPP frames are sent to a MP MCE/PPP deframing unit 716 and the respective flow of MP MCE fragments is sent to a respective buffer 710 prior to the respective MP MCE defragmenting/deframing unit 718. This procedure results in a flow of IP packets 702,704, comprising a header portion and a payload portion.
In a second preferred embodiment according to the present invention shown in
b shows the ingress interface of an intermediate router with pipelining in accordance with the present invention. MP MCE/PPP frames 729 and MP MCE fragments 728 are demultiplexed by a demultiplexer 714 into separated flows. The MP MCE/PPP frames 729 are MP MCE/PPP deframed by a deframer 716 located in the intermediate router. The MP MCE fragments 728 are further transferred to a subsequent node according to their MCE that is associated with an output port number of the router. The MP MCE fragments 728 are thus not deframed or defragmented.
Referring to
The method is implemented in a router according to the present invention and is further described by
Locally Generated MP MCE Fragments at the Router:
1000: A first low priority IP packet is fragmented into a plurality of MP MCE fragments at a first router, wherein the first MP MCE fragment includes the header portion of the IP packet. The fragments are forwarded after the fragmentation to a subsequent intermediate router.
Arriving MP MCE Fragments at the Router, Previous Fragmented at Another Router:
1001: A first MP MCE fragment belonging to a second IP packet comprising the header portion of a second IP packet is received at the router, wherein the fragment has a marked B-bit.
1002: Perform routing look-up, i.e. determine an output port number at the router in order to route the first MP MCE fragment to a subsequent router. (Each output port of the router is associated with a subsequent router.) It is important that the MP MCE fragment comprising the header portion is arriving to the router first relative to the other fragments belonging to the same IP packet.
If one of the other fragments belonging to the same IP packet arrives first has an unmarked B-bit, implying that the fragments of the IP packets have not arrived in the correct order, different methods may be used. One alternative is to discard all fragments with the same multi-class extension as the arrived fragment until a fragment is received comprising a marked E-bit. A second alternative is to store the arrived fragment in e.g. a buffer a predetermined time or until a fragment with a marked E-bit arrives or based upon another parameter.
1003: Associate an output port number with the MP multi-class extension of the first MP fragment comprising the header portion of the second IP packet in an L2 routing table. The IP packet the fragment belongs to is identified by the multi-class extension.
1004: Send the first MP fragment from the router to a subsequent router independent of if other MP fragments, belonging to the same second IP packet, have arrived to the first router.
1005: Send other MP fragments with the identical multi-class, as the first MP fragment, to the port number stored in the L2 table at the router.
1006: Erase the entry in the L2 routing table when a fragment with a marked E-bit has been sent, if it is guaranteed that all fragments are sent in order, which implies that the rest of the fragments belonging to the same IP packet already have been sent.
1007: Discard the rest of the fragments, if the first fragment is lost, i.e. if there is no index corresponding to the multi-class of the MP fragment.
Repeat steps 1001b-1008 until the end of the pipeline is reached and the MP MCE fragments having identical multi-class extension are reassembled.
1008: Discard the MP fragments provided that they are sent to the end of the pipeline and the IP packet is reassembled.
The method may be implemented by means of a computer program product comprising the software code means for performing the steps of the method. The computer program product is run on processing means in a router in an IP network.
The present invention is not limited to the above-described preferred embodiments. Various alternatives, modifications and equivalents may be used. Therefore, the above embodiments should not be taken as limiting the scope of the invention, which is defined by the appending claims.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/SE02/00571 | 3/22/2002 | WO | 00 | 9/20/2004 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO03/081855 | 10/2/2003 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4958341 | Hemmady et al. | Sep 1990 | A |
4977582 | Nichols et al. | Dec 1990 | A |
5440545 | Buchholz et al. | Aug 1995 | A |
5497371 | Ellis et al. | Mar 1996 | A |
6073176 | Baindur et al. | Jun 2000 | A |
6130880 | Naudus et al. | Oct 2000 | A |
6535486 | Naudus et al. | Mar 2003 | B1 |
6577596 | Olsson et al. | Jun 2003 | B1 |
6671758 | Cam et al. | Dec 2003 | B1 |
6714985 | Malagrino et al. | Mar 2004 | B1 |
6724747 | Arango et al. | Apr 2004 | B1 |
6744927 | Kato | Jun 2004 | B1 |
6795435 | Jouppi et al. | Sep 2004 | B1 |
6876669 | Shalom | Apr 2005 | B2 |
6891855 | Bruckman | May 2005 | B2 |
6944672 | Crow et al. | Sep 2005 | B2 |
7089486 | Marleux et al. | Aug 2006 | B1 |
7184402 | Sharma et al. | Feb 2007 | B1 |
7224703 | Antal et al. | May 2007 | B2 |
7283527 | Brown et al. | Oct 2007 | B2 |
7292530 | Christensen | Nov 2007 | B2 |
7302491 | Brown et al. | Nov 2007 | B2 |
7315900 | Ofelt et al. | Jan 2008 | B1 |
7403999 | Corl et al. | Jul 2008 | B2 |
20020010866 | McCullough et al. | Jan 2002 | A1 |
20020051466 | Bruckman | May 2002 | A1 |
20020131425 | Shalom | Sep 2002 | A1 |
20030058836 | Even | Mar 2003 | A1 |
20030081605 | Egevang | May 2003 | A1 |
20030110286 | Antal et al. | Jun 2003 | A1 |
20040093521 | Hamadeh et al. | May 2004 | A1 |
20050025185 | Brown et al. | Feb 2005 | A1 |
20050129020 | Doyle et al. | Jun 2005 | A1 |
20050152355 | Henriques | Jul 2005 | A1 |
20060039379 | Abe et al. | Feb 2006 | A1 |
20060098684 | Bozionek et al. | May 2006 | A1 |
20070263542 | Bammesreiter et al. | Nov 2007 | A1 |
Number | Date | Country |
---|---|---|
WO 0228048 | Apr 2002 | WO |
WO 0228048 | Apr 2002 | WO |
Number | Date | Country | |
---|---|---|---|
20050152355 A1 | Jul 2005 | US |