The present invention relates generally to the field of audio/video data packet transmission over digital networks; more specifically, to mechanisms for minimizing or eliminating fragmentation in real-time data traffic over a network.
In the past few decades, advances in the related fields of video compression and video transmission systems have lead to the widespread availability of digital video programs transmitted over a variety of communication systems and networks. Most recently, new technologies have been developed that have allowed audio and video programs to be transmitted as multicast digital bitstreams of multiplexed video and audio signals delivered to users or client subscribers over packet switched Internet Protocol (IP).networks.
IP multicasting is defined as the transmission of an IP datagram (i.e., a data packet formatted according to the Internet Protocol) to a “host group”, which is a set of hosts identified by a single IP destination address. A multicast datagram is asymmetrically delivered to all members of its destination host group. The Internet Group Management Protocol (IGMP), which is defined in RFC792, is used between IP hosts and their immediate neighbor multicast agents to support the creation of transient groups, the addition and deletion of members of a group, and the periodic confirmation of group membership. Multicast data streams are typically sent using the User Datagram Protocol (UDP), which is implemented in the transport layer and provides a connectionless datagram service for the delivery of packets.
By way of further background, U.S. Pat. No. 6,771,644 teaches a system and method for supporting audio/video program insertion in real-time IP multicasts. Scheduling and control traffic occurs through a new protocol wherein smooth transitions occur by manipulation of the Real-Time Protocol (RTP) header in the packets and the associated RTP Control Protocol (RTCP) stream. RTP is a known protocol for transmitting real-time data such as audio or video streams. While it does not guarantee real-time delivery of data, RTP does provide mechanisms for sending and receiving applications to support data streaming. RTCP, on the other hand, relies on the periodic transmission of control packets from the endpoints to the originator of the data (media) stream using the same distribution mechanism—but not necessarily the same path—as the data packets. A method and system for providing media services in voice over IP (VoIP) telephony in which audio is transmitted in packet streams such as RTP/RTCP packets is disclosed in U.S. Pat. No. 6,947,417. U.S. Pat. No. 6,044,081 teaches a communications system and multimedia system that allows private network signaling to be routed over a packet network.
In video streaming applications, MPEG video streams comprise different types of frames that do not include all of the data to be displayed at any given time. For instance, Inter-frames, or I-frames, are the only type of frame that is not coded with reference to any other frame; P-frames are coded predicatively from a previous I-frame or P-frame; B-frames are coded predicatively from I-frames and P-frames. In order to be properly decoded, a B-frame associated with a group of pictures (“GOPs”) may need to reference the I-frame of a next GOP. Due to their comprehensive nature, I-frames are generally much larger (e.g., 5 Kbytes or more) than either P-frames or B-frames. (It should be understood that a GOP is an optional structure of an elementary stream. Also, in the context of the present application, the term “I-frame” is intended to broadly refer to an Inter-frame and its equivalents, e.g., an IDR frame in the case of H.264.)
For audio/video solutions, RTP streams are typically created between the media server and the endpoints. In the case of video, the RTP packet sizes that include I-frames are often larger than the maximum transmission unit (MTU) size that the intermediate nodes in the network can process. When the RTP packet size exceeds the MTU of a node, the packets usually are fragmented into smaller sized packets. Fragmentation of packets, however, adds latency that slows down video data transmissions and other time-sensitive applications. Therefore, fragmentation of RTP packets should be avoided as much as possible.
In the case where the packet header has the “Don't Fragment” (DF) bit marked, the packet is simply dropped. In accordance with RFC1191, an ICMP notification that the packet has been dropped may be sent back to the source node. Although another attempt may be made to re-send the dropped packet, the RFC1191 mechanism provides no assurance that the resent packet will successfully traverse the network without being fragmented. For instance a large video packet containing an I-frame may be repeatedly re-sent by the source with the DF bit set, but never arrive at the destination. Obviously, this is a serious problem for video streaming traffic which usually must be delivered without delay or interruption. Furthermore, marking video packets with the DF bit will result in loss of packets for certain specifications, such as the H.26x codec standard, which lack a resend mechanism.
In many cases, fragmentation occurs when data packets are passed from one networking layer to another. By way of example,
What is needed then is for a solution to the problem of fragmentation of large, time-sensitive data packets, such as audio and video data packets.
The present invention will be understood more fully from the detailed description that follows and from the accompanying drawings, which however, should not be taken to limit the invention to the specific embodiments shown, but are for explanation and understanding only.
A mechanism for optimizing the size of data packets in a stream so that fragmentation of packets does not occur is described. In the following description, numerous specific details are set forth, such as device types, protocols, data sizes, formats, etc., in order to provide a thorough understanding of the present invention. However, persons having ordinary skill in the networking arts will appreciate that these specific details may not be needed to practice the present invention.
A computer network is a geographically distributed collection of interconnected subnetworks for transporting data between nodes, such as intermediate nodes and end nodes. A local area network (LAN) is an example of such a subnetwork; a plurality of LANs may be further interconnected by an intermediate network node, such as a router, bridge, or switch, to extend the effective “size” of the computer network and increase the number of communicating nodes. Examples of the end nodes may include servers and personal computers. The nodes typically communicate by exchanging discrete frames or packets of data according to predefined protocols. In this context, a protocol consists of a set of rules defining how the nodes interact with each other.
Each node typically comprises a number of basic subsystems including a processor, a main memory and an input/output (I/O) subsystem. Data is transferred between the main memory (“system memory”) and processor subsystem over a memory bus, and between the processor and I/O subsystems over a system bus. Examples of the system bus may include the conventional lightning data transport (or hyper transport) bus and the conventional peripheral component interconnect (PCI) bus. The processor subsystem may comprise a single-chip processor and system controller device that incorporates a set of functions including a system memory controller, support for one or more system buses and direct memory access (DMA) engines. In general, the single-chip device is designed for general-purpose use and is not heavily optimized for networking applications.
As shown in
In a typical networking application, packets are received from a framer, such as an Ethernet media access control (MAC) controller, of the I/O subsystem attached to the system bus. A DMA engine in the MAC controller is provided a list of addresses (e.g., in the form of a descriptor ring in a system memory) for buffers it may access in the system memory. As each packet is received at the MAC controller, the DMA engine obtains ownership of (“masters”) the system bus to access a next descriptor ring to obtain a next buffer address in the system memory at which it may, e.g., store (“write”) data contained in the packet. The DMA engine may need to issue many write operations over the system bus to transfer all of the packet data.
According to one embodiment of the present invention, an extension to the RTCP is utilized to signal back to the RTP originating node the number of fragments received at the destination node (i.e., endpoint) and also the minimum size of the first fragmented packets (the first fragment is marked in the IP header of a fragment received). The minimum size of the received packet (typically greater than 576 bytes) signifies the general preference of the network at that particular point in time to ensure that subsequently sent RTP packets in the data stream are not fragmented. In other words, the information conveyed in the RTCP report is used by the RTP originating or source node to adjust/control the size of the RTP packets subsequently sent across the network. Once the source node has adjusted the packet size and then receives a RTCP report back from the destination node indicating no fragmentation occurred in the previous transmission, the source node continues sending RTP packets of that size.
In an alternative embodiment, the node that originates the RTP stream incrementally increases the packet size after a predetermined time or after receipt of a certain number of RTCP reports indicating no fragmentation. For example, in one implementation, after three RTCP reports are received with no packet fragmentation the sending node incrementally sends packets of a slightly larger size (e.g., ˜10% larger) until a RTCP report is received indicating fragmentation. At that point, the sender again reduces the size of the packets being sent until RTCP reports are again received indicating no fragmentation. This latter approach permits optimization or dynamic MTU tuning of the network to maximize the size of packets being sent over the network as conditions change (e.g., new routing paths) for the better.
With reference now to
In one implementation, both nodes 11 & 12 run software (or firmware) which implements an extension to RTCP that permits communication of the number of fragments received by the destination, as well as the minimum size of the first fragment greater than a predetermined size (e.g., 576 bytes). In the example of
After receiving the three packet fragments, destination node 12 utilizes an extension to RTCP to send a report back to sending node 11 along path 19 indicating that three fragments were received, with the minimum size of the first fragment greater than 576 bytes being equal to 850 bytes (block 23). It should be understood that the 576 byte number simply represents a predetermined threshold or minimum MTU that the network ordinarily should support. In other words, the threshold or minimum MTU number may vary substantially based on network type and/or configuration details. In an alternative embodiment, the RTCP report may simply contain the size of the largest packet fragment received.
RTP originating node 11 responds to the RTCP report by reducing the size of the data packets it sends over the network. In this example, originating node 11 responds to the RTCP report by subsequently sending data packets of size=850 bytes (block 24). Since the 850 byte data packet size is less than or equal to the smallest MTU of intermediate nodes 15-17, no fragmentation occurs for this network path. Consequently, RTP destination node 12 sends RTCP reports back to RTP originating node 11 indicating no fragmentation (block 25). In other words, the mechanism of the present invention eliminates fragmentation in an RTP data stream (e.g., audio and/or video data packets).
The process continues with the destination node continuing to send RTCP reports back indicating fragmentation or no fragmentation (block 34). If there is no fragmentation, the sending node continues with the packet size previously sent. In the embodiment shown, the sending node incrementally increases the size of the data packets over time until a RTCP report is received indicating fragmentation. For example, the originating node may increase the data packet size by a factor of ˜10% after three consecutive RTCP reports have been received indicating no fragmentation (block 35). In the case of streaming video, RTCP reports may be sent every five seconds, meaning that the originating node may adjust the RTP video stream approximately every 10 to 15 seconds.
The sending or originating node continues to increment the data packet size until an RTCP report is received from the destination node that indicates fragmentation has occurred (block 36). At that point, the sending node reduces the size of the packets being sent to eliminate fragmentation. That is, the datagram size is reduced to the largest size that does not require fragmentation anywhere along the current path across the network from source to destination. This entire process continues for the duration of the data stream. In other words, dynamic optimization of the packet size takes place on a per stream basis to reduce latency in the delivery of real time traffic regardless of the path the RTP traffic might traverse.
Practitioners in the networking and multicasting arts will appreciate that the embodiments described above are transparent to the client devices. That is, no changes are required to existing client STBs in order to implement the system and methods of the present invention. The present invention is not limited to video applications but may also be useful to other communication technologies.
It should be understood that elements of the present invention may also be provided as a computer program product which may include a “machine-readable medium” having stored thereon instructions which may be used to program a computer (e.g., a processor or other electronic device) to perform a sequence of operations. A machine-readable medium” may include any computer program product, apparatus and/or device (e.g., magnetic discs, optical disks, memory, Programmable Logic Devices (PLDs) used to provide machine instructions and/or data to a programmable processor, including a machine-readable medium that receives machine instructions as a machine readable signal. Alternatively, the operations may be performed by a combination of hardware and software. The machine-readable medium may include, but is not limited to, floppy diskettes, optical disks, CD-ROMs, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, magnet or optical cards, or other type of media/machine-readable medium suitable for storing electronic instructions. Elements of the present invention may be downloaded as a computer program product, wherein the program may be transferred from a remote computer (e.g., a server) to a requesting computer by way of data signals embodied in a carrier wave or other propagation medium via a communication link (e.g., a modem or network connection).
Furthermore, although the present invention has been described with reference to specific exemplary embodiments, it should be understood that numerous changes in the disclosed embodiments can be made in accordance with the disclosure herein without departing from the spirit and scope of the invention. The preceding description, therefore, is not meant to limit the scope of the invention. Rather, the scope of the invention is to be determined only by the appended claims and their equivalents.
Number | Name | Date | Kind |
---|---|---|---|
5483587 | Hogan et al. | Jan 1996 | A |
5600366 | Schulman | Feb 1997 | A |
5673253 | Shaffer | Sep 1997 | A |
5729687 | Rothrock et al. | Mar 1998 | A |
5917830 | Chen et al. | Jun 1999 | A |
5963217 | Grayson et al. | Oct 1999 | A |
6137834 | Wine et al. | Oct 2000 | A |
6141324 | Abbott et al. | Oct 2000 | A |
6236854 | Bradshaw | May 2001 | B1 |
6269107 | Jong | Jul 2001 | B1 |
6332153 | Cohen | Dec 2001 | B1 |
6501739 | Cohen | Dec 2002 | B1 |
6505169 | Bhagavath et al. | Jan 2003 | B1 |
6570926 | Agrawal et al. | May 2003 | B1 |
6608820 | Bradshaw | Aug 2003 | B1 |
6624841 | Buchner | Sep 2003 | B1 |
6643496 | Shimoyama et al. | Nov 2003 | B1 |
6650652 | Valencia | Nov 2003 | B1 |
6671262 | Kung et al. | Dec 2003 | B1 |
6675216 | Quatrano et al. | Jan 2004 | B1 |
6718553 | Kenworthy | Apr 2004 | B2 |
6735572 | Landesmann | May 2004 | B2 |
6744785 | Robinett et al. | Jun 2004 | B2 |
6771657 | Elstermann | Aug 2004 | B1 |
6775247 | Shaffer et al. | Aug 2004 | B1 |
6816469 | Kung et al. | Nov 2004 | B1 |
6876734 | Summers et al. | Apr 2005 | B1 |
6865540 | Faber et al. | Aug 2005 | B1 |
6925068 | Stanwood et al. | Aug 2005 | B1 |
6931001 | Deng | Aug 2005 | B2 |
6931113 | Ortel | Aug 2005 | B2 |
6937569 | Sarkar et al. | Aug 2005 | B1 |
6956600 | Gaylord | Oct 2005 | B1 |
6956828 | Simard et al. | Oct 2005 | B2 |
6959075 | Cutaia et al. | Oct 2005 | B2 |
6976055 | Shaffer et al. | Dec 2005 | B1 |
6989856 | Firestone et al. | Jan 2006 | B2 |
7003086 | Shaffer et al. | Feb 2006 | B1 |
7007098 | Smyth et al. | Feb 2006 | B1 |
7084898 | Firestone et al. | Aug 2006 | B1 |
7127487 | Wang et al. | Oct 2006 | B1 |
7209763 | Martin et al. | Apr 2007 | B2 |
7379653 | Yap et al. | May 2008 | B2 |
20010000540 | Cooper et al. | Apr 2001 | A1 |
20020004841 | Sawatari | Jan 2002 | A1 |
20020006165 | Kato | Jan 2002 | A1 |
20020014282 | Andersson et al. | Feb 2002 | A1 |
20020052214 | Maggenti et al. | May 2002 | A1 |
20020087976 | Kaplan et al. | Jul 2002 | A1 |
20020163918 | Cline | Nov 2002 | A1 |
20030016632 | Refai et al. | Jan 2003 | A1 |
20030025786 | Norsworthy | Feb 2003 | A1 |
20030076850 | Jason, Jr. | Apr 2003 | A1 |
20030198195 | Li | Oct 2003 | A1 |
20040057449 | Black | Mar 2004 | A1 |
20040165527 | Gu et al. | Aug 2004 | A1 |
20040165710 | DelHoyo et al. | Aug 2004 | A1 |
20040199659 | Ishikawa et al. | Oct 2004 | A1 |
20040213152 | Matuoka et al. | Oct 2004 | A1 |
20040250201 | Caspi | Dec 2004 | A1 |
20040255328 | Baldwin et al. | Dec 2004 | A1 |
20050069102 | Chang | Mar 2005 | A1 |
20050078171 | Firestone et al. | Apr 2005 | A1 |
20050081244 | Barett et al. | Apr 2005 | A1 |
20050138372 | Kajihara et al. | Jun 2005 | A1 |
20050259803 | Khartabil | Nov 2005 | A1 |
20060020995 | Opie et al. | Jan 2006 | A1 |
20060072672 | Holcomb et al. | Apr 2006 | A1 |
20060104458 | Kenoyer | May 2006 | A1 |
20060146734 | Wenger et al. | Jul 2006 | A1 |
20060189337 | Farrill et al. | Aug 2006 | A1 |
20060206617 | Rey et al. | Sep 2006 | A1 |
20060259755 | Kenoyer | Nov 2006 | A1 |
20070110029 | Gilmore, II et al. | May 2007 | A1 |
20070123284 | Schliwa-Bertling et al. | May 2007 | A1 |
20070133435 | Eneroth et al. | Jun 2007 | A1 |
20080043140 | Herpel et al. | Feb 2008 | A1 |
Number | Date | Country |
---|---|---|
1 553 735 | Jul 2005 | EP |
WO 0019693 | Apr 2000 | WO |
WO 2005093661 | Oct 2005 | WO |
Number | Date | Country | |
---|---|---|---|
20070115963 A1 | May 2007 | US |