1. Field of the Invention
The invention relates to network communications and more specifically to the management of data transfer rates under TCP/IP protocols.
2. Background of the Invention
As illustrated in prior art
Communications between the elements of computer network 100 are typically managed using a layered series of software and hardware systems. Prior art
TCP 220 includes software running on both sender and receiver 110 devices. TCP 220 hides the details pertaining to the network and its characteristics from FTP/HTTP 210 and communicates with FTP/HTTP 210 through a simple TCP application programming interface (API). TCP 220 has three roles: (a) it prevents congestion at the receiver, (b) it prevents congestion in the network along the path used from source to destination, and (c) it guarantees a reliable data transfer.
At sender 130, buffers in TCP 220 receive data to be transferred from applications at FTP/HTTP 210. These buffers are referred to as the “TCP Send Buffers.” Likewise, at receiver 110, there are “TCP Receive Buffers” associated with each sender 130, from which applications on receiver 110 read data transferred over network 120. A pair of send and receive buffers exists for each TCP connection. Data are written to and read from the send and receive buffers in blocks of various sizes.
To prevent congestion along a communications path, TCP 220 follows a congestion control algorithm that limits the amount of outstanding data. The congestion control algorithm contends with the fact that no information is available regarding the available bandwidth of the path from sender 130 to receiver 110. The congestion control algorithm starts by allowing a small amount of outstanding data and then increases the amount of outstanding data allowed as acknowledgements are received from receiver 110. The amount of outstanding data allowed by the congestion control algorithm is referred to as the congestion window (Wc). The actual amount of data that can be sent is the send window (Ws), which is equal to the minimum of Wr 330 and Wc.
To illustrate the dynamics of the congestion window Wc, one may momentarily assume that there is an abundance of data to send and that receive window Wr 330 is very large, and therefore does not place any limitation on the size of the send window Ws. The congestion control algorithm starts with a small congestion window (Wc) equal to the maximum size of a single TCP segment. The first transmission includes only one TCP segment, carrying at most an amount of data equal to the maximum segment size (MSS). When an acknowledgement of the first segment is returned from receiver 110, the congestion window Wc is increased by another MSS to two times MSS. Since, for this illustration, Wr 330 is assumed to be very large, the send window Ws also becomes two times MSS. The maximum amount of data that can be sent in the next transmission (Ws) is now two times MSS. When two segments are sent, each is acknowledged independently by receiver 110 and as each acknowledgement is received the congestion window Wc is increased by MSS. After both acknowledgements are received from a two-segment transmission, Wc (and Ws) will equal four times MSS. By increasing the size of the congestion window by one MSS for each acknowledgement received, the number of segments in each transmission doubles as each previous transmission is fully acknowledged. This increase continues until the congestion window Wc reaches a certain value called the “threshold.” Beyond the threshold, the congestion window Wc is incremented by only one MSS for each set of transmissions. This phase of the congestion control algorithm is referred to as the congestion avoidance phase. The congestion window Wc keeps increasing, albeit at a smaller (linear) rate instead of an exponential rate, until the size of the congestion window Wc reaches a preset maximum. This maximum is a predetermined system parameter. Since the congestion window Wc increases over time, the likelihood of congestion and, therefore, loss of a packet on the network also increases over time. If a TCP segment is lost in the network, the lack of acknowledgement from receiver 110 within a timeout period triggers the retransmission of the lost segment and all segments following it and also resets the size of congestion window Wc (and Ws) to MSS. The process described above is then repeated. TCP 220, therefore, does not maintain a constant transmission rate, but rather transmits data using a congestion window that increases according to first an exponential rule and then a linear rule until a transmission loss occurs and Wc is reset.
When Wr 330 is finite the send window, Ws, is determined by Wr 330 if Wr 330 is less than Wc. Ws is always the minimum of the current value of Wc and the most recently declared value of Wr 330.
A method of controlling the rate of data transfer is to limit the rate at which the application at sender 130 writes data to the send buffer. This is accomplished by limiting the size and frequency of data blocks written to the send buffer. Writing blocks of small size and high frequency can represent a high overhead at sender 130. Since sender 130 is typically a server serving a large number of clients, the preference of sender 130 is to use large blocks, and to control the rate by controlling the frequency of writes. However, writing large size blocks gives TCP 220 an opportunity to send large batches. Large batches generate traffic of a bursting nature and thus are more likely than traffic of a steady nature to cause congestion in network 120.
The prior art systems described above have a number of disadvantages. The variable send window Ws implies that an optimal transmission rate is never maintained. The average transmission rate also varies as a function of total traffic on the network. This variability makes scheduling of large data transfers difficult and allows large transfers to have a significant impact on other data. To be practical any solutions to these problems must be made utilizing the current standardized protocols.
Two prior art methods of regulating transfer rates involve intercepting the acknowledgement sent from receiver 110 to sender 130. In the first method the acknowledgement is delayed for a period of time and in the second the value of Wr reported within the acknowledgement is modified by inserting a new value. Both of these approaches have significant disadvantages, including the necessity of intercepting packets in the network.
Further information about TCP/IP and the state of the art in data transfer methods is found in the following references:
W. Richard Stevens, “TCP/IP Illustrated. Vol I—The Protocols,” Addison-Wesley, 1994;
Comer, Douglas, “Internetworking with TCP/IP. Vol. I.,” Prentice Hall, 1991;
Comer, Douglas, and Stevens, David, “Internetworking with TCP/IP. Vol. II.,” Design, Implementation, and Internals. Prentice Hall, 1991; and
Packer, Robert L., “Method for Explicit Data Rate Control in a Packet Communication Environment Without Data Rate Supervision”, U.S. Pat. No. 6,038,216.
The invention includes methods of regulating the rate of transmission on a computer network employing TCP. The methods involve moderating the amount of available space in a receive buffer by either reducing the rate at which data is removed from the receive buffer or changing the size of the receive buffer. By moderating the available space in the receive buffer, the size of data blocks sent by a sender is controlled and the average transmission rate is regulated.
Embodiments of the system for regulating an average rate of transmission on a computer network include a rate control software module, running on the receiver, that monitors the actual transmission rate, the number of lost packets, and the size of the receiver's receive window (Wr). In an embodiment of the invention the rate control module regulates the rate at which data is removed from the receive buffer. In another embodiment the rate control module adjusts the size of the receive buffer. In both of these embodiments, the rate control module moderates the size of the receiver's receive window, which controls the rate at which the sender transmits data.
The rate control module optionally operates in response to a priority or schedule associated with a data transmission. For example, the receiver attempts to achieve a transmission rate that is determined by a priority assigned to the data. Computer code operating on either the sender or the receiver optionally manages a schedule for a plurality of transmissions.
The invention allows the transfer of large data files in the “background” with controlled effects on other network traffic.
Rate control module 510 controls data transmission rates by manipulating the reported size, Wr 330, of the receive window. The send window and thereby the transmission rate is limited by changing the size of the receive window as reported to sender 540 by TCP 220. Limiting the transmission rate of low priority data reduces the impact that low priority transmissions have on higher priority transmissions.
In one embodiment of the invention, rate control module 510 manipulates the size of receive window WR 330 by controlling the rate at which data are read from receive buffer 310. Manipulating this rate does not require modification of standard protocols and is, therefore; non-intrusive. When data is removed from receive buffer 310 in increments of a relatively small size, the declared available space Wr 330 sent back with each acknowledgement remains small, and thus limits send window Ws and the rate of data transmitted by sender 540.
Locating rate control module 510 on receiver 520 provides a number of advantages. Software on receiver 520 is capable of measuring an actual transfer rate more accurately than software located elsewhere. Rate control module 510 interrogates the TCP API and obtains the number of total bytes and free bytes in receive buffer 310. Rate control module 510 is thus able to monitor the absolute rates at which data are delivered to and read out of receive buffer 310. Rate control module 510 actively adapts the rate at which data are read from receive buffer 310, so as to control Wr 330 and thereby the rate at which sender 540 transmits data. Rate control module 510 can adapt the rate at a variety of times, e.g., just before a read operation. Through the two steps of (1) reading data from receive buffer 310 in increments of a certain size and frequency, and (2) examining the amount of data in receive buffer 310, many further methods of dynamic rate control, through the parameter Wr, are possible. This embodiment of rate control module 510 does not require modification of standard protocols such as TCP 220 or IP 230.
In an alternative embodiment, rate control module 510 manipulates the size of receive window Wr 330 by modifying the size of receive buffer 310. Since TCP 220 uses the size of receive buffer 310 to calculate Wr 330, changing the size of receive buffer 310 influences the size of the send window Ws. The size of receive buffer 310 is changed using standard techniques. Modifying the size of receive buffer 310 has an advantage in that the rate that data is read from receive buffer 310 by FTP/HTTP 210 is not reduced. Controlling the transmission rate by modifying the size of receive buffer 310 is also used to prevent an initial burst of traffic during an initial phase of a transmission, when the congestion window Wc is increasing exponentially.
Schedule/priority manager 610 manages scheduling of data transfers and allows users to designate a level of priority for a data transfer. For example, data is scheduled to be transferred to receiver 520 before a specific deadline, or alternatively, the transfer is given a priority such that it does not interfere with other traffic on network 530. In one embodiment of transfer manager 550, a user selects a priority level for the data transfer. The selected priority level results in an absolute transfer rate or is responsive to other factors such as the total amount of traffic receiver 520 is receiving or the number of dropped packets experienced in the transfer. The rate of data transfer to receiver 520, for data transferred using the methods of the invention, is varied as a function of the amount of other traffic on network 530 or traffic receiver 520 is receiving. For example, receiver 520 begins a real-time video conference and as a result the transfer rate of a continuing low priority data transfer is reduced to regulate the effect that the low priority transfer has on the video conference. When the video conference is complete the transfer rate of the low priority data transfer is again adjusted.
Priority data 620 includes data provided by a user or supervisor that allows schedule/priority manager 610 to determine the relative priority of data transfers. Priority data 620 is used for determining transfer rates and transfer schedules. Transfer history data 630 is used to monitor the performance of transfer manager 550 and to estimate total transfer times.
In one embodiment, transfer manager 550 and receiver 520 first exchange information such as target transfer rates, transmission priority, historic transmission rates, reception rates, Wc, Ws, buffer information, or the like. Rate control module 510 at receiver 520 then uses this information to control the transmission such that it meets priority and schedule requirements while also having a regulated impact on network 530. Other shared data and factors, such as time of day, network load, and the like, are also optionally considered.
In another embodiment, transfer manager 550 regulates communications to a plurality of receiver 520. In this embodiment, transfer manager 550 is located on a router acting as an intermediary between the Internet and a local area network that includes a plurality of receiver 520. Transfer manager 550 assigns different priorities to data transfers as a function of the identity of receiver 520 or optionally assures that a data transfer to one receiver 520 does not significantly affect data transfers to a second receiver 520.
As shown in
In alternative embodiments, rate control module 510 is present at receiver 520 and sender 540, and rate control module 510 adjusts the size of the TCP buffers of both sender 540 and receiver 520. In one embodiment, the receive window Wr 330 on receiver 520 is dynamically controlled by changing the size of receive buffer 310. A large initial burst of traffic is avoided during the initial period 710 (
Reducing the number of lost data packets in a transmission reduces the quantity of segments lost by other traffic on the network and the impact that the transmission has on that traffic. This is important when a goal is to reconcile the effect the transmission has on other network traffic with the priority of the transmission. For example, in some instances it is desirable to send a large file in the “background” without affecting other traffic on the network. In these instances, transfer manager 550 initially sets a low priority for a transmission and later adjusts the priority as a function of a delivery schedule. Rate control module 510 enables the delivery of large files without causing a significant increase in lost data packets.
In step 830, TCP 220 on receiver 520 receives a requested data segment from sender 540 and places the data segment in receive buffer 310. In step 840, TCP 220 calculates the receive window Wr 330 and places the result within an acknowledgement of the last data transmission. The acknowledgement is then sent to sender 540 in step 850. In a step 860, rate control module 510 running on receiver 520 decides how much data, if any, should be removed from receive buffer 310. The decision is based, in part, on a comparison of an actual transmission rate with the target transfer rate. The decision is also optionally based on other data such as historical transmission rates, a frequency of dropped data packets, or a time since data was last removed from receive buffer 310. For example, if the actual transmission rate is higher than the target transfer rate, removal of data from receive buffer 310 is sometimes delayed such that the receive window size Wr 330 is decreased and the average transmission rate is thereby reduced. If data is not removed, after a delay step 870, then the method returns to step 860. If in step 860 rate control module 510 decides that data should be removed from receive buffer 310, a determined amount is read out in a step 880. In step 890, rate control module 510 determines whether there is more data to be removed from receive buffer 310. If there is more data, the method returns to step 860. If there is not more data, the method returns to step 830.
In another embodiment of the method of the invention, the removal of data in step 860 is replaced by or combined with a step of adjusting the size of receive buffer 310. Both of these actions are used to regulate the value of Wr 330 and thereby control the rate at which data is sent from sender 540 to receiver 520.
Embodiments of rate control module 510 adapted to function in accordance with communication protocols other than TCP that use buffered transmission, for example, those used in systems such as advanced digital telephone networks, video distribution networks, or for the delivery of streaming media, are within the scope of the invention. In these systems a much higher priority is optionally applied to live voice data transmissions than to other data. The transmission of non-voice data is controlled using methods of the invention in a manner that minimizes losses of the higher priority voice data. The invention provides significant advantages when real-time and non-real-time data share a network by providing a method that is able to reduce the impact of the non-real-time data on the transmission of the real-time data.
The invention is optionally used to optimize the scheduling of data transmissions among computer systems. For example, if many large files are to be transmitted, their transmission rate and sequence are optionally adjusted in response to priority criteria and schedules. Files with a lower priority are sent at a lower transmission rate so that files with a higher priority can utilize a higher proportion of available bandwidth. An embodiment of the invention includes a scheduling utility that delivers files to a plurality of receivers according to required delivery dates. The scheduling utility sequences and manages deliveries to have minimum impact on a receiver's other activities.
This application claims the priority benefit of U.S. Provisional Patent Application No. 60/257,651, entitled “System and Method For Rate Controlled Data Transfer Over A Network,” filed Dec. 22, 2000; and U.S. Provisional Patent Application No. 60/257,654, entitled “System and Method for Scheduling Data Transfers Through a Network,” filed Dec. 22, 2000, the disclosures of which are hereby incorporated by reference. This application is related to co-pending U.S. patent application Ser. No. 09/935,016, entitled “System and Method for Scheduling Data Transfers Through a Network,” filed Aug. 21, 2001 and co-pending U.S. patent application Ser. No. 09/852,464 entitled “System and Method for Automated and Optimized File Transfers Among Devices in a Network,” filed May 9, 2001.
| Number | Name | Date | Kind |
|---|---|---|---|
| 4642758 | Teng | Feb 1987 | A |
| 4644463 | Hotchkin et al. | Feb 1987 | A |
| 5523854 | Hornsby | Jun 1996 | A |
| 5557320 | Krebs | Sep 1996 | A |
| 5708960 | Kamisaka et al. | Jan 1998 | A |
| 5802292 | Mogul | Sep 1998 | A |
| 5886793 | Satou | Mar 1999 | A |
| 5920700 | Gordon et al. | Jul 1999 | A |
| 5920701 | Miller et al. | Jul 1999 | A |
| 5933413 | Merchant et al. | Aug 1999 | A |
| 5933603 | Vahalia et al. | Aug 1999 | A |
| 5978363 | Dimitrijevic et al. | Nov 1999 | A |
| 6003082 | Gampper et al. | Dec 1999 | A |
| 6026097 | Voois et al. | Feb 2000 | A |
| 6038216 | Packer | Mar 2000 | A |
| 6052710 | Saliba et al. | Apr 2000 | A |
| 6052734 | Ito et al. | Apr 2000 | A |
| 6070191 | Narendran et al. | May 2000 | A |
| 6085193 | Malkin et al. | Jul 2000 | A |
| 6098064 | Pirolli et al. | Aug 2000 | A |
| 6098096 | Tsirigotis et al. | Aug 2000 | A |
| 6134584 | Chang et al. | Oct 2000 | A |
| 6134596 | Bolosky et al. | Oct 2000 | A |
| 6141754 | Choy | Oct 2000 | A |
| 6208661 | Marshall | Mar 2001 | B1 |
| 6240460 | Mitsutake et al. | May 2001 | B1 |
| 6324570 | Tonchev et al. | Nov 2001 | B1 |
| 6332023 | Porter et al. | Dec 2001 | B1 |
| 6337850 | Nakano et al. | Jan 2002 | B1 |
| 6374336 | Peters et al. | Apr 2002 | B1 |
| 6397251 | Graf | May 2002 | B1 |
| 6427140 | Ginter et al. | Jul 2002 | B1 |
| 6560651 | Katz et al. | May 2003 | B1 |
| 6657954 | Bird et al. | Dec 2003 | B1 |
| 6715007 | Williams et al. | Mar 2004 | B1 |
| 6741563 | Packer | May 2004 | B1 |
| 6748447 | Basani et al. | Jun 2004 | B1 |
| 6754200 | Nishimura et al. | Jun 2004 | B1 |
| 6771603 | Yi et al. | Aug 2004 | B1 |
| 6775665 | Piersol | Aug 2004 | B1 |
| 6789108 | McMillan | Sep 2004 | B1 |
| 6798787 | Kikuchi et al. | Sep 2004 | B1 |
| 6813244 | He et al. | Nov 2004 | B1 |
| 6816798 | Pena-Nieves et al. | Nov 2004 | B1 |
| 6959327 | Vogl et al. | Oct 2005 | B1 |
| 6959451 | Ramanathan | Oct 2005 | B1 |
| 6975648 | Kikuchi et al. | Dec 2005 | B1 |
| 20020002618 | Vange | Jan 2002 | A1 |
| 20020083185 | Ruttenberg et al. | Jun 2002 | A1 |
| 20040022191 | Bernet et al. | Feb 2004 | A1 |
| 20040071085 | Shaham et al. | Apr 2004 | A1 |
| 20040114607 | Shay et al. | Jun 2004 | A1 |
| Number | Date | Country | |
|---|---|---|---|
| 20020080721 A1 | Jun 2002 | US |
| Number | Date | Country | |
|---|---|---|---|
| 60257651 | Dec 2000 | US | |
| 60257654 | Dec 2000 | US |