METHOD AND APPARATUS FOR MINIMIZING CONGESTION IN GATEWAYS

Description

FIELD OF THE INVENTION

The present invention generally relates to communication systems, and more particularly relates to gateways operating in communication systems and even more specifically toward congestion control systems for use in such gateways.

BACKGROUND OF THE INVENTION

Contemporary communication systems are designed in a tiered or layered arrangement typically rising from a physical layer to a link layer, then to a network layer above which is a transport layer then a middleware layer to ultimately an application layer; which is the layer users interface with when communicating through the network. Both wireline and wireless communication systems commonly move data in packets to minimize retransmission for dropped or corrupted data packets. A widely deployed wireline network utilizes the Transport Control Protocol (TCP)/Internet Protocol (IP) suite. TCP was originally designed for wireline networks, where packet losses are mostly caused by network congestions. The current TCP algorithm uses either a retransmission timer timing out, or receipt of three duplicate acknowledgements (ACKs) sent by receivers, to implicitly indicate data packet loss events.

However, networks with lossy links, such as radio frequency (RF) wireless networks, have a number of characteristics inherently different from wireline networks, for which the TCP/IP suite was originally designed. Notable among these differences is the transmission error measured by bit error rate (BER). Few errors per packet may be corrected by network layer encoding schemes, however, the network layer must also manage the congestion window (or buffer) of gateways at the physical layer to reduce dropped packets.

Since the original TCP protocol utilizes a packet loss as an indication of network congestion it can work against efficient data packet throughput when wireless networks are involved in a data packet flow. In a wireless network with lossy links, packet losses due to link errors are not caused by network congestions. Unfortunately, the current TCP protocol treats these losses as congestion losses, and in turn reduces the transmission speed, thus reducing communication throughput.

Unlike wireline TCP/IP networks, wireless links are characterized by high error rates. In most cases, packet losses due to corruption are more significant than congestion losses when a wireless link is involved in a TCP connection. In such a case, TCP may not be able to transmit or receive at the full available bandwidth, because the TCP algorithm will be unnecessarily reducing transmission speed in an attempt to avoid perceived congestion assumed to have been triggered by link errors. Consequently, the current congestion control algorithms in TCP result in very poor performance over wireless links.

Moreover, it is increasingly common for wireless networks to bridge onto classical wireline TCP/IP networks. The resulting patchwork of wireline and wireless networks may never operate at full performance if the TCP/IP algorithm operates to manage congestion over the wirelessly extended network. Also, wireline networks damaged by natural disasters or man made attacks can be rendered lossy and the legacy TCP algorithm will not be able to effectively control congestion.

Accordingly, it is desirable to have a congestion control algorithm for gateways operating in wireless, wireline or a combination wireless/wireline communication system that is able to differentiate and respond appropriately in the presence of congestion and corruption losses. The congestion control architecture should more optimally set the congestion window (or buffer) for gateways and be backward compatible with existing legacy networks. Furthermore, other desirable features and characteristics of the present invention will become apparent from the subsequent detailed description of the invention and the appended claims, taken in conjunction with the accompanying drawings and this background of the invention.

BRIEF SUMMARY OF THE INVENTION

An apparatus is provided for congestion management in gateways operating in a wireless, wireline or a combination wireless and wireline communication system. The apparatus can support legacy assumption based congestion systems or speculation based congestion systems (denoted SpecTCP) or the alternative combination thereof (for legacy backward compatibility). The congestion management system optimally resizes, or not, congestion window (or buffer) sizing for the communication gateways. Application of the inventive congestion management system optimizes data recovery and throughput in communication networks, particularly those networks having lossy data links.

A method is provided for congestion management in gateways operating in a wireless, wireline or a combination wireless and wireline communication system. The method can support legacy assumption based congestion systems or speculation based congestion systems (denoted SpecTCP) or the alternative combination thereof (for legacy backward compatibility). The congestion management method optimally resizes, or not, congestion window (or buffer) sizing for the communication gateways. Application of the inventive congestion management method optimizes data recovery and throughput in communication networks, particularly those networks having lossy data links.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention will hereinafter be described in conjunction with the following drawing figures, wherein like numerals denote like elements, and

FIG. 1 is a block diagram of a communication node arranged within a communication system in accordance with the present invention;

FIG. 2 is a block diagram of the conditional Bernoulli predictor suitable for use with the present invention; and

FIG. 3 is a flow diagram of the SpecTCP algorithm suitable for use with the present invention.

FIG. 4 is a block diagram of RED gateways operating with data congestion.

DETAILED DESCRIPTION OF THE INVENTION

The following detailed description of the invention is merely exemplary in nature and is not intended to limit the invention or the application and uses of the invention. Furthermore, there is no intention to be bound by any theory presented in the preceding background of the invention or the following detailed description of the invention.

To overcome the detriments of legacy congestion control systems when applied to lossy (e.g., wireless) links characterized by high error rates, the present invention has modeled congestion window size for gateways to optimally size the congestion window to improve data throughput. The congestion management algorithum an system of the preferred embodiment is particularly useful when applied to speculative techniques (i.e., speculating on the outcome of branch predictions) for throughput improvements when lossy links are involved in TCP/IP connections. Thus, the present invention helps to eliminates the waste of bandwidth responding to link errors.

Referring to FIG. 1, a single communication node 10 in accordance with an embodiment of the present invention is illustrated in block diagram form. As can be seen, the communication node 10 of FIG. 1 is arranged in layers (three layers shown); a network layer 12 that interfaces with a link layer (not shown) interfaced with gateways, transmitters, receivers or transceivers, a transport layer 14 and a middleware layer 16. Those of ordinary skill in the art will appreciate that above the middleware layer 16, one or more applications would be running as desired by users of the communication system. The communication node 10 of FIG. 1 is particularly useful in wireless communication systems (e.g., radio frequency, infrared and wireless LAN) 11 alone or in combination with a wireline communication system 13, but also finds utility in wireline networks, particularly if a wireline network suffers damage by natural disasters or man made attacks. Such damaged wireline networds can be rendered lossy and the legacy TCP algorithm will not be able to effectively control congestion. Such communication systems may be terrestrial, maritime, avionic or space based communications systems.

In accordance with the preferred embodiment of the present invention, enhancements to the network layer operate to optimally control the congestion window size for gateways at the physical layer (not shown). In one preferred embodiment of the present invention, the middleware layer 16 and the network layer 12 provide control functions and supporting parameters to the transport layer 14. At the transport layer 14, a conditional Bernoulli loss predictor 18 provides inputs to a congestion control and loss recovery module 20. Congestion control and loss recovery module 20 utilizes a speculation based algorithm according to a preferred embodiment of the present invention. Together, the conditional Bernoulli loss predictor 18 and the congestion control and loss recovery module 20 combine to provide SpecTCP congestion control. Alternatively, a preferred embodiment of the present invention could operate in a wireline communication system or a combination wirelss/wireline communication system.

At the network layer 12, a condition engine 22 provides required information for the conditional Bernoulli predictor 18. The condition engine 22 includes mathematical models of explicit congestion notification (ECN) capable random early drop (RED) gateways that produce networking parameters to minimize congestion losses at RED gateways, thus maximizing the accuracy of the conditional Bernoulli loss predictor 18 at the transport layer. To most effectively adjust RED gateway parameters so that congestion losses can be minimized, the present invention utilized mathematical models to dimension the buffer size inside a RED gateway. As dimensioned by the present invention, the RED gateway buffer sizes are much smaller than previously suggested values thus providing a significant contribution to the network performance improvement.

Assuming that both assumption based and speculation based congestion control are used, the middleware layer 16 has a congestion control manager 24 to select and control the execution of congestion control schemes at the transport layer 14. Based on the global knowledge of a network, the congestion control manager 24 makes and executes the decision whether the communication system should run an assumption based congestion control algorithm of the legacy assumption based congestion controller 26 (i.e., TCP/IP) or SpecTPC (the speculation based congestion control algorithm of the conditional Bernoulli loss predictor 18 and congestion control and loss recovery module 20). Moreover, the congestion control manager 24 at the middleware layer 16 functions as a bridge to ensure that end users with the communication system of the present invention can communicate with users of legacy networks. This is achieved by the ability of switching between the legacy assumption based congestion controller 26 and SpecTCP based upon the global ECN compatibility information obtained through the congestion control manager 24. Alternately, if only one type of congestion control system is used, the present invention within the network layer 12 would communicate with the congestion control system employed.

FIG. 2 illustrates the operation of the conditional Bernoulli predictor 18 within the communication node 10 in block diagram form. In order to improve the TCP throughput over lossy links, a conditional Bernoulli loss predictor 18 can be used to predict the type of loss event, (i.e., congestion loss or link corruption loss), and make the TCP congestion window (or buffer) respond accordingly. Predictive speculation techniques have been used in computer architecture to eliminate potential clock-cycle penalties caused by branch hazards. However, unlike the speculation techniques used in processor design, if the speculations are incorrect, there is no way to undo or flush execution results for the case of TCP congestion control. This is because execution results of instructions in a processor are values of pre-designed calculations, while execution results of the TCP congestion control algorithm are changes (increments or decrements) of the TCP transmission speed. To improve the speculation accuracy the present invention minimizes the probability of congestion losses by optimally dimensioning the buffer of the ECN capable RED gateways in the network. When a RED buffer is optimally dimensioned and the thresholds appropriately set, the probability of congestion losses can be minimized by appropriately adjusting the sender's congestion window (or buffer) size based on feedback from ECN signals.

As illustrated in FIG. 2, the condition engine 22 at the network layer 12 uses congestion loss minimization models at RED gateways to minimize network congestion losses. The output of condition engine 22 is employed so that congestion loss events are minimized. Based on the accuracy of this result, the conditional Bernoulli loss predictor 18 at the transport layer 14 produces results by setting the Bernoulli probability P for predicting link corruption losses, and accordingly the probability of predicting congestion losses is 1−P. If the condition engine 22 is optimized according to the present invention (i.e., congestion losses are minimized), the conditional Bernoulli loss predictor 18 at the protocol layer 14 sets P=1, resulting in predicting incoming loss events as link corruption losses.

To develop the congestion loss minimization models at RED gateways, the inventors derived expressions for the maximum buffer size and the maximum threshold of a RED gateway to minimize congestion packet losses. The minimization of congestion losses significantly improves the accuracy of speculating that loss events are due to link corruptions. This indicates that the condition engine within conditional Bernoulli predictor is optimized. Therefore, it is reasonable to set P=1 (i.e., to predict all incoming loss events are caused by link errors). As is known, ECN-capable RED gateways use an exponential weighted moving average to calculate an average queue size from the instantaneous queue size, and two thresholds (minimum and maximum), to determine whether an arriving packet should be dropped. If the average queue size is greater than the maximum threshold, the packet is dropped. If the average queue size is between the minimum and the maximum thresholds, the packet is marked with a probability as a Congestion Experienced (CE) packet. Packet losses due to the average queue size exceeding the maximum threshold at a RED gateway degrade TCP performance.

Those skilled in the art will be able to consider a typical model consisting of two RED gateways fed by multiple sources. As is known, the link connecting two RED gateways is the bottleneck link which causes congestion. The sources, destinations and the RED gateways use ECN for end-to-end congestion control.

The following notations will be used in the discussion of the inventive model in accordance with the present invention:

Q(t);Q(t)_max: Instantaneous and maximum instantaneous queue sizes respectively at the RED gateway at time t.

Q, Q_max: Average and maximum average queue sizes respectively at the RED gateway.

w: Weighting factor for calculating Q.

(t): Marking probability at the RED gateway at time t.

min_th, max_th: Minimum and maximum thresholds respectively of a RED gateway.

m: total number of TCP flows.

Wi(t): Window size of the i^thTCP flow at time t, t, ≧0, i=1; . . . , m.

SSthresh_i: Slow Start threshold for the i^thTCP flow, i=1. . . , m.

ri: Round Trip Time (RTT) for the i^thTCP flow, i=1; . . . , m. ri is replaced by r when all the RTTs are same.

μ: Average share of bottleneck link bandwidth of the i^thTCP flow, i=1; . . . , m.

μ: Bandwidth of bottleneck link which is given by μ=Σ_i=1^mμ_i.

T[1]: Waiting time for the first marking event after the average queue size exceeds min_th.

β_i: Number of window size increases during time T[1] for the i_thTCP flow, i=1; . . . , m.

τ: Propagation delay from source i to the RED gateway, i=1; . . . , m.

t₀: Time when the first packet is marked at the RED gateway.

t₁: Time when the last packet, which was sent just before the first window size reduction, arrives at the RED gateway.

Packet drops at an ECN-capable RED gateway are either due to buffer overflows (Q(t) is equal to the buffer size) or Q>max_th. The congestion window (or buffer) size during the slow start phase increases very quickly. The average queue size (being the output of a low pass filter) of a RED gateway can not follow the quick change of Q(t); as a result Q stays less than min_th. Therefore, Q(t) reaches the maximum value when the packet leaving the source at t−τ_ireaches the RED buffer. When this packet left the source, W_i(t−τ_i)=SSthresh_ifor i=1; 2 . . . , m; the queue size is smaller when the sources are in congestion avoidance. For m TCP flows, Q(t)_maxcan be expressed as the output of a system with processing capacity of Σ_i=1^mr_iμ_iand the maximum input rate when sources reach their slow start threshold.

Thus:

$\begin{matrix} {Q (t)}_{\max} = \sum_{i = 1}^{m} (Wi (t - τ_{i}) - r_{i} μ_{i}) = \sum_{i = 1}^{m} ({SSth}_{i} - r_{i} μ_{i}) . & (Eq 1) \end{matrix}$

According to one embodiment of the present invention, this is the buffer size used to minimize packet loss at the RED gateway.

Turning now to the derivation of the maximum average queue size, it is known that the recommended max_th=3×min_th. When the average queue size is in the steady-state condition (during which the sources are in the congestion avoidance phase), the instantaneous queue size at time to is: Q(t₀)=min_th+Σ_i=1^mβ_i. Since the difference between t₀and t₁is one RTT, and the window size of a source is increased by one per RTT during the congestion avoidance phase, the instantaneous queue size at time t₁can be expressed as: Q(t₁)=min_th+Σ_i=1^m(β_i1). The average queue size is estimated using an exponential weighted moving average as shown in Eq 1 above. If time is discretized into time slots with each slot being equal to one RTT, the RED's average queue size estimation algorithm at the k-th slot can be expressed as: Q[k+1]=(1−w)Q[k]+Q[k]w. In practice, w is very small, and the congestion window (or buffer) size increases by one every RTT during the congestion avoidance phase. Therefore, before the first marking event happens (i.e., no congestion control) it is reasonable to consider both the instantaneous queue size and the average queue size to be constant within a very short time period. Thus, by using Q(t₁) (slot k is equal to t₁in time) the derivation above and assuming that the average queue sizes during the two previous consecutive time slots are the same, the average queue size estimated at time t, can be solved iteratively, which is: Q_max=Q=min_th+Σ_i−1^m(β_i+w) (Eq 2). The first marking event is followed by many random ECN marking events, which make TCP sources adjust their congestion window sizes. The average queue size stays at a certain level smaller than the average queue size at time t₁. Therefore, Eq 2 gives the maximum average queue size for minimizing packet losses and represents the value of max_thaccording to the present invention.

With the buffer size and maximum average queue size determined, consider again FIG. 2. The conditional Bernoulli loss predictor 18 is enabled or disabled by the congestion control manager 24 at the middleware layer 16. For network backward compatibility issues (i.e., some network components may not be ECN capable), the congestion control manager 24 may choose to disable the conditional Bernoulli loss prediction function and switch to the legacy assumption based congestion controller 26. This function is especially useful for integrating networking equipment in accordance with the present invention with existing network infrastructure.

FIG. 3 illustrates the algorithm utilized of SpecTCP (the combination of the conditional Bernoulli loss predictor 18 and congestion control and loss recovery module 20). The algorithm 30 treats the retransmission timer timing out and/or three duplicate acknowledgements as an indication of link errors 32. This provides speculation that the losses are due to link corruption. Most often, this is the case in a network with wireless links (data packet losses due to link errors). In this case, a preferred embodiment of the present invention does not decrease (either not substantially or at all) the congestion window (or buffer) of the RED gateways 34. This assumes that no ECN ECHO packets were received 36 which would indicate that, potentially, there were no congestion data packet losses. If an incorrect speculation/prediction was made (i.e., an ECN ECHO packet was received) then a fast recovery scheme 38 is executed as is known in contemporary TCP algorithms.

In the algorithm 30 in SpecTCP, the congestion window size is appropriately controlled in the presence of either network congestion or corruption. In the preferred embodiment of the present invention, the congestion window is halved with the fast recovery algorithm when there is network congestion (as evidenced by ECN EHCO packets 36) and the congestion window size persists (remains substantially unchanged) at the previous value in the presence of corruption. Alternately, adjusting the congestion window (or buffer) size in the range of 40 percent to 80 percent may also be done, however, about a 50 percent reduction is one preferred embodiment. As will be appreciated by those skilled in the art, the ECN mechanism will be most effective if it is used with active queue management such as that found in contemporary RED gateways. In active queue management, when a buffer reaches a certain threshold, the RED gateway will send a CE packet to the TCP receiver before the buffer overflows. Therefore, packet drops due to congestion happen only after the RED gateway has sent CE packets. By optimally dimensioning the congestion window (or buffer) size per the present invention, and using the maximum threshold for the active queue management for ECN capable RED gateways, the accuracy of the conditional Bernoulli loss predictor 18 can be optimized.

In FIG. 4 illustrates a RED gateway 46 fed by multiple TCP sources. Source 1 (S1) 43 and source 2 (S2) 44 both feed 10 Mb of data in router 1 (R1) 46, while router 2 (R2) is capable of delivering 10 Mb of data to both destionations (D1) 50 and (D2) 52. However, the link 54 between R1 and R2 is a bottleneck only moving data at 1.5 Mb. According to a preferred embodiment of the present invention, setting the maximum buffer size 56 for the congestion window 58 of R1 is set according to:

${Q (t)}_{\max} = \sum_{i = 1}^{m} (Wi (t - τ_{i}) - r_{i} μ_{i}) = \sum_{i = 1}^{m} ({SSth}_{i} - r_{i} μ_{i}),$

while the maximum threshold size 60 is set according to: Q_max=Q=min_th+Σ_i−1^m(β_i+w) will minimize congestion as well as result in better prediction should the present invention be utilized in prediction based congestion management systems.

Returning again to FIG. 1, the overall operation of the present invention can be described. First, a loss is detected 40 and the congestion control manager 24 is notified. Loss events are detected by either retransmission timer timing out or three duplicated acknowledgements at the transport layer. Using the global knowledge of the network, such as whether or not the receiver is ECN compatible, the congestion control manager will issue commands about which loss treatment algorithm should be executed. If both assumption based and speculation based congestion control methods are used, and if the receiver and other nodes along the way to the receiver are ECN capable, then the SpecTCP (speculative) algorithm and the conditional Bernoulli loss predictor 18 is applied. Otherwise the legacy assumption based congestion controller 26 is executed. After the congestion control manager 24 chooses the appropriate loss treatment scheme, the selected loss treatment algorithm generates and supplies appropriate congestion window sizing and recovery commands 42 to the congestion control and loss recovery module 20. The congestion control and loss recover module 20 then executes congestion window resizing (or substantially maintaining the current size) in accordance with the loss treatment selected by the congestion control manager 24. In this way, lost or dropped data packets can be most effectively retransmitted. Alternatively, if only one type of congestion control is employed, the present invention would communicate with the sole congestion control algorithm.

As will be appreciated by those skilled in the art, the present invention is effective in improving network throughput over lossy links. The inventive communication system is able to optimally adjusting congestion window (or buffer) sizes in gateways. Also, when used in combination with a speculation based congestion system, the message sender does not have to waste time and bandwidth (i.e., congestion window size backoff) waiting for implicit network information about the losses. Therefore, network throughput effectively is enhanced.

Another benefit of the present invention is that it does not starve other competing data flows. Under the normal working condition, no matter which congestion control algorithm is applied, all users are controlled by congestion window evolutions, which was designed to reduce unfairness. Unfairness in data flow is more likely to happen during failure modes of the system. For example, if the communication system incorrectly speculated a congestion loss as a link corruption loss, and ECN packets used to indicate congestions were lost, the system would not decrease the sender's congestion window size (but it should), which could result in starvation of other competing legacy TCP data flow. To improve the ECN algorithm reliability in one embodiment of the present invention, ECN packets are transmitted continuously until the sender acknowledges that ECN packets are received.

While at least one exemplary embodiment has been presented in the foregoing detailed description of the invention, it should be appreciated that a vast number of variations exist. It should also be appreciated that the exemplary embodiment or exemplary embodiments are only examples, and are not intended to limit the scope, applicability, or configuration of the invention in any way. Rather, the foregoing detailed description will provide those skilled in the art with a convenient road map for implementing an exemplary embodiment of the invention. It being understood that various changes may be made in the function and arrangement of elements described in an exemplary embodiment without departing from the scope of the invention as set forth in the appended claims.

Claims

1. A method for controlling congestion in a communication system, comprising: setting congestion buffer parameters of at least one gateways including at least setting the maximum congestion buffer size and maximum threshold based upon mathematical models of the gateways in the presence of network congestion.
2. The method of claim 1 wherein the step of setting the maximum congestion buffer size comprises:
3. The method of claim 1 wherein the step of setting the maximum threshold comprises: Qmax=Q=minth+Σi=1m (βi+w).
4. The method of claim 1 which includes the step of the gateway transmitting data in a wireline network.
5. The method of claim 1 which includes the step of the gateway transmitting data in a wireless network
6. The method of claim 1 which includes the step of the gateway transmitting data in a combination wireless/wireline network
7. The method of claim 1 wherein the steps are performed within an ECN compatible RED gateway.
8. A congestion control system for use in a communication system, comprising: a network layer having a condition engine for generating network parameters including maximum congestion buffer size and maximum threshold; andmeans for adjusting a maximum threshold in the presence of network congestion in accordance with: Qmax=Q=minth+Σi=1m (βi+w), and for adjusting a maximum congestion buffer size in the presence of network congestion in accordance with:
9. The system of claim 8, including a middleware layer for controlling whether a speculation based congestion algorithum will be inactivated in favor of using an assumption based congestion algorithm.
10. The system of claim 8, wherein the adjusting means supports a speculation based congestion algorithm.
11. The system of claim 8, including a conditional Bernoulli loss predictor for predicting whether a lost data packet was due to network congestion or data corruption.
12. The system of claim 8, which includes an assumption based congestion controller utilizes a TCP congestion management algorithm.
13. The system of claim 8, which includes a network layer in communication with wireless communication equipment.
14. The system of claim 8, which includes a network layer in communication with wireline communication equipment.
15. The system of claim 8, which includes a network layer in communication with wireless and wireline communication equipment.
16. A method for optimizing data throughput in a communication system having ECN compatible RED gateways, comprising the steps of: (a) detecting a data packet loss;(b) determining whether to utilize a legacy congestion controller or SpecTCP to generate instructions for recovery of the lost data packet including generating instructions for sizing a congestion buffer in the ECN compatible RED gateways;(c) responsive to step (b), controlling whether and to what extent the congestion buffer size is changed.
17. The method of claim 16, wherein the determination of step (c) includes reducing the congestion buffer sizing by one-half when a conditional Bernoulli loss predictor determines that the data packet was lost due to network congestion.
18. The method of claim 16, wherein the determination of step (c) includes not changing the congestion buffer size when a conditional Bernoulli loss predictor determines that the data packet was lost due to data corruption.

METHOD AND APPARATUS FOR MINIMIZING CONGESTION IN GATEWAYS

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

US Classifications

International Classifications

Abstract

Description

Claims