System and method for router queue and congestion management

Information

  • Patent Grant
  • 7813365
  • Patent Number
    7,813,365
  • Date Filed
    Monday, November 14, 2005
    19 years ago
  • Date Issued
    Tuesday, October 12, 2010
    14 years ago
Abstract
In a multi-QOS level queuing structure, packet payload pointers are stored in multiple queues and packet payloads in a common memory pool. Algorithms control the drop probability of packets entering the queuing structure. Instantaneous drop probabilities are obtained by comparing measured instantaneous queue size with calculated minimum and maximum queue sizes. Non-utilized common memory space is allocated simultaneously to all queues. Time averaged drop probabilities follow a traditional Weighted Random Early Discard mechanism. Algorithms are adapted to a multi-level QOS structure, floating point format, and hardware implementation. Packet flow from a router egress queuing structure into a single egress port tributary is controlled by an arbitration algorithm using a rate metering mechanism. The queuing structure is replicated for each egress tributary in the router system.
Description
TECHNICAL FIELD

This application relates to the field of optical communication networks, and particularly to large-scale routers for optical communication networks.


BACKGROUND

A router system has a number of ingress ports that feed information, e.g., data packets to a switching fabric. The switching fabric then routes the information to egress routing ports. In such a system, typically the switching fabric on a per port basis has more bandwidth going into and out of the switch than is actually needed by the ports, such that typically there is more bandwidth capability feeding into an egress port than is feeding out of the egress port. Under these circumstances, queuing within that egress port can become very congested. The queues can fill up, and an intelligent mechanism is needed in order to manage those queues such that traffic is not dropped indiscriminately. In a system having multiple Quality of Service (QOS) levels, particular attention must be paid such that each level has its dedicated queuing space. Yet when the overall queuing space is not heavily utilized, it is desirable that the remaining queues can grow and use more space than they normally would be allotted.


SUMMARY OF THE INVENTION

The present invention is directed to a system and method for managing both instantaneous and time averaged queue congestion in a multi-QOS level queuing structure having a common shared memory pool. Packet payload pointers are stored in multiple queues, each of which represents a distinct QOS priority level, and packet information payloads are stored in the common memory pool. Instantaneously, if an incoming packet causes one of the QOS levels to reach its allocated memory space limitation, then it must be determined whether to discard the packet or not. Algorithms control the drop probability of packets entering the queuing structure.


The instantaneous management algorithm determines the total amount of shared memory space in bytes and monitors the instantaneous actual sizes of the individual queues. The algorithm dynamically calculates a minimum and a maximum queue size for each queue, using the percent utilization of the common memory pool. Any non-utilized common memory space is allocated simultaneously to all of the queues sharing the common memory pool, advantageously providing all queues more than their allocated memory space. A drop probability is calculated from a comparison of actual instantaneous queue size with minimum and maximum queue sizes.


The time averaged congestion management follows a traditional Weighted Random Early Discard (WRED) mechanism. However, both instantaneous and weighted time averaged congestion management are adapted to a multi-level QOS structure as well as to floating point calculations as opposed to a complex multi-dimensional table driven mechanism. In some embodiments the algorithms are implemented in hardware.


Particularly, the present invention is directed to an egress queuing system in a router, such that packets flow out of the queuing structure into a single tributary of a router egress port. The packet flow out of the queuing system is controlled on a QOS priority basis by an egress tributary arbitration algorithm using a rate metering mechanism. Packets are received into the queuing structure from an egress reassembly logic. The queuing structure is replicated for each egress tributary in the router system.


When the queues fill up near their capacity, the present mechanism manages those queues, such that lower priority traffic is dropped more frequently than higher priority traffic, but at the same time lower priority traffic is not completely dropped but rather is only relegated to a smaller percentage of the overall space within that memory system.


Various aspects of the invention are described in co-pending and commonly assigned U.S. application Ser. No. 09/703,057, entitled “System And Method For IP Router With an Optical Core,” co-pending and commonly assigned U.S. application Ser. No. entitled “System and Method for Router Central Arbitration,” co-pending and commonly assigned U.S. application Ser. No. 09/703,038, entitled “System and Method for Router Data Aggregation and Delivery,” co-pending and commonly assigned U.S. application Ser. No. 09/702,958, entitled “Timing and Synchronization for an IP Router Using an Optical Switch,” co-pending and commonly assigned U.S. application Ser. No. 09/703,027, entitled “Router Network Protection Using Multiple Facility Interfaces,” co-pending and commonly assigned U.S. application Ser. No. 09/703,043, entitled “Router Line Card Protection Using One-for-N Redundancy” and co-pending and commonly assigned U.S. application Ser. No. 09/703,064, entitled “Router Switch Fabric Protection Using Forward Error Correction,” all filed Oct. 31, 2000, the disclosures of which are incorporated herein by reference.


The foregoing has outlined rather broadly the features and technical advantages of the present invention in order that the detailed description of the invention that follows may be better understood. Additional features and advantages of the invention will be described hereinafter which form the subject of the claims of the invention. It should be appreciated by those skilled in the art that the conception and specific embodiment disclosed may be readily utilized as a basis for modifying or designing other structures for carrying out the same purposes of the present invention. It should also be realized by those skilled in the art that such equivalent constructions do not depart from the spirit and scope of the invention as set forth in the appended claims. The novel features which are believed to be characteristic of the invention, both as to its organization and method of operation, together with further objects and advantages will be better understood from the following description when considered in connection with the accompanying figures. It is to be expressly understood, however, that each of the figures is provided for the purpose of illustration and description only and is not intended as a definition of the limits of the present invention.





BRIEF DESCRIPTION OF THE DRAWING

For a more complete understanding of the present invention, reference is now made to the following descriptions taken in conjunction with the accompanying drawing, in which:



FIG. 1 is a schematic diagram illustrating an egress queuing structure;



FIGS. 2A-2D are graphic representations of drop probability curves for the instantaneous management function of the Queue Congestion Management Block of FIG. 1;



FIGS. 3A-3D are graphic representations showing how the drop probabilities are modified when the total of the four QOS levels now consume only 80% of the common memory space;



FIGS. 4A-4D are graphic representations of calculated drop probability when total common memory space of 50% is utilized; and



FIGS. 5A-5D are graphic representations of weighted random early discard probability curves for each of four QOS levels.





DETAILED DESCRIPTION


FIG. 1 is a schematic diagram illustrating an egress queuing structure 10. Packets arrive from egress reassembly logic (not shown) within an egress ASIC through link 101. Prior to the arrival of these packets, a router switching fabric has supplied chunks of information to the egress ASIC, which has separated the individual packets out of the chunks and reassembled the larger packets, such that the packets received through link 101 are reassembled complete packets. Those packets enter Queue Congestion Manager and Weighted Random Early Discard (WRED) Block 12.


Queue Congestion Manager Block 12 determines whether packets should be forwarded to the actual queuing in the egress structure or whether packets should be dropped due to congestion. There are instantaneous congestion issues as well as time averaged congestion issues. Packets emerge out of Queue Congestion Manager Block 12 through four separate links 102-0 through 102-3, each of which carries packets destined for a particular quality of service (QOS) level queue 14-0 through 14-3. Each of queues 14-0 through 14-3 stores individual packet payload pointers; however, the actual packet data for all four queues is stored in a common shared memory pool 13.


An egress tributary arbitration block 16, responsible for determining which packet is next to go out through a link 104, gives selection information to a selection MUX 15, which selects one of the four egress queues 14-0 through 14-3 from which to remove information through links 103-0 through 103-3.


Each egress port can have one or multiple egress tributaries associated with it. For example an OC192c port, which is a 10 gigabit per second (Gb/s) port, can have one tributary feeding that egress port, which would consume all 10 Gb/s of bandwidth. On the other hand, smaller bandwidth tributaries feeding that egress port, for example, four tributaries each having 2.5 Gb/s of bandwidth, would then equal the overall port bandwidth of 10 Gb/s.


In FIG. 1, link 104 represents the output of a single tributary going out of queuing structure 10. In an output port structure, if there are multiple tributaries, then entire queuing structure 10 would be replicated on each of the tributaries. On link 101 packets would come from a common reassembly logic, and packets designated to each individual tributary would be directed toward the appropriate queuing structure for that tributary. Thus queuing structure 10 would be replicated as many times as the number of tributaries for that particular port. However, the queues from all tributaries share a common memory pool.


A certain bandwidth is pre-allocated to each of the four QOS levels that exist in egress queuing structure 10. An arbitration algorithm controls the flow of data from each QOS level QOS-0 through QOS-3, such that each QOS level receives its fair share of the bandwidth going out of the egress port. However, if there is underutilized bandwidth from any of the QOS levels, which will result in excess bandwidth capacity for that port, then if there are any queues that have packets available, even though these queues have exhausted their current bandwidth allocation, then those queues are allowed to send additional packets out through the egress port.


A rate metering mechanism operates such that periodically, tokens are put into a counter that can for example be 16 or 32 bits in size. Every so many clock periods, a certain number of tokens or count is added to the rate meter. Then, as packets are selected from the queue associated with that rate meter, a certain number of tokens are removed from the rate meter by deducting a quantity associated with the length of the packet, such that the tokens are then associated with a certain amount of bytes out of the packet, or a certain bandwidth through the tributary. Thus, periodically the QOS rate meter is replenished, and then, as packets are extracted from the queue, tokens are deducted from the rate meter.


For example, if a token is equivalent to a byte, then for a 40 byte packet, 40 tokens are deducted from that rate meter. Similarly, the number of tokens added to the rate meter on a periodic basis is equivalent to the amount of bandwidth in bytes per time unit, allocated for that particular QOS level. The rate meter has a maximum number of tokens as well as a minimum number. For example, the minimum number may be 0, in which case if packets are removed, such that the rate meter would go below 0, it may truncate to keep the token level at a value 0. Similarly a rate meter is not allowed to continue to increase its number of tokens indefinitely if no packets are going out.


In the present implementation an egress tributary arbitration algorithm applies a set of code, namely


(1.0) // find next packet to send, first check token buckets which are positive

















for (qos=0, pkt=0; !pkt&&qos<MAX_QOS_LEVELS; qos+=1)



if (qosMeter[qos].tokenCnt (trib) >0 & &



(pkt=txQueue.unlink_first(trib,qos)))



{qosMeter[qos].decTokenCnt (trib, pkt->posSize( ))}










(1.1) // next check token buckets which are negative

















for (qos=0; !pkt&&qos < MAX_QOS_LEVELS; qos+=1)



if (qosMeter [qos].tokenCnt(trib)<=



0&&(pkt=txQueue.unlink_first(trib,qos)))



{qosMeter[qos].decTokenCnt(trib, pkt->posSize( ))}










Section 1.0 of the arbitration algorithm indexes through each of the four QOS levels and determines if there is a packet in that level and if so, if there are positive rate metering tokens for that QOS level. Thus, if there is a packet available and there are positive QOS tokens available, a packet is extracted from that queue. If there is no packet ready, or if the token meter is negative, then the algorithm proceeds to the next QOS level and repeats the same procedure. This procedure repeats sequentially through all four QOS levels.


Next in Section 1.1 the algorithm cycles through all four QOS levels again to determine whether there are packets ready to go on meters that are negative, i.e., that have already exhausted their quota of bandwidth. Again the algorithm indexes from the highest priority QOS level 0 to lowest priority QOS level 3. The same algorithm could be applied to any number of QOS levels.



FIGS. 2A-2D are graphic representations of drop probability curves for the instantaneous management function of Queue Congestion Management Block 12 of FIG. 1. When a packet is ready to be assigned to a queue, these drop probabilities are examined to determine if the packet should be placed in the queue or dropped. Drop probability curves FIGS. 2A-2D show drop probabilities calculated when the common memory pool is completely or very nearly full. FIG. 2A shows that for QOS level 0 queue 14-0 on Curve 201, the drop probability starts out at 0 and stays at 0 until a threshold utilization 202, which occurs at 10% queue utilization. The drop probability then increases to unity at roughly 15% queue utilization and remains there for all higher values of queue utilization. This limits the amount of the common memory space that is available for QOS level 0 to 10% before any packet drops start occurring. The slope between point 202 and 203 is such that a small amount of traffic will be dropped initially, if the amount of space is used in excess of what is allocated to it, but if utilization increases beyond an additional 5%, then all subsequent packets will be dropped.


In FIG. 2B, curve 204 for QOS 1 queue 14-1 starts at a threshold of 20% under an assumption that the QOS level 1 is allotted twice as much bandwidth as is QOS level 0. Similarly QOS levels 2 and 3 are allotted 30% and 40% respectively, as illustrated in FIGS. 2C and 2D. Software can configure how much bandwidth each of the QOS levels can consume at the egress port of the router by allocating the number of tokens that are put into the rate meter. The respective bandwidth allocations are reflected in the drop probability curves of FIGS. 2A-2D to allocate space on a per QOS level that is consistent with the amount of bandwidth that is able to be consumed by that particular QOS level.


When a queue is full, it is important that each of the QOS levels are allocated their memory space, such that the total of the four memory space utilizations equal 100%. Thus the threshold utilization of 10% on curve 201 of FIG. 2A plus the 20% on FIG. 2B, 30% on FIG. 2C, and 40% on FIG. 2D, add to 100%, such that the entire common memory space is utilized fully, but no individual QOS level consumes more than its allocation.



FIGS. 3A-3D are graphic representations showing how the drop probabilities are modified when the total of the four QOS levels now consume only 80% of the common memory space. The 20% excess bandwidth can then be used by any QOS level that needs it. In FIGS. 3A-3D, it is shown that 20% overall excess bandwidth capacity is added to each of QOS levels 0, 1, 2, and 3 illustrated respectively in FIGS. 3A, 3B, 3C and 3D.


For example in FIG. 3A for QOS level 0, the packet drop threshold 302 occurs at 302 utilization, whereas in FIG. 2A the threshold occurs at 10% utilization, which is a 20% lower utilization than in FIG. 3A. The 20% excess available bandwidth is given simultaneously to each of the four QOS levels, such that QOS level 0, instead of being 10% is now 30%, QOS 1 in FIG. 3B, instead of being 20% is 40%, QOS 2 in FIG. 3C instead of being 30% is 50%, and QOS 3 in FIG. 3D instead of being 40% is now 60%. Thus, any of the four QOS levels could consume 20% more than its allocated memory space, which is acceptable because the common memory pool is only 80% full. As the common memory pool becomes less full, each of the four QOS levels is given a steadily increasing allocation. If at any point in time in FIGS. 3A-3D overall utilization were to exceed the 80% full point, then each of these curves would move back to the left, gradually allocating a correspondingly smaller memory space for each individual queue, preventing any single queue from excessively consuming the shared memory space. Thus each of the queues can have its full memory space allocation, but if any queue does not use all of its allocation, then any of the other queues can use that memory space. This is an example of the flexible algorithm described below that actually implements this function in hardware.



FIGS. 4A-4D are graphic representations of calculated drop probabilities when the total common memory space is 50% utilized. In FIG. 4A, instead of 10% bandwidth allocated to QOS level 0, 50% is added, resulting in 60% available for QOS level 0. Similarly for QOS levels 1, 2, and 3, 50% is added to each of their respective allocated bandwidths. This implies that overall only 50% of the common memory space is utilized and that in each of these curves the actual individual queue utilization is less than the initial threshold, for example point 402 in FIG. 4A, such that no queue at this point is being overconsumed and such that no packets are being dropped. However, if QOS level 0 actually started to consume 65% queue utilization, some percentage of the packets would be dropped even though overall common memory utilization is only 50%.


A Queue Congestion Management algorithm implemented in hardware in Queue Congestion Manager and WRED block 12 controls instantaneous queue space congestion management for one particular tributary, as follows:














// Physical queue limits








(2.1)
memByteCnt = txQueue.totalByteCnt( );


(2.2)
queByteCnt = txQueue.queByte.Cnt(pkt->dstTrib( ),pkt->qos( ));


(2.3)
minSize = config.egressMinSize(pkt->qos( )) − memByteCnt*



  config.egressMinSlope(pkt-<qos( ));


(2.4)
maxSize = config.egressMaxSize(pkt->qos( )) − memByteCnt*



  config.egressMaxSlope(pkt->qos( ));


(2.5)
if (queByteCnt < minSize)



  dropProb=0;


(2.6)
else if (queByteCnt < maxSize)



  dropProb = (queByteCnt − minSize)/(maxSize − minSize);


(2.7)
else dropProb = 1;









The hardware monitors the instantaneous actual sizes of the queue utilizations to make these calculations. The total amount of memory space in bytes is stored in a hardware register memByteCnt, as shown in Equation 2.1. Every time a packet is assigned to any of the queues or extracted from a queue, a queue byte count hardware register queByteCnt is updated to reflect the total amount of bytes in each of the four individual QOS queues for that tributary, as shown in Equation 2.2. The threshold intercept point, for example point 202 in FIG. 2A, is represented by a configuration value config.egressMinSize, which is stored in a hardware CSR and is indexed by QOS level. Point 202 is the allocated minimum threshold of space allocation for QOS level 0. Regardless of whether the queue is full, that amount of space is always available to that particular QOS level. However, as the total common memory utilization is decreased, point 202 moves to the right, as illustrated in FIGS. 3A and 4A, to allocate more space for an individual queue.


In Equation 2.3 the minimum queue size MinSize is calculated by representing the slope in curve 201 by another configuration value config.egressMinSlope, which is then multiplied by the memory byte count memByteCnt to modify the config.egressMinSize value. Equation 2.4 calculates the maximum size maxSize starting with a configured value config.egressMaxSize, which is the value to be used when the queue is fully consumed and is indexed by the appropriate QOS level. That value is then modified by config.egressMaxSlope multiplied by the memory byte count memByteCnt to displace that value to the right as the common memory utilization decreases.


Next a comparison of the actual queue size for that queue is performed to compute the drop probability. In Equation 2.5 if the actual queue size is less than the calculated minimum size, then the drop probability is set to zero. If the actual queue size is between the maximum and minimum size according for example to curve 201, then the drop probability is calculated by using the slope of the curve, namely queByteCnt less minSize divided by the quantity maxSize minus minSize, as shown in Equation 2.6. This calculates a drop probability along curve 201 for example from point 202 to point 203. Finally in Equation 2.7, if the queue size is greater than the value at point 203, then the drop probability is set to unity. Queue Congestion Manager and WRED block 12 of FIG. 1 calculates this drop probability and also calculates a random number between the value of 0 and 1.0, to determine if the individual packet should be dropped or passed on through and then queued in the appropriate QOS level queue.


The quantities that are used to calculate the drop probability can extend over a wide dynamic range, including fractional numbers as well as very large numbers, depending upon how they are used in these equations and the actual size of the queues and packets. Instead of using integer values typically used in hardware design for these types of equations, a small floating point format is actually used. This floating point format uses a four-bit normalized mantissa, which implies that the most significant bit of the mantissa always has the value 1. The four-bit mantissa is normalized such that the value always is between the values of 0.5 and 1.0. The floating point format has a 6-bit exponent biased such that it is between 0 and a maximum value of 63. This allows the exponent to have a range of −32 to +31, obtained from taking the 6-bit value and subtracting off a bias value of 32. In the equations, only the positive numbers are required, such that no sign is needed for the floating point numbers. All of the algorithms require these numbers greater than or equal to zero. With such a small mantissa, multiply and divide operations are accomplished with simple look up tables, for example two 3-bit inputs result in one 4-bit output. Since the most significant bit of the mantissa always has a value of one, it does not need to be explicitly represented, but is rather implied, and does not need to be part of the two 3-bit inputs in table lookups.



FIGS. 5A-5D are graphic representations of weighted random early discard (WRED) probability curves for each of four QOS levels QOS-0 through QOS-3. In FIG. 5A, assuming a 10% overall bandwidth allocated to QOS level 0 as in FIGS. 2A-2D, curve 501 has a threshold 502 which is less than 10% utilization. Threshold 502 is the value of the average common memory space that is being used by QOS level 0 at which a small number of packets begin to be dropped. At a point 503 the amount of memory space consumed by QOS level 0 has reached a maximum, such that the drop probability at point 503 immediately goes vertically to unity, as indicated by point 504. These curves are implemented by the WRED algorithm, which is well known in the literature. In the present embodiment, however, the WRED algorithm is applied concurrently to four different QOS levels all sharing a common memory space, such that a certain portion of the overall memory space is allocated to each of the QOS levels. For example, in FIGS. 5A-5D the maximum memory space allocated to QOS level 0 is 10%, whereas QOS levels 1, 2, and 3 receive respectively 20%, 30%, and 40%. These respective allocations of shared memory pool space are the same as the corresponding queue bandwidth allocations of each QOS level queue, as described above in connection with FIGS. 2A-4D.


In the present embodiment, the WRED algorithm applied to egress queuing is:














// WRED for egress queueing









(3.1)

avgByteCnt = txQueue.avgByteCnt(pkt->dstTrib( ));


(3.2)

prob1 = avgByteCnt *




config.egressWredConst1(pkt->qos( )) −




 config.egressWredConst2(pkt->qos( ));


(3.3)

dropProb = pkt->size( ) * prob1;


(3.4)

wredCnt = txQueue.wredCnt(pkt->dstTrib( ), pkt->qos( ));


(3.5)

if (!pkt->wredDisable ( ) & &




 (avgByteCnt > config.egressQueMaxThresh(pkt->qos( )) | |




 avgByteCnt >




 config.egressQueMinThresh (pkt->qos( )) & &




 rndDrop − rndDrop * wredCnt * dropProb <dropProb( ) {







// drop packet









(3.6)

txQueue.wredCnt (pkt->dstTrib( ), pkt->qos( ), 0);




deletepkt;




} else {




txQueue.append(pkt);


(3.7)

txQueue.wredCnt(pkt->dstTrib( ), pkt->qos( ), wredCnt+1);




}



}









In Equation 3.1 of the WRED algorithm, an average byte count avgByteCnt is obtained for the particular queue into which a packet is being placed, that is indexed by both the destination tributary and the QOS level of the incoming packet. The average byte count is calculated by determining the time weighted average for the actual byte count for that particular queue. In Equation 3.2 a probability probl is calculated by multiplying the average byte count avgByteCnt by a constant config.egress WredConst1 and then subtracting a different constant config egress WredConst2. These constants are a base and a slope corresponding to points 502 and 503 in FIG. 5A. In Equation 3.3 the drop probability dropProb is calculated by multiplying probl by the size of the packet, such that the smaller the packet the less likely it will be dropped. This is desirable to minimize the likelihood that a TCP/IP acknowledge packet will be dropped. In equation 3.4, the wredCnt value for the target queue is obtained. The wredCnt is used to make the interval between dropped packets uniform in delay. The wredCnt is incremented by one each time a packet is not dropped, and set to zero when a packet is dropped. The conditional expression of equation 3.5 determines if a packet is a candidate to be dropped by examining the WRED disable bit associated with the packet, and be determining if the average byte count, avgByteCnt, is within the region where packets are dropped. Average byte counts less than egressQueueMin Thresh are not dropped. Average byte counts in the range from egressQueueMinThresh and egressQueueMaxThresh are dropped using the specified probability function.


Equation 3.6 is performed when a packet is to be dropped. The equation sets the wredCnt for the specific target queue to the value zero. Likewise, equation 3.7 is performed when a packet is appended to a queue. Equation 3.7 is used to increment the value of wredCnt by one. Once a packet is dropped, it is deleted entirely from the router. The entire algorithm is implemented in hardware using the floating point format as described above. Additionally, Queue Congestion Manager Block 12 checks common shared memory space 13, and if shared memory space has been fully consumed, then drops the packet independently of the instantaneous congestion management as well as random early discard management.


Although the present invention and its advantages have been described in detail, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention as defined by the appended claims. Moreover, the scope of the present application is not intended to be limited to the particular embodiments of the process, machine, manufacture, composition of matter, means, methods and steps described in the specification. As one of ordinary skill in the art will readily appreciate from the disclosure of the present invention, processes, machines, manufacture, compositions of matter, means, methods, or steps, presently existing or later to be developed that perform substantially the same function or achieve substantially the same result as the corresponding embodiments described herein may be utilized according to the present invention. Accordingly, the appended claims are intended to include within their scope such processes, machines, manufacture, compositions of matter, means, methods, or steps.

Claims
  • 1. A congestion control method comprising: receiving packets into parallel queues which share a common memory;allocating memory to the parallel quesues based at least on each queue's priority;determing utilization of the common memory;over-subscribing the determined unused common memory space; andallocating over-subscribed unused space of the common memory simultaneously to the parallel queues and dynamically adjusting packet drop probabilities of all or non-empty queues based at least on a division of the over-subscribed unused commom memory space among the queues.
  • 2. The method of claim 1, further comprising: assigning said packets to said plurality of queues in accordance with their quality of service (QOS) priority levels.
  • 3. The method of claim 1, further comprising: storing packet payload pointers of the received packets in said queues;storing packet payloads of the received packets in said common memory; andreleasing said packets from said queues into a common egress tributary using a rate metering mechanism.
  • 4. The method of claim 1, further comprising dropping received packets based at least on each queue's packet drop probability prior to assigning said packets to said queues.
  • 5. The method of claim 4, wherein the packet dropping is performed by hardware.
  • 6. The method of claim 4, wherein dropping the packets applying a weighted random early discard (WRED) method with the packet drop probabilities.
  • 7. The method of claim 6, wherein the weighted random early discard (WRED) method is performed by hardware.
  • 8. The method of claim 1, further comprising: periodically adding tokens to a counter associated with each queue at a time averaged rate substantially proportional to the allocated memory associated with each queue; andapplying a rate metering algorithm to said queues.
  • 9. The method of claim 8, wherein said rate metering algorithm is implemented in hardware.
  • 10. The method of claim 1, wherein the packets are received at an egress of a switch.
  • 11. An apparatus for congestion control, comprising: a receiving mechanism configured to receive packets into parallel queues which share a common memory;a memory allocation mechanism configured to: allocate memory to the parallel queues based at least on each queue's priority;determine utilization of the common memory;over-subscribe the determined unused common memory space; andallocate the over-subscribed unused space of the common memory simultaneously to the parallel queues and dynamically adjust packet drop probabilities of all or non-empty queues based at least on a division of the over-subscribed unused common memory space among the queues.
  • 12. A apparatus as recited in claim 11, further comprising: a packet assignment mechanism configured to assign said packets to said plurality of queues in accordance with their quality of service (QOS) priority levels.
  • 13. An apparatus as recited in claim 11, further comprising: a packet releasing mechanism configured to release said packets from said queues into a common egress tributary using a rate metering mechanism.
  • 14. An apparatus as recited in claim 11, wherein the packet receiving mechanism is configure to receive the packets at an egress of a switch.
RELATED APPLICATIONS

This application is a continuation of and commonly-assigned U.S. patent application Ser. No. 09/740,923, filed Dec. 19, 2000 now U.S. Pat. No. 7,002,980, entitled, “System and Method for Router Queue and Congestion Management.” the disclosure of which is hereby incorporated herein by reference. This application is related to commonly assigned U.S. application Ser. No. 09/703,057, entitled “System And Method For IP Router With an Optical Core,” now abandoned, to and commonly assigned U.S. application Ser. No. 09/703,056, entitled “System and Method for Router Central Arbitration,” to and commonly assigned U.S. application Ser. No. 09/703,038, entitled “System and Method for Router Data Aggregation and Delivery,” to commonly assigned U.S. application Ser. No. 09/702,958, entitled “Timing and Synchronization for an IP Router Using an Optical Switch,” issued as U.S. Pat. No. 6,711,357 on Mar. 23, 2004, to and commonly assigned U.S. application Ser. No. 09/703,027, entitled “Router Network Protection Using Multiple Facility Interfaces,” to commonly assigned U.S. Application Ser. No. 09/703,043, entitled “Router Line Card Protection Using One-for-N Redundancy” issued as U.S. Pat. 6,879,559 on Apr. 12, 2005, and to commonly assigned U.S. application Ser. No. 09/703,064, entitled “Router Switch Fabric Protection Using Forward Error Correction,” issued as U.S. Pat. No. 6,894,970 on May 17, 2005, all filed Oct. 31, 2000, the disclosures of which are incorporated herein by reference.

US Referenced Citations (239)
Number Name Date Kind
4683564 Young et al. Jul 1987 A
4791629 Burns et al. Dec 1988 A
4794629 Pastyr et al. Dec 1988 A
4807280 Posner et al. Feb 1989 A
4876681 Hagiwara et al. Oct 1989 A
4985889 Frankish et al. Jan 1991 A
5101404 Kunimoto et al. Mar 1992 A
5195181 Bryant et al. Mar 1993 A
5224108 McDysan et al. Jun 1993 A
5301192 Henrion Apr 1994 A
5307345 Lozowick et al. Apr 1994 A
5323386 Wiher et al. Jun 1994 A
5365512 Combs et al. Nov 1994 A
5390173 Spinney et al. Feb 1995 A
5392279 Taniguchi Feb 1995 A
5406643 Burke et al. Apr 1995 A
5408469 Opher et al. Apr 1995 A
5430442 Kaiser et al. Jul 1995 A
5436893 Barnett Jul 1995 A
5461615 Henrion Oct 1995 A
5463620 Sriram Oct 1995 A
5506840 Pauwels et al. Apr 1996 A
5521923 Willmann et al. May 1996 A
5546385 Caspi et al. Aug 1996 A
5550816 Hardwick et al. Aug 1996 A
5566170 Bakke et al. Oct 1996 A
5598410 Stone Jan 1997 A
5600795 Du Feb 1997 A
5619497 Gallagher et al. Apr 1997 A
5640504 Johnson et al. Jun 1997 A
5649110 Ben-Nun Jul 1997 A
5663952 Gentry, Jr. Sep 1997 A
5663959 Nakagawa et al. Sep 1997 A
5666353 Klausmeier et al. Sep 1997 A
5721819 Galles et al. Feb 1998 A
5732080 Ferguson et al. Mar 1998 A
5740176 Gupta et al. Apr 1998 A
5751710 Crowther et al. May 1998 A
5815146 Youden et al. Sep 1998 A
5835496 Yeung et al. Nov 1998 A
5838684 Wicki et al. Nov 1998 A
5862350 Coulson Jan 1999 A
5867675 Lomelino et al. Feb 1999 A
5870538 Manning et al. Feb 1999 A
5872783 Chin Feb 1999 A
5907566 Benson et al. May 1999 A
5907660 Inoue et al. May 1999 A
5909686 Muller et al. Jun 1999 A
5915094 Kouloheris et al. Jun 1999 A
5920566 Hendel et al. Jul 1999 A
5920886 Feldmeier Jul 1999 A
5936939 Des Jardins et al. Aug 1999 A
5936966 Ogawa et al. Aug 1999 A
5936967 Baldwin et al. Aug 1999 A
5978359 Caldara et al. Nov 1999 A
5999528 Chow et al. Dec 1999 A
6000016 Curtis et al. Dec 1999 A
6023471 Haddock et al. Feb 2000 A
6035414 Okazawa et al. Mar 2000 A
6038288 Thomas et al. Mar 2000 A
6067301 Aatresh May 2000 A
6076115 Sambamurthy et al. Jun 2000 A
6081522 Hendel et al. Jun 2000 A
6088356 Hendel et al. Jul 2000 A
6094434 Kotzur et al. Jul 2000 A
6104700 Haddock et al. Aug 2000 A
6108306 Kalkunte et al. Aug 2000 A
6118787 Kalkunte et al. Sep 2000 A
6125417 Bailis et al. Sep 2000 A
6128666 Muller et al. Oct 2000 A
6144668 Bass et al. Nov 2000 A
6151301 Holden Nov 2000 A
6151615 Brooks et al. Nov 2000 A
6154446 Kadambi et al. Nov 2000 A
6157643 Ma Dec 2000 A
6160809 Adiletta et al. Dec 2000 A
6172990 Deb et al. Jan 2001 B1
6188698 Galand et al. Feb 2001 B1
6222845 Shue et al. Apr 2001 B1
6233236 Nelson May 2001 B1
6243667 Kerr et al. Jun 2001 B1
6272144 Berenbaum et al. Aug 2001 B1
6320859 Momirov Nov 2001 B1
6333929 Drottar et al. Dec 2001 B1
6335932 Kadambi et al. Jan 2002 B2
6335935 Kadambi et al. Jan 2002 B2
6343072 Bechtolsheim et al. Jan 2002 B1
6349097 Smith Feb 2002 B1
6351143 Guccione et al. Feb 2002 B1
6356550 Williams Mar 2002 B1
6356942 Bengtsson et al. Mar 2002 B1
6369855 Chauvel et al. Apr 2002 B1
6421352 Manaka et al. Jul 2002 B1
6424658 Mather Jul 2002 B1
6424659 Viswanadham et al. Jul 2002 B2
6427185 Ryals et al. Jul 2002 B1
6463063 Bianchini, Jr. et al. Oct 2002 B1
6470436 Croft et al. Oct 2002 B1
6473433 Bianchini, Jr. et al. Oct 2002 B1
6477167 Wu Nov 2002 B1
6477174 Dooley et al. Nov 2002 B1
6490280 Leung Dec 2002 B1
6493347 Sindhu et al. Dec 2002 B2
6522656 Gridley Feb 2003 B1
6532234 Yoshikawa Mar 2003 B1
6549519 Michels et al. Apr 2003 B1
6549541 Sainio Apr 2003 B1
6553370 Andreev et al. Apr 2003 B1
6567404 Wilford May 2003 B1
6580721 Beshai Jun 2003 B1
6606300 Blanc et al. Aug 2003 B1
6608819 Mitchem Aug 2003 B1
6611522 Zheng et al. Aug 2003 B1
6643269 Fan et al. Nov 2003 B1
6654342 Dittia et al. Nov 2003 B1
6654370 Quirke et al. Nov 2003 B1
6654373 Maher, III et al. Nov 2003 B1
6658002 Ross et al. Dec 2003 B1
6671275 Wong et al. Dec 2003 B1
6674754 Ofek Jan 2004 B1
6678248 Haddock et al. Jan 2004 B1
6681332 Byrne et al. Jan 2004 B1
6687247 Wilford et al. Feb 2004 B1
6691202 Vasquez et al. Feb 2004 B2
6696917 Heitner et al. Feb 2004 B1
6697359 George Feb 2004 B1
6697368 Chang et al. Feb 2004 B2
6700894 Shung Mar 2004 B1
6707818 Kadambi et al. Mar 2004 B1
6711357 Brewer et al. Mar 2004 B1
6721229 Cole Apr 2004 B1
6721268 Ohira et al. Apr 2004 B1
6721313 Van Duyne Apr 2004 B1
6735218 Chang et al. May 2004 B2
6741597 Jeong May 2004 B1
6751224 Parruck et al. Jun 2004 B1
6754881 Kuhlmann et al. Jun 2004 B2
6775706 Fukumoto et al. Aug 2004 B1
6778546 Epps et al. Aug 2004 B1
6781990 Puri et al. Aug 2004 B1
6792484 Hook Sep 2004 B1
6792502 Pandya et al. Sep 2004 B1
6798740 Senevirathne et al. Sep 2004 B1
6804731 Chang et al. Oct 2004 B1
6807179 Kanuri et al. Oct 2004 B1
6807363 Abiko et al. Oct 2004 B1
6810046 Abbas et al. Oct 2004 B2
6813243 Epps et al. Nov 2004 B1
6816467 Muller et al. Nov 2004 B1
6822940 Zavalkovsky et al. Nov 2004 B1
6831923 Laor et al. Dec 2004 B1
6831932 Boyle et al. Dec 2004 B1
6839346 Kametani Jan 2005 B1
6842422 Bainchini, Jr. Jan 2005 B1
6854117 Roberts Feb 2005 B1
6859438 Haddock Feb 2005 B2
8859438 Haddock at at Feb 2005
6865153 Hill et al. Mar 2005 B1
6879559 Blackmon Apr 2005 B1
6894970 McDermott, III et al. May 2005 B1
6898182 Cloonan May 2005 B1
6901072 Wong May 2005 B1
6910148 Ho Jun 2005 B1
6920154 Achler Jul 2005 B1
6925516 Struhsaker et al. Aug 2005 B2
6937606 Basso et al. Aug 2005 B2
6938179 Iyer Aug 2005 B2
6946948 McCormack et al. Sep 2005 B2
6957258 Maher, III et al. Oct 2005 B2
6959007 Vogel et al. Oct 2005 B1
6975639 Hill Dec 2005 B1
6977943 Zboril Dec 2005 B2
7009968 Ambe et al. Mar 2006 B2
7012919 So et al. Mar 2006 B1
7050430 Kalkunte et al. May 2006 B2
7073005 Basu et al. Jul 2006 B1
7093160 Lau Aug 2006 B2
7126956 Scholten Oct 2006 B2
7133399 Brewer et al. Nov 2006 B1
7151797 Limberg Dec 2006 B2
7161948 Sampath et al. Jan 2007 B2
7167471 Calvignac et al. Jan 2007 B2
7185141 James et al. Feb 2007 B1
7185266 Blightman et al. Feb 2007 B2
7187687 Davis et al. Mar 2007 B1
7191277 Broyles Mar 2007 B2
7191468 Hanner Mar 2007 B2
7203194 Chang et al. Apr 2007 B2
7206283 Chang et al. Apr 2007 B2
7212536 MacKiewich et al. May 2007 B2
7224670 Jeffries et al. May 2007 B2
7236490 Chang et al. Jun 2007 B2
7266117 Davis Sep 2007 B1
7283470 Sindhu et al. Oct 2007 B1
7324500 Blackmon Jan 2008 B1
7356030 Chang et al. Apr 2008 B2
7382728 Chen et al. Jun 2008 B2
7382736 Mitchem Jun 2008 B2
20010007560 Masuda et al. Jul 2001 A1
20010048785 Steinberg Dec 2001 A1
20010053150 Clear et al. Dec 2001 A1
20020012585 Kalkunte et al. Jan 2002 A1
20020054594 Hoof et al. May 2002 A1
20020069294 Herkersdorf et al. Jun 2002 A1
20020073073 Cheng Jun 2002 A1
20020089977 Chang et al. Jul 2002 A1
20020091884 Chang et al. Jul 2002 A1
20020097713 Chang et al. Jul 2002 A1
20020105966 Patel et al. Aug 2002 A1
20020126672 Chow et al. Sep 2002 A1
20020146013 Karlsson et al. Oct 2002 A1
20030009466 Ta et al. Jan 2003 A1
20030033435 Haner Feb 2003 A1
20030043800 Sonksen et al. Mar 2003 A1
20030048785 Calvignac et al. Mar 2003 A1
20030103499 Davis et al. Jun 2003 A1
20030110180 Calvignac et al. Jun 2003 A1
20030152084 Lee et al. Aug 2003 A1
20030165160 Minami et al. Sep 2003 A1
20030174719 Srinivas et al. Sep 2003 A1
20030177221 Ould-Brahim et al. Sep 2003 A1
20030215029 Limberg Nov 2003 A1
20040022263 Zhao et al. Feb 2004 A1
20040062246 Boucher et al. Apr 2004 A1
20040128434 Khanna et al. Jul 2004 A1
20040179548 Chang et al. Sep 2004 A1
20050041684 Reynolds et al. Feb 2005 A1
20050089049 Chang et al. Apr 2005 A1
20050132132 Rosenbluth et al. Jun 2005 A1
20050175018 Wong Aug 2005 A1
20060114876 Kalkunte Jun 2006 A1
20070179909 Channasagara Aug 2007 A1
20070208876 Davis Sep 2007 A1
20070253420 Chang Nov 2007 A1
20070258475 Chinn et al. Nov 2007 A1
20070288690 Shingyu et al. Dec 2007 A1
20080002707 Davis Jan 2008 A1
20080049742 Bansal Feb 2008 A1
20080225859 Mitchem Sep 2008 A1
Foreign Referenced Citations (4)
Number Date Country
2273291 Nov 2000 CA
0987861 Mar 2000 EP
2003-289359 Oct 2003 JP
WO 0184728 Nov 2001 WO
Related Publications (1)
Number Date Country
20060062233 A1 Mar 2006 US
Continuations (1)
Number Date Country
Parent 09740923 Dec 2000 US
Child 11272998 US