The present invention is concerned with data and storage communication systems and is more particularly concerned with a network processor that includes a scheduler component.
Data and storage communication networks are in widespread use. In many data and storage communication networks, data packet switching is employed to route data packets or frames from point to point between source and destination, and network processors are employed to handle transmission of data into and out of data switches.
The network processor 10 includes data flow chips 12 and 14. The first data flow chip 12 is connected to a data switch 15 (shown in phantom) via first switch ports 16, and is connected to a data network 17 (shown in phantom) via first network ports 18. The first data flow chip 12 is positioned on the ingress side of the switch 15 and handles data frames that are inbound to the switch 15.
The second data flow chip 14 is connected to the switch 15 via second switch ports 20 and is connected to the data network 17 via second network ports 22. The second data flow chip 14 is positioned on the egress side of the switch 15 and handles data frames that are outbound from the switch 15.
As shown in
The network processor 10 also includes a first processor chip 28 coupled to the first data flow chip 12. The first processor chip 28 supervises operation of the first data flow chip 12 and may include multiple processors. A second processor chip 30 is coupled to the second data flow chip 14, supervises operation of the second data flow chip 14 and may include multiple processors.
A control signal path 32 couples an output terminal of second data flow chip 14 to an input terminal of first data flow chip 12 (e.g., to allow transmission of data frames therebetween).
The network processor 10 further includes a first scheduler chip 34 coupled to the first data flow chip 12. The first scheduler chip 34 manages the sequence in which inbound data frames are transmitted to the switch 15 via first switch ports 16. A first memory 36 such as a fast SRAM is coupled to the first scheduler chip 34 and stores data frame pointers (in the form of flow queues) and flow control information (in the form of flow queue control blocks (“FQCBs”) 37). Flow queues are discussed further below. The first memory 36 may be, for example, a QDR (quad data rate) SRAM.
A second scheduler chip 38 is coupled to the second data flow chip 14. The second scheduler chip 38 manages the sequence in which data frames are output from the second network ports 22 of the second data flow chip 14. Coupled to the second scheduler chip 38 are at least one and possibly two memories (e.g., fast SRAMs 40) for storing data frame pointers and flow control information. The memories 40 may, like the first memory 36, be QDRs. The additional memory 40 on the egress side of the network processor 10 may be needed because of a larger number of flows output through the second network ports 22 than through the first switch ports 16.
Flows with which the incoming data frames are associated are enqueued in (“attached to”) a scheduling queue 42 maintained in the first scheduler chip 34. The scheduling queue 42 defines a sequence in which the flows attached thereto are to be serviced. The particular scheduling queue 42 of interest in connection with the present invention is a weighted fair queue which arbitrates among flows entitled to a “best effort” or “available bandwidth” Quality of Service (QoS).
As shown in
Although not indicated in
The memory 36 associated with the first scheduler chip 34 holds pointers (“frame pointers”) to locations in the first data buffer 24 corresponding to data frames associated with the flows enqueued in the scheduling queue 42. The frame pointers are listed in flow queues (not separately shown), each of which corresponds to a respective flow that is or may be attached to the scheduling queue 42. The flow queue indicates an order in which frames associated with the flow were received and are to be dispatched.
The memory 36 also stores flow control information, such as information indicative of the QoS to which flows are entitled. The flow control information is stored in flow queue control blocks (“FQCBs”), each of which corresponds to a respective one of the flow queues.
When the scheduling queue 42 indicates that a particular flow attached thereto is the next to be serviced, reference is made to the first frame pointer in the corresponding flow queue in the memory 36 and the corresponding frame data is transferred from the first data buffer 24 to an output queue 46 associated with the output port 44. At the same time, the flow is detached from the scheduling queue 42, and, assuming that at least one more frame pointer remains in the corresponding flow queue, is reattached to the scheduling queue in accordance with a procedure that is described below.
A more detailed representation of the scheduling queue 42 is shown in
More specifically, the queue slot in which a flow is placed upon reattachment is calculated according to the formula CP+((WF×FS)/SF), where CP is a pointer (“current pointer”) that indicates a current position (the slot currently being serviced) in the scheduling queue 42; WF is a weighting factor associated with the flow to be enqueued, the weighting factor having been determined on the basis of the QoS to which the flow is entitled; FS is the size of the frame currently being dispatched for the flow to be reattached; and SF is a scaling factor chosen to scale the product (WF×FS) so that the resulting quotient falls within the range defined by the scheduling queue 42. (In accordance with conventional practice, the scaling factor SF is conveniently defined as an integral power of 2—i.e., SF=2n, with n being a positive integer—so that scaling the product (WF×FS) is performed by right shifting.) With this known weighted fair queuing technique, the weighting factors assigned to the various flows in accordance with the QoS assigned to each flow govern how close to the current pointer of the queue each flow is enqueued. In addition, flows which exhibit larger frame sizes are reattached farther from the current pointer of the queue, to prevent such flows from appropriating an undue proportion of the available bandwidth of the queue. Upon reattachment, data that identifies a flow (the “Flow ID”) is stored in the appropriate queue slot 48.
In addition to the “reattachment” situation described above, there are two other cases in which flows are attached to the scheduling queue 42. The first of these two cases is concerned with attachment to the scheduling queue 42 upon arrival of the first frame for a new flow. The second of the two cases is concerned with attachment of a flow to the scheduling queue 42 upon arrival of the first frame after the flow queue for the flow in question has been emptied (i.e., after the last frame pointed to by the flow queue is dispatched). In both of these cases, there is no frame currently being dispatched, and accordingly, there is no size information available for such a currently dispatched frame. It has therefore been proposed in both cases to attach the flow to the scheduling queue 42 at a predetermined fixed distance from the current pointer CP for the scheduling queue 42. However, the present inventors have recognized that this proposed practice may undermine the desired weighted fair queuing in certain situations that may be encountered in the second case, namely attachment of the flow to the scheduling queue 42 after the corresponding flow queue has been emptied. In particular, if a given flow is made up of large but relatively infrequent frames, the predetermined fixed enqueuement distance may be too short to limit the flow in question to the Quality of Service to which it is entitled. Furthermore, where a flow is made up of relatively infrequent short frames, the predetermined fixed enqueuement distance may work to “short change” the flow, i.e., to prevent it from receiving the Quality of Service to which it is entitled.
It is an object of the present invention to assure that a contracted-for QoS is maintained for a flow upon attachment of the flow to a weighted fair queue in a case where a new frame is received for the flow after the corresponding flow queue has emptied.
According to a first aspect of the invention, a method of operating a network processor is provided. The method includes dispatching a last frame from a flow queue maintained in the network processor, thereby emptying the flow queue, and storing data indicative of a size of the dispatched last frame.
In at least one embodiment, the inventive method may further include receiving a new frame corresponding to the emptied flow queue, and attaching to a scheduling queue a flow corresponding to the emptied flow queue. The flow may be attached to the scheduling queue a distance D from a current pointer for the scheduling queue, where the distance D is determined based at least in part on the stored data indicative of the size of the dispatched last frame.
According to a second aspect of the invention, a network processor is provided, including a scheduler which includes a scheduling queue. The scheduling queue has flows attached thereto and defines a sequence in which the attached flows are to be serviced. The network processor according to this aspect of the invention further includes a storage device that is associated with the scheduler, and maintains a flow queue corresponding to each flow attached to the scheduling queue. Further in accordance with the first aspect of the invention, the storage device stores, for each flow queue that has been emptied, data indicative of a size of a last frame dispatched from the respective flow queue.
In at least one embodiment, when a new frame is received that corresponds to a flow queue that has been emptied, the flow corresponding to the new frame may be attached to the scheduling queue at a distance D from a current pointer for the scheduling queue. The distance D is determined based at least in part on the stored data indicative of the size of the last frame dispatched from the flow queue that has been emptied.
Numerous other aspects are provided, as are computer program products. Each inventive computer program product may be carried by a medium readable by a computer (e.g., a carrier wave signal, a floppy disk, a hard drive, a random access memory, etc.).
With the apparatus and method of the present invention, a flow may be attached to the scheduling queue, after emptying of the corresponding flow queue and receipt of a new frame for the flow, on the basis of the size of the last frame dispatched upon emptying of the flow queue. Consequently, flows that attempt to “misbehave” by sending very large but infrequent frames, are nevertheless accorded their appropriate Quality of Service. Furthermore, flows made up of relatively infrequent short frames will not be penalized due to the small size of the frames in the flow.
Other objects, features and advantages of the present invention will become more fully apparent from the followed detailed description of exemplary embodiments, the appended claims and the accompanying drawings.
Attachment of a flow to the scheduling queue 42 in accordance with the invention will now be described, with reference to
The process of
Following, or in conjunction with, block 52 is block 54. At block 54 data indicative of the size of the frame dispatched at block 52 is stored. For example, this data may be stored in the flow queue control block (FQCB) corresponding to the flow queue which was emptied at block 52.
Following block 54 is a decision block 56, at which it is determined whether the next frame has arrived for the flow corresponding to the emptied flow queue. Until the next frame arrives, the process of
Following block 58 is block 60. At block 60, the flow in question is attached to the scheduling queue 42 at the slot determined at block 58. The process then ends, at 62.
With the method and apparatus of the present invention, flows that “misbehave” by sending very large frames infrequently can be prevented from misappropriating a quantity of bandwidth to which such flows are not entitled. At the same time, the inventive method and apparatus prevent flows exhibiting infrequent, small frames from being “short changed”.
The process of
The foregoing description discloses only exemplary embodiments of the invention; modifications of the above disclosed apparatus and method which fall within the scope of the invention will be readily apparent to those of ordinary skill in the art. According to one alternative embodiment, a scheduling queue may have plural subqueues of different ranges and resolutions, according to an invention disclosed in above-referenced co-pending patent application Ser. No. 10/016,518, filed Nov. 1, 2001 (Attorney Docket No. ROC920010199US1).
Moreover, in the above description, the invention has been implemented in connection with a separate scheduler chip associated with a network processor. However, it is also contemplated to implement the invention in a scheduler circuit that is implemented as part of a data flow chip or as part of a processor chip.
Accordingly, while the present invention has been disclosed in connection with exemplary embodiments thereof, it should be understood that other embodiments may fall within the spirit and scope of the invention, as defined by the following claims.
The present application is a continuation of and claims priority to U.S. patent application Ser. No. 10/102,166, filed Mar. 20, 2002, which is hereby incorporated by reference herein in its entirety. The present application is related to the following U.S. Patent Applications, each of which is hereby incorporated by reference herein in its entirety: U.S. patent application Ser. No. 10/016,518, filed Nov. 1, 2001, titled “WEIGHTED FAIR QUEUE HAVING EXTENDED EFFECTIVE RANGE” (IBM Docket No. ROC920010199US1); U.S. patent application Ser. No. 10/015,994, filed Nov. 1, 2001, titled “WEIGHTED FAIR QUEUE SERVING PLURAL OUTPUT PORTS” (IBM Docket No. ROC920010200US1); U.S. patent application Ser. No. 10/015,760, filed Nov. 1, 2001, titled “WEIGHTED FAIR QUEUE HAVING ADJUSTABLE SCALING FACTOR” (IBM Docket No. ROC920010201US1); U.S. patent application Ser. No. 10/002,085, filed Nov. 1, 2001, titled “EMPTY INDICATORS FOR WEIGHTED FAIR QUEUES” (IBM Docket No. ROC920010202US1); U.S. patent application Ser. No. 10/004,373, filed Nov. 1, 2001, titled “QoS SCHEDULER AND METHOD FOR IMPLEMENTING PEAK SERVICE DISTANCE USING NEXT PEAK SERVICE TIME VIOLATED INDICATION” (IBM Docket No. ROC920010203US1); U.S. patent application Ser. No. 10/002,416, filed Nov. 1, 2001, titled “QoS SCHEDULER AND METHOD FOR IMPLEMENTING QUALITY OF SERVICE WITH AGING STAMPS” (IBM Docket No. ROC920010204US1); U.S. patent application Ser. No. 10/004,440, filed Nov. 1, 2001, titled “QoS SCHEDULER AND METHOD FOR IMPLEMENTING QUALITY OF SERVICE WITH CACHED STATUS ARRAY” (IBM Docket No. ROC920010205US1); and U.S. patent application Ser. No. 10/004,217, filed Nov. 1, 2001, titled “QoS SCHEDULER AND METHOD FOR IMPLEMENTING QUALITY OF SERVICE ANTICIPATING THE END OF A CHAIN OF FLOWS” (IBM Docket No. ROC920010206US1).
Number | Date | Country | |
---|---|---|---|
Parent | 10102166 | Mar 2002 | US |
Child | 11838152 | Aug 2007 | US |