A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent files or records, but otherwise reserves all copyright rights whatsoever.
The present invention relates to the field of switching in a computer network. More particularly, the present invention relates to the controlling the flow of packets through Media Access Control (MAC) layer switching, while honoring the priority levels of the packets.
A switch is a device that provides a switching function (i.e., determines a physical path) in a data communications network. Switching may often involve transferring information, such as digital data packets or frames, among entities of the network. Switching is accomplished by examining data on one or more of the network layers. One such type of switching is media access control (MAC) address-based switching, which involves switching in the data link layer. The data link layer is also commonly referred to as layer 2 of the OSI Reference Model. Specifically, switching may occur through Ethernet and/or Gigabit Ethernet on full duplex ports for layer 2 networks.
During switching, there is often a need for flow control of packets, in case of network outages or if a network device does not have enough resources to handle the received traffic. In a full duplex network, a receiver can signal to the transmitter to stop sending the traffic if it does not have enough resources to handle the traffic. The IEEE 802.3x Specification (now part of 802.3, Annex 31B), uses PAUSE frames for a device to signal another device. The specialized MAC control PAUSE frames according to IEEE 802.3x are depicted in
The PAUSE frame causes any device receiving it to stop forwarding traffic to the requested device for the specified period of time. The hope is that when that period of time is up, the device has resources available for the traffic.
This flow control mechanism, however, does not discriminate among the incoming frames—it applies to all incoming frames to a device. In certain systems, however, frames may be prioritized. For example, voice data may have a high priority level as it is extremely time sensitive, whereas text data may have a low priority level. Furthermore, certain subscriber's traffic may be afforded higher priority than others. The prior art flow control mechanism, however, violates these priorities by simply ceasing all incoming transmissions. This can even defeat the purpose of flow control in the first place, by deteriorating network throughput, causing more transmissions, and a compounding of the problem.
What is needed is a mechanism wherein the MAC can take the action of the flow control and apply it in a way that takes into account the priority of the frames.
Furthermore, currently PAUSE frames are sent out as untagged and only have significance on a single link.
However, in the metro Ethernet environment, clients and servers may not be directly connected, but rather connected over several hops.
Solutions are provided that allow a network device to apply flow control on the MAC layer while taking into account the priority of the frame of traffic. This may be accomplished by generating a frame indicating that traffic flow should be paused, while utilizing a new opcode value, or alternatively by utilizing a new type/length value (possibly combined with a new opcode value). A receiving device may then examine the fields of the frame to determine whether it should it should use priority-based pausing, and then examine other fields to determine which priority-levels to pause and for how long. This allows for improved efficiency in flow control on the MAC layer.
The accompanying drawings, which are incorporated into and constitute a part of this specification, illustrate one or more embodiments of the present invention and, together with the detailed description, serve to explain the principles and implementations of the invention.
In the drawings:
Embodiments of the present invention are described herein in the context of a system of computers, servers, and software. Those of ordinary skill in the art will realize that the following detailed description of the present invention is illustrative only and is not intended to be in any way limiting. Other embodiments of the present invention will readily suggest themselves to such skilled persons having the benefit of this disclosure. Reference will now be made in detail to implementations of the present invention as illustrated in the accompanying drawings. The same reference indicators will be used throughout the drawings and the following detailed description to refer to the same or like parts.
In the interest of clarity, not all of the routine features of the implementations described herein are shown and described. It will, of course, be appreciated that in the development of any such actual implementation, numerous implementation-specific decisions must be made in order to achieve the developer's specific goals, such as compliance with application- and business-related constraints, and that these specific goals will vary from one implementation to another and from one developer to another. Moreover, it will be appreciated that such a development effort might be complex and time-consuming, but would nevertheless be a routine undertaking of engineering for those of ordinary skill in the art having the benefit of this disclosure.
In accordance with the present invention, the components, process steps, and/or data structures may be implemented using various types of operating systems, computing platforms, computer programs, and/or general purpose machines. The computer programs may be stored on a program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform a method as described herein. In addition, those of ordinary skill in the art will recognize that devices of a less general purpose nature, such as hardwired devices, field programmable gate arrays (FPGAs), application specific integrated circuits (ASICs), or the like, may also be used without departing from the scope and spirit of the inventive concepts disclosed herein. Furthermore, the present invention is described in the context of a switch. However, one of ordinary skill in the art will recognize that the term switch should be read broadly, so as to include any device that directs packets, including a router and a gateway.
The present invention provides mechanisms to allow a network device to apply flow control on the MAC layer while taking into account the priority of the frames of traffic.
Two mechanisms are described herein for applying flow control on a MAC layer for packets having a priority value. One of ordinary skill in the art will recognize that the specifics of these mechanisms are merely examples and should not be read as limiting. In one embodiment of the present invention, a different opcode value along with a new field may be utilized in the PAUSE frame in order to indicate how to handle frames of varying priorities. The new field may be termed a priority mask, and may be used to identify to which priority to apply the PAUSE command. Thus,
The new priority mask field 408 may be two bytes long, however the first byte may be unused in systems having 8 or fewer possible priority levels. In this embodiment, each bit of the second byte may correspond to a different priority level. Therefore, bit 0 might correspond to a priority level of 0, bit 1 to a priority level of 1, etc. The presence of any bit signals the traffic with the specific priority to be paused.
In an embodiment of the present invention, the pause time field 410 may be extended to 16 bytes, to allow for each priority level to have a different pause time. This may be utilized only when it is desired to have varying pause times—if it is more desirable in a specific instance to have a single pause time for all paused traffic, the only pause time field may be used. The new pause time field allows for 8 2-byte values for pause time. For example, as depicted in
Typically, the PAUSE frame utilizes an opcode value of 1. In an embodiment of the present invention, an opcode value of 2 may indicate the presence of the priority mask field—thus the receiving device would pause traffic with a priority value indicated by the priority mask. The pausing in this instance would be for a set time for all priorities, thus using only a single value in the pause time field.
An opcode value of 3, then, may indicate the presence of both the priority mask field and the new pause time field, thus indicating to the receiving device that it should pause traffic with a priority value indicated by the priority mask, for time periods as specified in the new pause time field.
In another embodiment of the present invention, a new type/length value may be used. This embodiment is beneficial when encountering devices utilizing older MAC standards, which may not be able to understand the new opcode values described above. Typically, the value “8808” is utilized in the type/length field to indicate a PAUSE frame. In this embodiment, the value “8809” may be used, for example, to indicate that this is a PAUSE frame that handles priority. The opcode field may then be used to indicate whether or not all the traffic priority levels utilize the same pause time—rather than values of 2 and 3 they may be, for example, 1 and 2. Otherwise, the frame format described in
While embodiments and applications of this invention have been shown and described, it would be apparent to those skilled in the art having the benefit of this disclosure that many more modifications than mentioned above are possible without departing from the inventive concepts herein. The invention, therefore, is not to be restricted except in the spirit of the appended claims.
This application is a continuation of prior U.S. patent application Ser. No. 12/615,142, filed on Nov. 9, 2009, which is a continuation of prior U.S. patent application Ser. No. 10/693,037, entitled “Priority Aware Mac Flow Control,” filed on Oct. 23, 2003, now U.S. Pat. No. 7,639,608.
Number | Name | Date | Kind |
---|---|---|---|
4412285 | Neches et al. | Oct 1983 | A |
4727537 | Nichols | Feb 1988 | A |
5305321 | Crayford | Apr 1994 | A |
5355375 | Christensen | Oct 1994 | A |
5436617 | Adams et al. | Jul 1995 | A |
5517520 | Chi | May 1996 | A |
5535211 | Yano | Jul 1996 | A |
5673254 | Crayford | Sep 1997 | A |
6021446 | Gentry, Jr. | Feb 2000 | A |
6084856 | Simmons et al. | Jul 2000 | A |
6084879 | Berl et al. | Jul 2000 | A |
6115356 | Kalkunte et al. | Sep 2000 | A |
6128665 | Iturrald | Oct 2000 | A |
6147995 | Dobbins et al. | Nov 2000 | A |
6167029 | Ramakrishnan | Dec 2000 | A |
6170022 | Linville et al. | Jan 2001 | B1 |
6181699 | Crinion et al. | Jan 2001 | B1 |
6405258 | Erimli et al. | Jun 2002 | B1 |
6539022 | Virgile | Mar 2003 | B1 |
6560236 | Marghese et al. | May 2003 | B1 |
6633585 | Ghanwani et al. | Oct 2003 | B1 |
6667985 | Drummond-Murray | Dec 2003 | B1 |
6704280 | Mangin et al. | Mar 2004 | B1 |
6707817 | Kadambi et al. | Mar 2004 | B1 |
6754179 | Lin | Jun 2004 | B1 |
6850542 | Tzeng | Feb 2005 | B2 |
6957269 | Williams et al. | Oct 2005 | B2 |
6957270 | Erimli et al. | Oct 2005 | B1 |
6981054 | Krishna | Dec 2005 | B1 |
7009968 | Ambe et al. | Mar 2006 | B2 |
7020139 | Kalkunte et al. | Mar 2006 | B2 |
7035255 | Tzeng | Apr 2006 | B2 |
7035286 | Tzeng | Apr 2006 | B2 |
7062568 | Senevirathne et al. | Jun 2006 | B1 |
7065050 | Herbst | Jun 2006 | B1 |
7142512 | Kobayashi et al. | Nov 2006 | B1 |
7145869 | Kadambi et al. | Dec 2006 | B1 |
7161948 | Sampath et al. | Jan 2007 | B2 |
7180857 | Kawakami et al. | Feb 2007 | B2 |
7212534 | Kadambi et | May 2007 | B2 |
7292572 | Liu et al. | Nov 2007 | B2 |
7379422 | Nation | May 2008 | B2 |
7423967 | Tzeng et al. | Sep 2008 | B2 |
7539132 | Werner et al. | May 2009 | B2 |
7539134 | Bowes | May 2009 | B1 |
7639608 | Jain et al. | Dec 2009 | B1 |
7640359 | Gilmartin | Dec 2009 | B1 |
7746778 | Chilukoor et al. | Jun 2010 | B2 |
7761589 | Jain | Jul 2010 | B1 |
7948880 | Kwan et al. | May 2011 | B2 |
7990857 | Jain et al. | Aug 2011 | B2 |
8514713 | Jain | Aug 2013 | B2 |
20020080444 | Phillips et al. | Jun 2002 | A1 |
20020085551 | Tzeng | Jul 2002 | A1 |
20020085585 | Tzeng | Jul 2002 | A1 |
20020087723 | Williams et al. | Jul 2002 | A1 |
20020093973 | Tzeng | Jul 2002 | A1 |
20020133534 | Forslow | Sep 2002 | A1 |
20020161914 | Belenki | Oct 2002 | A1 |
20030016628 | Kadambi | Jan 2003 | A1 |
20030037163 | Kitada et al. | Feb 2003 | A1 |
20030055900 | Glas et al. | Mar 2003 | A1 |
20030120759 | Ogawa | Jun 2003 | A1 |
20030123393 | Feuerstraeter et al. | Jul 2003 | A1 |
20030123446 | Muirhead et al. | Jul 2003 | A1 |
20030185249 | Davies et al. | Oct 2003 | A1 |
20030212898 | Steele et al. | Nov 2003 | A1 |
20030218977 | Pan et al. | Nov 2003 | A1 |
20040032868 | Oda et al. | Feb 2004 | A1 |
20040081090 | Hara et al. | Apr 2004 | A1 |
20040095882 | Hamzah et al. | May 2004 | A1 |
20040120334 | Nation | Jun 2004 | A1 |
20040179476 | Kim et al. | Sep 2004 | A1 |
20040205206 | Naik et al. | Oct 2004 | A1 |
20050002375 | Gokhale et al. | Jan 2005 | A1 |
20050021846 | Tzeng et al. | Jan 2005 | A1 |
20050111446 | Greaves et al. | May 2005 | A1 |
20060092837 | Kwan et al. | May 2006 | A1 |
20060164978 | Werner et al. | Jul 2006 | A1 |
20080137534 | Chilukoor et al. | Jun 2008 | A1 |
20100020689 | Tang | Jan 2010 | A1 |
20100046556 | Jain et al. | Feb 2010 | A1 |
20100238804 | Jain | Sep 2010 | A1 |
20110286335 | Dubey | Nov 2011 | A1 |
20120147747 | Jain et al. | Jun 2012 | A1 |
20130301409 | Jain | Nov 2013 | A1 |
Number | Date | Country |
---|---|---|
0529774 | Mar 1993 | EP |
0648034 | Apr 1995 | EP |
0948853 | Aug 2004 | EP |
Entry |
---|
Office Action in U.S. Appl. No. 12/791,828, mailed Sep. 7, 2012. |
“Annex B Baseband System Guidelines and Concepts, 10 Mb/s,” System Guidelines, IEEE, Std. 802-3, Mar. 2002, Section One, pp. 489-511. |
“Virtual Bridged Local Area Networks—Amendment: Priority-based Flow Control,” IEEE P802.1Qbb/D2.2, Apr. 15, 2010, pp. 1-41. |
DeSanti, C., “IEEE 802.1 Liaison Report to T11,” T11/08-335v0, Jun. 2008, pp. 1-4. |
“NetApp—Ethernet Storage Guy,” http://blogs.netapp.com/ethernet/8021qbb, Jun. 6, 2010, pp. 1-3. |
Smith, G. et al., “Converged Enhanced Ethernet—Good for iSCSI SANs,” NetApp White Paper, Blade Network Technologies, Oct. 2008, pp. 1-7. |
Hagen, M., “Data Center Bridging Tutorial,” University of New Hampshire—InterOperability Laboratory, Feb. 2009, pp. 1-3. |
“Brocade Fabric OS v6.12—ceel, Release Notes v3.0,” Brocade, Sep. 2, 2009, pp. 1-15. |
“iSCSI Primer,” Ethernet Alliance SC09, Ethernet: The Converged Network Presentation, Nov. 14-20, 2009, Oregon Convention Center, pp. 1-17. |
Thaler, P., “myProjectTM—P802.1Qbb PAR Detail,” Amendment to IEEE Standard, Feb. 13, 2008, 2 pages. |
Barrass, H. et al., “Proposal for Priority Based Flow Control, V.2,” Cisco, May 27, 2008, pp. 1-9. |
Barrass, H. et al., “Proposal for Priority Based Flow Control, V.1,” Cisco, Apr. 2008, pp. 1-9. |
Barrass, H. et al., “Proposal for Priority Based Flow Control, V.2,” Cisco, Apr. 2008, pp. 1-9. |
Barrass, H. et al., “Proposal for Priority Based Flow Control, V.3,” Cisco, Apr. 2008, pp. 1-9. |
Barrass, H. et al., “Proposal for Priority Based Flow Control, V.1,” Cisco, May 2008, pp. 1-9. |
Thaler, P., “BB Frame Issues,” IEEE 802, pthaler@broadcom.com, Jul. 2008, pp. 1-3. |
Pelissier, J., “Proposed Relation Between PFC and the MAC Control Sublayer,” Cisco, bb-pelissier-pfc-mac-conrol-0708, Sep. 2008, pp. 1-11. |
Pelissier, J., “Convergence of 802.1Q, PFC, AVB, and ETS,” Cisco, bb-pelissier-convergence-proposal-1108, Nov. 2008, pp. 1-23. |
Pelissier, J., “PFC Defense Mode Proposal,” Cisco, bb-pelissier-pfc-defense-0409, Apr. 2009, pp. 1-5. |
Ghanwani, A. et al., “PFC and Untagged Frames,” Brocade, May 2009, pp. 1-11. |
DeSanti, C., “PFC State Diagrams,” vol. 1, Sep. 2009, pp. 1-12. |
DeSanti, C., “PFC State Diagrams,” vol. 2, Sep. 2009, pp. 1-7. |
“Data Center Bridging (aka CEE)—It is not just for PCoE,” © 2009 Blade Network Technologies, Feb. 25, 2010, pp. 1-22. |
DeSanti, C., “802.1Qbb Status,” Cisco, bb-cds-draft-0-2-status-1108, Nov. 2008, pp. 1-4. |
Lippitt, M. et al., “Fibre Channel over Ethernet (FCOE),” Version 4.1, EMC Techbooks, 2010, pp. 1-401. |
IEEE Standard for Information technology—Telecommunications and information exchange between systems—Local and metropolitan area networks—Specific requirements, Part 3: Carrier Sense Multiple Access with Collision Detection (CSMA/CD) Access Method and Physical Layer Specifications, Amendment 2: Ethernet Organizationally Specific Type, Length, Value (TLVs), Amendment to IEEE Std. 802.3-2008, CSMA/CD, 2009, pp. 1-38. |
IEEE Standard for Information technology—Telecommunications and information exchange between systems—Local and metropolitan area networks—Specific requirements, Part 3: Carrier Sense Multiple Access with Collision Detection (CSMA/CD) Access Method and Physical Layer Specifications, IEEE Std. 802.3-2008/Cor Jan. 2009, Feb. 1, 2010, pp. 1-12. |
IEEE Standard for Information technology—Telecommunications and information exchange between systems—Local and metropolitan area networks—Specific requirements, “Part 3: Carrier Sense Multiple Access with Collision Detection (CSMA/CD) access method and Physical Layer specifications”, Section Two, Revision of IEEE 802.3, 2008, pp. 1-790. |
Office Action in U.S. Appl. No. 10/693,037, dated Jan. 25, 2008. |
Final Office Action in U.S. Appl. No. 10/693,037, dated Nov. 12, 2008. |
Office Action in U.S. Appl. No. 10/693,037, dated Mar. 17, 2009. |
Notice of Allowance in U.S. Appl. No. 10/693,037, dated Jun. 29, 2009. |
Notice of Allowance in U.S. Appl. No. 10/693,037, dated Sep. 21, 2009. |
Office Action in U.S. Appl. No. 10/702,387, dated Jun. 14, 2007. |
Office Action in U.S. Appl. No. 10/702,387, dated May 30, 2008. |
Office Action in U.S. Appl. No. 10/702,387, dated Dec. 8, 2008. |
Office Action in U.S. Appl. No. 10/702,387, dated Mar. 17, 2009. |
Office Action in U.S. Appl. No. 10/702,387, dated Aug. 20, 2009. |
Office Action in U.S. Appl. No. 10/702,387, dated Dec. 28, 2009. |
Notice of Allowance in U.S. Appl. No. 10/702,387, dated Apr. 29, 2010. |
Notice of Allowance in U.S. Appl. No. 10/702,387, dated May 27, 2010. |
Office Action in U.S. Appl. No. 12/615,142, dated Dec. 14, 2010. |
Notice of Allowance in U.S. Appl. No. 12/615,142, dated Jun. 13, 2011. |
Office Action in U.S. Appl. No. 10/702,387, dated Nov. 27, 2007. |
Office Action in U.S. Appl. No. 12/791,828, mailed Nov. 9, 2012. |
Notice of Allowance in U.S. Appl. No. 12/791,828, mailed Apr. 18, 2013. |
NonFinal Office Action in U.S. Appl. No. 13/944,876 mailed Jan. 14, 2014; 7 pages. |
Number | Date | Country | |
---|---|---|---|
20120147747 A1 | Jun 2012 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 12615142 | Nov 2009 | US |
Child | 13161439 | US | |
Parent | 10693037 | Oct 2003 | US |
Child | 12615142 | US |