In-band management relates to the local management of a network. In-band management may be achieved, for example, via a virtual connection to a communication device in the network or by employing a Simple Network Management Protocol (SNMP). These connections consume bandwidth and traditionally require a complex management layer. In addition, management of a remote device in a network generally requires a processor sub-system with a dedicated Ethernet port which is often complex and costly.
Bit-interleaved parity (BIP) is a method of error detection employed in networks to determine whether a data transmission comprises an error. In BIP, a parity byte is calculated bit-wise across a number of bytes in a frame for data transmission, such as an Ethernet frame or data packet, to force character bit patterns into even parity.
Many aspects of the present disclosure can be better understood with reference to the following drawings. The components in the drawings are not necessarily to scale, with emphasis instead being placed upon clearly illustrating the principles of the disclosure. Moreover, in the drawings, like reference numerals designate corresponding parts throughout the several views.
The present disclosure relates to in-band management of Ethernet links utilizing a bit-interleaved parity (BIP) block in a transmission frame. In-band management relates to the local management of a network. In-band management may be achieved, for example, via a virtual connection to a communication device within the network or by employing a Simple Network Management Protocol (SNMP). These connections consume bandwidth and traditionally require a complex management layer. Additionally, management of remote devices in a network generally requires a processor sub-system with a dedicated Ethernet port which is often complex and costly.
For example, an administrator of a network may monitor, maintain, or otherwise interact with communication devices (e.g., Ethernet switches, routers, servers) installed in one or more racks in a network center, wherein each of the one or more racks may be managed as an individual and segmented unit within the network center. A “top of the rack” (TOR) device (e.g., a server, a switch, etc.) located in a rack may connect to one or more Ethernet switches installed within the rack and may be used as a central device for monitoring or configuring the devices installed within the rack. Similarly, an “end of row” (EOR) device (e.g., a server, a switch, etc.) located in a rack may connect to one or more Ethernet switches installed within one or more racks in the network center and may be used as a central device for monitoring or configuring the devices within the one or more racks. To monitor, maintain, or otherwise interact with the Ethernet switches in the rack, the administrator may make a physically connection to the TOR device or EOR device via an Ethernet management cable and/or a console to transmit network management information used in the maintenance, configuration, reconfiguration, and/or management of the network.
According to various embodiments of the present disclosure, network management information may be encoded in existing transmission frames that otherwise would have been sent without network management information. A BIP error code may be generated for a monitored portion of network data for transmission in a first BIP block. Network management data may be encoded in a plurality of bits for transmission in a second BIP block according to a predefined block code, wherein the predefined block code generates the plurality of bits to maintain a direct current (DC) balance between the bit-interleaved parity error code and the plurality of bits, as will be discussed in greater detail below.
By transmitting network management data in existing transmission frames, maintenance and/or control of TOR and/or EOR devices may be facilitated without the use of an Ethernet management cable or console and will not increase the bandwidth used in transmitting separate network management data. Alternative in-band management implementations may require a designated internet protocol (IP) address, may be complex, and may consume network bandwidth. In the following discussion, a general description of the system and its components is provided, followed by a discussion of the operation of the same.
An equipment rack 101 is an example of a component assembly or component housing that may be used in a network center. The equipment rack 101 may contain multiple individual components 104 and/or other sub-components. The components 104 may be interconnected and connected to the equipment rack 101 using various cables in order to support network connectivity, console connectivity, electrical power, and/or other services.
The network center may be staffed by one or more administrators 108, or other support personnel. Generally, to configured one or more components 104 of the equipment rack 101, the administrators 108 may use a console 110 to assist in performing various diagnostic, informational, and administrative tasks. The console 110 may be a general-purpose device, such as a laptop computer, personal digital assistant (PDA), tablet computer, slate computer, smartphone, or a non-portable computing device (for example, a workstation or purpose-built stationary appliance located near the data center loading dock) upon which applications may be executed. The console 110 may comprise a display 112, a printer, and/or other type of interface device for communicating with the administrator 108.
The console 110 may be capable of configured the various components 104 of the equipment rack 101 by interacting with a TOR device 103 (or an EOR device) via an Ethernet cable 115 or like component.
Referring next to
The source device 203 is representative of multiple source devices 203 that may be in communication with the network 212. According to various embodiments, the source device 203 may comprise a processor-based system, such as a server computer, a switch, a router, or any other suitable communication device. The source device 203 is operable to transmit and receive data to and from the destination device 206 and/or other devices using the network 212. To this end, the source device 203 comprises source device processing circuitry 215 and potentially other components and/or functionality. The source device processing circuitry 215 may, for example, generate data, format data, transmit data, and/or perform other processing functionality. As such, the source device processing circuitry 215 may comprise, for example, a network interface card (NIC), a transceiver physical layer (PHY), and/or other types of components.
The source device 203 generates and transmits transmission frames 218 (e.g., Ethernet frames or data packets) to the destination device 206 and/or other devices using the network 212. The destination device 206 and potentially other devices may generate transmission frames 218 as well. A transmission frame 218 comprises framed data that is suitable for transmission in the network 212. The transmission frame 218, for example, may comprise control data and payload data, wherein the control data facilitates the routing and transmission of the payload data in the transmission frame 218. The transmission frame 218 may also comprise network management data 220 that is located in, for example, portions of the transmission frame 218, as will be discussed in greater detail below.
The intermediary devices 209 are representative of one or more intermediary devices 209a, 209b, and/or 209c (and so forth) that may be present in the network 212. In the non-limiting example of
In the example of
The destination device 206 is representative of multiple destination devices 206 that may be in communication with the network 212. The destination device 206 is operable to receive transmission frames 218 from the network 212. To this end, the destination device 206 comprises destination device processing circuitry 224 and potentially other components and/or functionality. According to various embodiments, the destination device 206 may comprise a processor-based or processor-less system, such as a server computer, a switch, a network controller, a router, or any other suitable communication device. The destination device processing circuitry 224 may, for example, receive a transmission frame 218, remove the network management data 220 in the transmission frame 218, and process the network management data 220 in the transmission frame 218. As such, the destination device processing circuitry 224 may comprise, for example, a network interface card (NIC), a transceiver physical layer (PHY), and/or other types of components.
Next, a discussion of an example of the operation of the networked environment 200 is provided. In the following discussion, it is assumed that the source device 203 and the destination device 206 are in communication with the network 212.
The source device 203 may prepare to transmit a transmission frame 218 that comprises network management data 220 that is to be transmitted as at least a portion of the payload. Such network management data 220 may be generated by and originate from the source device 203. Alternatively, another communication device may generate the network management data 220, and the source device 203 may receive the network management data 220 from the other device and then transmit the network management data 220 in the transmission frame 218 on behalf of the other device.
As may be appreciated, transmission frames 218 may comprise error detection codes (error codes) that facilitate reliable delivery of data over communication channels. Accordingly, in various embodiments, the source device processing circuitry 215 may encode an error code for a monitored portion of network data for transmission in a first block within the transmission frame 218. Further, the source device processing circuitry 215 may encode network management data 220 in a second block of the transmission frame 218 without modification of the payload data and without the use of additional transmission frames 218. According to various embodiments, the network management data 220 may be encoded according to a predefined block code, wherein the predefined block code generates a plurality of bits comprising the network management 220 to maintain a DC balance between the error code and the network management data 220. The source device processing circuit 215 may generate or otherwise configure the transmission frame 218 comprising at least the first block and the second block for transmission to the destination device 206. According to various embodiments, the destination device 206 may comprise a TOR device 113 (or an EOR device) that may be configured to process the network management data 220 within the transmission frame 218 to configure or reconfigure various components within an equipment rack 101 according to the network management data 220. Generation of the transmission packet 218 comprising network management data 220 will be discussed in greater detail below.
Turning now to
A first portion 306 of the alignment marker 300 may comprise one or more “words,” or a plurality of bits, that represent at least a portion of a payload of the alignment marker 300. In the non-limiting example of
Bit-interleaved parity (BIP) code is a method of monitoring error rates of network links (e.g., Ethernet links). An X-bit BIP error code (BIP-X) may be generated for a monitored portion (e.g., monitored payload blocks) having even parity for transmission over a specific portion, or block, of a transmission frame 218. A first bit-interleaved parity block 312 may comprise, for example, an error detection code in a first BIP format. In the non-limiting example of
Similarly, a second bit-interleaved parity block 315 may comprise, for example, an error detection code in a second BIP format. In the non-limiting example of
BitsBIP-7=˜(BitsBIP-3) (eq. 2).
As can be appreciated, a primary use of a BIP-7 block 315 is to maintain DC parity with the BIP-3 block 312. However, all or at least a portion of the bits in the second BIP block 315 may be dedicated for encoding network management data 220 while maintaining the DC balance. Accordingly, network management data 220 may be transmitted in otherwise existing transmission frames 218 while maintaining DC balance in the alignment marker 300.
As a non-limiting example, all 8 bits in an 8-bit second BIP block 315 may be used to encode network management data 220. The 8 bits representing network management data 220 may be encoded according to a block code so that the resulting bits maintain DC balance in the transmission frame 218, while comprising network management data 200. In another example, 5 bits in the 8-bit second BIP block 315, or any other subset of bits, may be employed to encode network management data 220. For example, the 5 bits may be used to encode network management data 220. The remaining 3 bits may be determined such that DC balance is maintained in the transmission frame 218. Thus, the plurality of remaining bits in the second bit-interleaved parity block is to generate a uniform block comprising both the encoded network management data (e.g., the 5 bits) and the resulting bits (e.g., the 3 bits) encoded to maintain DC balance between the uniform block and the first BIP block 312. Accordingly, network management data 220 may be encoded in existing transmission frames 218 without necessitating separate transmission of the network management data 220 as a payload in a plurality of transmission frames 218, thereby avoiding an increase in bandwidth of a network 212.
As may be appreciated, the transmission frames 218 may be employed in various technologies for sending transmission frames 218 such as 40 Gigabit Ethernet (40 GbE), 100 Gigabit Ethernet (100 GbE), and/or other technologies. Tables 1 and 2 below show examples of 40 GbE and 100 GbE distinct personal communication service (PCS) lane markers, respectively, which may include the first BIP block 312 (e.g., a BIP-3 block) and the second BIP block 315 (e.g., a BIP-7 block).
According to various embodiments, a source device 203, a destination device 206, or any other communication device may comprise, for example, a gearbox PHY. The structure of multi-link gearbox (MLG) 1.0 PCS lane numbers are similar to Ethernet multi-link device (MLD) PCS lane numbers, however, with new distinct PCS lane numbers. For example, MLG 1.0 PCS lane numbers also include a BIP-3 block and a BIP-7 block. Table 3 below shows an example of 4×25 MLG lane alignment marker values which may include a BIP-3 block and a BIP-7 block.
MLG 2.0, as opposed to MLG 1.0, comprises two modes of operation: 4×25G and 8×25G. Tables 4 and 5 below show examples of the PCS lane numbers of 4×25G and 8×25G modes, respectively.
Further, 40 GbE Ethernet and 100 GbE Ethernet links, as well as Optical Internetworking Forum (OIF) MLG definition of 4×25G and 8×25G, embed BIP-3 and BIP-7 blocks into respective PCS lane numbers. As discussed above, BIP-3 is an 8 parity bit word that may be used to quickly estimate a BER for a network link. IEEE 802.3 clause 82 defines a parity definition for BIP-3. An example of BIP-3 value assignment is shown below in Table 6.
For example, BIP-3, bit 0, is an exclusive or of bits 2, 10, 18, 26, 34, 42, 50, 58 from a 66-bit word. Because BIP-7 may be a bitwise inversion of BIP-3, it is capable of being used to carry network management data 220 between two communication devices, such as host and a client, without the need of a separate Ethernet link or console 110. The BIP-3 or BIP-7 block in MLD or MLG traffic can be used to carry point-to-point or multi-point management information between a host and a client or between two hosts. The generation of a DC balanced transmission frame 218 will be discussed in greater detail below.
Moving on to
Thus, the Ethernet switch 403 may comprise processing circuitry that may, for example, generate data, format data, transmit data, and/or perform other processing functionality. As such, the Ethernet switch 403 may comprise, for example, a network interface card (NIC), a transceiver physical layer (PHY) 409, a field-programmable gate array (FPGA), and/or other types of components. Accordingly, a transmission frame 218 comprising network management data 220 may be read from or written to by a PHY 409, a chip, an FPGA, or any other circuitry within or otherwise in data communication with the Ethernet switch 403.
According to various embodiments, the original BIP-3 data is preserved and the network management data 220 is added to the BIP-7 block 315 while maintaining DC balance. The gearbox PHY 406, upon receipt of the transmission frame 218, reads and monitors the BIP-3 block 312 (
According to various embodiments, the Ethernet switch 403 and the gearbox PHY 406 may communicate as a host-client arrangement or as a client-client arrangement and may communicate directly or indirectly (i.e., over a network 212) with a network management device 410. Region 412 and region 415 illustrate that the gearbox PHY 406 may communicate with downstream devices or, alternatively, may communicate with the network 212. According to various embodiments, a gearbox PHY 406 may comprise a processor-less device, a multi-link distribution (MLD) device, and/or a multi-link gearbox (MLG). For example, the gearbox PHY 406 may comprise the Broadcom® BCM84790 gearbox PHY or the BCM84793 gearbox PHY. In various embodiments, the gearbox PHY 406 may comprise a 10 GbE, 40 GbE, or 100 GbE physical layer device while offering a multi-rate interface for 4-lane (4×25 Gbps) and 10-lane (10×10 Gbps) bi-directional transmissions.
Referring next to
Accordingly, the first Ethernet switch 503 and/or the second Ethernet switch 506 may comprise processing circuitry that may, for example, generate data, format data, transmit data, and/or perform other processing functionality. As such, the Ethernet switch 403 may comprise, for example, a network interface card (NIC), a transceiver physical layer (PHY), a field-programmable gate array (FPGA), and/or other types of components. Accordingly, a transmission frame 218 may be read or write network management data 220 by a NIC, a PHY, a chip, an FPGA, or any other circuitry within or otherwise in data communication with the first Ethernet switch 503 or the second Ethernet switch 506. Although the non-limiting example of
Turning now to
Beginning with 603, the source device processing circuitry 215 is operable to encode a BIP error code for monitored data for transmission in a first BIP block 312 (
In 606, the source device processing circuitry 215 may encode network management data 220 according to a predefined block code for transmission in a second BIP block 315 (
Accordingly, a primary use of a BIP-7 block 315 is to maintain DC parity with the BIP-3 block 312. However, all or at least a portion of the bits in the second BIP block 315 may be dedicated for encoding network management data 220 while maintaining the DC balance. Accordingly, network management data 220 may be transmitted in otherwise existing transmission frames 218 while maintaining DC balance in the alignment marker 300.
As a non-limiting example, all 8 bits in an 8-bit second BIP block 315 may be used to encode network management data 220. The 8 bits representing network management data 220 may be encoded according to a block code so that the resulting bits maintain DC balance in the transmission frame 218, while comprising network management data 200. A block code may comprise, for example, 3B/4B block code, 4B/5B block code, 8B/10B block code, and/or any other block code encoding technique that may encode network management data 220 while maintaining DC balance.
In another example, a portion of bits in the second BIP block 315 (as opposed to all the bits in the second BIP block 315) may be employed to encode network management data 220. As a non-limiting example, 4 bits of the 8-bit second BIP block 315, or any other subset of bits, may be employed to encode network management data 220. The remaining bits in the second BIP block 315 may be determined such that DC balance is maintained in the transmission frame 218. Accordingly, network management data 220 may be encoded in existing transmission frames 218 without necessitating separate transmission of the network management data 220 as a payload in a plurality of transmission frames 218, thereby avoiding an increase in bandwidth of a network 212.
In 609, a transmission frame 218 may be generated or otherwise reconfigured to include both the first BIP block 312 and the second BIP block 315, wherein the transmission frame 218 has DC balance. In 612, the transmission frame 218, comprising the network management data 220, may be transmitted from the source device 203 to a destination device 206. To restate, the transmission frame 218, comprising the network management data 220, may be transmitted from a first communication device to a second communication device.
Turning now to
Beginning with 703, a first communication chip is operable to encode a BIP error code for monitored data for transmission in a first BIP block 312 (
In 706, the first communication chip is operate to encode management data (as opposed to network management data 220) according to a predefined block code for transmission in a second BIP block 315 (
Accordingly, a primary use of a BIP-7 block 315 is to maintain DC parity with the BIP-3 block 312. However, all or at least a portion of the bits in the second BIP block 315 may be dedicated for encoding management data while maintaining the DC balance. Accordingly, management data may be transmitted in otherwise existing transmission frames 218 while maintaining DC balance in the alignment marker 300.
As a non-limiting example, all 8 bits in an 8-bit second BIP block 315 may be used to encode network management data 220. The 8 bits representing network management data 220 may be encoded according to a block code so that the resulting bits maintain DC balance in the transmission frame 218, while comprising network management data 200. A block code may comprise, for example, 3B/4B block code, 4B/5B block code, 8B/10B block code, and/or any other block code that may encode network management data 220 while maintaining DC balance.
In another example, a portion of bits in the second BIP block 315 (as opposed to all the bits in the second BIP block 315) may be employed to encode network management data 220. As a non-limiting example, 4 bits of the 8-bit second BIP block 315, or any other subset of bits, may be employed to encode network management data 220. The remaining bits in the second BIP block 315 may be determined such that DC balance is maintained in the transmission frame 218. Accordingly, management data may be encoded in existing transmission frames 218 without necessitating separate transmission of the management data as a payload in a plurality of transmission frames 218, thereby avoiding an increase of data on the bus.
In 709, a transmission frame 218 may be generated or otherwise reconfigured to include both the first BIP block 312 and the second BIP block 315, wherein the transmission frame 218 has a DC balance. In 712, the transmission frame 218, comprising the management data, may be transmitted from the first chip to a second chip.
With reference to
Stored in the memory 806 are both data and several components that are executable by the processor 803. In particular, stored in the memory 806 and executable by the processor 803 is an encoding application 812 that performs the operations depicted in
It is understood that there may be other applications that are stored in the memory 806 and are executable by the processor 803 as can be appreciated. Where any component discussed herein is implemented in the form of software, any one of a number of programming languages may be employed such as, for example, C, C++, C#, Objective C, Java®, JavaScript®, Perl, PHP, Visual Basic®, Python®, Ruby, Flash®, or other programming languages.
A number of software components are stored in the memory 806 and are executable by the processor 803. In this respect, the term “executable” means a program file that is in a form that can ultimately be run by the processor 803. Examples of executable programs may be, for example, a compiled program that can be translated into machine code in a format that can be loaded into a random access portion of the memory 806 and run by the processor 803, source code that may be expressed in proper format such as object code that is capable of being loaded into a random access portion of the memory 806 and executed by the processor 803, or source code that may be interpreted by another executable program to generate instructions in a random access portion of the memory 806 to be executed by the processor 803, etc. An executable program may be stored in any portion or component of the memory 806 including, for example, random access memory (RAM), read-only memory (ROM), hard drive, solid-state drive, USB flash drive, memory card, optical disc such as compact disc (CD) or digital versatile disc (DVD), floppy disk, magnetic tape, or other memory components.
The memory 806 is defined herein as including both volatile and nonvolatile memory and data storage components. Volatile components are those that do not retain data values upon loss of power. Nonvolatile components are those that retain data upon a loss of power. Thus, the memory 806 may comprise, for example, random access memory (RAM), read-only memory (ROM), hard disk drives, solid-state drives, USB flash drives, memory cards accessed via a memory card reader, floppy disks accessed via an associated floppy disk drive, optical discs accessed via an optical disc drive, magnetic tapes accessed via an appropriate tape drive, and/or other memory components, or a combination of any two or more of these memory components. In addition, the RAM may comprise, for example, static random access memory (SRAM), dynamic random access memory (DRAM), or magnetic random access memory (MRAM) and other such devices. The ROM may comprise, for example, a programmable read-only memory (PROM), an erasable programmable read-only memory (EPROM), an electrically erasable programmable read-only memory (EEPROM), or other like memory device.
Also, the processor 803 may represent multiple processors 803 and/or multiple processor cores and the memory 806 may represent multiple memories 806 that operate in parallel processing circuits, respectively. In such a case, the local interface 809 may be an appropriate network that facilitates communication between any two of the multiple processors 803, between any processor 803 and any of the memories 806, or between any two of the memories 806, etc. The local interface 809 may comprise additional systems designed to coordinate this communication, including, for example, performing load balancing. The processor 803 may be of electrical or of some other available construction.
Although the encoding application 812, and other various systems described herein may be embodied in software or code executed by general purpose hardware as discussed above, as an alternative the same may also be embodied in dedicated hardware or a combination of software/general purpose hardware and dedicated hardware. If embodied in dedicated hardware, each can be implemented as a circuit or state machine that employs any one of or a combination of a number of technologies. These technologies may include, but are not limited to, discrete logic circuits having logic gates for implementing various logic functions upon an application of one or more data signals, application specific integrated circuits (ASICs) having appropriate logic gates, field-programmable gate arrays (FPGAs), or other components, etc. Such technologies are generally well known by those skilled in the art and, consequently, are not described in detail herein.
The flowcharts of
Although the flowcharts of
Also, any logic or application described herein, including the encoding application 812, that comprises software or code can be embodied in any non-transitory computer-readable medium for use by or in connection with an instruction execution system such as, for example, a processor 803 in a computer system or other system. In this sense, the logic may comprise, for example, statements including instructions and declarations that can be fetched from the computer-readable medium and executed by the instruction execution system. In the context of the present disclosure, a “computer-readable medium” can be any medium that can contain, store, or maintain the logic or application described herein for use by or in connection with the instruction execution system.
The computer-readable medium can comprise any one of many physical media such as, for example, magnetic, optical, or semiconductor media. More specific examples of a suitable computer-readable medium would include, but are not limited to, magnetic tapes, magnetic floppy diskettes, magnetic hard drives, memory cards, solid-state drives, USB flash drives, or optical discs. Also, the computer-readable medium may be a random access memory (RAM) including, for example, static random access memory (SRAM) and dynamic random access memory (DRAM), or magnetic random access memory (MRAM). In addition, the computer-readable medium may be a read-only memory (ROM), a programmable read-only memory (PROM), an erasable programmable read-only memory (EPROM), an electrically erasable programmable read-only memory (EEPROM), or other type of memory device.
Further, any logic or application described herein, including the encoding application 812, may be implemented and structured in a variety of ways. For example, one or more applications described may be implemented as modules or components of a single application. Further, one or more applications described herein may be executed in shared or separate computing devices or a combination thereof. For example, a plurality of the applications described herein may execute in the same source device 203, or in multiple computing devices in the same computing environment. Additionally, it is understood that terms such as “application,” “service,” “system,” “engine,” “module,” and so on may be interchangeable and are not intended to be limiting.
Disjunctive language such as the phrase “at least one of X, Y, or Z,” unless specifically stated otherwise, is otherwise understood with the context as used in general to present that an item, term, etc., may be either X, Y, or Z, or any combination thereof (e.g., X, Y, and/or Z). Thus, such disjunctive language is not generally intended to, and should not, imply that certain embodiments require at least one of X, at least one of Y, or at least one of Z to each be present.
It should be emphasized that the above-described embodiments of the present disclosure are merely possible examples of implementations set forth for a clear understanding of the principles of the disclosure. Many variations and modifications may be made to the above-described embodiment(s) without departing substantially from the spirit and principles of the disclosure. All such modifications and variations are intended to be included herein within the scope of this disclosure and protected by the following claims.
This application claims the benefit of and priority to U.S. Provisional Application Ser. No. 61/881,673, filed Sep. 24, 2013, and entitled “INBAND MANAGEMENT OF ETHERNET LINKS” which is incorporated by reference herein in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
6493342 | Breslow et al. | Dec 2002 | B1 |
6711704 | Tezuka | Mar 2004 | B1 |
6876315 | Widmer | Apr 2005 | B1 |
8514634 | Wu | Aug 2013 | B1 |
20040135710 | Widmer | Jul 2004 | A1 |
20120102210 | Kudo | Apr 2012 | A1 |
20130251012 | Fujimori | Sep 2013 | A1 |
Entry |
---|
Roy Bynum, Recommendations for Operational Maintenance Functions Within the 10GbE WAN Compatible PHY, May 2000, MCI Broadcom, P802.3ae, Interim Meeting. |
Number | Date | Country | |
---|---|---|---|
20150089319 A1 | Mar 2015 | US |
Number | Date | Country | |
---|---|---|---|
61881673 | Sep 2013 | US |