Network devices are commonly used to facilitate the transfer of data among devices on a computer network. Different network devices may perform different functions. For instance, layer 1 network devices are typically configured to transfer data at the physical layer (layer 1). However, they do not manage any of the traffic coming through them. Layer 2 network devices may be configured to process and forward data at the data link layer (layer 2) based on hardware addresses (e.g., media access control (MAC) addresses). As another example, layer-3 devices can be configured to process and forwards data at the network layer (layer 3) based on routing protocols (e.g., an Internet Protocol (IP) routing protocol).
In the following description, for purposes of explanation, numerous examples and specific details are set forth in order to provide a thorough understanding of the present invention. It will be evident, however, to one skilled in the art that the present invention as defined by the claims may include some or all of the features in these examples alone or in combination with other features described below, and may further include modifications and equivalents of the features and concepts described herein.
A network device can have several network interfaces, such as Ethernet ports, through which data is received from and transmitted to other network devices. In some cases, the network interfaces may conform to a network port standard (e.g., a 40G-Base-CR4 network port standard). At the physical layer, each network interface may have four physical connections. A receiving network device can receive packets from a transmitting network device. The transmitting network device may divide Ethernet packets into 64/66 blocks, allocate the blocks one at a time to each physical layer interface (also referred to as a lane), and transmit the blocks to the receiving network device. The order of writing to lanes is always lane 0, lane 1, lane 2, lane 3, lane 0, lane 1, lane 2, etc. However, the physical layer may re-order the lanes. That is, the lanes can be swapped. The receiving network device needs to reorder the lanes before it can correctly decode a packet. The transmitting network device periodically sends alignment markers to allow the receiving network device to uniquely identify each physical connection (i.e., lane) and then reorder the lanes according to a defined order.
When each of the transceivers 120-123 receives a block of encoded and scrambled data, it performs some processing and sends it to lane deskew and alignment searcher 125. Lane deskew and alignment searcher 125 searches the blocks of data received from the transceivers 120-123 for alignment data indicating which logical lane is associated with which physical connection. As shown in
While
Described herein are techniques for using a cross point switch of a network device to reorder lanes for network interfaces of the network device. In some embodiments, a network device includes a set of network interfaces. Each network interface may include several physical connections. When the network interface of the network device receives data from another network device, the data is received in chunks (also referred to as blocks) that are allocated across the different physical connections in a defined order of logical lanes. In order to determine the logical lane associated with each physical connection, the network interface of the network device may receive alignment information at each of the physical connections that uniquely identifies the logical lane with which the physical connection is associated. Based on the alignment information, the network device can configure a cross point switch of the network device to reorder the physical connections so that the logical lanes associated with the physical connections are ordered according to the defined order of logical lanes. This way, the network device is able to process the data received at the physical connections in the correct order, which allows the network device to correctly reconstruct the data received from the other network device.
Using the techniques described herein, the latency through prior art network devices, such as the ones illustrated in
The techniques described in the present application provide a number of benefits and advantages over conventional methods for reordering logical lanes for network interfaces of network devices. For instance, in some embodiments, a network device may include a cross point switch that is part of the data path used to process data received from another network device. By leveraging the cross point switch of the network device, which already part of the data path used to process data, to reorder the physical connections of a network interface receiving the data from the other network device, the network device is able to avoid using particular components (e.g., a set of multiplexors) that would otherwise be used in certain implementations of conventional network devices (e.g., network device 100) to reorder the physical connections. Hence, the technical effect provided by the techniques described herein is reducing the latency through the network device by the amount of latency incurred by the processing of data by the particular components. Reducing and/or removing latency in a network device is particularly advantageous in applications where providing the lowest possible latency is desirable (e.g., trading infrastructures of trading companies).
Encoder 210 is configured to encode packets of data intended to be transmitted to network device 235 into blocks of encoded data. For example, encoder 210 may take an Ethernet packet and divide it into blocks of data having a defined bit-length (e.g., 32 bits, 64 bits, etc.). Once the Ethernet packet is divided into such blocks of data, encoder 210 encodes each of the blocks of data. In some embodiments, encoder 210 uses a 64b/66b encoding scheme to encode the blocks of data. In some such embodiments, encoder 210 divides the Ethernet packet into blocks of 64 bit data. Then, the 64b/66b encoding scheme that encoder 210 employs transforms a 64 bit block of data to a 66 bit line code. A line code can be a pattern of voltage, current, or photons used to represent digital data transmitted down a transmission line. After encoding the block of data, encoder 210 sends the block of encoded data to scrambler 215 for further processing. Encoder 210 may continue to process remaining blocks of data in the same manner.
Scrambler 215 is responsible for processing blocks of encoded data. For instance, scrambler 215 can receive a block of encoded data from encoder 210. In response to receiving the block of encoded data, scrambler 215 may scramble the block of encoded data. In some embodiments, scrambler 215 utilizes a self-synchronizing scrambling technique to scramble the block of encoded data. Once scrambler 215 scrambles the block of encoded data, scrambler 215 sends it to block distributor 220 for further processing.
Block distributor 220 is configured to process encoded and scrambled blocks of data. For example, block distributor 220 may receive a block of scrambled and encoded data from scrambler 215. Upon receiving the block of data, block distributor 220 distributes it to a logical lane associated with one of the physical connection 231-234. In some embodiments, block distributor 220 distributes sequential blocks of data to logical lanes in a round robin manner. As an example, assume that physical connection 231 is associated with logical lane 0, physical connection 232 is associated with logical lane 1, physical connection 233 is associated with logical lane 2, and physical connection 234 is associated with logical lane 3. In addition, assume block distributor 220 receives a first block of data first, a second block of data second, a third block of data third, etc. In such an example, block distributor 220 may distribute the first block of data to logical lane 0 (i.e., physical connection 231), the second block of data to logical lane 1 (i.e., physical connection 232), the third block of data to logical lane 2 (i.e., physical connection 233), the fourth block of data to logical lane 3 (i.e., physical connection 234), the fifth block of data to logical lane 0 (i.e., physical connection 231), the sixth block of data to logical lane 1 (i.e., physical connection 232), and so on and so forth. After distributing a block of data to a logical lane, block distributor 220 sends it to alignment inserter 225.
Alignment inserter 225 handles the insertion of alignment data to the data streams transmitted through physical connections 231-234. For instance, alignment inserter 225 may receive blocks of data from block distributor 220 and forward them to the physical connections 231-234 determined by block distributor 220. Periodically, alignment inserter 225 can insert alignment data into data stream of each physical connection 231-234. In some embodiments, alignment inserter 225 inserts alignment data into data stream of each physical connection 231-234 at intervals of a defined number of blocks (e.g., 1,000 blocks, 5,000 blocks, 16,383 blocks, 20,000 blocks, etc.) transmitted through each physical connection 231-234. For example, after the defined number of blocks have been transmitted through each physical connection 231-234, alignment inserter 225 inserts alignment data into the data stream of each physical connection 231-234. Then after another defined number of blocks have been transmitted through each physical connection 231-234, alignment inserter 225 again inserts alignment data into the data stream of each physical connection 231-234. In some embodiments, the alignment data that alignment inserter 225 inserts into the data stream of one of the physical connections 231-234 includes data indicating a logical lane with which the physical connection is associated. Using the example above where physical connection 231 is associated with logical lane 0, physical connection 232 is associated with logical lane 1, physical connection 233 is associated with logical lane 2, and physical connection 234 is associated with logical lane 3, the alignment data that alignment inserter 225 inserts into the data stream of physical connection 231 indicates logical lane 0 is associated with physical connection 231. Similarly, the alignment data that alignment inserter 225 inserts into the data stream of physical connection 232 indicates logical lane 1 is associated with physical connection 231, the alignment data that alignment inserter 225 inserts into the data stream of physical connection 233 indicates logical lane 2 is associated with physical connection 231, and the alignment data that alignment inserter 225 inserts into the data stream of physical connection 234 indicates logical lane 3 is associated with physical connection 231. In some embodiments, the data used to indicate a logical lane may be a predefined unique value for identifying a unique logical lane associated with a physical connection. For instance, continuing with the aforementioned example, the data used to indicate logical lane 0 can be a first predefined unique value, the data used to indicate logical lane 1 can be a second predefined unique value, the data used to indicate logical lane 2 can be a third predefined unique value, and the data used to indicate logical lane 3 can be a fourth predefined unique value.
Operating system 230 is a software program executable on network device 205 that performs a variety of functions for facilitating the operation of network device 205. For example, operating system 230 may provide a command line interface (CLI) through which network device 205 (e.g., components of network device 205) can be configured and/or programmed. Operating system 130 can also configure the mode of the ports 231-234. For example, in some embodiments, ports 231-234 can be configured as four independent links (e.g., four 10 Gigabit links). In some such embodiments, each independent link may have its own encoder and scrambler. In addition, operating system 230 can receive status information from ports 231-234. Examples of such status information include whether an SFP transceiver is plugged in the ports 231-234 or not, the transmit power (tx) level, the receive (rx) light level, etc.
As shown, cross point switch 240 includes four inputs coupled to physical connections 236-239 and four outputs coupled to transceivers 241-244. Cross point switch 240 may be configured so that any input may be connected to any output. In some instances, cross point switch 240 receives configuration information from operating system 260 that specifies which input is to be connected to which output. In response to receiving such configuration information, cross point switch 240 configures itself accordingly.
Transceivers 241-244 is configured to process blocks of encoded and scrambled data received from physical connections 236-239 via cross point switch 240. For example, when one of the transceivers 241-244 receives a block of encoded and scrambled data from one of the physical connections 236-239 via cross point switch 240, the transceiver converts the serial string of bits of the block of data into a parallel string of bits. The transceiver can also perform some clock recovering operations based on the block of encoded and scrambled data. After processing the block of data, the transceiver sends it to lane deskew and alignment searcher 245 for further processing.
Lane deskew and alignment searcher 245 can be configured to search for alignment data in the blocks of data received from transceivers 241-244. As mentioned above, in some embodiments, alignment data inserted into a data stream of a physical connection may be a predefined unique value associated with a logical lane in order to indicate the logical lane is associated with the physical connection. In some such embodiments, lane deskew and alignment searcher 245 checks the blocks of data received from transceivers 241-244 for such predefined values. When lane deskew and alignment searcher 245 detects alignment data from each of the transceivers 241-244, lane deskew and alignment searcher 245 determines, for each of the transceivers 241-244, the logical lane associated with the transceiver based on the alignment data. Next, lane deskew and alignment searcher 245 sends those determinations to operating system 260.
Lane deskew and alignment searcher 245 may also be configured to perform data deskew operations on blocks of encoded and scrambled data based on the detected alignment data. For instance, there may be instances where block of data transmitted from network device 205 at the same time may arrive at network device 235 at different times. This may be due to the physical links between network device 205 and network device 235 being different lengths. Therefore, when lane deskew and alignment searcher 245 receives alignment data from transceivers 241-244, lane deskew and alignment searcher 245 may adjust the skew of the blocks of data received from transceivers 241-244 so that the alignment data received at each of the transceivers 241-244 line up. Lane deskew and alignment searcher 245 may accomplish this by locking the lane of a transceiver when lane deskew and alignment searcher 245 detects alignment data in the lane of the transceiver and buffering blocks of data received at the transceiver. When the lanes of the transceivers 241-244 are locked (i.e., lane deskew and alignment searcher 245 has detected alignment data in the lane of each of the transceivers 241-244), lane deskew and alignment searcher 245 can removes the alignment data and unlocks the lanes. Lane deskew and alignment searcher 245 may perform the deskew operations described above every time it detects alignment data received from transceivers 241-244. After the blocks of data buffered from each of the transceivers 241-244 are aligned based on the alignment data, lane deskew and alignment searcher 245 may send descrambler 250 blocks of data from each of the transceivers 241-244 in a round robin manner. For instance, lane deskew and alignment searcher 245 can first send a block of data received from transceiver 241 to descrambler 250, then send a block of data received from transceiver 242 to descrambler 250, then send a block of data received from transceiver 243 to descrambler 250, then send a block of data received from transceiver 244 to descrambler 250, then send a block of data received from transceiver 241 to descrambler 250, then send a block of data received from transceiver 242 to descrambler 250, and so on and so forth.
Descrambler 250 is responsible for blocks of encoded and scrambled data. For example, descrambler 250 may receive a block of encoded and scrambled data from lane deskew and alignment searcher 245. In response, descrambler 250 can descramble the block of encoded and scrambled data. In some embodiments, descrambler 250 employs a self-synchronizing descrambling technique to descramble the block of encoded and scrambled data. After descrambling the block of data, descrambler 250 sends it to decoder 255 for further processing.
Decoder 255 handles the decoding of blocks of encoded data into packets of data. For instance, decoder 255 can receive blocks of encoded data from descrambler 250, decode them, and then construct an Ethernet packet from the blocks of decoded data. In some embodiments, decoder 255 employs a 64b/66b decoding schemed to decode the blocks of data.
Operating system 260 is a software program executable on network device 235 that performs a variety of functions for facilitating the operation of network device 205. For example, operating system 260 may be responsible for configuring cross point switch 240. When operating system 260 receives alignment data from lane deskew and alignment searcher 245 indicating the logical lane associated with each of the transceivers 241-244, operating system 260 configures cross point switch 240 to connect the inputs and outputs of cross point switch 240 so that the order of the logical lanes outputted from cross point switch 240 is the same as a defined order of logical lanes. Operating system 260 can provide a command line interface (CLI) through which network device 235 (e.g., components of network device 235) can be configured and/or programmed. Additionally, operating system 260 may configure the mode of the ports 236-239. For instance, in some embodiments, ports 236-239 may be configured as four independent links (e.g., four 10 Gigabit links). In some such embodiments, each independent link may have its own descrambler and decoder. Operating system 260 can also receive status information from ports 236-239. Examples of such status information include whether an SFP transceiver is plugged in the ports 236-239 or not, the transmit power (tx) level, the receive (rx) light level, etc. Operating system 260 may also report the status of lane deskew and decoder error counts for the purpose of diagnostics. Operating system 260 can additionally report whether the individual links are up or down, the physical layer, and whether the link formed collectively by ports 236-239 is up or down.
An example operation will now be described by reference to
The example operations starts by lane deskew and alignment searcher 245 detecting alignment data received from one of the transceiver 241-244. For example, lane deskew and alignment searcher 245 detects one of the predefined unique values associated with a logical lane from one of the transceivers 241-244. In response, lane deskew and alignment searcher 245 locks the lane of the transceiver. Lane deskew and alignment searcher 245 continues to lock the lane of remaining transceivers upon detecting alignment data from the transceiver. Once lane deskew and alignment searcher 245 detects alignment data from all of the transceivers 241-244, lane deskew and alignment searcher 245 determines the logical lane associated with each of the transceivers 241-244 based on the received alignment data. In this example, lane deskew and alignment searcher 245 determines that logical lane 0 is associated with transceiver 241, logical lane 2 is associated with transceiver 242, logical lane 3 is associated with transceiver 243, and logical lane 1 is associated with transceiver 244. Lane deskew and alignment searcher 245 sends, at operation 310, the determinations to operating system 260.
Upon receiving the associations between logical lanes and transceivers 241-244, operating system 260 generates configuration information for configuring cross point switch 240. For this example, operating system 260 stores information (e.g., a mapping between inputs and outputs of cross point switch 240) indicating that cross point switch 240 is currently configured so that the input coupled to physical connection 236 is connected to transceiver 241, the input coupled to physical connection 237 is connected to transceiver 242, the input coupled to physical connection 238 is connected to transceiver 243, and the input coupled to physical connection 239 is connected to transceiver 244. In this example, the defined order of logical lanes is such that data for logical lane 0 is received at transceiver 241, data for logical lane 1 is received at transceiver 242, data for logical lane 2 is received at transceiver 243, and data for logical lane 3 is received at transceiver 244. Therefore, operating system 260 generates configuration information that would connect the input coupled to physical connection 236 to transceiver 241, connect the input coupled to physical connection 237 to transceiver 242, connect the input coupled to physical connection 238 to transceiver 243, and connect the input coupled to physical connection 239 to transceiver 244. Once the configuration information is generated, operating system 260 uses, at 315, the configuration information to reconfigure cross point switch 240.
The examples and embodiments described above by reference to
Next, process 500 determines, at 520, a first order of the plurality of unique logical lanes associated with the plurality of physical connections based on the received alignment information. Referring to
Finally, process 500 configures, at 530, a cross point switch of the network device to reorder the physical connections from the first order of the plurality of unique logical lanes to a second order of the plurality of the unique logical lanes. Referring to
The following are some example embodiments of the present disclosure. In some embodiments, a method receives, by a network device, alignment information at each physical connection of a plurality of physical connections of a network interface of the network device. Each alignment information comprises a unique value for identifying a unique logical lane associated with the physical connection. The method further determines a first order of the plurality of unique logical lanes associated with the plurality of physical connections based on the received alignment information. The method also configures a cross point switch of the network device to reorder the physical connections from the first order of the plurality of unique logical lanes to a second order of the plurality of the unique logical lanes.
In some embodiments, the method may further send the determination of the first order of the plurality of unique logical lanes to an operating system executing on the network device. Configuring the cross point switch of the network device may be performed by the operating system. The network device may be a first network device. The alignment information may be received from a second network device. The method may further receive a plurality of blocks of encoded data from the second network device, descramble the plurality of blocks of encoded data, and decode the plurality of blocks of encoded data into a packet of data. The second network device may have generated the plurality of blocks of encoded data from a sequential stream of data. Descrambling the plurality of blocks of encoded data may include iteratively retrieving each block of encoded data from the plurality of blocks of encoded data in a round robin manner and constructing a sequential stream of blocks of encoded data based on the retrieved blocks of encoded data from the plurality of blocks of encoded data. The method may further convert each block of encoded data in the plurality of blocks of encoded data from a serial string of bits into a parallel string of bits.
In some embodiments, a network device includes a cross point switch. The network device further includes a network interface comprising a plurality of physical connections. Each physical connection of the plurality of physical connections is configured to receive alignment information. Each alignment information comprises a unique value for identifying a unique logical lane associated with the physical connection. The network device also includes an alignment searcher configured to determine a first order of the plurality of unique logical lanes associated with the plurality of physical connections based on the received alignment information. The network device further includes an operating system configured to configure the cross point switch of the network device to reorder the physical connections from the first order of the plurality of unique logical lanes to a second order of the plurality of the unique logical lanes.
In some embodiments, the alignment searcher may be further configured to send the determination of the first order of the plurality of unique logical lanes to the operating system. The network device may be a first network device. The alignment information may be received from a second network device. The plurality of physical connections of the network interface may be configured to receive a plurality of blocks of encoded data from the second network device. The network device may further include a descrambler configured to descramble the plurality of blocks of encoded data and a decoder configured to decode the plurality of blocks of encoded data into a packet of data. The second network device may have generated the plurality of blocks of encoded data from a sequential stream of data. The descrambler may be further configured to descramble the plurality of blocks of encoded data by iteratively retrieving each block of encoded data from the plurality of blocks of encoded data in a round robin manner and constructing a sequential stream of blocks of encoded data based on the retrieved blocks of encoded data from the plurality of blocks of encoded data. The network device may further include a plurality of transceivers configured to converting each block of encoded data in the plurality of blocks of encoded data from a serial string of bits into a parallel string of bits.
In some embodiments, a non-transitory machine-readable medium stores a program executable by at least one processing unit of a network device. The program receives alignment information at each physical connection of a plurality of physical connections of a network interface of the network device. Each alignment information comprises a unique value for identifying a unique logical lane associated with the physical connection. The program further determines a first order of the plurality of unique logical lanes associated with the plurality of physical connections based on the received alignment information. The program also configures a cross point switch of the network device to reorder the physical connections from the first order of the plurality of unique logical lanes to a second order of the plurality of the unique logical lanes.
In some embodiments, the program may further send the determination of the first order of the plurality of unique logical lanes to an operating system executing on the network device. Configuring the cross point switch of the network device may be performed by the operating system. The network device may be a first network device. The alignment information may be received from a second network device. The program may further receive a plurality of blocks of encoded data from the second network device, descramble the plurality of blocks of encoded data, and decode the plurality of blocks of encoded data into a packet of data. The second network device may have generated the plurality of blocks of encoded data from a sequential stream of data. Descrambling the plurality of blocks of encoded data may include iteratively retrieving each block of encoded data from the plurality of blocks of encoded data in a round robin manner and constructing a sequential stream of blocks of encoded data based on the retrieved blocks of encoded data from the plurality of blocks of encoded data.
The above description illustrates various embodiments of the present invention along with examples of how aspects of the present invention may be implemented. The above examples and embodiments should not be deemed to be the only embodiments, and are presented to illustrate the flexibility and advantages of the present invention as defined by the following claims. Based on the above disclosure and the following claims, other arrangements, embodiments, implementations and equivalents will be evident to those skilled in the art and may be employed without departing from the spirit and scope of the invention as defined by the claims.
Number | Name | Date | Kind |
---|---|---|---|
8488729 | Mendel | Jul 2013 | B1 |
8913705 | Chang | Dec 2014 | B2 |
8949493 | Wortman | Feb 2015 | B1 |
9246617 | Caggioni | Jan 2016 | B2 |
9395744 | Benjamini | Jul 2016 | B2 |
20100284486 | Kuwata | Nov 2010 | A1 |
20140269769 | Gresham | Sep 2014 | A1 |
20150003477 | Baeckler | Jan 2015 | A1 |
Number | Date | Country | |
---|---|---|---|
20210112018 A1 | Apr 2021 | US |