This patent application claims priority to India patent application No. 202341064503, filed on Sep. 26, 2023, and entitled “SYSTEMS AND METHODS FOR PROVIDING ENERGY EFFICIENT NETWORKS.” The disclosure of the prior application is considered part of and is incorporated by reference into this patent application.
A network planner designs a network for worst-case traffic conditions while also catering to various potential resiliency scenarios. A capacity of a network may be underutilized due to traffic conditions, and actual traffic patterns may exhibit periodicity with regards to overall network utilization.
Some implementations described herein relate to a method. The method may include generating, by a first network device, a first protocol data unit (PDU) identifying a request to transition from a power on state to a power sleep state, and providing the first PDU to a second network device. Power sleep is a state of a link where consumption of power by the link is lesser than a power on state. Implementations can power off the link entirely or partially to put the link in a power sleep state. The method may include receiving, by a control channel link, a second PDU identifying a link of the second network device to transition from the power on state to the power sleep state, and generating a third PDU identifying a link of the first network device to transition from the power on state to the power sleep state. The method may include providing the third PDU to the second network device, and transitioning the link of the first network device and the link of the second network device from the power on state to the power sleep state based on the second PDU and the third PDU.
Some implementations described herein relate to a first network device. The first network device may include one or more memories and one or more processors. The one or more processors may be configured to generate a first PDU identifying a request to transition from a power on state to a power sleep state, and provide the first PDU to a second network device. The one or more processors may be configured to receive a second PDU identifying a link of the second network device to transition from the power on state to the power sleep state, and generate a third PDU identifying a link of the first network device to transition from the power on state to the power sleep state. The one or more processors may be configured to provide the third PDU to the second network device, and determine whether a sleep timer has expired based on the second PDU and the third PDU. The one or more processors may be configured to transition the link of the first network device and the link of the second network device from the power on state to the power sleep state based on determining that the sleep timer has expired.
Some implementations described herein relate to a non-transitory computer-readable medium that stores a set of instructions. The set of instructions, when executed by one or more processors of a first network device, may cause the first network device to generate a first PDU identifying a request to transition from a power on state to a power sleep state, and provide the first PDU to a second network device. The set of instructions, when executed by one or more processors of the first network device, may cause the first network device to receive an acknowledgement of the first PDU from the second network device, and receive a second PDU identifying a link of the second network device to transition from the power on state to the power sleep state. The set of instructions, when executed by one or more processors of the first network device, may cause the first network device to generate a third PDU identifying a link of the first network device to transition from the power on state to the power sleep state, and provide the third PDU to the second network device. The set of instructions, when executed by one or more processors of the first network device, may cause the first network device to transition the link of the first network device and the link of the second network device from the power on state to the power sleep state based on the second PDU and the third PDU.
The following detailed description of example implementations refers to the accompanying drawings. The same reference numbers in different drawings may identify the same or similar elements.
In many networks, there is a large difference between peak network utilization and off-peak network utilization, and traffic patterns exhibit periodicity. Controlling a power life cycle of a network link (e.g., disabling a link during off-peak network utilization and reenabling the link during peak network utilization) may exploit power saving opportunities in a network. However, abruptly powering off and powering on a link may result in loss of traffic due to removal of an active link from a set of forwarding next-hop network devices. For example, links in a link aggregation group (LAG) must be gracefully added and removed from the LAG to minimize any loss of traffic during changes to link states. Current techniques for disabling a link (e.g., for energy efficiency) fail to entirely power off the link, which prevents powering off various components in a network device hierarchy (e.g., optical transceivers, lasers, and/or the like). Thus, current techniques for disabling a link consume computing resources (e.g., processing resources, memory resources, communication resources, and/or the like), networking resources, and/or the like, associated with failing to entirely power off the link, failing to power off components of a network device associated with the partially powered off link, losing traffic due to abruptly disabling the link, and/or the like.
Some implementations described herein relate to a network device that manages power states of aggregated links of the network device. For example, a first network device may generate a first protocol data unit (PDU) identifying a request to transition from a power on state to a power sleep state, and may provide the first PDU to a second network device. The first network device may receive a second PDU identifying a link of the second network device to transition from the power on state to the power sleep state, and may generate a third PDU identifying a link of the first network device to transition from the power on state to the power sleep state. The first network device may provide the third PDU to the second network device, and may transition the link of the first network device and the link of the second network device from the power on state to the power sleep state based on the second PDU and the third PDU.
In this way, the network device manages power states of aggregated links of the network device. For example, the network device may utilize enhancements to a link aggregation control protocol (LACP) to enable graceful transitions of power states of links in a LAG. The network device may also process actual traffic (e.g., in-system telemetry data) and predicted traffic (e.g., predicted telemetry data), with a model, to determine power state transitions of the links in the LAG, to manage the power states transitions of the links in the LAG, and/or the like. Thus, the network device conserves computing resources, networking resources, and/or the like that would otherwise have been consumed by failing to entirely power off the link, failing to power off components of a network device associated with the partially powered off link, losing traffic due to abruptly disabling the link, and/or the like.
As shown in
As further shown in
As further shown in
In some implementations, various applications, such as routing protocols, telemetry settings, configuration settings, and/or the like may interact with the LACP to transition a link to a desired link state. The LACP may interact with platform management software (e.g., of a network device) to power off and/or power on the links. In some implementations, two types of PM-TLVs may be encoded in an LACP PDU: a PM-TLV Type 1 that may be exchanged on links which are eligible for power state transition, and a PM-TLV control channel type. The PM-TLV control channel type may be associated with at least one member link (e.g., a control channel link) that will not participate in power state transition. Current power states of all links in a LAG may be periodically exchanged by peer network devices via the control channel link of the LAG. The control channel link may not be powered down.
The PM-TLV may enable a link to participate in dynamic power management. When the PM-TLV is not present in the LACP PDU, a link may not participate in the dynamic power management. As further shown in
In some implementations, the LACP PDU may be associated with a power sleep action timer element. After negotiating with a peer network device to power off a link, each peer network device may wait for the power sleep action timer to expire before powering off a side of the link. The power sleep action timer may be derived based on various parameters, such as, for example, a time required to empty a queue. In some implementations, the power sleep action timer may be implementation specific and not exchanged via the LACP. In some implementations, information associated with peer network devices (e.g., discovered by the LACP) may be stored in a local database of a network device. The network device may utilize such information to manage a power state of a link, and may retain the information even when a link enters a power sleep state. The network device may revalidate the information when the link is powered back on.
The network device 1 may initiate the transition from the power on state to the power sleep state. Thus, at time T4, the network device 1 may transmit a first LACP PDU (e.g., with a DS set to power sleep) to the network device 2. The network device 2 may receive the first LACP PDU at time T5. At time T6, the network device 2 may determine that the network device 2 is ready to transition the link (e.g., link Y) from the power on state to the power sleep state, and may generate a second LACP PDU (e.g., with a DS set to power sleep and an Ack set equal to one). The network device 2 may transmit the second LACP PDU towards the network device 1 at time T6. At time T7, the network device 2 may generate a third LACP PDU identifying a TLV type of the control channel, link Y, a partner desired state of power sleep, and an actor state of power on. The network device 2 may transmit the third LACP PDU towards the network device 1 via the control channel and at time T7. The network device 1 may receive the second LACP PDU at time T8, and may receive the third LACP PDU via the control channel and at time T9.
At time T10, the network device 1 may generate a fourth LACP PDU (e.g., with a DS set to power sleep and an Ack set equal to one). The network device 1 may transmit the fourth LACP PDU towards the network device 1 at time T10, completing a three-way handshake between the network device 1 and the network device 2. The three-way handshake may enable the network device 1 and/or the network device 2 to generate link alarms, to differentiate between a graceful link power sleep and a physical link failure event when transitioning a link, and/or the like. At time T11, the network device 1 may generate a fifth LACP PDU identifying a TLV type of the control channel, link X, a partner desired state of power sleep, and an actor state of power on. The network device 1 may transmit the fifth LACP PDU towards the network device 1 via the control channel and at time T11. The network device 2 may receive the fourth LACP PDU at time T12, and may receive the fifth LACP PDU via the control channel and at time T13. At times T10 and T12, the network device 1 and the network device 2 may empty link level queues and may cease transmitting traffic on links X and Y. Both the network device 1 and the network device 2 may continue to accept frames.
The network device 1 and the network device 2 may wait for expiration of a power sleep timer before transitioning links X and Y from the power on state to the power sleep state. Upon expiration of the power sleep timer (e.g., at time T14), the network device 1 may power off a port associated with link X. The network device 2 may detect a link down, and may power off a port associated with link Y based on detecting the link down. Once the link is powered off at both network devices, the network device 1 and the network device 2 may utilize the control channel to exchange LACP PDUs with an actor State and a partner desired state set to power sleep.
In some implementations, since both network devices independently determine whether to change a link state to a power sleep state or maintain the link state in a power on state, the network device 2 may not agree with a request for entering the power sleep state. In such implementations, the network device 2 may acknowledge the request by setting an acknowledge bit to zero and a desired state to a power on state. On receiving, the acknowledgement set to zero, the network device 1 may not take any further action.
The network device 1 may initiate the transition from the power sleep state to the power on state. Thus, at time T4, the network device 1 may transmit a first LACP PDU (e.g., with an actor port associated with link X, an actor port state set to the power on state, and a partner port desired state set to the power on state) to the network device 2 via the control channel. The network device 2 may receive the first LACP PDU at time T5. The network device 1 may power on link X and the network device 2 may power on link Y based on the first LACP PDU. At time T6, the network device 2 may transmit a second LACP PDU (e.g., with an actor port associated with link Y, an actor port state set to the power on state, and a partner port desired state set to the power on state) to the network device 1 via the control channel. The network device 1 may receive the second LACP PDU at time T7.
As further shown in
As further shown in
As shown in
The action model may determine to transition the link to the power sleep state based on the following procedure. At the beginning of every binning interval, if (Pt<Lbw) and (Pc<Pt), the action model may determine whether any of the member links can be turned off, while still ensuring that Pt<(Lbw−member link bandwidth). If a member link can be turned off, the action model may transition the member link to the power sleep state. The action model may select an eligible member link based on configuration power priority settings or an iterative logic across eligible links. The action model may update the Lbw (e.g., Lbw=Lbw−power sleep link bandwidth). When the LAG includes unequal member links, it is possible to power sleep various combinations of links. For example, a combination of links may be selected such that maximum a power gain may be obtained, while ensuring an available LAG bandwidth is greater than a predicted value.
The action model may determine to transition the link to the power on state based on the following procedure. The action model may continuously monitor the instantaneous traffic pattern (Pc). When the instantaneous traffic Pc≥Pm, an additional link may be transitioned to the power on state from the power sleep state. The action model may recompute and/or update Pm (e.g., Pm=Pm+new link bandwidth) and may update Lbw (e.g., Lbw=Lbw+power sleep link bandwidth). In some implementations, a dampening model may be utilized when traffic is significantly less than Pt during the binning interval. The action modem may retain a link in a power on state until a next reactive interval (e.g., a binning interval) to avoid further link transitions. In some implementations, at the beginning of the reactive interval, if Pt>Lbw, the action model may power on an additional minimal set of links such that Low is greater than Pt.
Since a LAG may include multiple member links, a network device initiating a transition to a power sleep state may identify a link to transition based on link power priorities of eligible links. The link power priorities may match for both peer network devices. Also, where link power priorities do not exist, the network device may reject a request or accept the request based on an estimated bandwidth. In some implementations, the binning interval may be similar at both peer network devices.
In some implementations, if a link is in a power sleep state for a long duration (e.g., multiple hours in a day), a periodic power on and power sleep mechanism may be enhanced to check for link sanity and to generate an alarm if required.
In this way, the network device manages power states of aggregated links of the network device. For example, the network device may utilize enhancements to an LACP to enable graceful transitions of power states of links in a LAG. The network device may also process actual traffic (e.g., in-system telemetry data) and predicted traffic (e.g., predicted telemetry data), with a model, to determine power state transitions of the links in the LAG, to manage the power states transitions of the links in the LAG, and/or the like. Thus, the network device conserves computing resources, networking resources, and/or the like that would otherwise have been consumed by failing to entirely power off the link, failing to power off components of a network device associated with the partially powered off link, losing traffic due to abruptly disabling the link, and/or the like.
As indicated above,
The endpoint device 210 includes one or more devices capable of receiving, generating, storing, processing, and/or providing information, such as information described herein. For example, the endpoint device 210 may include a mobile phone (e.g., a smart phone or a radiotelephone), a laptop computer, a tablet computer, a desktop computer, a handheld computer, a gaming device, a wearable communication device (e.g., a smart watch, a pair of smart glasses, a heart rate monitor, a fitness tracker, smart clothing, smart jewelry, or a head mounted display), a network device, or a similar type of device. In some implementations, the endpoint device 210 may receive network traffic from and/or may provide network traffic to other endpoint devices 210 and/or the server device 230, via the network 240 (e.g., by routing packets using the network devices 220 as intermediaries).
The network device 220 includes one or more devices capable of receiving, processing, storing, routing, and/or providing traffic (e.g., a packet or other information or metadata) in a manner described herein. For example, the network device 220 may include a router, such as a label switching router (LSR), a label edge router (LER), an ingress router, an egress router, a provider router (e.g., a provider edge router or a provider core router), a virtual router, or another type of router. Additionally, or alternatively, the network device 220 may include a gateway, a switch, a firewall, a hub, a bridge, a reverse proxy, a server (e.g., a proxy server, a cloud server, or a data center server), a load balancer, and/or a similar device. In some implementations, the network device 220 may be a physical device implemented within a housing, such as a chassis. In some implementations, the network device 220 may be a virtual device implemented by one or more computer devices of a cloud computing environment or a data center. In some implementations, a group of network devices 220 may be a group of data center nodes that are used to route traffic flow through the network 240.
The server device 230 includes one or more devices capable of receiving, generating, storing, processing, and/or providing information, such as information described herein. For example, the server device 230 may include a laptop computer, a tablet computer, a desktop computer, a group of server devices, or a similar type of device, associated with multicast traffic. In some implementations, the server device 230 may receive information from and/or transmit information (e.g., multicast traffic) to the endpoint device 210, via the network 240 (e.g., by routing packets using the network devices 220 as intermediaries).
The network 240 includes one or more wired and/or wireless networks. For example, the network 240 may include a packet switched network, a cellular network (e.g., a fifth generation (5G) network, a fourth generation (4G) network, such as a long-term evolution (LTE) network, or a third generation (3G) network), a code division multiple access (CDMA) network, a public land mobile network (PLMN), a local area network (LAN), a wide area network (WAN), a metropolitan area network (MAN), a telephone network (e.g., the Public Switched Telephone Network (PSTN)), a private network, an ad hoc network, an intranet, the Internet, a fiber optic-based network, a cloud computing network, or the like, and/or a combination of these or other types of networks.
The number and arrangement of devices and networks shown in
The bus 310 includes one or more components that enable wired and/or wireless communication among the components of the device 300. The bus 310 may couple together two or more components of
The memory 330 includes volatile and/or nonvolatile memory. For example, the memory 330 may include random access memory (RAM), read only memory (ROM), a hard disk drive, and/or another type of memory (e.g., a flash memory, a magnetic memory, and/or an optical memory). The memory 330 may include internal memory (e.g., RAM, ROM, or a hard disk drive) and/or removable memory (e.g., removable via a universal serial bus connection). The memory 330 may be a non-transitory computer-readable medium. The memory 330 stores information, instructions, and/or software (e.g., one or more software applications) related to the operation of the device 300. In some implementations, the memory 330 includes one or more memories that are coupled to one or more processors (e.g., the processor 320), such as via the bus 310.
The input component 340 enables the device 300 to receive input, such as user input and/or sensed input. For example, the input component 340 may include a touch screen, a keyboard, a keypad, a mouse, a button, a microphone, a switch, a sensor, a global positioning system sensor, an accelerometer, a gyroscope, and/or an actuator. The output component 350 enables the device 300 to provide output, such as via a display, a speaker, and/or a light-emitting diode. The communication interface 360 enables the device 300 to communicate with other devices via a wired connection and/or a wireless connection. For example, the communication interface 360 may include a receiver, a transmitter, a transceiver, a modem, a network interface card, and/or an antenna.
The device 300 may perform one or more operations or processes described herein. For example, a non-transitory computer-readable medium (e.g., the memory 330) may store a set of instructions (e.g., one or more instructions or code) for execution by the processor 320. The processor 320 may execute the set of instructions to perform one or more operations or processes described herein. In some implementations, execution of the set of instructions, by one or more processors 320, causes the one or more processors 320 and/or the device 300 to perform one or more operations or processes described herein. In some implementations, hardwired circuitry may be used instead of or in combination with the instructions to perform one or more operations or processes described herein. Additionally, or alternatively, the processor 320 may be configured to perform one or more operations or processes described herein. Thus, implementations described herein are not limited to any specific combination of hardware circuitry and software.
The number and arrangement of components shown in
The input component 410 may be one or more points of attachment for physical links and may be one or more points of entry for incoming traffic, such as packets. The input component 410 may process incoming traffic, such as by performing data link layer encapsulation or decapsulation. In some implementations, the input component 410 may transmit and/or receive packets. In some implementations, the input component 410 may include an input line card that includes one or more packet processing components (e.g., in the form of integrated circuits), such as one or more interface cards (IFCs), packet forwarding components, line card controller components, input ports, processors, memories, and/or input queues. In some implementations, the device 400 may include one or more input components 410.
The switching component 420 may interconnect the input components 410 with the output components 430. In some implementations, the switching component 420 may be implemented via one or more crossbars, via busses, and/or with shared memories. The shared memories may act as temporary buffers to store packets from the input components 410 before the packets are eventually scheduled for delivery to the output components 430. In some implementations, the switching component 420 may enable the input components 410, the output components 430, and/or the controller 440 to communicate with one another.
The output component 430 may store packets and may schedule packets for transmission on output physical links. The output component 430 may support data link layer encapsulation or decapsulation, and/or a variety of higher-level protocols. In some implementations, the output component 430 may transmit packets and/or receive packets. In some implementations, the output component 430 may include an output line card that includes one or more packet processing components (e.g., in the form of integrated circuits), such as one or more IFCs, packet forwarding components, line card controller components, output ports, processors, memories, and/or output queues. In some implementations, the device 400 may include one or more output components 430. In some implementations, the input component 410 and the output component 430 may be implemented by the same set of components (e.g., and input/output component may be a combination of the input component 410 and the output component 430).
The controller 440 includes a processor in the form of, for example, a CPU, a GPU, an accelerated processing unit (APU), a microprocessor, a microcontroller, a DSP, an FPGA, an ASIC, and/or another type of processor. The processor is implemented in hardware, firmware, or a combination of hardware and software. In some implementations, the controller 440 may include one or more processors that can be programmed to perform a function.
In some implementations, the controller 440 may include a RAM, a ROM, and/or another type of dynamic or static storage device (e.g., a flash memory, a magnetic memory, an optical memory, etc.) that stores information and/or instructions for use by the controller 440.
In some implementations, the controller 440 may communicate with other devices, networks, and/or systems connected to the device 400 to exchange information regarding network topology. The controller 440 may create routing tables based on the network topology information, may create forwarding tables based on the routing tables, and may forward the forwarding tables to the input components 410 and/or output components 430. The input components 410 and/or the output components 430 may use the forwarding tables to perform route lookups for incoming and/or outgoing packets.
The controller 440 may perform one or more processes described herein. The controller 440 may perform these processes in response to executing software instructions stored by a non-transitory computer-readable medium. A computer-readable medium is defined herein as a non-transitory memory device. A memory device includes memory space within a single physical storage device or memory space spread across multiple physical storage devices.
Software instructions may be read into a memory and/or storage component associated with the controller 440 from another computer-readable medium or from another device via a communication interface. When executed, software instructions stored in a memory and/or storage component associated with the controller 440 may cause the controller 440 to perform one or more processes described herein. Additionally, or alternatively, hardwired circuitry may be used in place of or in combination with software instructions to perform one or more processes described herein. Thus, implementations described herein are not limited to any specific combination of hardware circuitry and software.
The number and arrangement of components shown in
As shown in
As further shown in
As further shown in
As further shown in
As further shown in
As further shown in
In some implementations, process 500 includes exchanging one or more periodic link aggregation control protocol PDUs with the second network device. In some implementations, process 500 includes receiving an acknowledgement of the first PDU from the second network device. In some implementations, process 500 includes providing an acknowledgement of the second PDU to the second network device.
In some implementations, process 500 includes providing, to the second network device, a fourth PDU identifying a retransition of the link of the first network device from the power sleep state to the power on state, and receiving, from the second network device and in response to the fourth PDU, a fifth PDU identifying a retransition of the link of the second network device from the power sleep state to the power on state. In some implementations, process 500 includes verifying that the link of the first network device and the link of the second network device are available based on the fourth PDU and the fifth PDU, and transitioning the link of the first network device and the link of the second network device from the power sleep state to the power on state based on verification that the link of the first network device and the link of the second network device are available. In some implementations, process 500 includes determining that one of the link of the first network device or the link of the second network device is not available based on the fourth PDU and the fifth PDU, and generating an alarm based on determination that one of the link of the first network device or the link of the second network device is not available.
In some implementations, process 500 includes receiving actual traffic time period data, and determining predicted traffic time period data based on the actual traffic time period data. In some implementations, process 500 includes processing, the predicted traffic time period data and the actual traffic time period data, with an action model, to determine whether to transition the link of the first network device to a power sleep state, whether to transition the link of the first network device to a power on state, or whether to perform no link action on the link of the first network device. In some implementations, process 500 includes performing no link action on the link of the first network device based on determination that no link action is to be performed on the link of the first network device.
Although
The foregoing disclosure provides illustration and description but is not intended to be exhaustive or to limit the implementations to the precise form disclosed. Modifications may be made in light of the above disclosure or may be acquired from practice of the implementations.
As used herein, the term “component” is intended to be broadly construed as hardware, firmware, or a combination of hardware and software. It will be apparent that systems and/or methods described herein may be implemented in different forms of hardware, firmware, and/or a combination of hardware and software. The actual specialized control hardware or software code used to implement these systems and/or methods is not limiting of the implementations. Thus, the operation and behavior of the systems and/or methods are described herein without reference to specific software code-it being understood that software and hardware can be used to implement the systems and/or methods based on the description herein.
Although particular combinations of features are recited in the claims and/or disclosed in the specification, these combinations are not intended to limit the disclosure of various implementations. In fact, many of these features may be combined in ways not specifically recited in the claims and/or disclosed in the specification. Although each dependent claim listed below may directly depend on only one claim, the disclosure of various implementations includes each dependent claim in combination with every other claim in the claim set.
No element, act, or instruction used herein should be construed as critical or essential unless explicitly described as such. Also, as used herein, the articles “a” and “an” are intended to include one or more items and may be used interchangeably with “one or more.” Further, as used herein, the article “the” is intended to include one or more items referenced in connection with the article “the” and may be used interchangeably with “the one or more.” Furthermore, as used herein, the term “set” is intended to include one or more items (e.g., related items, unrelated items, a combination of related and unrelated items, and/or the like), and may be used interchangeably with “one or more.” Where only one item is intended, the phrase “only one” or similar language is used. Also, as used herein, the terms “has,” “have,” “having,” or the like are intended to be open-ended terms. Further, the phrase “based on” is intended to mean “based, at least in part, on” unless explicitly stated otherwise. Also, as used herein, the term “or” is intended to be inclusive when used in a series and may be used interchangeably with “and/or,” unless explicitly stated otherwise (e.g., if used in combination with “either” or “only one of”).
In the preceding specification, various example embodiments have been described with reference to the accompanying drawings. It will, however, be evident that various modifications and changes may be made thereto, and additional embodiments may be implemented, without departing from the broader scope of the invention as set forth in the claims that follow. The specification and drawings are accordingly to be regarded in an illustrative rather than restrictive sense.
Number | Date | Country | Kind |
---|---|---|---|
202341064503 | Sep 2023 | IN | national |