A routing algorithm is a set of step-by-step operations used to direct data traffic through a network efficiently. When a packet of data leaves its source, there are many different paths it can take to its destination. The routing algorithm may be used to determine the best path to take. Different routing algorithms may use different methods to determine the best path. For example, a distance vector algorithm may calculate a graph of all available routes by having each routing point (called a node in the network) determine the “cost” or “weight” of travelling to each of its immediate neighbors. This information may be collected for every node to create a distance table which may be used to determine the best path to take from any one node to another.
Routing algorithms may use a weighted decision based on congestion and hop count (e.g., Universal Globally Adaptive Load-Balancing (UGAL)) to route data packets through nodes in a network. These routing algorithms may estimate the expected remaining delay in routing the data packets by multiplying the hop count and congestion of the network and choose the route that yields the lowest weight, which approximates the lowest delay between the source node and the destination node. The source node can be considered to be the node where the packet is created or the node where the packet currently exists and routing is being performed. The routing algorithms may generally assume that channel lengths of all possible routes between the source node and the destination node in a network are equal. However, this assumption may be untrue, especially in large scale networks in which channel lengths are high enough to introduce significant propagation delays, and may result in inefficient routing choices.
The congestion of the network may refer to the delay sending a packet from a source node to a destination node due to existing data traffic in the route followed by the packet. For example, the congestion may comprise the queue occupancy of the output or egress ports of network devices in the network, for example a network device such as a router or a switch, through which packets are going to be forwarded to the destination node. In such examples, the egress ports may comprise a buffer, for example a First-In-First-Out (FIFO) buffer or a ring buffer, to temporarily store data to be forwarded to a neighbor node. In turn, the hop count may refer to the number of intermediate network devices (nodes) through which data is to pass between the source network device and destination network device (source node and destination node) in the network. Each network device along a data path (route) for packets between the source node and the destination node may constitute a hop, as the data is moved from one network device to another. Hop count may be therefore a measurement of a logical distance in a network.
As generally described herein, propagation delay may refer to the amount of time it takes for a packet to travel from the source node to the destination node in a network. In some examples, the propagation delay may be computed as the ratio between the path length (route length) and the propagation speed over a specific channel. The propagation delay may depend on various parameters, including, for example, characteristics of the network channel or link between the source and the destination nodes such as channel latency, channel length, channel bandwidth, and so on. In general, propagation delay may include any latency that is a fixed delay through a particular route.
Weighted decision routing algorithms do not consider the propagation delay in their calculations. This can lead to unnecessary high latencies delivering packets from a source node to a destination node because packets may cross long latency routes when other routes with low latency may be available. For example, considering a case where a routing algorithm is deciding between two routes with 2 hops each and the same congestion in the respective routes, the routing algorithm may equally choose between these two routes. However, one of the routes may have two short channels and the other route may have two very long channels. The choice of the route having the two very long channels may yield to undesirable high latency transmissions through the network.
To address these issues, some examples disclosed herein describe methods for routing packets considering the propagation delay of routes. These methods may comprise receiving a packet at a network device in a network, for example in a network device such as a router or a switch. Then a routing engine of the network device may determine a plurality of candidate routes for the packet to a destination device. As generally described herein, a routing engine may represent a combination of hardware and software logic in a network device for determining candidate routes for a packet to take from a source device to a destination device. Each of these candidate routes may be associated to a weight that is based on a propagation delay of the candidate route. Thus, the weight associated to a particular candidate route considers the amount of time it takes for the received packet to travel from the network device to the destination device through the particular candidate route. Then, the routing engine may select an optimal route among the plurality of weighted candidate routes and route the packet to the destination device using the selected optimal route.
In some examples, the routing engine may determine all possible candidate routes between the source network device and the destination device and calculate a weight associated to each candidate route. The routing engine may determine all possible candidate routes by, for example, using a pre-existing routing table in the network device or by performing arithmetic operations, for example using fixed function logic in the network device. Further, in some other examples, the weight associated to each candidate route may be calculated, by the routing engine, based on a combination of a number of hops of the candidate route, a congestion along the candidate route and the propagation delay associated to the candidate route. In such examples, the candidate routes are associated to a weight that considers the latency introduced by the network into the packet transmission between the source network device and the destination network device, said latency being due to the links and the network devices in the routes through which a packet passes until reaching the destination network device.
In some examples, the weight of a particular candidate route may be calculated by multiplying the number of hops of the particular candidate route and the congestion along the particular candidate route and adding the propagation delay associated to the particular candidate route. In some other examples, the weight associated to a particular candidate route may be calculated by multiplying the number of hops of the particular candidate route and the congestion along the particular candidate route and adding a bias and the propagation delay associated to the particular candidate route. In such example the bias is a heuristically determined value added to the weight of non-minimal candidate routes to create prejudge against taking routes with higher hop counts. Biasing the weights of non-minimal candidate routes has been used to overcome the inaccuracy of the UGAL latency approximation assumptions.
In some examples, selecting the optimal route among the plurality of weighted candidate routes comprises selecting, by the routing engine, the optimal route as the weighted candidate route having the lowest estimated latency. This optimal route may minimize the delay of delivering the packet from the source network device to the destination network device considering the latency due to the links and the network devices between the source and the destination network devices in the route. Thus, the route with the minimum weight may be the route having the minimum latency to deliver packets from the source network device to the destination network device.
In some examples, each of the network devices in the network may store a routing table that lists the routes to particular network destinations, and in some cases, metrics (distances) associated with those routes. The routing table of a particular network device may further contain information about the topology of the network immediately around the particular network device. These routing tables may be automatically and dynamically generated by each network device by “discovering” procedures, or may be periodically provided by a network controller to each one of the network devices of the network. In both cases, any change in the network topology may automatically trigger an updating process in the routing tables in all network devices of the network. The routing tables may further store the propagation delays and weights associated to the routes.
In some other examples, the routing tables may be indexed with an identifier of the destination network device to determine which output port the packet should take. In some other examples, these routing tables may store different information fields such as the destination network device identifier, the cost or metric of the path through which the packet is to be sent and the next hop that is the address of the next node to which the packet is to be sent on the way to its final destination. In some other examples, the routing table may further store quality of service associated with the route links to filtering criteria/access lists associated with the route and an interface for each route.
In some other examples, the network devices may avoid the routing tables by arithmetically calculating the routes, metrics, propagation delays and weights associated to the routes when a packet is received in the destination network device.
In some other examples, the method may calculate the propagation delay associated to each channel based on channel parameters selected from channel latency, channel length, channel bandwidth or a combination thereof. For example, high performance low diameter networks have channel lengths that can vary by multiple orders of magnitude which may introduce different propagation delays.
In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the present systems and methods. It will be apparent, however, to one skilled in the art that the present apparatus, systems, and methods may be practiced without these specific details. Reference in the specification to “an example” or similar language means that a particular feature, structure, or characteristic described in connection with that example is included as described, but may not be included in other examples.
Turning now to the figures,
At step 101 of the method 100, a particular packet of data is received in a particular network device, for example a network device such as a switch or router, in a particular network.
At step 102 of the method 100, a routing engine of the particular network device determines a plurality of candidate routes for the packet to be received at a destination device. Each candidate route is associated to a weight that is calculated based on a propagation delay of the candidate route. This particular destination network device may be another network device from the same particular network, an edge device of another network to which the particular network is connected or any other device external to the network. In some examples, the routing engine may calculate all the possible candidate routes between the network device and the destination device and then calculate a weight associated to each candidate route. In some other examples, the routing engine may calculate a subset of candidate routes among all the possible candidate routes between the network device and the destination device. These candidate routes may be any possible routes connecting the current network device with the destination network device.
For example, the routing engine may calculate the propagation delay of the candidate routes based on channel parameters selected from channel latency, channel length, channel bandwidth or a combination thereof. In an example, a Serializer/Deserializer (SerDes) device may train links between network devices in the network and a round trip latency could be known from this process. From the round trip latency a one-direction channel latency can be obtained. The routing engine may then obtain channel latencies from a SerDes device. In some other examples, the routing engine may obtain the channel latency, channel length and channel bandwidth from known network information like the cables that got installed in the network.
At step 103 of the method 100, the routing engine selects the optimal route among the plurality of weighted candidate routes. In some examples, the routing engine may select the optimal route as the weighted candidate route having the lowest estimated latency to the destination device considering the latency due to the links and the network devices in the route.
Lastly, at step 104 of the method 100, the routing engine routes the packet to the destination network device using the selected optimal route. In this way, the expected latency computation of the optimal route includes the propagation delay such that the delay estimation includes the delay due to queues in the egress ports of network devices in the route and the delay due to channels of the route.
At step 201 of the method 200, a particular packet of data is received in a particular network device, for example a network device such as a switch or router, in a particular network.
At step 202 of the method 200, a routing engine of the particular network device determines a plurality of candidate routes for the packet to be received at a destination device. These candidate routes may be any possible route connecting the network device receiving the packet with the destination network device. Each candidate route is associated to a weight that is calculated by multiplying the congestion in the candidate route with the hop count of the candidate route and then adding the propagation delay from the current network device to the destination network device along the candidate route. The propagation delay associated to a route is the amount of time it takes for a packet to travel from the current network device to the destination network device. This propagation delay may be associated with the speed of data transmission on the channels and the fixed latency devices within the network devices through which the packet passes. The congestion along a candidate route may be obtained by multiple ways, for example, the routing engine may obtain the congestion based on queue occupancy along the candidate route, by comparing ingoing and outgoing rates of a particular route, be explicitly told by a specific software, etc.
For example, a packet arrives at a particular network device and there are two candidate routes, candidate route A and candidate route B, for the packet to the destination device. Then, the routing engine may calculate a corresponding weight for the routes. Candidate route A may have 4 hops, a 50% congestion and 100 ns propagation delay, such that the routing engine may calculate a weight associated to candidate route A as WA=450+100=300. Candidate route B may have 3 hops, a 40% congestion and 10 ns propagation delay, such that the routing engine may calculate a weight associated to candidate route B as WB=3*40+120=240. WA and WB incorporate latency due to the links and the network devices in the respective candidate routes A and B.
At step 203 of the method 200, the routing engine selects the optimal route among the plurality of weighted candidate routes. In some examples, the routing engine may select the optimal route as the weighted candidate route having a lowest weight. In the above cited example, the routing engine may select route B because WB<WA and thus, the total latency of candidate route B is less than the total latency of candidate route A.
At step 204 of the method 200, the routing engine routes the packet to the destination network device using the selected optimal route. In the above cited example, the routing engine routes the packet through candidate route B. This candidate route B is the route, among the possible candidate routes between the source network device and the destination network device, having the minimum latency delivering packets to the destination node.
In some other examples, the weight associated to the candidate routes may further include a bias for non-minimal routes. In such cases, the weight may be calculated by multiplying the congestion in the candidate route with the hop count of the candidate route and then adding the bias (if the route is non-minimal) and the propagation delay from the source network device to the destination network device along the candidate route. For example the bias may be 7 for non-minimal routes. In such example, the weight associated to candidate route A is WA=4*50+7+100=307 (+7 because it is a non-minimal route) and the weight associated to candidate route B is WB=3′40+120=240 (lack of +7 because it is a minimal route). Then, the routing engine may select the optimal route as the weighted candidate route having a lowest weight. In the cited example, the routing engine may select route B because WB<WA and thus, the total latency of candidate route B is less than the total latency of candidate route A.
The network 300 comprises a network device 301 in which packets 310 of data are received. The network 300 further comprises network devices 302-307 that can be a destination for the packet 310 if the packet is directed to the network device 302-307, or a routing network device if the packet 310 is directed to another network device. Network devices 301-307 may be for example routers or switches. While
In the example of
The packet 310 may be decoded by the routing engine 308 to extract the identifier of the destination network device. In such example the destination network device is network device 307 so the routing engine 308 selects a subset of candidate routes from the routing table 309 as shown in
In addition, each candidate route has a weight associated with it. Route1 has weight1 (W1) that corresponds to the result of multiplying the number of hops of route1 (hops 302,303,305) and the congestion along route1 (due existing data traffic in route1) and adding the propagation delay associated to the route1 (due to links 313,316,317,320 and network devices 302,303,305). Route2 has weight2 (W2) that corresponds to the result of multiplying the number of hops of route2 (hops 302,303,306) and the congestion along route2 (due existing data traffic in route2) and adding the propagation delay associated to the route2 (due to links 313,316,318,321 and network devices 302,303,306). Route3 has weight3 (W3) that corresponds to the result of multiplying the number of hops of route3 (hops 303,305) and the congestion along route3 (due existing data traffic in route3) and adding the propagation delay associated to the route3 (due to links 314,317,320 and network devices 303,305). Route4 has weight4 (W4) that corresponds to the result of multiplying the number of hops of route4 (hops 303,306) and the congestion along route4 (due existing data traffic in route4) and adding the propagation delay associated to the route4 (due to links 314,318,321 and network devices 303,306). Route5 has weight5 (W5) that corresponds to the result of multiplying the number of hops of route5 (hops 304,306) and the congestion along route5 (due existing data traffic in route5) and adding the propagation delay associated to the route5 (due to links 315,319,321 and network devices 304,306).
The routing engine 308 determines that the route5 322 has the minimum weight among the 5 candidate routes and selects it to forward the packet to network device 307. Therefore, the routing engine 308 forwards the received packet 310 to the network device through egress port 4 and along route5.
In examples including high performance networks within a single administrative domain, the propagation delay from any node to another node may correspond to cable lengths that in turn may be known explicitly or implicitly. For example, when a SerDes device trains a link, a round trip latency could be known from this process. The routing engine may obtain channel latencies from a SerDes device or from known system information like the cables that got installed. In examples including low diameter networks such as HyperX, Dragonfly, and Slimfly the propagation delays along different paths can vary by multiple orders of magnitude, for example between 1 ns to 500 ns.
The network device 400 is depicted as including a plurality of ingress ports 408 through which packets 404 of data are received, a plurality of egress ports 409 through which packets 404 are forwarded towards their destination network devices, a machine readable storage medium 403, a routing engine 401 and a routing table 402. The routing engine 401 may include hardware and software logic to execute instructions, such as the instructions 405-407 stored in the machine-readable storage medium 403. While
The routing engine 401, in response to reception of a packet 402 via the ingress ports 408 of the network device 400, determines at 405 a plurality of candidate routes for the packet 404 to a destination device, wherein each candidate route is associated to a weight based on a combination of a number of hops of the candidate route, a congestion along the candidate route and a propagation delay of the candidate route.
In examples where the routing table 402 stores all the possible routes between the current network device 400 and the rest of network devices in the network, the routing engine 401 may decode the received packet 404 to extract an identifier of the destination network device to which the packet is to be routed. Then, the routing engine 401 may select, based on the identifier of the destination network, the routes between the current network device 400 and the destination device among all the routes stored in the routing table 402.
The routing engine 401 further selects at 406 the optimal route among the plurality of weighted candidate routes. Then, the routing engine 401 further routes at 407 the packet 404 to the destination device using the optimal route. For example, the routing engine 401 may select the optimal route as the candidate route having a lowest weight.
In some other examples, the routing engine 401 may also calculate a weight associated to each candidate route by multiplying a number of hops of the weighted candidate route and a congestion along the weighted candidate route and adding the propagation delay associated to the weighted candidate route. The route having the minimum weight is the route that introduces a minimum delay in the transmission of packet 404. The weights and the propagation delays associated to each weighted candidate route may be further stored in the routing table 402.
The routing engine 401 may include hardware and software logic to perform the functionalities described above in relation to instructions 405-407. The machine-readable storage medium 403 may be located either in the computing device executing the machine-readable instructions, or remote from but accessible to the computing device (e.g., via a computer network) for execution.
As used herein, a “machine-readable storage medium” may be any electronic, magnetic, optical, or other physical storage apparatus to contain or store information such as executable instructions, data, and the like. For example, any machine-readable storage medium described herein may be any of Random Access Memory (RAM), volatile memory, non-volatile memory, flash memory, a storage drive (e.g., a hard drive), a solid state drive, any type of storage disc (e.g., a compact disc, a DVD, etc.), and the like, or a combination thereof. Further, any machine-readable storage medium described herein may be non-transitory. In examples described herein, a machine-readable storage medium or media may be part of an article (or article of manufacture). An article or article of manufacture may refer to any manufactured single component or multiple components.
Routing packets considering propagation delay of candidate routes as described herein achieves many technical effects, including providing a more accurate packet delivery process that yields a significant improvement of network services, reducing average packet delivery latency by choosing between paths with respect to propagation delay. In addition, the solution herein described also may be useful to overcome problems experienced by many systems where choosing between minimal and non-minimal routes on an unloaded system may result in falsely choosing the non-minimal routes because the congestion is reported as zero and thus the weight becomes zero. In those cases, since the calculated weights also consider the propagation delay, the solution herein proposed allows to select the optimal route as the route having a lowest propagation delay, and thus the lowest estimated latency.