Cross-reference is made to U.S. patent application Ser. Nos. 10/127,888, filed Apr. 22, 2002, entitled “Topology Discovery by Partitioning Multiple Discovery Techniques”, to Goringe, et al., and Ser. No. 10/127,967, filed Apr. 22, 2002, entitled “Using Link State Information to Discover IP Network Topology”, to Goringe, et al., each of which contains subject matter related to the subject matter of the present application and is incorporated herein by this reference.
The present invention relates generally to network topology and specifically to ascertaining network topology in load balanced networks.
The topology of a distributed processing network, such as an Internet Protocol network, is information with many potential uses in troubleshooting, administration, planning, and other tasks. “Topology” refers generally to the patterns of connection between the various machines in a network, and “topology information” refers to the body of information associated therewith. Complete and accurate network topology information can predict the path of an individual packet through the network and hence identify the set of network devices, such as routers and their associated interfaces, involved. Incorrect or inaccurate information can lead to poor planning and/or administrative decisions, an inability to accurately measure network performance, and/or consumption of excessive resources in troubleshooting problems, particularly in IP telephony where jitter and packet round trip time can be crucial considerations.
Current solutions to the problem of topology discovery can be separated into two broad categories, namely Simple Network Management Protocol or SNMP-based and traceroute-based. The majority of current commercial products, such as Avaya ExpertNet™ and Hewlett Packard Open View™, use an SNMP-discovery mechanism to construct topology information. In this approach, topology information is obtained from the Management Information Base or MIB of one or more routers in the network segment or subnetwork of interest. The topology information in the MIB can, however, be incomplete and/or inaccurate. Neither of the two standardized routing tables available through SNMP, namely IpRouteTable (RFC1213) and IpCidrRouteTable (RFC 2096), are capable of containing the multiple routes, or redundant links, having the same metric or cost to a selected destination. Traceroute-based techniques are available on a number of major operating systems, such as Windows™, Unix, and Linux. Traceroute sends a series of Internet Control Message Protocol or ICMP packets, each with an increasing Time-To-Live or TTL, to a particular destination device. It then uses error messages that are returned from each router on-route when the TTL expires to construct the path to that destination. Although the traceroute technique can have an advantage over SNMP-based techniques, namely that traceroute takes into account the actual routing decisions that are made on packets traveling across the network, traceroute can return an incorrect path that is made up of some combination of two or more physically separate paths, particularly when load balancing is in effect.
Both techniques are generally unable to detect the existence of load balancing, let alone the type of load balancing, in effect along a route. With reference to
These and other needs are addressed by the various embodiments and configurations of the present invention. The present invention is directed generally to a system and method for detecting load balancing in a distributed processing network.
In one embodiment, a method for detecting load balancing in a distributed processing network is provided that includes the steps of:
(a) providing a baseline topology;
(b) selecting, from the baseline topology, first and second addresses associated with first and second routers, respectively, such that the first router has an associated first hop count relative to a selected node and the second router an associated second hop count relative to the selected node, with the first hop count being less than the second hop count;
(c) transmitting one or more (typically at least two) test packets, with each one test packet having a time to live equal to or greater than the second hop count;
(d) receiving one or more responses (e.g., TTL-expired error messages) associated with the test packets; and
(e) determining, based on the responses, whether load balancing is in effect at the first router. These steps are performed iteratively preferably on subnetwork-by-subnetwork and router-by-router bases.
The step of selecting the first and second routers typically includes the additional steps of:
(i) selecting a (first) subnetwork;
(ii) identifying a first set of unique addresses within the selected (first) subnetwork; and
(iii) creating a second set of unique addresses.
The second set of addresses is the union of the first set and a third set of router interface addresses associated with routers between the selected node and the selected subnetwork. The first and second addresses are included within the third set.
In configuring the test packets, the time to live is preferably equal to the second hop count and the second hop count preferably exceeds the first hop count by one hop. The test packets are typically transmitted from a common source node while the destination address in the packet headers is held constant for per-packet load balancing detection and varied for per-destination and per-source/destination load balancing detection. Load balancing is in effect when at least two different routers respond to the test packets.
In a typical application, the test packets for detecting per-packet load balancing are sent first, and the test packets for detecting per-destination or per-source/destination load balancing thereafter. The detection of per-packet load balancing before per-destination and per-source/destination load balancing can be important as otherwise it would be difficult to know what type of load balancing is detected by the first set of test packets.
The method can have a number of advantages over conventional topology discovery algorithms. For example, the present invention can not only generate an accurate and complete topology of a desired network segment or subnetwork but can also identify the existence of load balancing and determine the type of load balancing in existence. This knowledge can facilitate troubleshooting, administration, planning and other network related tasks. In particular in Voice Over IP or IP telephony, this knowledge can lead to substantial time and cost savings in post-installation in IP telephony troubleshooting.
These and other advantages will be apparent from the disclosure of the invention(s) contained herein.
The above-described embodiments and configurations are neither complete nor exhaustive. As will be appreciated, other embodiments of the invention are possible utilizing, alone or in combination, one or more of the features set forth above or described in detail below.
Before discussing the configuration and operation of the present invention, it is important to understand certain features of many routing protocols. A router can be identified by a unique router ID in the case of certain protocols, and associated with a unique area ID. A router itself typically (but not always) has no IP address. Rather, the interfaces associated with the router generally have IP addresses. An interface is a logical device belonging to a host such as a router than can be the attachment point of a link. Typically, an interface will have zero or one IP address and belong to a network. The interface will normally have an interface number and a network mask. A link contains two or more bindings of a source interface and a metric or cost. It is templated by the metric representation which is specific to the routing protocol and represents the cost for a packet to leave an interface. A link is typically associated with a cost metric and a routing protocol identifier. A network object represents a data network or subnetwork. It has an address and a mask and represents an address space in which a set of hosts is contained. A network object may derive its address and/or its mask from its member interfaces.
With this in mind,
The memory 204 comprises a topology discovery agent 212 configured to determine not only network topology generally but also detect the existence of load balancing and the type of load balancing, a baseline topology 216 containing topology information to be used as the starting point in topology discovery and a discovered topology 220 containing topology information in the baseline topology and discovered during topology discovery.
The topology discovery agent 212 preferably, given the baseline topology, can construct a path to each of the network edges (or edge subnets) and represents the paths as a tree structure. Such trees are conceptually similar to the spanning tree structure for ethernet bridges or to IP multicast group trees. The output is a modified form of the output illustrated in
The baseline topology 216 is topology information collected and configured in any suitable manner. In one configuration, the baseline topology is obtained by accessing network device routing tables using SNMP IpRouteTable and/or IpCidrRouteTable entries. In this configuration, the topology information is possibly incomplete.
The discovered topology 220 is preferably in the form of a network tree that describes the set of paths accessible from the system 200. By way of illustration,
The system 200 can include other modules (which are not shown). For example, the system 200 can include a metric measuring agent configured to measure delays, such as jitter and packet loss rate, experienced by packets traveling to and from each of the identified edge hosts in the network. When load balancing is in effect or otherwise present, the destination address of the directly connected upstream router is typically not varied when attempting to measure delays to/from intermediate routers. In other words, an intermediate router may not be pinged directly, as the path taken to that router may not be the expected one. A TTL-based method, similar to that used by traceroute techniques, is typically used to ping the intermediate routers for metric measurement. This approach will elicit an ICMP TTL-expired message when the ping packet reaches the intermediate router. As will be appreciated, the Uniform Datagram Protocol or UDP can also be used instead of or in addition to ICMP not only for metric measurement but also for load balancing detection. UDP packets are treated similarly to voice telephony or VoIP packets by most routers. By picking a UDP port which is typically not open on the edge host, a response can be elicited from the edge host, in the form of a port-unreachable ICMP error message.
Referring to
In step 300, the agent 212 reads the baseline topology 216 file, and generates a set S of subnet work addresses in the network topology. This can be effected based on the baseline topology. Typically, the baseline topology file includes a list of routers and, for each listed router, a table similar to the table of
In step 302, a next subnet Si in the set S of subnets is selected, and in step 304 a (first) set E of device addresses inside Si are generated. The device addresses in set E can be determined based on the baseline topology and/or other topology discovery techniques. Additionally, the addresses can be generated by known techniques based on the selected subnet address as discussed in detail below. At minimum, the set E will include the interface address of the router upstream from the selected subnet. With reference to
In step 306, a (second) set D of addresses is created. The set D is the union of the device addresses in set E and the router interface addresses between the testing node or system 200 and the selected subnet Si (or a third set of unique addresses). With reference to
In step 308, a next router interface address from set D is selected. The router interface address selected is preferably associated with the router immediately downstream from system 200.
In decision diamond 310, the router interface address is pinged by known techniques (e.g., by SNMP or ICMP) to determine if the interface address is valid, i.e., contactable. When the address is invalid, the agent 212 returns to step 302 above. To enable router utilization monitoring and baseline topology determination, SNMP and ping contactability are preferred for each router in the tree. If a router were not SNMP and ping contactable, the tree may be pruned at that point. SNMP contactability is normally not required for load balanced link detection alone. When the address is valid, the agent 212 proceeds to step 312.
In step 312, the agent 212 removes the selected and validated interface address and any associated interface address (for the selected router) from set D. Associated interface addresses exist where a given router has more than one contactable interface identified in the baseline topology.
In step 314, the agent 212 initializes and sets the following parameters:
(a) “h”, or the hop count, is set equal to the hop count from the system 200 (or selected node) to the selected router interface address. For example, in
(b) “Ru”, or an address associated with the immediately upstream router from the selected router, is set to an interface address of the immediately upstream router.
In decision diamond 316, it is determined if there is an address to which Ru can be set. For example, if the initially selected router is router R1, there is no router upstream of R1 (as R1 is the immediately downstream router from the system 200). In contrast if the selected router is router R2 or R3, the upstream router is R1. In the event that there is no address corresponding to Ru, the agent 212 returns to step 308, selects the next downstream router, which in the configuration of
In step 320, the agent 212 tests for per-packet load balancing on Ru by pinging the selected router a selected number (“Np”) times with the Time To Live or TTL set to “h”. Thus, for either router R2 or R3 as Ru “h” is set to two. A higher probability that load balancing has been detected is associated with a higher value of Np. Typically, Np is set to ten, or ten test packets are sent, to provide a greater than 90% probability that load balancing will be detected on the selected router. The destination for the packet can be any destination downstream of the selected router as well as the address of the interface of the selected router itself. Referring to
In decision diamond 324, it is determined whether a response is received to any of the test packets from the selected router. If not, the baseline topology file 216 is deemed to be incorrect and the agent 212 proceeds to step 396 and terminates operation. Although it seems excessive to quit upon discovering inaccurate topology particularly if the router in question is attached to a relatively unimportant edge subnet, it is left to the user to sort out the problem as the network topology is probably fairly unstable or has changed since the baseline topology was acquired. If a response is received only from a router other than the selected router, the topology is nonetheless deemed to be incorrect. If a response is received from the selected router, the agent 212, proceeds to decision diamond 328.
In decision diamond 328, it is determined whether a response is received from a router other than the selected router. During transmission of the test packets a table similar to that of
In decision diamond 336, the agent 212 determines whether the size or membership of set D is equal to (or less than) one. If only one member remains in set D, the agent 212 is typically unable to test for per-destination and per-source/destination load balancing; that is, the agent is unable to test for per-destination and per-source/destination load balancing when set D contains all known or predictable destination addresses downstream of the selected router. In that event, the agent in step 340 instantiates a link between Ru and the selected router, with a warning indicating that per-destination and per-source/destination load balancing could not be tested for the associated link, and proceeds to step 374 discussed below. Whenever a link is instantiated, the agent 212 determines the input and output interfaces for each router. These are recorded in the tree for the purpose of router interface monitoring. If more than one member remains in set D, the agent 212 proceeds to step 344.
In step 344, the agent 212 tests for per-destination and per-source/destination load balancing on the upstream router Ru. This is typically effected by pinging all addresses in set D with the TTL equal to “h”. Because the source address of the system 200 is generally constant among the test packets, per-destination and per-source/destination load balancing are effectively identical for purposes of detection and corrective action. In one configuration, other destination addresses are generated and used for test packets. Such addresses can be generated by selecting destination addresses off of the subnet address of subnet Si. For example, as shown in
During transmission of the test packets a table similar to that of
In decision diamond 348, it is determined whether a response is received to any of the test packets from the selected router. If not, the baseline topology file is deemed to be incorrect and the agent 212 proceeds to step 396 and terminates operation. If a response is received only from a router other than the selected router, the topology is nonetheless deemed to be incorrect. If a response is received from the selected router, the agent 212, proceeds to decision diamond 352.
In decision diamond 352, it is determined whether a response is received from a router other than the selected router. If all responses are received from the selected router, the agent 212 assumes no load balancing is in effect and, in step 356, instantiates a link between Ru and the selected router and proceeds to step 374 below. If a response is received from a router other than the selected router, a decision must be made as to which of the load-balanced links to follow downstream.
In step 360, the decision as to which link to follow is effected. Although any technique can be used to effect the selection, a preferred technique is to set the downstream router Rd to the interface address of the router generating the most responses to pings in the load balancing test of step 344. In
In decision diamond 364, the agent 212 determines whether or not Rd is in the baseline topology 216. If not, the baseline topology is incorrect and the agent proceeds to step 396 and terminates operation. If so, the agent instantiates a link between Ru and Rd in step 366 with a suitable warning indicating that per-destination or per-source/destination load balancing is in effect.
In step 368, all (destination) addresses that failed to return a response from Rd are removed from the set D. In other words with reference to
In step 370, the agent 212 sets the next router interface address to Rd and checks for asymmetry in step 374. As will be appreciated, asymmetric paths can arise in some routing protocols, such as Open Shortest Path First or OSPF, when the metric for one direction of a link is not the same for the reverse direction. This results in packets sent from node A to node B being forwarded through a different set of routers than packets sent from node B to node A. Such paths are undesirable in general for real-time communications, as the delay/packet loss/jitter characteristics for the return path of a packet may be substantially different to that for the outbound path. Detection of asymmetric paths can be performed given access to each router's routing tables. If an outbound path from node A contains a segment between first and second routers, the routing table of the second router is examined to ensure that for packets destined to node A, the next hop is the first router. Accordingly, the agent 212 accesses the baseline topology 216 and finds the entry for the address of agent 212 in the routing table of the selected router.
In decision diamond 378, the agent determines if the next hop address in the entry is the selected router. If not, a warning is added in step 380 to the discovered topology for the selected router indicating that the associated link may be asymmetrical. The agent 212 then proceeds to step 388. If so, the agent proceeds to decision diamond 382.
In decision diamond 382, the agent determines whether the incoming link Ru to the selected router has a warning for load balancing. If so, the agent 212 in step 384 adds a warning indicating that the associated link may be asymmetrical. It is assumed that, when there is load balancing in the downstream direction, it is likely that there will be load balancing in the upstream direction. This gives rise to the possibility of asymmetry, hence the extra warning. Thereafter or if the incoming link has no warning, the agent 212 proceeds to decision diamond 388.
In decision diamond 388, the agent 212 determines whether there is a next router interface address in set D. If so, the agent 212 returns to step 308 and sets the next router address to the entry in set D. If not, the agent proceeds to decision diamond 392.
In decision diamond 392, the agent 212 determines whether there is a next (edge) subnet that has not yet been the subject of load balancing testing. When a next untested subnet exists, the agent 212 returns to step 302 and sets the next subnet S to the untested subnet. When no next untested subnet exists, the agent proceeds to step 394.
In step 394, the agent 212 writes the discovered topology tree to the discovered topology 220 output file(s) and terminates operation in step 396.
A number of variations and modifications of the invention can be used. It would be possible to provide for some features of the invention without providing others.
For example in one alternative embodiment, only a subset of set D is pinged in step 344. Although this reduces network traffic, it can also reduce the chances of detecting subsequent downstream load balancing.
In another alternative embodiment, instead of quitting in decision diamond 364 when Rd is not found in the baseline topology, another downstream router that is included in the baseline topology could be picked for Rd.
In yet another alternative embodiment, step 360 could be changed so that instead of following a single downstream link all of the downstream links are followed. This modification would increase coverage of the network but also the complexity of the software.
In a further alternative embodiment, in step 332 instead of instantiating a cloud the topology is “pruned” at the selected router using per-packet load balancing and no tests are performed on components downstream of the “pruned” router. “Pruning” refers to removal of the branch(es) of the tree downstream from a selected point.
In another embodiment, any of the software modules discussed above can be implemented, in whole or part, as an application specific integrated circuit or any other type of logic circuit.
The present invention, in various embodiments, includes components, methods, processes, systems and/or apparatus substantially as depicted and described herein, including various embodiments, subcombinations, and subsets thereof. Those of skill in the art will understand how to make and use the present invention after understanding the present disclosure. The present invention, in various embodiments, includes providing devices and processes in the absence of items not depicted and/or described herein or in various embodiments hereof, including in the absence of such items as may have been used in previous devices or processes, e.g., for improving performance, achieving ease and\or reducing cost of implementation.
The foregoing discussion of the invention has been presented for purposes of illustration and description. The foregoing is not intended to limit the invention to the form or forms disclosed herein. In the foregoing Detailed Description for example, various features of the invention are grouped together in one or more embodiments for the purpose of streaming the disclosure. This method of disclosure is not to be interpreted as reflecting an intention that the claimed invention requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the following claims are hereby incorporated into this Detailed Description, with each claim standing on its own as a separate preferred embodiment of the invention.
Moreover though the description of the invention has included description of one or more embodiments and certain variations and modifications, other variations and modifications are within the scope of the invention, e.g., as may be within the skill and knowledge of those in the art, after understanding the present disclosure. It is intended to obtain rights which include alternative embodiments to the extent permitted, including alternate, interchangeable and/or equivalent structures, functions, ranges or steps to those claimed, whether or not such alternate, interchangeable and/or equivalent structures, functions, ranges or steps are disclosed herein, and without intending to publicly dedicate any patentable subject matter.
Number | Name | Date | Kind |
---|---|---|---|
4556972 | Chan et al. | Dec 1985 | A |
4644532 | George et al. | Feb 1987 | A |
5136690 | Becker et al. | Aug 1992 | A |
5185860 | Wu | Feb 1993 | A |
5226120 | Brown et al. | Jul 1993 | A |
5450408 | Phaal | Sep 1995 | A |
5557745 | Perlman et al. | Sep 1996 | A |
5564048 | Eick et al. | Oct 1996 | A |
5572650 | Antis et al. | Nov 1996 | A |
5581797 | Baker et al. | Dec 1996 | A |
5596703 | Eick et al. | Jan 1997 | A |
5623590 | Becker et al. | Apr 1997 | A |
5636350 | Eick et al. | Jun 1997 | A |
5644692 | Eick | Jul 1997 | A |
5734824 | Choi | Mar 1998 | A |
5737526 | Periasamy et al. | Apr 1998 | A |
5751971 | Dobbins et al. | May 1998 | A |
5805593 | Busche | Sep 1998 | A |
5812763 | Teng | Sep 1998 | A |
5850397 | Raab et al. | Dec 1998 | A |
5881051 | Arrowood et al. | Mar 1999 | A |
5881246 | Crawley et al. | Mar 1999 | A |
5943317 | Brabson et al. | Aug 1999 | A |
5966513 | Horikawa et al. | Oct 1999 | A |
6047330 | Stracke, Jr. | Apr 2000 | A |
6088451 | He et al. | Jul 2000 | A |
6108702 | Wood | Aug 2000 | A |
6119171 | Alkhatib | Sep 2000 | A |
6122639 | Babu et al. | Sep 2000 | A |
6131117 | Clark et al. | Oct 2000 | A |
6249820 | Dobbins et al. | Jun 2001 | B1 |
6252856 | Zhang | Jun 2001 | B1 |
6256675 | Rabinovich | Jul 2001 | B1 |
6269398 | Leong et al. | Jul 2001 | B1 |
6269400 | Douglas et al. | Jul 2001 | B1 |
6275492 | Zhang | Aug 2001 | B1 |
6282404 | Linton | Aug 2001 | B1 |
6298381 | Shah et al. | Oct 2001 | B1 |
6360255 | McCormack et al. | Mar 2002 | B1 |
6377987 | Kracht | Apr 2002 | B1 |
6405248 | Wood | Jun 2002 | B1 |
6418476 | Luciani | Jul 2002 | B1 |
6430612 | Iizuka | Aug 2002 | B1 |
6442144 | Hansen et al. | Aug 2002 | B1 |
6456306 | Chin et al. | Sep 2002 | B1 |
6466121 | Shah | Oct 2002 | B1 |
6550012 | Villa et al. | Apr 2003 | B1 |
6744739 | Martin | Jun 2004 | B2 |
6859878 | Kerr et al. | Feb 2005 | B1 |
6895436 | Caillau et al. | May 2005 | B1 |
6952779 | Cohen et al. | Oct 2005 | B1 |
7131140 | O'Rourke et al. | Oct 2006 | B1 |
7133929 | Shah | Nov 2006 | B1 |
7143184 | Shah et al. | Nov 2006 | B1 |
7185100 | Shah | Feb 2007 | B2 |
7200673 | Augart | Apr 2007 | B1 |
20010034837 | Kausik et al. | Oct 2001 | A1 |
20010049786 | Harrison et al. | Dec 2001 | A1 |
20020087704 | Chesnais et al. | Jul 2002 | A1 |
20020112062 | Brown et al. | Aug 2002 | A1 |
20020116647 | Mont et al. | Aug 2002 | A1 |
20020128885 | Evans | Sep 2002 | A1 |
20020141593 | Kurn et al. | Oct 2002 | A1 |
20020144149 | Hanna et al. | Oct 2002 | A1 |
20020161591 | Danneels et al. | Oct 2002 | A1 |
20020188708 | Takahashi et al. | Dec 2002 | A1 |
20030004840 | Gharavy | Jan 2003 | A1 |
20030043820 | Goringe et al. | Mar 2003 | A1 |
20030065626 | Allen | Apr 2003 | A1 |
20030065940 | Brezak et al. | Apr 2003 | A1 |
20030065944 | Mao et al. | Apr 2003 | A1 |
20030084176 | Tewari et al. | May 2003 | A1 |
20030163686 | Ward et al. | Aug 2003 | A1 |
20050071469 | McCollom et al. | Mar 2005 | A1 |
Number | Date | Country |
---|---|---|
0 455 402 | Nov 1991 | EP |
7-334445 | Dec 1995 | JP |
11-340995 | Dec 1999 | JP |
2000-32132 | Jan 2000 | JP |
2000-101631 | Apr 2000 | JP |
2000-83057 | Sep 2000 | JP |
2001-94560 | Apr 2001 | JP |
2001-144761 | May 2001 | JP |
Number | Date | Country | |
---|---|---|---|
20040260755 A1 | Dec 2004 | US |