Performing simplified troubleshooting procedures to isolate connectivity problems

Information

  • Patent Grant
  • 7774447
  • Patent Number
    7,774,447
  • Date Filed
    Wednesday, April 26, 2006
    18 years ago
  • Date Issued
    Tuesday, August 10, 2010
    13 years ago
Abstract
A method and apparatus for implementing troubleshooting of a network connectivity problem between a client computer coupled to a local switch and an end point on the network utilizes a client_proxy module instantiated on the local switch. The module automatically runs a series of tests utilizing the IP and MAC addresses of the client computer source addresses and reports the results of the tests.
Description
BACKGROUND OF THE INVENTION

Switches and routers provide a broad set of troubleshooting tools and utilities such as, for example, ping, Layer 3 traceroute, Layer 2 traceroute, etc., that can be combined with the output of various commands to debug network connectivity problems.


However, debugging can become quite challenging for users who are not network specialists. Even for the most basic connectivity problems, it is necessary to go through a step by step process to validate the connectivity checks and isolate the problem.


A typical example of a connectivity problem is depicted in FIG. 1 where a client computer 10, coupled to a port of a Local Switch 12, is unable to connect to an end station host server 14 located on the network 16. Debugging the problem involves running utilities such as ping and traceroute from the client computer.


Ping is a utility to determine whether a specific Internet Protocol (IP) address is accessible. It works by sending a packet to the specified address and waiting for a reply. Ping is used primarily to troubleshoot network connections. Traceroute utilities work by sending packets with low time-to-live (TTL) fields. The TTL value specifies how many hops the packet is allowed before it is returned. When a packet can not reach its destination because the TTL value is too low, the last host returns the packet and identifies itself. By sending a series of packets and incrementing the TTL value with each successive packet, traceroute finds out who all the intermediary hosts are.


These troubleshooting tools and utilities must be initiated at the client's computer because the connectivity problem occurs somewhere along the path taken by packets between the client and end station host server. This requires that the network administrator (the “Admin”) be physically present at the client computer to run the tests or remotely connect with the user to guide her through performing the steps on the client computer.


Thus, either the user is diverted from other tasks in order to assist the Admin or the Admin must move from computer to computer to debug connectivity problems.


The challenges in the field of network administration continue to increase with demands for more and better techniques having greater flexibility and adaptability. Therefore, a need has arisen for a new system and method for debugging connectivity problems between a client computer and an end station host server connected to a network.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a view of a system suitable for implementing an embodiment of the invention;



FIG. 2 is a flow chart depicting tests and analysis required to debug a connectivity problem;



FIG. 3 is a flow chart depicting steps performed by an embodiment of the invention; and



FIG. 4 is a block diagram of a network device configured to implement an embodiment of the invention.





DETAILED DESCRIPTION OF THE INVENTION

Reference will now be made in detail to various embodiments of the invention. Examples of these embodiments are illustrated in the accompanying drawings. While the invention will be described in conjunction with these embodiments, it will be understood that it is not intended to limit the invention to any embodiment. On the contrary, it is intended to cover alternatives, modifications, and equivalents as may be included within the spirit and scope of the invention as defined by the appended claims. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the various embodiments. However, the present invention may be practiced without some or all of these specific details. In other instances, well known process operations have not been described in detail in order not to unnecessarily obscure the present invention.


One embodiment of the invention is a simplified interface that helps in troubleshooting connectivity problems. The interface does not necessarily point to the root cause of the problem, but helps isolate the problem. In computer networks without well-qualified networking administration personnel to support the network, as is common in small medium businesses, this embodiment makes it easier to troubleshoot connectivity problems.


Referring again to FIG. 1, client computer A 10 has problems connecting to an endpoint host server 14 located somewhere in the network 16. The client computer A 10 is directly connected to Ethernet switch 12 marked as “Local Switch” in FIG. 1. In typical networks, the user of client computer A would seek the help of the Admin to help troubleshoot the connectivity problem. The Admin would have to walk user through a series of steps that debug the problem as shown in the flowchart of FIG. 2.


In existing networks the Admin must perform the various tests in the decision blocks of the flowchart from the client computer A. Alternatively, the Admin can guide the user of client computer A through the tests.



FIG. 3 shows a high-level, system block diagram of a Local Switch that may be used to execute software of an embodiment of the invention. The Local Switch includes a memory 120 which can be utilized to store and retrieve software and data for use by the software. Exemplary computer readable storage media include CD-ROM, floppy disk, tape, flash memory, system memory, and hard drive. Additionally, a data signal embodied in a carrier wave may be the computer readable storage medium. The local switch further includes subsystems such as a central processor 122, one or more network ports 124. In FIG. 3 the network ports are shown grouped into Virtual LANs (VLANs) 126. Other switches, routers, or network devices suitable for use with the invention may include additional or fewer subsystems.



FIG. 3 depicts some of the functionality included in the switch. The processor executes a management interface module 128, for example an http interface that permits management of the switch from remote locations connected to the network such as Admin computer 18 coupled to the network at a location remote from Local Switch 12. In the presently described embodiment, the Admin can log on to the switch from any workstation on the network and initiate a proxy_client module 130 that will perform the tests depicted in FIG. 2 as a proxy for the client computer A. Also, in this embodiment the switch provides this set of steps in an integrated manner, so that the Admin can perform the checks easily through the switch management interface. The switch may optionally include a switch_analysis module 132 having functions described below.



FIG. 4 is a flow chart depicting the steps required to initiate the simplified troubleshooting feature of the switch and the events that happen at the switch. The Admin logs into the Local Switch, and selects the option for troubleshooting. The switch is aware of where the clients are connected, their IP addresses and Media Access Control (MAC) addresses. The Admin selects the client having the problem, and selects option for connectivity tests, and enters the server's host name that the client computer cannot reach.


In order to ensure that the results are exactly those that would be encountered by the client PC, the switch must disable the client port, so that traffic from the client does not interfere with the tests. When the administrator requests this troubleshooting functionality, the following things happen on the switch;

    • A proxy_client module is initialized with an IP address and a MAC address of the client, and its associated VLAN as well as IP subnet information. The switch also has knowledge of the DNS server through its own configuration, while the rest of the information is gathered from snooping DHCP packets.
    • The proxy_client module interacts with the IP stack through internal Layer 3 interface, and performs the ping, L3 traceroute, and L2 traceroute as described in the flowchart. The L2 traceroute is performed with the source MAC address of the client, and the destination MAC address of the router.
    • Based on the results of the tests, the proxy_client module interfaces with switch_analysis module to perform the tasks in block H.


The switch Instantiates the proxy_client module that proxies for the client PC. The proxy_client module has a logical interface on the VLAN on which the client is connected, and assigns the interface the same IP address and MAC address as the client PC. The IP address and MAC addresses are learned through snooping of DHCP or ARP packets involving the client PC. The switch must use the client IP and MAC addresses, and the logical interface in the same VLAN as the client. This will ensure that the packets originating from the switch will traverse the same path as if they had originated from the client, which is necessary to ensure that the test results will point out the problems encountered by the client.


In the flow chart of FIG. 2, the steps in the decision blocks are performed automatically by the proxy_client module when it is instantiated. The steps listed in rectangle B are also performed by the proxy_client module as part of the IP connectivity block. The diagnostic analysis listed in rectangles A and C-G are performed by the Admin or some other diagnostic software not resident in the switch. The analysis of block H can be performed by switch_analysis module as described below.


The results of the tests can be put into can be put into three categories:

    • 1. the problem is on some other device in the network where the device is identified by tests like ping/traceroute etc. OR;
    • 2. the problem is on the switch OR;
    • 3. the problem is on a link on the switch.


Accordingly, the switch can provide additional information for each of the following conditions:

    • Block A—the Admin can use the switch to capture the packets being generated from the Client. This can be used to do further troubleshooting.
    • Block B—the switch can perform the necessary set of tests to check the connectivity to the DNS server using the same algorithms as in the flow chart.
    • Block D, E & F—It is possible for network management application to use the information reported by the switch and perform further diagnosis on the exact device in the network where the problem is occurring.
    • Block H—the switch_analysis module can perform extensive checks and report the exact problem in most cases. Link level packet error statistics, and cable diagnostic tests can be used by the switch to determine if the problem is a cabling error or network adapter problem. Analysis of the switch hardware state and the state of different features can also help determine if problem is due to issues on the switch itself.


In one embodiment, the switch_analysis module performs the following functions:

    • Disable the proxy_client, so that the switch can initiate tests with the client PC to check for problems on the client.
    • The port on which the router is to be reached is checked for errors. If there are no errors, then the client side port is checked.
    • The port on which the client is connected is checked for errors.
    • Cable diagnostics tests are run to see if the cable has any problems.
    • The switch pings the client to see if the IP stack on the client is responsive or not.
    • If no problems are found, then features on the switch (such as access control lists) are checked to report all the types of traffic that the switch would not forward from the client PC.


This capability of the switch_analysis module added to the proxy_client module not only performs tasks that the user would otherwise have been required to do at the client station, it also integrates the results of the tests with knowledge of the network present within the switch, and as seen by the switch, to help get to the root cause of the connectivity problems quicker.


The invention has now been described with reference to the preferred embodiments. Alternatives and substitutions will now be apparent to persons of skill in the art. Accordingly, it is not intended to limit the invention except as provided by the appended claims.

Claims
  • 1. A method comprising: initializing a proxy_client module with a client IP address and a client MAC address of a client computer having a connectivity problem with an end station included in a network, with the proxy_client module stored on memory of a switch and with the switch having first, second, and third ports, where the client computer is coupled to the first port of the switch and where the network is coupled to the second port of the switch, and also initializing the proxy_client module with the identity of a VLAN on which the client computer is connected and with an IP address of the end station in the network, with the client IP address and client MAC address, the identity of the VLAN and the IP address of the end station provided by a remote computer coupled to the third port of the switch;disabling the first port of the switch;
  • 2. An apparatus comprising: a first switch port configured to couple to a client computer, with the client computer having an Internet Protocol (IP) address and a Media Access Control (MAC) address; a second switch port configured to couple to a network;a third switch port configured to couple to a remote computer;a memory storing computer program code including a management interface module and a proxy_client module;a processor, coupled to the memory, configured to execute the management interface module to:allow the remote computer to be used to select a client computer, connected to the first switch port and having a connectivity problem with an end point device on the network;
  • 3. A system comprising: a processor configured to:assign a client IP address and a client MAC address of a client computer to a logical interface with a VLAN, where the VLAN includes at least a first port of a switch, where the client computer is connected to the first port of the switch and is assigned to the VLAN and where the client computer is having connectivity problems with an end station included in a network coupled to a second port of the switch;run a sequence of connectivity tests with the end station when the first port of the switch is disabled, utilizing the client IP address and the client MAC address as source addresses;and report the results of the sequence of connectivity tests to facilitate identification of the reasons packets are being dropped;identify the reason for the packets being dropped when if packets are dropped at the switch;identify the feature responsible for dropping the packets, such as failure to respond to Address Resolution Protocol (ARP) queries;test a link where the packets are dropped or showing errors; anddetermine whether the problem is on the switch, or on a device in the network other than the network device, or on a link on the switch.
  • 4. A computer program product comprising: a computer-readable storage medium having computer readable program code physically embodied therein, said computer program product further comprising:
US Referenced Citations (36)
Number Name Date Kind
5825772 Dobbins et al. Oct 1998 A
5920699 Bare Jul 1999 A
5982753 Pendleton et al. Nov 1999 A
6188691 Barkai et al. Feb 2001 B1
6515969 Smith Feb 2003 B1
6625146 Merchant et al. Sep 2003 B1
6678241 Gai et al. Jan 2004 B1
6775290 Merchant et al. Aug 2004 B1
7062595 Lindsay et al. Jun 2006 B2
7286491 Smith Oct 2007 B1
7428237 Gai et al. Sep 2008 B1
7436832 Gallatin et al. Oct 2008 B2
7451204 Shiga et al. Nov 2008 B2
7463588 Tanaka et al. Dec 2008 B1
7554997 Schlichter et al. Jun 2009 B1
20020080800 Lee et al. Jun 2002 A1
20020104039 DeRolf et al. Aug 2002 A1
20020112076 Rueda et al. Aug 2002 A1
20020118692 Oberman et al. Aug 2002 A1
20040184401 Nguyen et al. Sep 2004 A1
20040199627 Frietsch Oct 2004 A1
20050108444 Flauaus et al. May 2005 A1
20050240799 Manfredi et al. Oct 2005 A1
20060031446 Hamedi Feb 2006 A1
20060031488 Swales Feb 2006 A1
20060098670 Voit et al. May 2006 A1
20060133368 Tolliver Jun 2006 A1
20060143344 Lindsay et al. Jun 2006 A1
20060146823 Ding Jul 2006 A1
20060168648 Vank et al. Jul 2006 A1
20060248229 Saunderson et al. Nov 2006 A1
20060251085 Kalkunte et al. Nov 2006 A1
20070081541 Umekage et al. Apr 2007 A1
20080001765 Nguyen et al. Jan 2008 A1
20080089323 Elias et al. Apr 2008 A1
20090052336 Nguyen et al. Feb 2009 A1
Related Publications (1)
Number Date Country
20070255733 A1 Nov 2007 US