The present disclosure relates generally to wireless networks.
Market adoption of wireless LAN (WLAN) technology has exploded, as users from a wide range of backgrounds and vertical industries have brought this technology into their homes, offices, and increasingly into the public air space. This inflection point has highlighted not only the limitations of earlier-generation systems, but also the changing role that WLAN technology now plays in people's work and lifestyles across the globe. Indeed, WLANs are rapidly changing from convenience networks to business-critical networks. Increasingly users are depending on WLANs to improve the timeliness and productivity of their communications and applications, and in doing so, require greater visibility, security, management, and performance from their network.
When a wireless client has a connectivity problem in a wireless network, the end-user of the wireless client may generate and send a trouble ticket to a network administrator who can then troubleshoot the problem. The network administrator, however, typically has very little information available to debug the problem remotely. The network administrator may have access to the wireless client's current information and historical graphs on association history. However, this information may be limited. Some solutions provide indexing functions that index system logs from different network systems, but such solutions have no intelligence to identify problems.
A. Overview
Particular implementations facilitate troubleshooting of wireless client connectivity problems in a wireless network. In one implementation, the present invention provides a data gathering function that automatically identifies the network associations and attempted associations of a wireless client, and based on those associations, identifies and polls potential sources of data relevant to troubleshooting the connectivity of the wireless client. According to particular implementations, the present invention allows live troubleshooting where various elements of the wireless network log data associated with the connection states of a given wireless client that is having connection problems. Such wireless network nodes may include wireless access points, central controllers, routers, authentication authorization accounting (AAA) servers, location servers, dynamic host configuration protocol (DHCP) servers, etc. A WLAN management server determines the current connection state of the wireless client and analyzes various current and historical log data provided by the various wireless network nodes, and correlates the log data with particular security protocols to determine possible causes of the connection problems. In one implementation, the WLAN management server reports the data logs, possible causes of the connectivity problem, and possible solutions for the problems to a network administrator for corrective action. A variety of services can be constructed that utilize or take advantage of this data gathering function. The WLAN management server may provide this troubleshooting functionality live (e.g., on-demand) or according to a schedule.
In one implementation, the WLAN management server provides troubleshooting functionality that is integrated with functionalities of an external trouble ticket system to perform live troubleshooting or to perform scheduled troubleshooting. In particular implementations, the trouble ticket may be user invoked, or the WLAN management server may automatically initiate the troubleshooting process when the WLAN management server becomes aware of a connectivity problem. The WLAN management server then applies an appropriate troubleshoot policy depending on whether the cause of the connection problem is associated with the wireless client or associated with the wireless network infrastructure. For example, if the problem is associated with the wireless client side, the WLAN management server may send an asynchronous report to a trouble ticket system when the troubleshoot analysis and identified problem is available, automatically close the trouble ticket once the WLAN management server identifies the problem, and send an email to the end user with suggestions for corrective action. If the problem is associated with the wireless network infrastructure, the WLAN management server may send an asynchronous report to the trouble ticket system. In one implementation, the report may include the identified problem and suggested corrective actions for a network administrator. If the problem is not identified, the report may include a new trouble ticket and troubleshooting information to facilitate a network administrator in identifying the problem.
B. Example Wireless Network System Architecture
B.1. Network Topology
As
The wireless access points 50 are operative to wirelessly communicate with remote wireless client devices 60a, 60b, 60c, and 60d. In one implementation, the wireless access points 50 implement the wireless network protocol specified in the IEEE 802.11 WLAN specification; of course, other wireless network protocols may be used. The wireless access points 50 may be autonomous or so-called “fat” wireless access points or light-weight wireless access points operating in connection with a wireless switch (not illustrated). In addition, the network infrastructure may also include a Wireless LAN Solution Engine (WLSE) offered by Cisco Systems, Inc. of San Jose, Calif. or another wireless network management system. In some implementations, the network infrastructure may also include one or more Wireless Control System (WCS) nodes operative to manage one or more wireless switches and access points.
B.2. WLAN Management Server
The elements of hardware system 200 are described in greater detail below. In particular, network interface 216 provides communication between hardware system 200 and any of a wide range of networks, such as an Ethernet (e.g., IEEE 802.3) network, etc. Mass storage 218 provides permanent storage for the data and programming instructions to perform the above described functions implemented in the system controller, whereas system memory 214 (e.g., DRAM) provides temporary storage for the data and programming instructions when executed by processor 202. I/O ports 220 are one or more serial and/or parallel communication ports that provide communication between additional peripheral devices, which may be coupled to hardware system 200.
Hardware system 200 may include a variety of system architectures; and various components of hardware system 200 may be rearranged. For example, cache 204 may be on-chip with processor 202. Alternatively, cache 204 and processor 202 may be packed together as a “processor module,” with processor 202 being referred to as the “processor core.” Furthermore, certain implementations of the present invention may not require nor include all of the above components. For example, the peripheral devices shown coupled to standard I/O bus 208 may couple to high performance I/O bus 206. In addition, in some implementations only a single bus may exist, with the components of hardware system 200 being coupled to the single bus. Furthermore, hardware system 200 may include additional components, such as additional processors, storage devices, or memories.
As discussed above, in one embodiment, the operations of the WLAN management server 20 described herein are implemented as a series of software routines run by hardware system 200. These software routines comprise a plurality or series of instructions to be executed by a processor in a hardware system, such as processor 202. Initially, the series of instructions are stored on a storage device, such as mass storage 218. However, the series of instructions can be stored on any suitable storage medium, such as a diskette, CD-ROM, ROM, EEPROM, etc. Furthermore, the series of instructions need not be stored locally, and could be received from a remote storage device, such as a server on a network, via network/communication interface 216. The instructions are copied from the storage device, such as mass storage 218, into memory 214 and then accessed and executed by processor 202.
An operating system manages and controls the operation of hardware system 200, including the input and output of data to and from software applications (not shown). The operating system provides an interface between the software applications being executed on the system and the hardware components of the system. According to one embodiment of the present invention, the operating system is the Windows® 95/98/NT/XP/Vista operating system, available from Microsoft Corporation of Redmond, Wash. However, the present invention may be used with other suitable operating systems, such as the Apple Macintosh Operating System, available from Apple Computer Inc. of Cupertino, Calif., UNIX operating systems, LINUX operating systems, and the like.
B.3. Wireless Client
In one embodiment, hardware system 400 includes a processor 402 and a cache memory 404 coupled to each other as shown. Additionally, hardware system 400 includes a high performance input/output (I/O) bus 406 and a standard I/O bus 408. A host bridge 410 couples processor 402 to high performance I/O bus 406, whereas an I/O bus bridge 412 couples the two buses 406 and 408 to each other. Hardware system 400 also includes a wireless network interface 424, a system memory 414, and a video memory 416 couple to bus 406. In turn, a display device 418 couples to video memory 416. A mass storage 420, a keyboard and pointing device 422, and I/O ports 426 couple to bus 408. Collectively, these elements are intended to represent a broad category of computer hardware systems, including but not limited to general purpose computer systems based on the Pentium® processor manufactured by Intel Corporation of Santa Clara, Calif., as well as any other suitable processor.
The remaining elements of hardware system 400 are described below. In particular, wireless network interface 424 provides communication between hardware system 400 and any of a wide range of wireless networks, such as a WLAN (i.e., IEEE 802.11), WiMax (i.e., IEEE 802.16), Cellular (e.g., GSMA), etc. Mass storage 420 provides permanent storage for the data and programming instructions to perform the above described functions implemented in the system controller, whereas system memory 414 (e.g., DRAM) is used to provide temporary storage for the data and programming instructions when executed by processor 402. I/O ports 426 are one or more serial and/or parallel communication ports that provide communication between additional peripheral devices, which may couple to hardware system 400.
Hardware system 400 may include a variety of system architectures; and various components of hardware system 400 may be rearranged. For example, cache 404 may be on-chip with processor 402. Alternatively, cache 404 and processor 402 may be packed together as a “processor module,” with processor 402 being referred to as the “processor core.” Furthermore, certain implementations of the present invention may not require nor include all of the above components. For example, the peripheral devices shown coupled to standard I/O bus 408 may couple to high performance I/O bus 406. In addition, in some implementations only a single bus may exist, with the components of hardware system 400 being coupled to the single bus. Furthermore, hardware system 400 may include additional components, such as additional processors, storage devices, or memories.
In one embodiment, the operations of wireless client-side functionality are implemented as a series of software routines run by hardware system 400. These software routines, which can be embodied in a wireless network interface driver, comprise a plurality or series of instructions to be executed by a processor in a hardware system, such as processor 402. Initially, the series of instructions are stored on a storage device, such as mass storage 420. However, the series of instructions can be stored on any suitable storage medium, such as a diskette, CD-ROM, ROM, etc. Furthermore, the series of instructions need not be stored locally, and could be received from a remote storage device, such as a server on a network, via network/communication interface 424. The instructions are copied from the storage device, such as mass storage 420, into memory 414 and then accessed and executed by processor 402. In alternate embodiments, the present invention is implemented in hardware or firmware.
While
C. Live Troubleshooting
As
The WLAN management server 20 then identifies sources of log data relevant to the wireless client's connectivity issue based on the WLANs and the central controller(s) identified in the polling step (506). For example, WLAN management server 20 may identify network elements, such as AAA servers, location servers, DHCP servers, etc. In one implementation, WLAN management server 20 may determine from the central controller 42 other network elements that may play a role in client connection to the network, For example, in one implementation, the DHCP configuration for this WLAN may identify DHCP server. Also, a security scheme and authentication configuration may help to identify AAA servers that play a role in authenticating the wireless client. Also, the central controller 42 may contact the location server 22 to determine if the wireless client is permitted to join the network at a specific location. In particular implementations, WLAN management server 20 determines this information either directly through the central controller 42 or through a database of enterprise network information based on the network elements involved. In particular implementations, such log sources may include the wireless client 60, one or more wireless access points 50, central controllers 42, AAA servers 21, location servers 22, DHCP servers 23, routers 32, switches, and other network elements that may interact with the wireless client (such as during a protocol exchange) or observe network traffic of the wireless client 60. In one implementation, the central controller 42 may also be polled for identifying information for these log sources. For example, WLAN management server 20 may probe a central controller 42 for its primary and backup AAA servers.
In one implementation, the WLAN management server 20 invokes troubleshoot logging for the wireless client 60 on all central controllers 42 to cause the central controllers 42 to perform an augmented logging function (508). In particular implementations, the central controllers 42 collect and maintain log data associated with their own interactions with the wireless client 60. If the wireless client 60 leaves the WLAN with which a given central controller 42 is associated, the central controller 42 may preserve the log data and resume logging if the wireless client 60 returns. In one implementation, the central controller 42 may cause the client utility, if available, of the wireless client 60 to perform logging functions such as polling other network elements for log data, initiating logging of transmitted and received wireless frames, etc. In connection with the troubleshooting function, the WLAN management server 42 may also transmit a set of commands to a central controller 42, which forwards the commands to the client utility of the wireless client 60. The commands may cause the wireless client 60 to step through a set of internal diagnostics and attempt association with an identified WLAN on a specified operating channel. During this process, the client utility may log all transmitted and received frames, which may be sent (in addition to security logs, system logs, and the like) to the WLAN management server 20 for analysis.
The WLAN management server 20 also polls the identified log sources (e.g., AAA servers, DHCP servers, etc.) for log data (510). In particular implementations, the WLAN management server 20 analyzes the collected data to determine connection state transitions and thereby isolate and determine potential sources of the connectivity problems encountered by the wireless client 60. For example, the WLAN management server 20 may correlate the connection state information returned by the wireless client 60 and/or the central controller 42 with the log data of different sources to diagnose the connectivity problem. The WLAN management server 20 determines the current connection state (e.g., PEM state) of the wireless client based on analysis of the (e.g., PEM state and security type) provided by the central controller 42. Connection state transitions may include, for example, when the wireless client 60 probes, roams, associates, authenticates, receives an IP address, etc. In one implementation, the WLAN management server 20 may perform diagnostic functions associated with the wireless client 60 in order to generate more information that may be used for troubleshooting.
In one implementation, the WLAN management server 20 may also probe the wireless client 60 or other wireless network nodes for information associated with the wireless client 60 for other reasons. For example, the WLAN management server 20 may probe the location server 22 to determine the location of the wireless client 60. The WLAN management server 20 may probe the central controller 42 for a wireless access point association history within a predefined period of time (e.g., in the last 24 hours) or for wireless client-related notifications transmitted with a predefined period of time (e.g., in the last 24 hours). Such wireless client notifications may indicate any blacklist events, Wireless Encryption Protocol (WEP) decryption errors, or other security related events. The WLAN management server 20 may probe the wireless client 60 for association and/or authentication related information. During the polling process (510), the WLAN management server 20 may reference any such information that the WLAN management server 20 has already collected.
The WLAN management server 20 then correlates and time adjusts the log data from the different sources based on an expected sequence of events inherent to the wireless network and security protocols employed by the wireless client and the network infrastructure (512). The WLAN management server 20 then identifies a possible cause or causes of the connectivity problem (514). In particular implementations, when a wireless client 60 connects to the wireless network, the wireless client 60 and other wireless network elements (e.g., wireless access point, etc.) perform an expected ordered set of operations (e.g., security protocol) based a security type. In one implementation, a given sequence of events ensures that a given wireless client 60 is properly connected to the wireless network. In one implementation, the sequence of events may include multiple events and connection state transitions. For example, if the wireless client 60 connects to a wireless network using 802.1X authentication, the sequence of events may include the following: 802.11 authentication, 802.11 association, IP address assignment, etc. If the wireless client 60 connects to a wireless network using WEB authentication, the states may include 802.11 association, IP address assignment, WEB authentication, successful association, etc.
The WLAN management server 20 correlates the connection state and the log data to the expected ordered sequence of events to isolate and identify the possible causes of the connectivity problem. Because the log data is consolidated and ordered by time stamp, the WLAN management server 20 may ascertain the possible causes of the connectivity problem based on discrepancies between the log data and the expected sequence of events.
In one implementation, the WLAN management server 20 may analyze the last N events associated with the current connection state and compare the last N events against an expert database or lookup table to identify the possible causes of the connectivity problem. In one implementation, the WLAN management server 20 may analyze predefined codes to interpret log messages from the log data. For example, in one implementation, log messages from the log data associated with the DHCP server 23 may indicate particular events and/or problems:
Severity: Normal
Message Code: 101
Message: Controller association request message received
Severity: Normal
Message Code: 104
Message: Controller client moved to DHCP required state.
Severity: Major
Message Code: 106
Message: DHCP failed. DHCP Server not up.
In one implementation, the severity indication may indicate whether an event is normal and expected event (e.g., severity=normal) or a problem (e.g., severity=major); and the message code correlates to a message in the log data.
The following is a list of other messages that the log data may provide:
The WLAN management server 20 then generates a report (516). In one implementation, if the cause of the problem is on the wireless client side, the WLAN management server 20 sends the report to the wireless client (e.g., an e-mail to the end-user) with suggested corrective actions. If the cause of the problem is on the wireless network infrastructure side, the WLAN management server 20 sends the report to a network administrator with suggested corrective actions.
In one implementation, the WLAN management server 20 may display the report in a graphical user interface (GUI) for the network administrator to view. For example, with the example describe above, the WLAN management server 20 may display a summary screen indicating that the cause of the connectivity problem is related to the DHCP server 23 (e.g., the DHCP server 23 is down or not reachable via the central controller 42). The summary page may also include a link to another screen that displays raw log data associated with the DHCP server 23. Implementations of the GUI are described below in connection with
D. Scheduled Troubleshooting
The troubleshooting functions discussed above can be accessed on-demand, as described above. The troubleshooting functionality discussed above can also be aggregated or accessed as a service available to other processes and systems, such as a trouble ticket system. In particular scenarios, the WLAN management server 20 may require additional information from the wireless client 60 or may require the wireless client 60 to perform particular actions in order for WLAN management server 20 to complete its troubleshooting process. However, the wireless client 60 may no longer be available (offline) for troubleshooting. For example, the end-user of the wireless client may open a trouble ticket indicating the connectivity problem but then go offline. A network administrator who receives the trouble ticket may then contact the end-user (e.g., by phone call or e-mail) to instruct the end-user to go online for the troubleshooting process to continue. As such, the WLAN management server 20 may invoke a scheduled trouble shooting process, as described below, where the wireless client is added to a watch list and a troubleshooting process is invoked if a connectivity problem is detected.
E. Integration with External Trouble-ticket System
In one implementation, the WLAN management server 20 provides troubleshooting functionality that is integrated with the functionalities of an external trouble ticket system to perform live troubleshooting or scheduled troubleshooting at a future time when a trouble ticket is opened. In particular implementations, the external trouble ticket system may be a web-based interface. As described in more detail below, in particular implementations, the trouble ticket may be user-invoked or the WLAN management server may automatically initiate the troubleshooting process.
E.1. End-User Invoked Trouble Tickets
As
The WLAN management server 20 then generates a report (708). In one implementation, the WLAN management server 20 sends the report to the trouble ticket system to indicate the results of the troubleshoot process. The WLAN management server 20 then analyzes the report (710) to determine whether the cause of the connectivity problem was identified, and, if identified, whether the cause of the connectivity problem was on the wireless client side or on the wireless network infrastructure side.
The WLAN management server 20 then applies a troubleshoot policy based on the analysis (712). For example, as indicated above in one implementation, if the cause of the problem is on the wireless client side, the WLAN management server 20 may send an e-mail to the wireless client with suggested corrective actions. In one implementation, the WLAN management server 20 may then close the trouble ticket and remove the wireless client from the watch list. If the cause of the problem is on the wireless network infrastructure side, the WLAN management server 20 may send a report to a network administrator with suggested corrective actions. For example, if the WEP key is incorrect, the WLAN management server 20 may suggest that the network administrator confirm the WEP Key and re-attempt connection to the wireless network. The WLAN management server 20 may also suggest that the network administrator reconfigure a particular aspect of the wireless network.
In one implementation, the WLAN management server 20 may display the report in a GUI for the network administrator to view. If the WLAN management server 20 did not identify the cause of the problem, the WLAN management server 20 may instruct the wireless client to perform one or more particular actions in order to collect more information for the troubleshooting processing. In one implementation, the network administrator may interact with the WLAN management server 20 via the GUI (described in more detail below) in order to complete the troubleshooting process. In one implementation, the WLAN management server 20 may add the wireless client to a watch list to monitor the wireless client and to send periodic asynchronous status reports to the trouble ticket system.
E.2. Automated Trouble Tickets
In one implementation, the WLAN management server 20 may monitor wireless clients to automatically determine which wireless clients have connectivity problems when such problems occur. Upon detection of a connectivity problem, the WLAN management server 20 may automatically determine possible causes of the connectivity problem according to one or more of the process flows described above. As such, the WLAN management server 20 may proactively detect problems and provide correction actions to a network administrator. In particular implementations, the WLAN management server 20 may also automatically open a trouble ticket that could get assigned to a network administrator without having an end-user have to initiate a trouble ticket.
F. Graphical User Interface
The present invention has been explained with reference to specific embodiments. For example, while embodiments of the present invention have been described as operating in connection with IEEE 802.11 networks, the present invention can be used in connection with any suitable wireless network environment. Other embodiments will be evident to those of ordinary skill in the art. It is therefore not intended that the present invention be limited, except as indicated by the appended claims.
Number | Name | Date | Kind |
---|---|---|---|
7324499 | Borella et al. | Jan 2008 | B1 |
7373508 | Meier et al. | May 2008 | B1 |
7451316 | Halasz et al. | Nov 2008 | B2 |
20060037063 | Clemmons et al. | Feb 2006 | A1 |
20060236383 | Cam-Winset et al. | Oct 2006 | A1 |
20070083470 | Bonner et al. | Apr 2007 | A1 |
20070105546 | Linkert et al. | May 2007 | A1 |
20080086634 | Salowey et al. | Apr 2008 | A1 |
Number | Date | Country |
---|---|---|
WO 2006102505 | Sep 2006 | WO |
Number | Date | Country | |
---|---|---|---|
20090113052 A1 | Apr 2009 | US |