The present invention relates generally to computer systems, and more particularly, to methods, systems and computer program products that manage resources in computer systems.
A computer system can include physical and logical system resources (e.g., computers, servers and networks), and these physical and logical resources may in turn host many software resources, which in turn may host more dependent software resources. Resource management systems can monitor some or all of these resources for situations that require corrective action.
A resource management system may include a collection of rules that determines under what situations corrective action should be initiated and the type of corrective action to apply in a particular situation. Some resource management systems extract metrics (data) at specified time intervals from resources that are being monitored, process those metrics, and use processing logic to identify the occurrence of a condition (an “alert condition”) that may require corrective action by a user and/or by self-corrective procedures.
The effectiveness of a resource management system can depend on its ability to properly identify alert conditions and notify a user and/or take self-corrective actions.
Various embodiments of the present invention provide methods, systems, and/or computer program products for selecting among a plurality of alert conditions for processing by a resource management system. A priority indication is associated with at least some resources in a computer system. Resources that are associated with the alert conditions are identified. An alert condition is selected from among the alert conditions based on the priority indication that is associated with the identified resources.
In some further embodiments of the present invention, the resources are prioritized based on their importance to operation of a business, based on an affect of their failure on other resources in the computer system, and/or based on whether a resource is a server of information for predetermined software applications in the computer system. A user may be notified of the selected alert condition and/or corrective action may be initiated for the selected alert condition. The alert conditions may be displayed to a user and/or corrective actions may be taken based on the priority indication associated with the identified resources.
The present invention will now be described more fully with reference to the accompanying drawings, in which embodiments of the invention are shown. This invention, however, may be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art. Like reference numerals refer to like elements throughout.
As will be appreciated by one of skill in the art, the present invention may be embodied as methods, computer systems, or computer program products. Accordingly, the present invention may take the form of entirely software embodiments or embodiments combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product on a computer-usable storage medium having computer-usable program code embodied in the medium. Any suitable computer readable medium may be utilized including hard disks, CD-ROMs, optical storage devices, a transmission media such as those supporting the Internet or an intranet, or magnetic storage devices.
Computer program code for carrying out operations of the present invention may be written in an object oriented programming language such as Java®, Smalltalk or C++ and/or using a conventional procedural programming languages, such as the “C” programming language. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer. In the latter scenario, the remote computer may be connected to the user's computer through a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
The present invention is described below with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. In some alternate embodiments of the invention, the functions/acts noted in the blocks may occur out of the order noted in the operational illustrations. For example, two blocks shown in succession may in fact be executed substantially concurrently or the blocks may sometimes be executed in the reverse order, depending upon the functionality/acts involved.
It will be understood that blocks of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function/act specified in the flowchart and/or block diagram block or blocks.
The computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
According to some embodiments of the present invention, the computer 110a includes a resource management system 130 that manages at least some resources in the computers 110a-d and/or the network 120. The resources are monitored based on, for example, their performance, availability, and/or operation, to identify conditions for which corrective action may be needed by a user and/or by the resource management system 130. When such conditions are identified, an alert condition is communicated to the resource management system 130 in the computer 110a. Exemplary alert condition communications 140a-c are illustrated in
Although only four computers 110a-d and one network 120 are shown in
According to some embodiments of the present invention, the resource management system 130 associates a priority indication with at least some resources in the computer system 100. The resources that are associated with the alert conditions in the queue 150 are identified. For example, the resource may be directly identified from information that is included in the alert condition communications 140a-c which identifies their sources, and/or may be indirectly identified based on characteristics of the alert condition. For example, a resource may be identified based on an analysis of which resources in the computer system 100 can cause the alert condition (e.g., based on notice that the network 120 has a low effective bandwidth or high error rate or that one of the computers 110b-d or application(s) thereon is non-responsive). The resource management system 130 then selects an alert condition from among the alert conditions in the queue 150 based on the priority indication that is associated with the identified resources. The selected alert condition may then be prioritized over at least some of the other alert conditions for response. The alert conditions in the queue 150 may thereby be prioritized relative to each other when the resource management system 130 is fetching the next alert to process.
The priority indication for a resource may be based on, for example, its relative importance to certain business operations, the affect of its failure on other resources in the computer system 100, whether it is associated with a server of information for other resources (e.g., a server for predefined computers and/or software applications), and/or based on other attributes of the resource. The priority indication may be predefined by a user and/or it may be assigned by the resource management system 130. Thus, for example, software/hardware resources that are needed for receiving new product orders or handling customer inquiries can have a priority indication that causes the resource management system 130 to prioritize alert conditions associated with those resources higher than other alert conditions. For example, when hardware and/or software resources are dynamically provisioned to enable on-demand balancing to achieve business' goals, the priorities of the re-provisioned resources can be reset based upon the priorities of the business goal.
The memory 206 is representative of the overall hierarchy of memory devices containing the software and data used to implement the functionality of the computer 200. The memory 206 can include, but is not limited to, the following types of devices: cache, ROM, PROM, EPROM, EEPROM, flash memory, SRAM, and DRAM. As shown in
As is further seen in
The data portion 226 of memory 206, as shown in the embodiments of
While the resource management system application 228 and the components of the data portion 226 of memory 206 are illustrated in
At Block 310, the resources that are associated with the alert conditions are identified. As explained above, the resources may be identified based on, for example, information that is included in the alert condition and/or based on characteristics of the alert condition. At Block 320, an alert condition from among the alert conditions is selected based on the priority indication that is associated with the identified resources. For example, the alert queue can be examined to locate the first alert associated with a high priority resource. If multiple high priority resources have alerts in the queue, the alerts belonging to the highest priority resource is selected. When the resources have equal priority, the first alert in the queue from among the resources is selected.
At Block 330, a response is initiated for the selected alert condition. For example, the alert condition is examined to determine the error condition reported within the alert. The alert can be compared against other processed alerts in the system to detect if the current alert is a symptom of an already reported condition or if the alert is a root cause for other reported alerts. Responses may then be initiated locally on the resource management system or remotely on a computer or other resource identified within the alert to reconfigure, terminate, reinitialize or otherwise cause operations to be taken with respect to a resource(s) that is associated with the alert condition to remedy the problem. Regardless of whether the response is initiated locally or remotely, the alert may be displayed to a user, and may be prioritized relative to other alerts for display to a user and/or emphasized (e.g., highlighted, placing in an upper portion of a list, or otherwise displayed in a predetermined fashion) relative to other alerts that are displayed to a user. Alternatively or additionally, the initiated response can include prioritizing the selected alert condition relative to other alert conditions for processing for corrective action.
The operation of Blocks 300-330 is the same as that described with regard to the same numbered blocks in
While this invention has been particularly shown and described with reference to particular embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims and equivalents.
Number | Name | Date | Kind |
---|---|---|---|
4937743 | Rassman et al. | Jun 1990 | A |
5109486 | Seymour | Apr 1992 | A |
5367670 | Ward et al. | Nov 1994 | A |
5369570 | Parad | Nov 1994 | A |
5528759 | Moore | Jun 1996 | A |
5666486 | Alfieri et al. | Sep 1997 | A |
5729688 | Kim et al. | Mar 1998 | A |
5862333 | Graf | Jan 1999 | A |
5935218 | Beyda et al. | Aug 1999 | A |
6046980 | Packer | Apr 2000 | A |
6047322 | Vaid et al. | Apr 2000 | A |
6052722 | Taghadoss | Apr 2000 | A |
6125390 | Touboul | Sep 2000 | A |
6154787 | Urevig et al. | Nov 2000 | A |
6233645 | Chrysos et al. | May 2001 | B1 |
6263359 | Fong et al. | Jul 2001 | B1 |
6272544 | Mullen | Aug 2001 | B1 |
6327630 | Carroll et al. | Dec 2001 | B1 |
6341303 | Rhee et al. | Jan 2002 | B1 |
6353616 | Elwalid et al. | Mar 2002 | B1 |
6363411 | Dugan et al. | Mar 2002 | B1 |
6408277 | Nelken | Jun 2002 | B1 |
6425006 | Chari et al. | Jul 2002 | B1 |
6430592 | Davison | Aug 2002 | B1 |
6446134 | Nakamura | Sep 2002 | B1 |
6643367 | White-Hauser | Nov 2003 | B1 |
7047292 | Stewart et al. | May 2006 | B1 |
7216163 | Sinn | May 2007 | B2 |
7225250 | Harrop | May 2007 | B1 |
20020138571 | Trinon et al. | Sep 2002 | A1 |
20020144147 | Basson et al. | Oct 2002 | A1 |
20030061265 | Maso et al. | Mar 2003 | A1 |
20030079030 | Cocotis et al. | Apr 2003 | A1 |
20030177176 | Hirschfeld et al. | Sep 2003 | A1 |
20030191989 | O'Sullivan | Oct 2003 | A1 |
Number | Date | Country | |
---|---|---|---|
20050198640 A1 | Sep 2005 | US |