The present application claims priority from Japanese application JP 2005-341777 filed on Nov. 28, 2005, the content of which is hereby incorporated by reference into this application.
The present invention relates to an information monitoring technique for handling, when monitoring information of a plurality of computers, units and names of performance information and capacity information in a unified manner.
There exists a method to manage, in a unified manner, management items to be managed in an operation managing system. For example, JP-A-2002-7231 describes a technique for an information managing system, the technique being applied to operation management in a computer system and/or a network system in a firm and/or a factory. A large number of computer terminals and network devices exist on such a network. To manage the computer terminals and the network devices, there are required managing ledgers including an information device managing ledger, a software managing ledger, an address managing ledger, and a vendor managing ledger. Although these managing ledgers include, depending on cases, data items which have mutually different item names and which are equal in the contents thereof to each other, the item names are not integrally handled in all managing ledgers. To solve the problem, the item names indicating the same contents are related to each other to refer to the same value to thereby manage the data in a unified manner.
For the operation management, there also exist items having units such as performance information and capacity information of a computer system. In the operation management only for the same vendor and the same product, it is not required to pay attention to the unit. However, in an environment in which the operation management is conducted for a plurality of vendors or in which the operation management is conducted for the same vendor and mutually different operation management products, the unit differs between the same item name in some cases. In this case, an administrator of Intelligent Technology (IT) resources or a job manager needs to conduct the operation by paying attention to the units. For the performance information and the capacity information, there possibly exists a case in which mutually different item names indicate the same item and the units thereof differ from each other. This situation cannot be coped with the prior art.
It is therefore an object of the present invention to provide an information monitoring method capable of conducting, in an environment in which a plurality of operation management schemes for different vendors exist on a network, the operation management schemes in a unified manner without paying attention to units and item names of data items unique to the operation management schemes.
According to the present invention, there is provided an information monitoring system to monitor information of a plurality of computers in which item identification information (an item identifier (ID) or name) and a unit in an acquisition request for performance information, performance information, or policy information to be communicated between devices during the operation to monitor the information of a plurality of computers are converted into item identification information and a unit of a device of a transmission destination before the item identification information and the unit are transmitted thereto.
In the information monitoring system of the present invention, during the operation to monitor information of a plurality of computers to be monitored such as job computers and a storage managing computer, when a system managing client or a job managing client sends an acquisition request for performance information of the job computers and the storage managing computer to a managing computer, a manager processing part of the managing computer refers to an information table by operation management having stored common performance identifiers, common units, and common performance item names by operation management corresponding to the operation management schemes of the job computers and the storage managing computer. The manager processing part then converts a common performance item identifier in the acquisition request for performance information into a performance item name by operation management of the job computers and the storage managing computer to send the converted acquisition request to the job computers and the storage managing computer.
Each of the agent processing parts of the job computers and the storage managing computer obtains, for an item indicated by the converted performance item name by operation management in the acquisition request, values of the job computer and the storage managing computer and then sends the values as performance information to the managing computer.
When the performance information is received from one of the job computers and the storage managing computer, the managing computer refers to the information table by operation management and a unit conversion table including each unit and a converted value of the unit to convert a performance item name by operation management and a unit in the received performance information into a common performance item identifier and a common unit of the system managing client and the job managing client on the acquisition request side and then sends the common performance item identifier and the common unit to the acquisition request side.
In the information monitoring system of the present invention, during the operation to set policy information indicating processing to be executed when a value of a predetermined item satisfies a threshold value condition to the job computers and the storage managing computer, when the system managing client or the job managing client sends policy information of the job computers and the storage computer to the managing computer, the manager processing part of the managing computer refers to the information table by operation management and the unit conversion table to convert a common performance item identifier and a common unit in the policy information into a performance item identifier and a unit of each of the job computers and the storage managing computer to send the converted policy information to the job computers and the storage managing computer.
Each of the agent processing parts of the job computers and the storage managing computer obtains, for an item indicated by the converted performance item name by operation management in the policy information, a value of each of the job computers and the storage managing computer to execute, if the value satisfies a threshold value condition corresponding to the item name, predetermined processing corresponding to the threshold value condition, for example, terminates an application or transmits a notification indicating that the threshold value condition is satisfied.
When the notification is received from one of the agent processing parts of the job computers and the storage managing computer, the managing computer refers to the information table by operation management and the unit conversion table to convert a performance item name by operation management and a unit in the received notification into a common performance item identifier and a common unit of the system managing client and the job managing client as the notification destinations and then sends the common performance item identifier and the common unit to the system managing client and the job managing client.
According to the present invention, in an environment including a plurality of different operation management schemes on a network, the operation management schemes can be conducted in a unified manner without paying attention to item identification information and units of performance information unique to the operation management schemes.
Other objects, features and advantages of the invention will become apparent from the following description of the embodiments of the invention taken in conjunction with the accompanying drawings.
Next, description will be given of an embodiment of an information monitoring system to monitor information of a plurality of computers.
A system managing client 101 conducts operation, for example, obtains performance information of the storage managing computer 105 and the job computers 106 to 108 and sets a policy via the managing computer 103. A job managing client 102 conducts operation, for example, controls jobs and services, obtains performance information and capacity information of the job computers 106 to 108 and job applications (AP), and sets a policy via the managing computer 103. These clients may be a Graphic User Interface (GUI) which a manager operates via a screen and/or a GUI to monitor a service process and a business process using, for example, an Information Technology Infrastructure Library (ITIL) and a Service Oriented Architecture (SOA). Although the system managing client 101 and the job managing client 102 operate on the client computer, these clients 101 and 102 may operate also on the managing computer 103.
The system managing client 101 is operated by a system manager. The system manager is a manager of IT resources such as a computer, a storage, and a network and maintains and manages, for example, an operating system (OS) required to operate a job application and software programs (ITIL and SOA) which are a premise and/or a basis of the job application. The job managing client 102 is operated by a job manager. The job manager is a manager to maintain and to manage the job application and may also manage the job application developed and/or combined on the software programs such as ITIL and SOA used as a premise and/or a basis of the job application.
It is assumed that a program which makes the managing computer 103 function as the manager processing part 201 is recorded in a recording medium such as a Compact Disk Read Only Memory (CD-ROM) to be stored in, for example, a magnetic disk. The program is thereafter loaded from the magnetic disk in a memory to be executed. The recording medium to record the program may be a recording medium other than a CD-ROM. The user may install the program from the recording medium in an information processing apparatus to thereby use the program. Or, the user may access the recording medium via a network to use the program.
The storage managing computer 105 includes a storage agent processing part 220. For an item indicated by the converted performance item name by operation management in the acquisition request, the storage agent processing part 220 obtains performance information of the storage managing computer 105 and sends the performance information by a communication device. Also, for an item indicated by the converted performance item name by operation management in the acquisition request, the storage agent processing part 220 obtains a value of the storage managing computer 105 and executes, if the value satisfies a threshold value condition corresponding to the item name, predetermined processing corresponding to the threshold value condition by a processing unit.
It is assumed that a program which makes the storage managing computer 105 function as the storage agent processing part 220 is recorded in a recording medium such as a CD-ROM to be stored in, for example, a magnetic disk. The program is thereafter loaded from the magnetic disk in a memory to be executed. The recording medium to record the program may be a recording medium other than a CD-ROM. The user may install the program from the recording medium in an information processing apparatus to thereby use the program. Or, the user may access the recording medium via a network to use the program.
The job computer 106 includes an agent processing part 230. For an item indicated by the converted performance item name by operation management in the acquisition request, the agent processing part 230 obtains performance information of the job computer 106 and sends the performance information by a communication device. Also, for an item indicated by the converted performance item name by operation management in the policy information, the agent processing part 230 obtains a value of the job computer 106 and executes, if the value satisfies a threshold value condition corresponding to the item name, predetermined processing corresponding to the threshold value condition by a processing unit.
It is assumed that a program which makes the job computer 106 function as the agent processing part 230 is recorded in a recording medium such as a CD-ROM to be stored in, for example, a magnetic disk. The program is thereafter loaded from the magnetic disk in a memory to be executed. The recording medium to record the program may be a recording medium other than a CD-ROM. The user may install the program from the recording medium in an information processing apparatus to thereby use the program. Or, the user may access the recording medium via a network to use the program.
The job computer 107 includes an agent processing part 234. For an item indicated by the converted performance item name by operation management in the acquisition request, the agent processing part 234 obtains performance information of the job computer 107 and sends the performance information by a communication device. Also, for an item indicated by the converted performance item name by operation management in the policy information, the agent processing part 234 obtains a value of the job computer 107 and executes, if the value satisfies a threshold value condition corresponding to the item name, predetermined processing corresponding to the threshold value condition by a processing unit.
It is assumed that a program which makes the job computer 107 function as the agent processing part 234 is recorded in a recording medium such as a CD-ROM to be stored in, for example, a magnetic disk. The program is thereafter loaded from the magnetic disk in a memory to be executed. The recording medium to record the program may be a recording medium other than a CD-ROM. The user may install the program from the recording medium in an information processing apparatus to thereby use the program. Or, the user may access the recording medium via a network to use the program.
The managing computer 103 includes the manager processing part 201. The manager processing part 201 includes a job-service management processing part 202, a configuration management processing part 203, a name/unit setting processing part 204, a policy control processing part 205, a name/unit conversion processing part 206, and communication processing part 207. The managing computer 103 includes name/unit information 210, configuration information 211, policy information 212, and client managing information 213.
The job computers 106, 107 include operating systems 232, 236, agent processing parts 230, 234, and job applications 231, 235, respectively. The other job computers are configured in almost the same way as for the job computer 106. The agent processing part 230 conducts operation, for example, collects performance information and capacity information of the operating system and the job application and monitors a threshold value for the performance information and the capacity information.
The storage managing computer 105 includes a storage agent processing part 230 and an operating system 221. The storage agent processing part 230 conducts operation, for example, collects performance information and capacity information of the storage 110 and monitors a threshold value for the performance information and the capacity information. The storage 110 includes Logical VoluMes (LVM) such as LVM1240 and LVM2241 which store therein data to be used by the job. The performance information and the capacity information may be collected in association with the storage 110 and may also be collected in association with the logical volumes.
Although not shown in the drawings, it is assumed that each of the computers and the clients includes programs which make the computer or the client function as each processing part, a Central Processing Unit (CPU) to execute programs such as a job application and an operating system, a memory in which the programs are to be loaded and in which data is temporarily stored at execution of a program, a magnetic disk device to store the programs and the information described above, an input device which conducts operation, for example, an operation to input an indication of operation and data; an output device which conducts operation, for example, an operation to output various information items, and a hardware resource such as a communication device to communicate with other computers and other clients.
The configuration management processing part 203 may be virtualized. In such a case, the virtual configuration may be implemented such that when viewed from the job manager, the job application cannot identify the job computers 106 and 107 in an operating state.
FIGS. 3 to 6 show details of the configuration information 211.
Depending on cases, one job uses one job computer or plural computers, and two or more jobs share one job computer. In
For example, No. 1 of
Depending on cases, a plurality of operation management schemes are operating on one computer. For example, the case of No. 1 of
FIGS. 7 to 9 show details of the name/unit information 210.
To each performance item name, a performance item ID 702 is assigned. When the performance item ID 702 is determined, one performance item name 704 is uniquely determined. By an operation management type 703, it is possible to identify operation management associated with the performance item. For example, in the case of a performance item indicated as No. 1 for which the performance item ID is OS_CPU1, the operation management type 703 is OS, and hence it is known that the performance item is used for OS management. It is also known that the performance item name 704 is “CPU usage ratio”. Depending on operation management schemes, there may exist entries which have the same performance item and which have different names such as “CPU usage ratio” and “CPU utilization” in some cases. By using the common performance item name of
Each entry of the table includes an operation management ID 801, a performance item ID 802, a unit 803, a performance item name by operation management 804, and a performance calculation expression 805. By establishing a correspondence between the performance item ID 802 and the performance item ID 702 of
The table includes data items according to categories 901 such as a byte unit 904 indicating size, a unit of percentage 905 of a ratio, and a time unit 906 representing a unit of time, e.g., a second or a minute. In each category, a unit is determined as a reference, and a magnification factor to convert a value in the units into a value in the reference units is kept in the converted value 910-927.
The policy information of the job threshold value table is valid for job computer(s) (1006-1009) designated by the target 1005. When the job computer as the target satisfies the threshold value condition 1003, the job computer executes the processing 1004 and notifies threshold value exceeded indicating that the threshold value is exceeded to the job managing client name 1002. As in the threshold value condition 1003 for which the threshold value ID 1001 has a value of TH4, the threshold value condition 1003 may include a plurality of performance items using an AND condition and an OR condition.
It is assumed that the policy information of the system management threshold value table is valid for one of the job computers 106 to 108 (1105, 1106) and the storage managing computer 105 (1107) designated in the target 1104. If ALL is specified, the policy information is valid for all computers managed by the managing computer 103. If the computer designated in the target 1104 satisfies the threshold condition 1102, the computer executes the processing 1103 and notifies “threshold value exceeded” to the system managing client 101.
A job managing client abc1 (1301) manages a job 1 and requests to notify a memory usage (OS_MEM1) in kilobyte units (kByte), a disk usage (OS_DISK1) in kilobyte units (kByte), and database usage size (DB_SIZE1) in kilobyte units (kByte; 1). A job managing client def1 (1302) manages the job 1 and a job 2 and requests a memory usage (OS_MEM1) in Megabyte units (MByte) and database usage size (DB_SIZE1) in Gigabyte units (GByte; 2). A job managing client abc2 (1303) manages the job 2, but is not connected to the managing computer 103 in the current state.
According to the job table of
Next, description will be given of the conversion of the performance item name and the unit when performance information is received from the job managing client. For the request (1) from the job managing client abc1, there is used a common performance item name (performance item ID) as stored in the performance item table of
According to the computer operation managing table of
Description will be given to the conversion of the performance item name and the unit when the performance information sent from the agent is notified to the client. As opposed to the operation to request performance information, the item name and the unit of each operation management are converted into a common item name (item ID) and a unit designated by the client. For example, as a result of the request in 1-1-1, a memory usage and a disk usage are returned in units of bytes (Byte) and kbytes (kByte), respectively. According to
Through the above procedure, the manager processing part can obtain, according to a request from at least one client, performance information having a name and a unit requested by the client. That is, for a name and a unit requested by a job manager operating the job managing client, the name and unit possessed by the operation management may be virtualized.
Next, the manager processing part 201 checks the contents of the termination flag in step 1407 and repeatedly executes request processing as follows. First, before the request processing, the manager processing part 201 obtains, by a communication device, configuration information of the storage managing computer 105 and the job computers 106 to 108 from the agent processing parts respectively of the computers 105 to 108 (1413). The manager processing part 201 compares the obtained configuration information with the previous configuration information, and updates the computer managing table (
Next, the manager processing part 201 receives by a communication device an input of a request from one of the system managing client 101, the job managing client 102, the storage managing computer 105, and the job computers 106 to 108 (1416) and executes processing according to the request.
If the received request is a client connection request from the system managing client 101 or the job managing client 102 (1417), the manager processing part 201 executes client request processing (1418). If the request is a policy information setting request (1419), the manager processing part 201 executes policy information setting processing (1420). If the request is a name/unit setting request (1421), the manager processing part 201 executes name/unit setting processing (1422). If the request is a monitoring request (1423), the manager processing part 201 executes monitoring processing (1424).
If the received request is an event receiving request from one of the storage managing compute 105 and the job computers 106 to 108 (1425), the manager processing part 201 executes event receiving processing (1426). If the received request is a termination request (1427), the manager processing part 201 stores “1” in the termination flag in the memory (1428).
If the termination flag is “1” as a result of the check of the contents of the termination flag in step 1407, the manager processing part 201 executes termination processing as follows. That is, the manager processing part 201 terminates the communication processing part 207 (1408), the name/unit conversion processing part 206 (1409), the name/unit setting processing part 204 (1410), the policy control processing part 205 (1411), and the configuration management processing part 203 (1412).
In step 1501, the configuration management processing part 203 of the managing computer 103 checks the contents of the client request received by a communication device from the system managing client 101 or the job managing client 102 and receives as an input, if the client request is a client connection request, a client name from the client request (1502). The configuration management processing part 203 then makes a check to determine whether or not the inputted client name exists in the client managing table (
On the other hand, if the client request is a client disconnection request as a result of the check of the contents of the client request in step 1506, the configuration management processing part 203 receives as an input a client name from the client request (1507) and sets the state of the inputted client name in the client managing table (
If the client request is a client deletion request as a result of the check of the contents of the client request in step 1509, the configuration management processing part 203 receives as an input a client name from the client request (1510) and then deletes the row of the inputted client name from the client managing table (1511).
First, the policy control processing part 205 of the managing computer 103 makes a check to determine whether or not a client having issued a policy setting request is the job managing client 102 (1601). In the processing, the policy control processing part 205 checks a name of the client having issued the policy setting request. If the client name includes a character string of “system managing client”, it is determined that the client is the system managing client 101. If the client name does not include the character string, it is determined that the client is the job managing client 102.
If the client having issued the policy setting request is the job managing client 102, the policy control processing part 205 updates the job threshold value table (
Next, the policy control processing part 205 stores “0” in variable I in the memory to initialize variable I (1606), retrieves a performance item ID 802 matching the performance item ID in the policy information using the information table by operation management, and obtains an operation management ID 801 from an associated row to store the operation management ID 801 in variable MID in the memory (1607).
The policy control processing part 205 then executes processing as many times as there are elements of computer list L as below. That is, the policy control processing part 205 makes a check to determine whether or not there exists an operation management ID corresponding to job computer L[I] in the computer operation managing table, the operation management ID matching MID (1608). If there exists such an operation management ID, the policy control processing part 205 executes the performance item name conversion processing 1 (1609) and the unit conversion processing 1 (1610). Through the processing, the common item name (item ID) can be converted into an item name of job computer L[I] and the unit can be converted into a unit unique to job computer L[I]. Using the converted data, the policy control processing part 205 sends threshold value information to job computer L[I] (1611). After adding “1” to variable I, the policy control processing part 205 executes similar processing also for the next L[I] (1608 to 1613).
The name/unit setting processing 204 of the managing computer 103 first generates and stores an operation management ID in MID (1701). Next, the name/unit setting processing 204 receives inputs of a performance item ID list, a list of performance item names, a list of units for the performance item names, and a performance calculation expression list to store the lists in lists L, M, N, and O, respectively (1702 to 1705). The name/unit setting processing part 204 adds (MID, L[I], M[I], N[I], O[I]) to the operation management information table (
The manager processing part 201 of the managing computer 103 receives, when there exists a performance information acquisition request from a client (1801), an input of a client name (1802) and inputs of a list of performance item names and a list of units for the performance item names to store the lists in lists L and M (1803 and 1804). The manager processing part 201 executes the performance item name conversion processing 1 (1804) for list L to store a conversion result in list N (1805). Next, the manager processing part 201 obtains entire performance information of performance item list L (1806) to convert each unit of the obtained performance information (1807). The manager processing part 201 stores the converted performance information in list O (1808) and returns a combination of item name L and performance information O (1809).
The manager processing part 201 of the managing computer 103 converts, when there exists a “threshold value exceeded” notification from an agent (1901), item names regarding performance information as information associated with the “threshold value exceeded” notification into common item names (1902). The manager processing part 201 stores a threshold value ID associated with the “threshold value exceeded” notification in variable T (1903). When T is a job threshold value (1904), the manager processing part 201 obtains from the job threshold value table (
On the other hand, when T is a system managing threshold value in step 1904, the manager processing part 201 converts the unit of performance information associated with the “threshold value exceeded” notification into a unit designated by the system managing client (1909) and sends the “threshold value exceeded” notification including the converted performance information to the system managing client (1910).
The storage agent processing part 220 of the storage managing computer 105 or each agent processing part of the job computers 106 to 108 notifies, when a configuration information acquisition request is received (2001), current configuration information at the current point of time to the managing computer 103 (2002).
When a performance information acquisition request is received (2003), the agent processing part notifies performance information to the managing computer 103 (2004). After “0” is set to variable I in step 2005, steps 2006 to 2011 include processing to check a threshold value, specifically, to check all threshold value conditions in the job threshold value table (
First, the name/unit conversion processing part 206 of the managing computer 103 receives as an input a list of common performance item names (item IDs) to store the list in list L (2101). Next, the name/unit conversion processing part 206 receives an operation management ID to store the ID in MID (2102). The name/unit conversion processing part 206 sets “0” to I (2103) and converts, for each of the item names of list L, a common item name (item ID) into a performance item by operation management. For this purpose, the name/unit conversion processing part 206 retrieves a row for which L[I] matches with the performance item name 704 in the performance item table (
Next, the name/unit conversion processing part 206 retrieves a row of the information table by operation management (
First, the name/unit conversion processing part 206 of the managing computer 103 receives as an input a list of performance item names by operation management to store the list in list L (2201). Next, the name/unit conversion processing part 206 receives an operation management ID to store the ID in MID (2201). The name/unit conversion processing part 206 sets “0” to I (2203) and converts, for each of the item names of list L, a performance item name by operation management into a common item name (item ID). For this purpose, the name/unit conversion processing part 206 retrieves from the information table by operation management (
First, the name/unit conversion processing part 206 of the managing computer 103 receives as an input a list of performance information to store the list in list P (2301). Next, the name/unit conversion processing part 206 receives a unit designated by the manager and a unit of the performance information to store the units in U and V, respectively (2302 and 2303). If the unit of U matches that of V (2304), the name/unit conversion processing part 206 returns the performance information (2312). If the unit of U does not match that of V, the name/unit conversion processing part 206 obtains converted values of U and V from the unit conversion table (
It is assumed in the description that the unit designated by the manager indicates a unit of each client beforehand designated by the manager. It is assumed that at request of performance information from each client or at activation by each client, the unit is beforehand sent from the client to the managing computer 103 and is stored in, for example, a memory.
First, the name/unit conversion processing part 206 of the managing computer 103 receives as an input a list of performance information to store the list in list P (2401). Next, in consideration of a situation in which a conditional expression set to a threshold value of a policy includes a plurality of performance items, the name/unit conversion processing part 206 receives the performance information in the form of a list. Next, the name/unit conversion processing part 206 receives a list of units designated by the manager and a list of units of the performance information to store the lists in U and V, respectively (2402 and 2403). For all elements of performance information list P, the name/unit conversion processing part 206 converts the unit. The name/unit conversion processing part 206 assign “0” to I (2404). If U[I] matches V[I] (2405), the name/unit conversion processing part 206 stores P[I] in Q[I] (2409). If U[I] does not match V[I], the name/unit conversion processing part 206 obtains converted values of U[I] and V[I] from the unit conversion table (
According to the information monitoring system of the embodiment described above, during the operation to monitor the information of a plurality of computers, item identification information and a unit in an acquisition request for performance information, performance information, or policy information to be communicated between devices are converted into item identification information and a unit of a device of a transmission destination before the item identification information and the unit are transmitted thereto. Therefore, in an environment in which a plurality of mutually different operation management schemes exist on a network, the operation management schemes can be conducted in a unified manner without paying attention to item names and units of performance information unique to the operation management schemes.
It should be further understood by those skilled in the art that although the foregoing description has been made on embodiments of the invention, the invention is not limited thereto and various changes and modifications may be made without departing from the spirit of the invention and the scope of the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
2005-341777 | Nov 2005 | JP | national |