The present application claims priority from Japanese application P2004-208501 filed on Jul. 15, 2004, the content of which is hereby incorporated by reference into this application.
This invention relates to a storage system management software cooperation method, and more particularly to a method of having plural management programs automatically execute management jobs, in particular, management jobs that involve a change in configuration.
There are various tasks that utilize an information processing device. For instance, in management of storage systems on SAN, a policy regulating what action (management job) is to be executed when what subject fulfils what condition is set to those disk drives. Then information on the busy or idle condition of the storage systems is collected and the value of the policy subject is obtained based on the collected information to judge whether or not the value of the subject satisfies the policy condition. When it is judged as a result that the policy condition is met, the policy action is executed. A technique has been proposed which automatically manages storage systems on SAN based on the thus predicted busy or idle condition of the storage systems and other factors (refer to JP2003-345632A).
In order to operate a system automatically by applying a combination of management programs designed for different purposes to such management jobs, there should be no contradiction between the management jobs. This is not always achieved since each management program only covers its own management scope and the management function of one management program may cause inconsistency in information possessed by another program, resulting in failure in correct management of a management target.
This invention has been made in view of the above, and it is therefore an object of this invention to prevent, when plural management programs which manage and refer the same configuration automatically execute management jobs that involve a change in configuration, simultaneous execution of the management jobs from creating a contradiction in configuration.
It is another object of this invention to stop, when, for example, a part of the configuration that is managed by one management program is changed by a management job of another management program making continuation of automatic execution of management programs impossible, execution of this management program, so that the system is prevented from continuing while there is a contradiction between management programs and the configuration in their management scopes.
An embodiment of this invention provides a computer system composed of a storage subsystem (system) that has plural disks and a disk array control units, and a management computer that has a control unit for managing the storage subsystem. The control unit holds the configuration of a management target as a management map, which ensures that it is no more than one management program that refers, or changes, one configuration at a time.
An embodiment of this invention is capable of avoiding creating a contradiction in configuration from simultaneous execution of management programs which consult the same configuration or are allowed to change the configuration.
Also, an embodiment of this invention is capable of stopping, when, for example, a part of the configuration that is managed by one management program is changed by a management job of another management program making continuation of automatic execution of management programs impossible, execution of this management program, so that the system is prevented from continuing while there is a contradiction between management programs and the configuration in their management scopes.
The present invention can be appreciated by the description which follows in conjunction with the following figures, wherein:
Embodiment of this invention will be described below with reference to the accompanying drawings.
The computer system of this embodiment is composed of an business server 1, a management server 2, a backup device 3, a storage subsystem 5, and networks 4 and 6 which connect these components to one another.
The business server 1 is a computer device equipped with a CPU 11, a memory 12, a disk drive 13, a network interface (NIC) 14, a host bus adapter (HBA) 15, and an input/output device. Management programs 1 to 3 stored in the disk drive 13 are loaded to the memory 12 to be executed by the CPU 11 for management of the computer system.
The management programs include a capacity management program for managing the percentage of a disk used to dynamically allocate the disk to an application, a data protection program for backing up and restoring data used by an application, and a device management program for managing the operation of switches provided in the storage subsystem and in a SAN 6. How these management programs are executed differs from one to another; one may be executed by an administrator on demand and another may be executed automatically following a policy set by an administrator.
The management server 2 is a computer device equipped with a CPU 21, a memory 22, a disk drive 23, a network interface (NIC) 24, a host bus adapter (HBA) 25, and an input/output device. A management program stored in the disk drive 23 is loaded to the memory 22 to be executed by the CPU 21 for management of the computer system. In the management server 2, the CPU 21 extracts a management map 202, an impact map 203, and an event management table 204, which are stored in the disk drive 23, in the memory, and the CPU 21 executes a monitor control program, which similarly loaded to the memory, and then create contents of the impact map 203.
The business server 1 and the management server 2 may be placed in the same hardware or different hardware.
The backup device 3 is a computer device equipped with a CPU 31, a memory 32, a host bus adapter (HBA) 35, a tape device 37, and an input/output device. The backup device 3 functions as a tape backup server.
The CPU 31 of the backup device 3 executes a program, so that data stored in the storage subsystem 5 is backed up on a tape in response to a backup request from the management server 2.
The backup device 3 may have, instead of the tape device 37, a magnetooptical disk device or other similar device which backs up data in a backup medium.
The business server 1 and the management server 2 are connected via the LAN 4. The LAN 4 makes data and control information communicable between computers with the TCP/IP protocol or the like. For example, Ethernet, can be used as the LAN 4.
The storage subsystem 5 is composed of a disk array controller and a disk array 56. The disk array controller is constituted of a CPU 51, a memory 52, and a host bus adapter (HBA) 55. The CPU 51 executes a control program to control input and output of data in the disk array 56 in response to a request from the business server 1.
The disk array 56 is a Redundant Array of Independent Disks (RAID) and stores data with redundancy. Because of this redundancy, a loss of stored data is prevented when a failure takes place in some of disk drives that constitute the disk array 56.
The storage subsystem 5 is connected by the SAN 6 to the business server 1, the management server 2, and the backup device 3. The SAN 6 is a network capable of data communication with a protocol suitable for transfer of data, for example, the fiber channel protocol.
The storage subsystem 5 is connected via the SAN 6 to a host A and a host B, which access the disk drives of the storage subsystem 5.
The host A executes an application program (e.g. database program) and handles plural database instances. The database instances are each mounted with single file system or plural file systems. Each of the file systems is associated with one or more logical volumes. The logical volume is the unit of disks that can be recognized by a host. Each logical volume is composed of single logical device file or plural logical device files. Logical devices are each composed of single physical disk drive or plural physical disk drives.
The computer system where many servers share a single storage subsystem and the configuration of the storage subsystem is desired to be shielded from application programs, is managed in such hierarchies unlike usual PCs. It is an advantage of the hierarchical management that an operating system or an application program does not need to consider how data it uses is arranged. But it is a disadvantage of the management of plural layers which is unavoidable in managing storage.
In this invention, an event is issued to the monitor control program before a management program carries out a management job. There are three types of event issued from a management program to the monitor control program.
The first event of the three is an execution request event. The execution request event is issued when a management program is about to carry out a management job. No management program is allowed to execute its management job until it receives a management job execution command. Upon receiving the execution request event, the monitor control program carries out processing which will be described later (
The second event is a configuration changing event. The configuration changing event is issued, upon a change in configuration, from a management program that is executing a management job. Basically, a management program that has received a management job execution command issues the configuration changing event whereas other management programs may issue the configuration changing event when a failure occurs or on other occasions. The monitor control program rewrites the management map and the impact map upon receiving the configuration changing event.
The third event is an execution completion event, which is issued from a management program that is executing a management job as a notification of completion of the management job.
No management program is allowed to execute its management job unless it receives a management job execution command from the monitor control program. A management job cannot be executed when an execution permission is not received.
Upon receiving an event, the monitor control program chooses an impact map from an event management table that corresponds to the received event, and marks up the extent of the impact onto the management map. Then the monitor control program sequentially sends management job execution commands to management programs based on the event management table.
Thus events defined in an event management table are sequentially transferred to designated management programs and the management programs inquire of the monitor control program upon executing processing for management jobs that involve referring or changing a configuration. Each management program checks whether other management programs are operating in its operation range before carrying out its operation. This way the system can be run automatically with fewer contradictions.
The monitor control program accumulated, in the management map extents of the impact marked up by the management programs.
The management map shown in
Resource information of each layer is obtained by using a command or the like. The obtained resource information of each layer is kept in a repository while maintaining the association between layers. The management map 202 is created from information accumulated in the repository. The repository is an area for storing resource information of each layer, and is placed in the disk drive 23 of the management server 2 or in the memory 22 of the management server 2.
The monitor control program is capable of mapping a map range of a specific impact map onto the management map and undoing the mapped range. The monitor control program also determines whether a received event is to be executed by referring to the mapping range on the management map.
The impact map (
The CPU 21 which executes the monitor control program receives an event from a management program (Step S101) and performs processing that corresponds to the type of the event received.
When the received event is a configuration changing event, the CPU 21 makes the configuration change reflected on the management map and the impact map (Step S114) and waits for the arrival of the next event (Step When the received event is an execution request event, the CPU 21 consults the event management table to extract a management job associated with the received event, and determines how to cope with the event (Step S102).
Then the CPU 21 judges whether it is necessary to register the extracted management job that is associated with the event in the management map (Step S103). Whether the management job to be executed affects the resource configuration determines the judgment result. For instance, when the event is addition of a logical device, marking up in the management map is necessary whereas marking up in the management map is unnecessary when the event is to display on a screen of the management server.
When marking up in the management map is judged to be unnecessary, the procedure moves to Step S106. On the other hand, when marking up in the management map is judged to be necessary, the CPU 21 judges whether other management jobs whose extent of the impact overlaps the extent of the impact of the extracted management job are being executed (Step S104). When the other management jobs whose extent of the impact overlaps are being executed, the CPU 21 performs queuing so that the extracted management job that is associated with the event is stored in an execution event queue (Step S112).
The management job stored in the execution event queue is then executed provided that there is no change in configuration in the extent of the impact. When the configuration is to be changed in the extent of the impact, the CPU 21 notifies the management program of the configuration change. The event stored in the event queue is searched upon completion of the event processing that has been executed, and again subjected to the judging in Step S104. When there is a configuration change in the extent of the impact, the CPU 21 notifies the management program of the configuration change.
In the case where no management job whose extent of the impact overlaps the extent of the impact of the extracted management job is being executed, the CPU 21 sets the extent of the impact of the mapping table 202 to “referring/updating” and marks up the extent of the impact in the management mapping table 202 (Step S105).
The above processing completes an update of the management map 202, and now the CPU 21 is ready to send a management job execution command to the management program, thereby starting execution of the management job defined in the event management table (Step S106).
When the received event is an execution completion event, the CPU 21 judges whether a management job defined in the event management table is still left (Step S108). When it is found as a result that a management job that has not been executed yet is left in the event management table, the procedure returns to Step S106 where a management job execution command is issued to a management program to continue execution of management jobs.
On the other hand, in the case where every management job defined in the event management table has already been executed, the CPU 21 checks whether “referring/updating” is marked-up in the management map 202 (Step S109). When the “referring/updating” marking is found, the marking is removed from the management map 202 and marking of the extent of the impact is also removed from the management map 202 (Step S10).
The CPU 21 then checks whether the event queue is empty (Step S116). When the event queue is empty, the processing is ended without further operation. When the event queue is not empty, the CPU 21 checks whether a configuration changing event has taken place while the current event is being processed (Step S111). When it is found as a result that a configuration changing event has taken place, the management program notified of the event stored in the event queue is notified issuer of request of cancellation of the management job (Step S113), and the processing is ended. On the other hand, when it is found as a result that no configuration changing event has taken place during the period, the event is searched from the forefront event queue to be executed (Step S115). To retrieve events from the event queue, the event that is stored in the event queue earliest is retrieved first. Alternatively, events may have priority attributes so that the event that has the highest priority of the events stored in the event queue is retrieved first.
A management program issues an execution request event to a monitor control program 201 before starting its management job (Step S121). In response to the request event, the monitor control program sends either a management job execution command or a management job halting command, which is received by the management program.
When the management program receives the management job halting command, a user is notified of cancellation of the management job (Step S124).
On the other hand, reception of the management job execution command causes the management program to execute the management job (Step S122). Whether execution of the management job involves a configuration change is checked (Step S125).
When it is found as a result that a change in configuration takes place, the management program sends a configuration changing event to the monitor control program (Step S126). Subsequently, whether the management job is finished or not is judged (Step S127) and, when the management job is not completed yet, the procedure returns to Step S122.
When the management job is judged to be completed, the management program sends an execution completion event to the monitor control program (Step S123).
The event management table states what event serves as an execution request event which starts execution, what program issues the event, what management job is to be executed upon occurrence of the event, and what program is to execute the management job. In other words, the event management table defines, for each of execution request events issued from the management programs, a management job executed by a management program as event and the procedure of carrying out the management job in context of operations of other management (software) programs.
While the above event management tables of
Another case, besides the case of
The description given above is about one event management table. In the case where a management job executed on one event management table serves as an event on another event management table, the event management table is traced more hierarchically to specify the range of influence of the event.
The first embodiment of this invention uses a configuration changing event to notify whether there is a configuration change in the respective management jobs. Alternatively, a configuration change may be notified when an execution completion event is sent to the monitor control program as incidental attributes of the event.
A second embodiment of this invention will be described next. An event management table in the second embodiment has a rule field (
A rule (2), on the other hand, provides to trace relevant resources but not recursively. According to the rule (2), the resource where the event in question takes place is traced by tracing the hierarchy upward and the tracing is ended upon reaching the uppermost layer to specify the area covered as the extent of the impact. Alternatively, the hierarchy is traced downward and the tracing is ended upon reaching the lowermost layer to specify the area covered as the extent of the impact (
The management map 202 of the second embodiment (
According to this embodiment, the monitor control program applies the rule (1) when, for example, the execution request event in
As has been described, the second embodiment does not need to define a table for each resource where an event could take place since specification of the extent of the impact is made into rules based on the management map and the event type.
While the present invention has been described in detail and pictorially in the accompanying drawings, the present invention is not limited to such detail but covers various obvious modifications and equivalent arrangements, which fall within the purview of the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
2004-208501 | Jul 2004 | JP | national |