The present invention relates to a method of managing a work operation of a network computer system comprising a plurality of computers.
As a system has become greater in size and global, works executed daily have been decentrailized and globalized, too. For instance, there may be the case where one work is dividedly allocated to, and executed by, a plurality of computers because throughput of one computer is not sufficient. There is the case where large quantities of data dispersedly stored in a plurality of computers are processed by each computer to obtain an intermediate result and these results are gathered together at one position to obtain a final result. In such a case, because management has been made in the past only as to which computer executes which work, a manager cannot easily grasp the relationship of divided works if one work is divided and allocated to a plurality of computers. It is therefore desired to provide a service which can supervise the execution of works and which is directed to a manager for supervising the execution of the works by managing divided works as one work even when a certain work is divided and allocated to a plurality of computers such a server would eliminate the need for examining in advance the computer or computers that execute the work or the divided works.
It has been customary to immediately transfer an information representing various events (e.g. start, termination, abnormal termination, etc.) that occur with the execution of the work to a computer for supervising the work. Therefore, a problem remains unsolved that those events which are rarely looked up usually (start, termination, etc.) are transferred in vain though they are necessary for the supervision purpose. Thus, the service should have as its aim reducing the transfer quantity of information by transferring only the reformation having high urgency such as the occurrence of abnormality when the events occur.
It has also been customary in the past to altogether store various definition informations relating to the execution of the work (e.g. execution schedules, execution parameters, etc.) in one place. This renders the problem that a long time is necessary for looking up the definition information for executing the work, though management is easy.
The purpose of the present invention is to reduce the time required for looking up the definition information by duplicately storing the definition information in a plurality of computers.
To accomplish the purpose described above, the present invention assorts and classifies the objects handled by each program that constitutes a work and puts common terms together into one system, and generates ID information that discriminates works. The present invention adds this ID information to various definition informations (execution schedules, execution parameters, etc.) relating to the execution of the work and to an information representing all the events that occur with the execution of the work (start, termination, abnormal termination, etc.). The present invention further unitarily manages these informations so that supervision of the execution of work and its control (change of the schedules, change and addition of executing computers, etc.) can be done by using the ID information as a key.
The present invention assigns a priority to all the events (start, termination, abnormal termination, etc.) occurring with the execution of the work in accordance with their urgency. Whether or not the information representing the event is to be immediately transferred to a computer supervising the work or whether or not the information is to be stored in the computers in which the respective events occur or in a computer or computers near the computers inviting such events, are decided in the order of this priority. The supervisor need not know the events in the normal operation (start and termination of the work, etc.) so long as the work is executed normally and smoothly (particularly when a computer system includes the huge number of computers), and they are not transferred to the computer supervising the execution of the work. In consequence, the transfer quantity of information can be reduced. The ID information for discriminating the work is added to the information that is dispersedly stored. Therefore, whenever necessary (such as when the status of a specific work is desired to be known), the information can be retrieved upon request from the computer supervising the work and only necessary information can be taken out. The information that is dispersedly stored can be retrieved simultaneously by a plurality of computers having the stored information, and the time for retrieving is short.
Further, the present invention stores duplicately various definition informations (execution schedules, execution parameters, etc.) relating to the execution of the work in a plurality of computers. Generally, updating frequency of such definition informations is low. Therefore, the present invention can reduce the time necessary for looking up the definition informations at the time of the execution of the work. When the definition information is to be updated, only the changed portion must be transferred. Therefore, the time for transfer can be reduced, too.
An embodiment of the present invention will be explained hereinafter with reference to
Each of the management objective computers 106, 111 comprises a work execution unit 106, 113 for handling the actual work itself or a management application (hereinafter called “AP”) execution unit 110, 114 for managing the work execution such as a job scheduler, an integrated management assistance unit 107, 112 for accomplishing the integrated management, and a local data base 108. When the management objective computer has the local data base, the integrated management assistance unit transfers to the supervisory manager computer 100 those event informations which must be dealt with urgently or quickly, such as abnormal termination of the work, shortage of the disk capacity, etc. among the events occurring with the execution of the work by the integrated management assistance unit itself or by the management objective computer not having such a local data base, stores other event informations in the local data base 108, executes retrieval upon instruction from the supervisory manager computer 100 and transfers the retrieving result to the integrated manager computer. When the management objective computer does not have the local data base, the integrated management assistance unit transfers all those events which occur with the execution of the work by itself to the management objective computers having the local data base. Incidentally, the supervisory manager computer 100 and the management objective computers 106 and 111 are connected with one another through a communication network 115.
Each of the management objective computers 206 and 211 comprises a work execution unit 209, 213 for executing the practical work itself or a management AP execution unit 210, 214 for managing the execution of the work such as a job scheduler, an integrated management assistance unit 207, 212 for achieving the integrated management and the local data base 208. When the management objective computer has the local data base, the integrated management assistance unit transfers various definition informations about the executions of the work such as the execution schedules and the execution parameters of its own and those of the management objective computers not having the local data base, stores such definition informations in the local data base 208, updates the information upon instruction from the integrated manager computer 200 and transfers the updating result to the integrated management assistance unit itself and to the work executed by the management objective computers not having the local data base. When the management objective computers do not have the local data base, the integrated management assistance unit transfers all of various definition informations about the execution of the work such as the execution schedule and the execution parameters executed by itself to the management objective computers having such a local data base, and transfers the retrieving result transferred thereto from the management objective computers having the local data base to the work executed by the management objective computers. Incidentally, the supervisory manager computer 200 and the management objective computers 206 and 211 are connected with one another through the communication between 215.
Though this condition is not essential, such a network construction can reduce the quantity of data flowing through the network and can also reduce the time necessary for looking up the information stored in the data base from each computer.
The supervisory manager computer 402 includes an integrated supervisory manager 404 and has a user interface for managing the entire system but processing for concrete display for the manager and for the operation of the manager varies with each management function. This processing is broadly divided into an indirect display/operation by using the management AP for managing the work and a direct display/operation of the work. Because the interface with each operation unit as viewed from the integrated management assistance unit is common, the interfaces between the management AP display processing unit 415 and the management AP operation unit 411 and between the work B display processing unit 414 and the work B operation unit 410 and the interface (413) of the display control unit 416 having the direct interface with the management AP display unit 415, the work B display processing unit 414 and the manager can be made common.
The procedures for collecting and looking up various event informations that occur with the execution of the work in the construction shown in
In this way, supervision of the work executed dividedly by a plurality of computers can be made without determining in advance by which computer the work is to be done.
The procedures for collecting, updating and distributing various definition informations relating to the execution of the work in the construction shown in
First, the supervisory manager displays the work list (801) and the work ID corresponding to the work selected by the manager is determined (802). The supervisory manager retrieves the master data base by using the work ID so determined as the key and displays it on the supervisory screen (803). When the manager updates the definition information (804), an updating request is generated for the integrated management assistance unit of all the management objective computers having the local data base (805). The integrated management assistance unit of each management objective computer having the local data base retrieves and updates the local data base by using the designated work ID as the key (806). The integrated management assistance unit reports the changed definition information to the work or management AP (807). The work or management AP executes the processing in accordance with the new definition (808). In this way, control of the execution of the work carried out dividedly by a plurality of computers can be made without examining in advance the computer which stores the definition of the work.
In a network computer system comprising a plurality of computers, the present invention can execute various processings relating to the operations of the work such as confirmation of the work executed dividedly by a plurality of computers, supervision of the occurrence of any abnormality, change of the work definition, etc., without examining in advance in which computer or computers the work is being executed or in which computer or computers the definition of the work is stored. Therefore, the present invention provides the effect that the execution of the work can be managed unitarily from one supervisory manager computer. Since the transfer of the information which is hardly looked up during steady supervision can be reduced in this instance, the present invention provides another effect that any excessive burden for the communication network is not applied to the operation management. Further, because the definition information which is frequently looked up during the execution of the work can be posted in the proximity of the computer that looks up the definition information, the present invention provides still another effect that the time necessary for looking up the information from each computer can be reduced.
Number | Date | Country | Kind |
---|---|---|---|
9-306066 | Nov 1997 | JP | national |
The present application is a continuation of application Ser. No. 09/188,245, filed Nov. 9, 1998, now abandoned the contents of which are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
5193172 | Arai et al. | Mar 1993 | A |
5293620 | Barabash et al. | Mar 1994 | A |
5313632 | Sekizawa et al. | May 1994 | A |
5361369 | Kametami | Nov 1994 | A |
5377352 | Tanaka et al. | Dec 1994 | A |
5655081 | Bonnell et al. | Aug 1997 | A |
5793365 | Tang et al. | Aug 1998 | A |
6067634 | Nelson | May 2000 | A |
6098091 | Kisor | Aug 2000 | A |
6148322 | Sand et al. | Nov 2000 | A |
6289382 | Bowman-Amuah | Sep 2001 | B1 |
6425017 | Dievendorff et al. | Jul 2002 | B1 |
6445968 | Jalla | Sep 2002 | B1 |
6597688 | Narasimhan et al. | Jul 2003 | B1 |
Number | Date | Country |
---|---|---|
6358568 | Mar 1988 | JP |
63300328 | Dec 1988 | JP |
04172045 | Jun 1992 | JP |
5204783 | Aug 1993 | JP |
8314763 | Nov 1996 | JP |
09006697 | Oct 1997 | JP |
Number | Date | Country | |
---|---|---|---|
20030200310 A1 | Oct 2003 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 09188245 | Nov 1998 | US |
Child | 10417267 | US |