As a backup technique, there is a method for calculating a necessary backup medium amount by using the file size in the backup method table and the medium capacity described in the medium definition file and outputting a message to prepare for the necessary backup medium amount, thereby performing backup (for example, see JP-A-07-210430).
An appropriate backup method is decided upon backup.
A backup management server collects disk access information and an available backup method from an environment information collection unit on the work server and selects an appropriate backup method upon backup.
Other objects, features and advantages of the invention will become apparent from the following description of the embodiments of the invention taken in conjunction with the accompanying drawings.
Description will now be directed to an embodiment of the present invention with reference to the attached drawings.
In
The data protection unit 110 includes an environment information management unit 111, a volume evaluation unit 112, and a rule creation unit 113.
The environment information management unit 111 collects RAID disk apparatus access information, file information, and a rule template (represents respective entries of the rule definition table (
The volume evaluation unit 112 selects a use application and importance of a disk volume as an object of backup according to various information collected such as the access information and the file information of the RAID disk apparatus. Moreover, the volume evaluation unit 112 selects the backup method by calculating the important viewpoint as the weighting parameter when a user performs backup processing.
The rule creation unit 113 selects an appropriate backup method for the selected disk volume (disk volume to be backed up). Moreover, the viewpoint which is considered to be important by a user is calculated as a weighting parameter and the backup method is selected.
The internal database 120 is a database including an environment information table 121, a volume information table 122, a rule definition table 123, and a backup information table 124. Anything other than the database that includes these information can be used instead.
The environment information table 121 contains information indicating what kind of application service is being executed on the work server 150 and what kind of tape apparatus exists on the network. The table may also contain software and hardware (storage apparatus, computer, switch, router, and the like) information other than the aforementioned.
The volume information table 122 contains information indicating what kind of accesses (access frequency, access type, and the like) have been performed to the respective disk volume and the existing file amount (file size, the number of files, and the like).
The rule definition table 123 contains backup table template information indicating which backup method is to be used according to the disk volume use application and the importance degree.
The backup definition table 124 contains information on the backup method selected for each disk volume. The RAID disk apparatus operation unit 130 controls the copy function, data read out and data write of an RAID disk apparatus 160. The tape backup execution unit 140 controls the data backup to a tape apparatus 170. Additionally, it is possible to provide a processing unit for executing control according to the backup apparatus connected. For example, when executing a backup of the volume ID V0000001 volume, the RAID disk apparatus operation unit 130 extracts the rule ID (R0000002 or R0000005) related to the V0000001 in the backup definition table 124 and can perform backup processing according to the action content related to the rule ID and stored in the rule definition table 123. It should be noted that the instruction of performing backup may be performed by specifying the disk volume but it also may be performed by specifying the database name or the file name. The instruction of backup to the RAID disc apparatus is performed by specifying the disk volume to be backed up. For this, when the database name or the file name is specified, it is necessary to identify the disk volume containing them. The information for identifying the disk volume containing them from the database name and the file name can be managed, for example, by a table (volume logic configuration map table) as shown in
The work server 150 is a computer on which a program including functions of an application service unit 151 and an environment information collection unit 152 operates. These processing units (application service unit 151 and the environment information unit 152) may also be executed separately by a plurality of hosts connected on the network.
The application service section 151 indicates database operated on the work server 150, a program providing a service, an application program, and the like.
The environment information collection unit 152 collects application service information operated on the work server 150 and access information on the disk volume used by the application service and collects information on the tape apparatus (various storage apparatuses) available by the work server 150.
The RAID disk device 160 is a large-size disk array apparatus connected to the network. The tape apparatus 170 represents a plurality of magnetic tape apparatuses connected to the network. It should be noted that, although not depicted, it is possible to connect various storage apparatuses such as a DVD (digital versatile disc) and an MO (magneto-optical disc).
There has been exemplified a system including: a backup management server for deciding a backup method according to a use application and a use state by collecting application service information, RAID disk apparatus access information, and an available backup method; and an internal database managing rule definition information serving as a template when deciding the backup method and the selected backup definition information. However, it is also possible to connect to a system other than this.
By using the aforementioned processing unit and the like, the access information of the RAID disk apparatus, file information, environment information referenced when creating a backup method serving as a template are collected to automatically select a backup method corresponding to a current access condition.
The host name 303 indicates the host in which the object is operated or the host name where the object can be used.
In
However, it is possible to recognize other than these as an object and use an object type different from these.
For example, the object of the object ID “A0000001” in
The access count 401 represents the number of accesses to the disk volume per day, the access type 402 represents an access type to a disk volume such as “read” or “read/write”, and the access interval 403 represents an average access interval to the disk volume.
The size use capacity 404 indicates a size of the disk volume currently used.
The size difference capacity 405 indicates a difference of the disk volume from the previous backup to the current moment. It should be noted that the value of the size difference capacity 405 may be a difference between the disk capacity at the N-th backup and the disk capacity at the N+1 backup. Moreover, the value may be a difference between the disk capacity at the N-th backup and the disk capacity at the latest backup. The value may be other difference values. A user can select in advance what kind of difference information is to be acquired.
The file count 406 indicates the number of files existing on the disk volume. The volume name 407 indicates the mount point of the disk volume. The backup definition information 411 indicates an identifier of the backup method of the disk volume. It should be noted that the backup method will be explained with reference to
The user weight parameter access 408, the user weight parameter size 409, and the user weight parameter file count 410 represent weight values indicating the viewpoint considered to be important by a user. The access 408 is a weight value of the access information to the disk volume, the size 409 is a weight value of the size information on the disk volume, and the file count 410 indicates the weight value of the number of files of the disk volume.
Here, explanation will be given on the entries included in the volume information table. For example, the disk volume “V000001” indicates that 50000 read accesses are performed at an average interval of 10 seconds per day, that the current use capacity is 50 GB, no disk volume capacity difference is present from the previous backup, and that 300 files are present. Moreover, it indicates that they are mounted on the mount point “/disk01/dbms” and backed up by the backup method “B0000001” and “B0000002”. Furthermore, it is indicated that the user weight value for this disk volume is in the ratio of “8:2:0” for the “access:size:file count”.
It should be noted that when a plurality of identifiers are assigned for the backup definition information 411, backup is performed for the backup apparatus related to each identifier. For example, when two identifiers “B0000001” and “B0000002” are present for the backup definition information 411, according to the respective rules, backup is performed to the two backup apparatuses: “high speed standard tape” and “multiple-generation disk backup”.
In
The rule template may be created by a manager (or a user) in advance and stored in a storage device so that the one adaptive to the information collected by the environment information management unit 111 and the environment information collection unit 152 is selected. It is also possible to create and use the rule template by other method.
The rule ID 500 is an identifier of a rule managed by the backup management server. This rule indicates that backup is performed by the backup method of the action element 501 for the disk volume in which the condition element 502 is satisfied. The action element 501 may be a specific backup method or description of a series of commands executed for backup or a pointer to a file and a program used for backup or the like.
Here, explanation will be given on the action element 501 and the condition element 502. For example, the rule ID “R0000004” indicates that when the disk volume use amount is “large” and the number of files is “large”, the disk volume is backed up by the “disk backup” using the copy function of the RAID disk apparatus. Here, the copy function represents a function of writing data of the same content as the data stored in a disk volume into another disk volume.
When judging whether the access count to the disk volume and the use state contained in the item of the condition element 502 are “large” or “small”, it is possible to use a threshold value defined by the manager in advance for each of the items of the condition elements 502.
Thus, the backup definition table is information relating the backup rule (backup method and condition), the disk volume, and the object to be backed up.
In the input item 700 of the weight parameter, a parameter of importance is set.
In the update item 701 of the environment information, the environment information collection and its evaluation interval are determined. When the update interval is set, environment information collection and its evaluation are performed at this update interval and an appropriate backup method is selected for each interval. When no update interval is set, the backup method initially set is held.
It should be noted that the collected environment information (or judgment) and selection of the backup method corresponding to the evaluation result will be detailed with reference to
In the information collection processing 800, the environment information management unit 111 on the backup management server 100 collects information on software such as application service information, RAID disk apparatus access information, and information on hardware such as a tape apparatus information (collects environment information) from the environment information collection unit 152 on the work server 150. As has been described above, the information collection processing 800 is performed at the update interval set in
It should be noted that the processing to judge whether the update interval has elapsed in the information collection processing 800 may be performed for one volume or two or more volumes successively. The volume ID to be judged may be extracted, for example, at random from the volume ID's stored in
In the volume information updating processing 801, the volume evaluation unit 112 on the backup management server 100 updates the volume information table 122 of the internal database 120 by using the collected disk volume information.
In the processing 802, it is judged whether the collected environment information has been modified. The collected information is compared to the information (object type 301, name 302, host name 303, and the like) registered in the environment information table shown in
If a new tape apparatus or a backup object is detected, in the rule template update processing 803, the rule creation unit 113 again creates the rule template of the backup method and updates the rule definition table 123 of the internal database 120. The processing 803 may be started upon notification of the end of the judgment processing of the processing 802 and its result from the volume evaluation unit.
The rule template can be created by preparing a predetermined proformas in advance according to the object type (4 mm tape apparatus, high speed standard tape apparatus, RAID disk apparatus, or the like) and adding an entry of the rule template according to the object detected. The rule template may also be created by other method.
It should be noted that when the object type and name of the environment information collected are already stored in the environment information table, modification of the environment information according to the detected object is checked.
For example, when the detected object is a tape apparatus connected to a host computer, the IP address or MAC address or host name of the host computer is identified and it is judged whether the apparatus is the same as the tape apparatus already stored in the environment information table. When the detected object is a tape apparatus connected to a SAN (Storage Area Network), it is possible to judge whether the apparatus is the same as the tape apparatus already stored in the environment information table by using the path name to the tape apparatus. When the detected object is software, it is possible to perform judgment by using the software name, version number, the host where the software is executed, and the IP address. It is also possible to perform judgment according to other object (WWN (World Wide Name) and unit number for various storage devices).
In the processing 804, for all the rule templates, the following template selection processing is repeated. The processing 804 can be started when the judgment processing end and its result are notified from the volume evaluation unit. It should be noted that the disk volumes serving as objects of the processing 804 may be all the disk volumes processed by the information collection processing 800 or may be some of the disk volumes. The judgment processing 805 judges whether the disk volume is matched with (similar to) the condition element 502 of the rule template.
Explanation will be given on an example that the processing 805 uses a threshold value for judging whether the condition element 502 of the rule template is similar to the use state of the volume. As the judgment method, the volume information stored in the volume information table of
Volume use size 404÷threshold value of volume use size
Value of Expression 1>=1 . . . (judged to be “large”)
Value of Expression 1<1 . . . (judged to be “small”) [Expression 1]
In the aforementioned example, when the value of the actual volume use size divided by the threshold value of the volume use size is equal to or greater than 1, judgment is made to be “large” and when the value is smaller than 1, judgment is made to be “small”. Moreover, Expression 1 shows an example of the volume use size 404 and the use size of the condition element 502 and judgment for the other items is made in the same way.
Thus, for each item of the disk volume attribute, the threshold value is compared to the actual volume information (
It should be noted that the similarity range (or item to be judged to be similar or not) and threshold value setting may be selected in advance in the program performing the processing or may be set by accepting the input from the manager or by other method.
When the judgment of the processing 805 shows presence of the rule template having the volume attribute similar to the disk volume to be backed up, the processing 806 selects and extracts the backup method of the rule template judged to be similar in the processing 805.
Furthermore, in the processing 807, by using the weight value of the viewpoint considered to be important by the user, the volume information is again calculated. The recalculated volume information is compared to the condition element contained in the rule template, added to the rule template selected in the processing 806, and selected.
That is, in the processing 807, judgment is made by using the user weight parameter and the rule template is extracted. In the processing 807, when judging the volume attribute, the volume information table of
A calculation example for making judgment by using the weight parameter of
Volume use size 404÷threshold value of volume use size×{(100+user weight parameter size 409)÷100}
Value of Expression 2>=1 . . . (judged to be “large”)
Value of Expression 2<1 . . . (judged to be “small”) [Expression 2]
For each item of disk volume attribute, a threshold value is compared to actual volume information (
Thus, by using the weight parameter input by the user, a backup method having an allowance for the volume attribute for which the user has specified the weight parameter can be extracted. It is possible to extract a backup method of a performance better than a backup method selected without using the weight parameter.
For example, even when the actual number of files stored in the disk volume is small, the user can increase the weight for the number of the files so that the backup method used when the number of files is large is extracted for the disk volume. Thus, it is possible to extract a backup method (rule ID 500 “R0000004”, “R0000005”, “R0000006”) having a better performance than the backup method (rule ID 500 “R0000001”, “R0000002”) used when the number of files is small.
In the backup definition information setting processing 808, the rule creation unit 1113 on the backup management server 100 registers the selected backup method on the backup definition table 124 of the internal database 120. By using the selected backup method, an instruction to execute the backup is transmitted to another computer. It should be noted that transmission of an instruction of backup is performed upon reception of an instruction of backup execution from a user. Otherwise, information on the selected backup method is only stored in the backup definition table.
For example, in
Moreover, although not depicted in
Moreover, when an environment information collection change is detected, it is possible to extract and present a new backup method to a user upon detection of the environment information change. It should be noted that it is possible to use the aforementioned method not only for backup but also for restoring. Moreover, it is possible to transmit an instruction for backup to another computer and various devices according to the backup method selected by using the aforementioned method.
Thus, even when the use application and use state of the storage device such as a disk volume is changed as the time elapses, volume information is periodically collected and it is possible to update necessary information such as the rule template deciding the volume information and the backup method, thereby responding to a change as needed. Accordingly, it is possible to select a backup method in accordance with the time elapse, and change of specification use application, hardware environment, and software environment.
Moreover, it is possible to select an appropriate backup method upon backup so as to reduce the restore time according to the change of the disk volume use application and the attribute of the disk volume which is considered to be important by the user.
By deciding a rule template for backup in advance and creating a new rule template or modifying the rule template upon detection of environment information so as to prepare a series of operation required for backup and restoring in advance, it is possible to operate backup or restoring when the user sets the necessary item such as a parameter for backup.
Instead of performing all the necessary processes for backup and restoring by a user, there is prepared a proformas of a series of processes and the backup method is selected so as to reduce the work required for the user to perform backup and restoring.
Next, explanation will be given on a method for selecting a backup method according to another embodiment with reference to
After the processing is started, information on the disk volume such as the access count is compared to the condition element 502 of the rule definition table 123 so as to judge whether a similar rule ID is present (S1302). This processing is performed as has been explained in 804 and 805 of
Next, still another embodiment of the method for selecting the backup method will be explained with reference to
Firstly, S1402 judges whether a rule selection reference is specified as a condition for narrow down the rule templates stored in the rule definition table (
The rule selection reference management table (
In S1402, when it is judged that the rule selection reference is specified, a selection condition related to the instruction information specified is extracted and the rule template having the backup method satisfying the selection condition (or a rule ID identifying this) is extracted (S1403). For example, consideration is taken when the “high-speed restore” is specified as instruction information for specifying the rule selection reference. Here, S1402 of
As has been described above, when the backup method is selected according to the state of the disk volume to be backed up, by narrowing down the backup method to be selected according to the user needs, it is possible to select a backup method matched with the user needs.
Next, explanation will be given on an embodiment for displaying information on the result of selection of the backup method according to the aforementioned embodiment.
In
As the method for displaying the information of
By providing such a view, in the disk volume state displayed 1203, 1204, it is possible to know whether the selection of the backup method displayed in 1201 has been appropriate (moreover, it is possible to know whether the setting condition of the condition element 502 in the rule definition table (
It should be further understood by those skilled in the art that although the foregoing description has been made on embodiments of the invention, the invention is not limited thereto and various changes and modifications may be made without departing from the spirit of the invention and the scope of the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
2003-052229 | Feb 2003 | JP | national |
2003-360128 | Oct 2003 | JP | national |
The present application is a continuation of application Ser. No. 10/743,737, filed Dec. 24, 2003, now U.S. Pat. No. 7,007,144 the contents of which are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
4607346 | Hill | Aug 1986 | A |
5778165 | Saxon | Jul 1998 | A |
6112257 | Mason et al. | Aug 2000 | A |
6411969 | Tam | Jun 2002 | B1 |
6513101 | Fisher et al. | Jan 2003 | B1 |
6539461 | Suzuki et al. | Mar 2003 | B2 |
6542975 | Evers et al. | Apr 2003 | B1 |
6611896 | Mason et al. | Aug 2003 | B1 |
6795834 | Higashiura et al. | Sep 2004 | B2 |
6854034 | Kitamura et al. | Feb 2005 | B1 |
6931422 | Gusler et al. | Aug 2005 | B1 |
6941328 | Cannon | Sep 2005 | B2 |
6952758 | Chron et al. | Oct 2005 | B2 |
6954833 | Yochai et al. | Oct 2005 | B1 |
6959368 | St. Pierre et al. | Oct 2005 | B1 |
6976039 | Chefalas et al. | Dec 2005 | B2 |
6986008 | Colvig et al. | Jan 2006 | B2 |
20020002561 | Higashiura et al. | Jan 2002 | A1 |
20020065999 | Kikuchi et al. | May 2002 | A1 |
20030229653 | Nakanishi et al. | Dec 2003 | A1 |
20030233518 | Yamagami et al. | Dec 2003 | A1 |
20040030852 | Coombs et al. | Feb 2004 | A1 |
20040044827 | Carlson et al. | Mar 2004 | A1 |
20040107315 | Watanabe et al. | Jun 2004 | A1 |
20040148485 | Suzuki | Jul 2004 | A1 |
20050010733 | Mimatsu et al. | Jan 2005 | A1 |
Number | Date | Country |
---|---|---|
07-210430 | Aug 1995 | JP |
Number | Date | Country | |
---|---|---|---|
20060069892 A1 | Mar 2006 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10743737 | Dec 2003 | US |
Child | 11281506 | US |