1. Field of the Invention
The present invention relates to a snapshot maintenance apparatus and method, and for instance is suitably employed in a disk array device.
2. Description of the Related Art
Conventionally, as one function of a NAS (Network Attached Storage) server and disk array device, there is a so-called snapshot function for retaining an image of an operation volume (a logical volume for the user to read and write data) designated at the time when a snapshot creation order is received. A snapshot function is used for restoring the operation volume at the time such snapshot was created when data is lost due to man-caused errors or when restoring the operation volume to a state of a file system at a desired time.
The image (also referred to as a virtual volume) of the operation volume to be retained by the snapshot function is not the data of the overall operation volume at the time of receiving the snapshot creation order, but is rather configured from the data of the current operation volume, and the difference data which is the difference between the operation volume at the time of receiving the snapshot creation order and the current operation volume. And the status of the operation volume at the time such snapshot creation order was given is restored based on the foregoing difference volume and current operation volume. Therefore, according to the snapshot function, in comparison to a case of storing the entire operation volume as is, there is an advantage in that an image of the operation volume at the time a snapshot creation order was given can be maintained with a smaller storage capacity.
Further, in recent years, a method of maintaining a plurality of generations of snapshots has been proposed (c.f. Japanese Patent Laid-Open Publication No. 2004-342050; hereinafter “Patent Document 1”). For instance, Patent Document 1 proposes the management of a plurality of generations of snapshots with a snapshot management table which associates the respective blocks of an operation volume and the blocks of the difference volume storing difference data of the snapshots of the respective generations.
However, according to the maintenance method of a plurality of generations of snapshots disclosed in Patent Document 1, when a failure occurs in the difference volume, there is a problem in that the system cannot be ongoingly operated unless the snapshots of the respective generations acquired theretofore are abandoned.
Nevertheless, the failure in a difference volume could be an intermittent failure or an easily-recoverable failure. The loss would be significant if the snapshots of all generations must be abandoned for the ongoing operation of the system even for brief failures. Therefore, if a scheme for maintaining the snapshot even when a failure occurs in the difference volume can be created, it is considered that the reliability of the disk array device can be improved.
The present invention was devised in view of the foregoing points, and an object thereof is to propose a snapshot maintenance apparatus and method capable of maintaining a snapshot in a highly reliable manner.
In order to achieve the foregoing object, the present invention provides a snapshot maintenance apparatus for maintaining an image at the time of creating a snapshot of an operation volume for reading and writing data from and to a host system, including: a volume setting unit for setting a difference volume and a failure-situation volume in a connected physical device; and a snapshot management unit for sequentially saving difference data, which is the difference formed from the operation volume at the time of creating the snapshot and the current operation volume, in the difference volume according to the writing of the data from the host system in the operation volume, and saving the difference data in the failure-situation volume when a failure occurs in the difference volume.
As a result, with this snapshot maintenance apparatus, even when a failure occurs in the difference volume, the difference data during the period from the occurrence of such failure to the recovery thereof can be retained in the failure-situation volume and, therefore, the system can be ongoingly operated while maintaining the snapshot.
Further, the present invention also provides a snapshot maintenance method for maintaining an image at the time of creating a snapshot of an operation volume for reading and writing data from and to a host system, including: a first step of setting a difference volume and a failure-situation volume in a connected physical device; and a second step of sequentially saving difference data, which is the difference formed from the operation volume at the time of creating the snapshot and the current operation volume, in the difference volume according to the writing of the data from the host system in the operation volume, and saving the difference data in the failure-situation volume when a failure occurs in the difference volume.
As a result, according to this snapshot maintenance method, even when a failure occurs in the difference volume, the difference data during the period from the occurrence of such failure to the recovery thereof can be retained in the failure-situation volume and, therefore, the system can be ongoingly operated while maintaining the snapshot.
According to the present invention, a snapshot maintenance apparatus and method capable of maintaining the snapshot in a highly reliability manner can be realized.
An embodiment of the present invention is now explained in detail with reference to the attached drawings.
A storage device (not shown) such as a hard disk drive is connected to the storage interface 4, and logical volumes VOL are defined in a storage area provided by such storage device. User data subject to writing transmitted from a host system as a higher-level device (not shown) is stored in the logical volume VOL defined as an operation volume P-VOL among the logical volumes VOL defined as described above.
Various programs such as a block I/O program 5 and a snapshot program 6 are stored in the memory 3. The CPU 2 controls the input and output of data between the host system and operation volume P-VOL according to the block I/O program 5. Further, the CPU 2 defines a difference volume D-VOL in relation to the operation volume P-VOL according to the snapshot program 6, and saves the difference data obtained at the time of creating the snapshot in the difference volume D-VOL. Meanwhile, the CPU 2 also creates a plurality of generations of snapshots (virtual volumes V-VOL 1, V-VOL 2, . . . ) based on the difference data stored in the difference volume D-VOL and the user data stored in the operation volume.
Next, the basic snapshot function in the NAS server 1 is explained in detail.
As shown in
Each block address column 13 stores block addresses (“0” to “7”) of the blocks 11 corresponding to the respective operation volumes P-VOL. Further, each CoW bitmap column 14 stores a bit string (hereinafter referred to as a CoW bitmap) having the same number of bits as the number of generations of the snapshots that can be created. Each bit of this CoW bitmap corresponds to the respective snapshots of the first to fourth generations in order from the far left, and these are all set to “0” at the initial stage when no snapshot has been created.
Meanwhile, four save destination block address columns 15 are provided to each block 11 of the operation volume P-VOL. These save destination block address columns 62 are respectively associated with the first to fourth generation snapshots. In
Each save destination block address column 62 stores block addresses of the blocks in the difference volume D-VOL saving the difference data of the snapshot generation of the corresponding blocks 11 in the operation volume P-VOL (blocks 11 of the block addresses stored in the corresponding block address column 13). However, when the difference data of the snapshot generation of the corresponding blocks 11 in the operation volume P-VOL has not yet been saved; that is, when the user data has not yet been written in the blocks 11 in the snapshot generation thereof, a code of “None” representing that there is no block address of the corresponding save destination is stored.
And, when the CPU 2 is given a creation order of the first generation snapshot from the host system when the snapshot management table 10 is in the initial state illustrated in
Incidentally, status of the operation volume P-VOL and the difference volume D-VOL in the foregoing case is depicted in
Thereafter, for instance, as shown in
And, when the CPU 2 confirms that the value of the bit is “1”, as shown in
In addition, as shown in
Further, as shown in
Thus, here, as shown in
Meanwhile, when a snapshot creation order for the next generation (second generation) is thereafter given from the host system, a shown in
Thereafter, as shown in
Further, as shown in
Here, with respect to the block 11 in which the block address in the operation volume P-VOL is “2”, the bit at the far left associated with the first generation snapshot of the corresponding CoW bitmap is also “1”, and it is evident that there was no change in the data up to the creation start time of the second generation snapshot; that is, the data contents of the first generation snapshot creation start time and second generation snapshot creation start time are the same.
Thus, here, the CPU 2 clears the first generation bit of the snapshot in the CoW bitmap of the snapshot management table 10 associated with the block 11 in which the block address of the operation volume P-VOL is “2”, and stores a block address that is the same as the block address stored in the save destination block address column 62 associated with the second generation snapshot in the save destination block address column 62 associated with the first generation snapshot in the snapshot management table 10.
And, when the update of this snapshot management table 10 is complete, the CPU 2 writes the user data in the operation volume P-VOL. Status of the operation volume P-VOL and difference volume D-VOL after the completion of write processing of user data is shown in
(1-2) Snapshot Data Read Processing
Next, the contents of processing performed by the CPU 2 when a read request is given from the host system for reading data of the snapshot created as described above are explained. Here, let it be assumed that the operation volume P-VOL and difference volume D-VOL are in the state shown in
The data to be used during read processing of data of the first generation snapshot is the region surrounded with a dotted line in
In actuality, as shown in
As a result of performing the foregoing mapping processing, it will be possible to create a first generation snapshot as shown in
Meanwhile, the data to be used during read processing of data of the second generation snapshot is the region surrounded with a dotted line in
In actuality, as shown in
(1-3) Problems of Basic Snapshot Function and Description of Snapshot Function According to Present Embodiment
Meanwhile, when a failure occurs in the difference volume D-VOL during the execution of the snapshot function in the NAS server (
In the foregoing cases, in order to realize a fault-tolerant operation of the NAS server 1, there is no choice but to adopt the latter method as the operation mode of the NAS server 1. Nevertheless, when this method is adopted, snapshots of all generations created theretofore will have to be abandoned. This is because, as shown in
In the case of
As a means for overcoming the foregoing problems, the present invention, for instance, as shown in
According to the snapshot maintenance method described above, even when a failure occurs in the difference volume D-VOL, when the difference volume D-VOL is recoverable, the previous snapshots can be maintained without having to stop the snapshot function or abandoning the snapshots of any generation created theretofore.
The snapshot function according to the foregoing embodiment is now explained.
(2-1) Configuration of Network System
The host system 21 is a computer device having an information processing resource such as a CPU (Central Processing Unit) and memory, and, for instance, is configured from a personal computer, workstation, mainframe and the like. The host system 21 has an information input device (not shown) such as a keyboard, switch, pointing device or microphone, and an information output device (not shown) such as a monitor display or speaker.
The network 22, for example, is configured from a SAN (Storage Area Network), LAN (Local Area Network), Internet, public line or dedicated line. Communication between the host system 21 and disk array device 23 via this network 22, for instance, is conducted according to a fibre channel protocol when the network 22 is a SAN, and conducted according to a TCP/IP (Transmission Control Protocol/Internet Protocol) protocol when the network 22 is a LAN.
The disk array device 23 is configured from a storage device unit 31 formed from a plurality of disk units 30 for storing data, a RAID controller 32 for controlling the user data I/O from the host system 21 to the storage device unit 31, and a plurality of NAS units 33 for exchanging data with the host system 21.
The respective disk units 30 configuring the storage device unit 31, for instance, are configured by having an expensive disk such as a SCSI (Small Computer System Interface) disk or an inexpensive disk such as a SATA (Serial AT Attachment) disk or optical disk built therein.
Each of these disk units 30 is operated under the RAID system with the RAID controller 32. One or more logical volumes VOL (
Further, another part of the logical volume VOL is defined as a difference volume D-VOL (
A unique identifier (LU: Logical Unit number) is provided to each logical volume VOL. In the case of the present embodiment, the input and output of user data is conducted based on an address obtained by combining this identifier and a number unique to the logical block thereof (LBA: Logical Block Address) provided to the respective logical blocks, and designating this address.
The RAID controller 32 has a microcomputer configuration including a CPU, ROM and RAM, and controls the input and output of user data between the NAS unit 33 and storage device 31. The NAS unit 33 has a blade structure, and is removably mounted on the disk array device 23. This NAS unit 33 is equipped with various functions such as a file system function for providing a file system to the host system 21 and a snapshot function according to the present embodiment described later.
The snapshot program 40, as shown in
Among the above, the operation volume read processing program 41 and operation volume write program 42 are programs for executing the read processing of user data from the operation volume P-VOL or write processing of user data in the operation volume P-VOL, respectively. The operation volume read processing program 41 and operation volume write program 42 configure the block I/O program 5 depicted in
The snapshot creation processing program 44 and snapshot deletion processing program 45 are programs for executing generation processing of a new generation snapshot or deletion processing of an existing snapshot. Further, the switching processing program 46 is a program for executing switching processing for switching the save destination of difference data from the difference volume D-VOL to the reproduction volume R-VOL. The difference data recovery processing program 47 is a program for executing difference data recovery processing of migrating difference data saved in the reproduction volume R-VOL to the difference volume D-VOL when the difference volume D-VOL is recovered.
Meanwhile, the snapshot management table 48, as shown in
Further, the failure-situation snapshot management table 49 is used for data management of snapshots in the respective generations when the difference data is not saved in the reproduction volume R-VOL. This failure-situation snapshot management table 49 has the same configuration as the snapshot management table 48 other than that a “Failure” address column 66 is provided in addition to being provided with an address column 64, a CoW bitmap column 65 and a plurality of address columns 67 respectively associated with the first to third generation snapshots in correspondence with each block 11 of the operation volume P-VOL.
However, in the failure-situation snapshot management table 49, the generation of the latest snapshot corresponds to “Failure” when a failure occurs in the difference volume D-VOL, and any snapshot created thereafter corresponds, in order, to a first generation (“V-VOL 1”), a second generation (“V-VOL 2”) and a third generation (“V-VOL 3”). Accordingly, for instance, when a failure occurs in the difference volume D-VOL when creating a second generation snapshot, even when a third generation snapshot is created thereafter, such snapshot will correspond to the first generation in the failure-situation snapshot management table 49.
The CoW bitmap cache 50 is a cache for storing a bit string formed by extracting and arranging bits corresponding to the latest snapshot in the order of block addresses among the respective CoW bitmaps stored in each CoW bitmap column 61 in the snapshot management table 48. For example, in the state shown in
The status flag 51 is a flag showing the status of the difference volume D-VOL in relation to the failure status, and retains a value of “Normal”, “Failure” or “Recovered”. Further, the latest snapshot generation information 52 stores the generation of the latest snapshot with the time in which the failure occurred in the difference volume D-VOL as the reference. For example, when a failure occurs in the difference volume D-VOL upon creating the second generation snapshot, a value of “2” is stored in the latest snapshot generation information 52.
(2-2) Various Processing of Disk Array Device
Next, the contents of processing to be performed by the CPU 2 (
(2-2-1) Write Processing of User Data in Operation Volume
Foremost, the contents of processing to be performed by the CPU 2 in the write processing of user data in the operation volume P-VOL are explained.
In other words, when the CPU 2 receives this write request, it starts the write processing (SP0), and foremost accesses the snapshot management table 48 (
To obtain a negative result at step SP1 (SP1: NO) means that the current snapshot generation has already been saved in the difference data D-VOL. Thus, the CPU 2 in this case proceeds to step SP8.
Contrarily, to obtain a positive result in the determination at step SP1 (SP1: YES) means that the difference data of the current snapshot generation has not yet been saved. Thus, the CPU 2 in this case reads the status flag 51 in the snapshot program 40, and determines whether or not this is a “Failure” (SP2).
And, when the CPU 2 obtains a negative result in this determination (SP2: NO), it saves the difference data in the difference volume D-VOL (SP3), and thereafter determines whether or not the writing of difference data in such difference volume D-VOL was successful (SP4). When the CPU 2 obtains a positive result in this determination (SP4: YES), it updates the snapshot management table 48 in accordance therewith (SP5), and further determines whether or not the update of such snapshot management table 48 was successful (SP6).
When the CPU 2 obtains a positive result in this determination (SP6: YES), it updates the contents of the CoW bitmap cache 50 according to the updated snapshot management table 48 (SP7), thereafter writes in the operation volume P-VOL the user data subject to writing provided from the host system 21 together with the write request (SP8), and then ends this write processing (SP12).
Contrarily, when the CPU 2 obtains a positive result in the determination at step SP2 (SP2: YES), it saves the difference data in the reproduction volume R-VOL (SP9), updates the failure-situation snapshot management table 49 in accordance therewith (SP10), and thereafter proceeds to step SP7. And, the CPU 2 thereafter performs the processing of step SP7 and step SP8 in the same manner as described above, and then ends this write processing (SP12).
Meanwhile, when the CPU 2 obtains a negative result in the determination at step SP4 or step SP6 (SP4: NO, SP6: NO), it proceeds to step SP11, and thereafter switches the save destination of user data from the difference volume D-VOL to the reproduction volume R-VOL based on the switching processing program 46 (
In other words, when the CPU 2 proceeds to step SP11 of the foregoing write processing, it starts this switching processing (SP20), and foremost sets “Failure” to the status flag 51 in the snapshot program 40 (SP21).
Next, the CPU 2 respectively stores the CoW bitmap cache 50 of the snapshot program 40 and the latest snapshot generation information 52 (SP22, SP23), and thereafter reflects the contents of the CoW bitmap cache 50 in the failure-situation snapshot management table 49. Specifically, as shown in
Next, the CPU 2 changes the generation of the snapshot to which a failure has occurred being stored as the latest snapshot generation information 52 into a “Failure” snapshot generation in the failure-situation snapshot management table 49 (SP25), and thereafter ends this switching processing (SP26). And, the CPU 2 thereafter returns from step SP11 to step SP1 of the foregoing write processing described with reference to
Accordingly, when the writing of difference data in the difference volume D-VOL or the update of the snapshot management table 48 ends in a failure (SP4: NO, SP6: NO), after the save destination volume of the difference data is switched from the difference volume D-VOL to the reproduction volume R-VOL, difference data is stored in the reproduction volume R-VOL according to the procedures of steps in the order of SP1-SP2-SP9-SP10-SP7-SP8.
(2-2-2) Read Processing of User Data from Operation Volume
Although the read processing of user data from the operation volume P-VOL is performed under the control of the CPU 2 based on the operation volume read processing program 42 (
(2-2-3) Read Processing of Snapshot Data
Next, the contents of processing to be performed by the CPU 2 in the data read processing of the created snapshot is explained.
In other words, when the CPU 2 if given a snapshot data read request designating the snapshot generation, block address and so on, it starts this snapshot data read processing (SP30), and foremost reads the status flag 51 (
When a negative result is obtained in the determination at step SP31 (SP31: NO), this means that the difference volume D-VOL is currently being operated, and the difference data is saved in the difference volume D-VOL. Thus, the CPU 2 in this case reads the block address stored in the save destination block address column 62 associated with the snapshot generation and block address designated in the snapshot management table 48 (SP32), and thereafter determines whether the reading of such block address was successful (SP33).
When the CPU 2 obtains a positive result in this determination (SP33: YES), it determines whether or not the read block address is “None” (SP34). When the CPU 2 obtains a positive result (SP34: YES), it proceeds to step 43, and, when the CPU 2 obtains a negative result (SP34: NO), it reads the user data stored in the block 12 of block address read at step SP32 in the difference volume D-VOL (SP35).
Further, the CPU 2 thereafter determines whether or not the reading of user data from the difference volume D-VOL was successful (SP36), and, when the CPU 2 obtains a positive result (SP36: YES), it ends this snapshot data read processing (SP44).
Contrarily, when the CPU 2 obtains a negative result in the determination at step SP33 or in the determination at step SP36 (SP33: NO, SP36: YES), it switches the save destination of difference data from the difference volume D-VOL to the reproduction volume R-VOL (SP37) by executing the switching processing described with reference to
Meanwhile, to obtain a negative result in the determination at step SP31 (SP31: YES) means that the difference volume D-VOL is not currently being operated, and that the difference data is saved in the reproduction volume R-VOL. Thus, the CPU 2 in this case determines whether or not the block subject to data reading designated by the user is a block belonging to either the snapshot of the generation to which a failure occurred, or the difference volume D-VOL (SP38).
And, when the CPU 2 obtains a positive result in this determination (SP38: YES), it error-ends this snapshot data read processing (SP45), and, contrarily, when the CPU 2 obtains a negative result (SP38: NO), it reads the block address stored in the address column 67 (
When the CPU 2 obtains a negative result in this determination (SP40: NO), it reads the user data stored in the block of the block address acquired at step SP39 in the reproduction volume R-VOL (SP41), and thereafter ends this snapshot data read processing (SP44).
Meanwhile, when the CPU 2 obtains a positive result in the determination at step SP40 (SP40: YES), it reads the status flag 51 (
To obtain a positive result in this determination (SP42: YES) means that the user data saved in the reproduction volume R-VOL is currently being written in the difference volume D-VOL that recovered from the failure. Thus, the CPU 2 in this case returns to step SP32, and thereafter executes the processing subsequent to step SP32 as described above.
Contrarily, to obtain a negative result in the determination at step SP42 (SP42: NO) means that a failure occurred in the difference volume D-VOL, and that the difference volume D-VOL has not yet been recovered. Thus, the CPU 2 in this case reads data from the operation volume P-VOL (SP43), and thereafter ends this snapshot data read processing (SP44).
(2-2-4) Snapshot Creation Processing
In other words, when the CPU 2 is given a snapshot creation order, it starts the snapshot creation processing (SP50), and foremost reads the status flag 51 in the snapshot program 40, and determines whether or not “Failure” is set to this status flag 51 (SP51).
When the CPU 2 obtains a negative result in this determination (SP51: NO), it sets the respective values of the bits corresponding to the generation of the snapshot to be created in each CoW bitmap in the snapshot management table 48 to 1 (SP52), and thereafter determines whether or not the update of the snapshot management table 48 was successful (SP54).
When the CPU 2 obtains a negative result in this determination (SP54: NO), it switches the save destination of difference data from the difference volume D-VOL to reproduction volume R-VOL (SP55) by executing the foregoing switching processing described with reference to
Contrarily, when the CPU 2 obtains a positive result in the determination at step SP54 (SP54: YES), it sets every value of the respective bits of the bit string stored in the CoW bitmap cache 50 of the snapshot program 40 to 1 (SP57). Further, the CPU 2 thereafter updates the latest snapshot generation information 52 to the value of the generation of the snapshot at such time (SP58), and then ends this snapshot creation processing (SP59).
Meanwhile, when the CPU 2 obtains a negative result in the determination at step SP51 (SP51: YES), it sets the respective values of the bits corresponding to the generation of the snapshot to be created in each CoW bitmap in the failure-situation snapshot management table 49 to 1 (SP53). Then, the CPU 2 sets every value of the respective bits of the bit string stored in the CoW bitmap cache 50 of the snapshot program 40 to 1 (SP57), updates the latest snapshot generation information 52 to the value of the generation of the snapshot at such time (SP58), and thereafter ends this snapshot creation processing (SP59).
(2-2-5) Snapshot Deletion Processing
Meanwhile,
In other words, when the CPU 2 is given a snapshot creation order, it starts the snapshot deletion processing (SP60), and foremost reads the status flag 51 in the snapshot program 40, and determines whether “Failure” is set to this status flag 51 (SP61).
When the CPU 2 obtains a negative result in this determination (SP61: NO), it sets the respective values of the bits corresponding to the generation of the snapshot to be deleted in each CoW bitmap in the snapshot management table 48 to “0” (SP62), and thereafter determines whether the update of the snapshot management table 48 was successful (SP63).
When the CPU 2 obtains a positive result in this determination (SP63: YES), it updates the contents of the CoW bitmap cache 50 in the snapshot program 40 to the contents corresponding to the snapshot of a generation preceding the snapshot subject to deletion when the snapshot subject to deletion is the latest snapshot (SP64). Specifically, the CPU 2 reads the respective values of the bits associated with the generation preceding the snapshot subject to deletion in each CoW bitmap in the snapshot management table 48, and arranges these in the order of the corresponding block addresses and writes these in the CoW bitmap cache 50 (SP64).
And, when the CPU 2 thereafter determines whether the update of the CoW bitmap cache 50 was successful (SP65) and obtains a positive result (SP65: YES), it updates the value of the latest snapshot generation information 52 in the snapshot program 40 to the value of the snapshot generation (SP69), and thereafter ends this snapshot deletion processing (SP70).
Contrarily, when the CPU 2 obtains a negative result in the determination at step SP63 or step SP65 (SP63: NO, SP65: NO), it switches the save destination of difference data from the difference volume D-VOL to the reproduction volume R-VOL (SP71) by executing the foregoing switching processing described with reference to
Meanwhile, when the CPU 2 obtains a positive result in the determination at step SP61 (SP61: YES), it determines whether or not the snapshot in which a failure occurred is the snapshot subject to deletion (SP66). And, when the CPU 2 obtains a positive result in this determination (SP66: YES), it error-ends this snapshot deletion processing (SP72).
Contrarily, when the CPU 2 obtains a negative result in the determination at step SP66 (SP66: NO), it sets the respective values of the bits corresponding to the generation of the snapshot to be deleted in each CoW bitmap in the failure-situation snapshot management table 49 to “0” (SP67).
Further, when the snapshot subject to deletion is the latest snapshot, the CPU 2 thereafter updates the contents of the CoW bitmap cache 50 in the snapshot program 40 to the contents corresponding to the snapshot of a generation preceding the snapshot subject to deletion (SP68). Specifically, the CPU 2 reads the respective values of the bits corresponding to the generation preceding the snapshot subject to deletion in each CoW bitmap in the failure-situation snapshot management table 49, and arranges these in the order of the corresponding block addresses and writes these in the CoW bitmap cache 50 (SP68).
And, the CPU 2 thereafter updates the value of the latest snapshot generation information 52 in the snapshot program 40 to the new snapshot generation (SP69), and then ends this snapshot deletion processing (SP70).
(2-2-6) Difference Data Recovery Processing
Next, difference data recovery processing is explained. This difference data recovery processing is executed when a recovery processing order of difference data is given from the system administrator in a case where the difference volume D-VOL in which a failure had occurred has recovered, or in a case where a new difference volume D-VOL is created since the difference volume D-VOL was irrecoverable.
For example, when the difference volume D-VOL recovers from its failure, the difference data saved in the reproduction volume R-VOL is migrated to the difference volume D-VOL, and the contents of the failure-situation snapshot management table 49 are reflected in the snapshot management table 48 pursuant thereto. Data migration in such a case is performed based on the latest snapshot generation information 52 in the snapshot program 40. Further, the saving of difference data from the operation volume P-VOL during this time is conducted based on the contents of the CoW bitmap cache 50 in the snapshot program 40. Further, data migration of difference data from the reproduction volume R-VOL is conducted while retaining the consistency of the snapshot management table 48 and the failure-situation snapshot management table 49.
Since data migration from the reproduction volume R-VOL targets the difference data stored in the block where the value of the bit in the CoW bitmap in the failure-situation snapshot management table 49 is “0”, this can be performed in parallel without being in conflict with the saving of difference data from the operation volume P-VOL.
Here, “None” is stored in the address column 67 in the failure-situation snapshot management table 49 of the difference data migrated to the difference volume D-VOL. However, during this difference data recovery processing, snapshots acquired prior to the occurrence of a failure cannot be accessed. This is because unrecovered difference data in the reproduction volume R-VOL may be referred to, and mapping from the snapshot management table 48 to an area in the reproduction volume R-VOL is not possible.
When the difference volume D-VOL cannot be recovered from its failure, only the difference data corresponding to the snapshot acquired after the occurrence of the failure; that is, the difference data regarding the snapshot of generations after the first generation snapshot in the failure-situation snapshot management table 49, is migrated to the newly set difference volume D-VOL.
In this case, the determination of whether the failure of the difference volume D-VOL is recoverable or irrecoverable is conducted by the system administrator. When the system administrator determines that the difference volume D-VOL is recoverable, he/she performs processing for recovering the difference volume D-VOL, and, contrarily, when the system administrator determines that the difference volume D-VOL is irrecoverable, he/she sets a new difference volume D-VOL.
However, the configuration may also be such that the CPU 2 of the NAS unit 33 automatically determines whether the difference volume D-VOL is recoverable or irrecoverable, and automatically creates a new difference volume D-VOL when it determines that the original difference volume D-VOL is irrecoverable. Specifically, for instance, the CPU 2 calculates the mean time to repair (MTTR: Mean Time To Repair) relating to the disk failure from past log information or the like, waits for the elapsed time from the occurrence of the failure to the current time to exceed the mean time to repair, and determines that the failure of the difference volume D-VOL is recoverable at the stage when such elapsed time exceeds the mean time to repair. As a result, it is anticipated that the response to failures in the difference volume D-VOL can be sped up in comparison to cases of performing this manually.
Details regarding the contents of difference data recovery processing are now explained. The difference data recovery processing is conducted in the order of reflecting the CoW bitmap cache in the snapshot management table 49, and then migrating the difference data to the difference volume D-VOL.
In other words, when the CPU 2 is given a recovery order of difference data, it starts the difference data recovery processing (SP80), and foremost reads the status flag of the snapshot program, and determines whether “Failure” is set thereto (SP81). And when the CPU 2 obtains a negative result in this determination (SP81: NO), it error-ends this difference data recovery processing (SP94).
Contrarily, when the CPU 2 obtains a positive result in this determination (SP81: YES), it stores the failure-situation snapshot management table 49, and thereafter determines whether or not the values of the bits corresponding to the latest snapshot in each CoW bitmap in the current snapshot management table 48 completely coincides with the values of the corresponding bits in the bit string stored in the CoW bitmap cache 50 at the time the failure occurred that was stored at step SP22 of the switching processing shown in
To obtain a positive result in this determination (SP83: YES) means that the current difference volume D-VOL is a difference volume D-VOL that was subject to a failure but recovered thereafter. Thus, the CPU 2 in this case sequentially copies the bit at the far left in each CoW bitmap in the failure-situation snapshot management table 49 to the bit corresponding to the current snapshot generation to the bit position of the corresponding generation of the corresponding CoW bitmap in the snapshot management table 48 (SP84). Here, the CPU 2 conducts the association of the snapshot generation in the failure-situation snapshot management table 49 and the snapshot generation in the snapshot management table 48 based on the latest snapshot generation information 52.
For instance, in the example shown in
Thus, the CPU 2 respectively copies the bit at the far left in each CoW bitmap of the failure-situation snapshot management table 49 to the bit (second bit from the far left) corresponding to the current snapshot generation (“V-VOL 1”) in the failure-situation snapshot management table 49 to a portion after the bit (second from the far left) corresponding to the second generation snapshot in the corresponding CoW bitmap in the snapshot management table 48. As a result of this kind of processing, it will become possible thereafter to perform the saving of difference data from the operation volume P-VOL to the difference volume D-VOL in parallel with the migration of difference data from the reproduction volume R-VOL to the difference volume D-VOL.
Incidentally,
Contrarily, to obtain a negative result in the determination at step SP83 (SP83: NO) means that the current difference volume D-VOL was created newly since the difference volume D-VOL subject to a failure was irrecoverable. Thus, the CPU 2 in this case respectively copies the bit at the far left in each CoW bitmap in the failure-situation snapshot management table 49 to the bit associated with the current snapshot generation to the portion after the bit at the far left in each CoW bitmap in the snapshot management table 48 (SP85). Accordingly, in this case, the difference data prior to the occurrence of a failure in the difference volume D-VOL will be lost.
Further, when the CPU 2 completes the processing of step SP84 or step SP85, it sets “Recovered” to the status flag in the snapshot program 40 (SP86).
The, the CPU 2 thereafter migrates the difference data saved in the reproduction volume R-VOL to the difference volume D-VOL in order from the oldest generation as of the generation of the snapshot at the time it was subject to a failure (SP87 to SP91).
Specifically, the CPU 2 confirms the generation of the snapshot at the time it was subject to a failure based on the latest snapshot generation information 52 in the snapshot program 40, and selects one block 11 (
Then, the CPU 2 thereafter migrates the difference data of the target snapshot generation of this target block 11 from the reproduction volume R-VOL to the difference volume D-VOL (SP88), and then, as shown in
Further, as shown in
Further, as shown in
And, the CPU 2 thereafter sets “None” as the block address in the respective address columns 66, 67 in the failure-situation snapshot management table 49 corresponding to the respective save destination block address columns 62 in the snapshot management table 48 updated at step SP88 (SP90).
Next, the CPU 2 determines whether the same processing steps (step SP87 to step SP90) have been completed for all blocks in the operation volume P-VOL from which the difference volume was saved in the reproduction volume R-VOL (SP91), and returns to step SP87 upon obtaining a negative result (SP91: NO). Then, while sequentially changing the blocks 11 to be targeted, the CPU 2 repeats the same processing steps (step SP87 to step SP91) to all blocks 11 in which difference data has been saved in the reproduction volume R-VOL.
And, when the CPU 2 eventually completes the processing to all blocks 11 (SP91: YES), it sets “Normal” to the status flag 51 in the snapshot program 40 (SP92), and thereafter ends this difference data recovery processing (SP93).
Here, the processing contents of the migration processing for migrating difference data from the reproduction volume R-VOL to the difference volume D-VOL conducted at step SP87 to step SP89 of the difference data recovery processing, and the update processing of the snapshot management table 48 and failure-situation snapshot management table 49 are explained in further detail with reference to
With the example shown in
Further, with respect to the block 11 having a block address of “0”, since the values of the respective bits corresponding to the first and second generation snapshots among the respective bits of the corresponding CoW bitmaps in the snapshot management table 49 are both “1”, it is evident these first and second generation snapshots share the same difference data. Meanwhile, since the same block address is not stored in the corresponding address column 66 of the “Failure” row and the address column 67 of the “V-VOL 1” row in the failure-situation snapshot management table 49, it is evident that the second and third generation snapshots do not share the same difference data.
Thus, the CPU 2 stores the block addresses (“11”) of the difference volume D-VOL, which is the migration destination of the difference data thereof, in the respective save destination block address columns 62 corresponding to the respective rows “V-VOL 1” and “V-VOL 2” in the snapshot management table 48. Further, the CPU 2 updates the corresponding CoW bitmap of the snapshot management table 48 to “0010”, and further sets “None” in the corresponding column 66 of the “Failure” row in the failure-situation snapshot management table 49.
Moreover, with respect to the block 11 in the operation volume P-VOL having a block address of “1” in the second generation snapshot, as shown in
Further, with respect to the block 11 having a block address of “1”, since the values of the respective bits corresponding to the first and second generation snapshots among the respective bits of the corresponding CoW bitmaps in the snapshot management table 48 are both “1”, it is evident these first and second generation snapshots share the same difference data. Meanwhile, since the same block address of “10” is stored in the corresponding address column 66 of the “Failure” row and the address column 67 of the “V-VOL 1” row in the failure-situation snapshot management table 49, it is evident that the second and third generation snapshots share the same difference data.
Thus, the CPU 2 stores the block addresses (“5”) of the difference volume D-VOL, which is the migration destination of the difference data thereof, in the respective save destination block address columns 62 corresponding to the respective rows “V-VOL 1” to “V-VOL 3” in the snapshot management table 48. Further, the CPU 2 updates the corresponding CoW bitmap of the snapshot management table 48 to “0000”, and further sets “None” in each of the corresponding columns 66, 67 of the “Failure” row and “V-VOL 1” row in the failure-situation snapshot management table 49.
Meanwhile, with respect to the respective blocks 11 in the operation volume P-VOL where the block addresses are “2” and “3” in the second generation snapshot, as evident from
Contrarily, with respect to the block 11 in the operation volume P-VOL having a block address of “4” in the second generation snapshot, as shown in
Further, with respect to the block 11, since the values of the respective bits corresponding to the first and second generation snapshots among the respective bits of the corresponding CoW bitmaps in the snapshot management table 48 are both “0”, it is evident these first and second generation snapshots do not share the same difference data. Meanwhile, with respect to the block 11, since different block addresses are stored in each of the corresponding address columns 66, 67 of the “Failure” row and the “V-VOL 1” row in the failure-situation snapshot management table 49, it is evident that the second and third generation snapshots do not share the same difference data.
Thus, the CPU 2 stores the block addresses (“8”) of the difference volume D-VOL, which is the migration destination of the difference data thereof, in the respective save destination block address columns 62 corresponding to the “V-VOL 2” row in the snapshot management table 48. Further, the CPU 2 stores “None” in the corresponding address column 66 of the “Failure” row in the failure-situation snapshot management table 49.
Moreover, with respect to the block 11 in the operation volume P-VOL having a block address of “5” in the second generation snapshot, as shown in
Further, with respect to the block 11, since the values of the respective bits corresponding to the first and second generation snapshots among the respective bits of the corresponding CoW bitmaps in the snapshot management table 48 are both “0”, it is evident these first and second generation snapshots do not share the same difference data. Meanwhile, with respect to this block 11, since the same block address of “2” is stored in the corresponding address column 66 of the “Failure” row and the address column 67 of the “V-VOL 1” row in the failure-situation snapshot management table 49, it is evident that the second and third generation snapshots share the same difference data.
Thus, the CPU 2 stores the block addresses (“8”) of the difference volume D-VOL, which is the migration destination of the difference data thereof, in the save destination block address columns 62 corresponding to the rows of “V-VOL 2” and “V-VOL 3” in the snapshot management table 48. Further, the CPU 2 stores “None” in the corresponding address columns 66, 67 of the respective rows of “Failure” and “V-VOL 1” in the failure-situation snapshot management table 49.
Moreover, with respect to the block 11 in the operation volume P-VOL having a block address of “6” in the second generation snapshot, as shown in
Thus, the CPU 2 stores the block addresses (“9”) of the difference volume D-VOL, which is the migration destination of the difference data thereof, in the save destination block address columns 62 corresponding to the rows of “V-VOL 1” and “V-VOL 2” in the snapshot management table 48. Further, the CPU 2 updates the corresponding CoW bitmap of the snapshot management table 48 to “0000”, and further stores “None” in the corresponding address column 66 of the “Failure” row in the failure-situation snapshot management table 49. Moreover, with respect to the block 11 in the operation volume P-VOL having a block address of “7” in the second generation snapshot, as shown in
Meanwhile, with respect to the block 11 in the operation volume P-VOL having a block address of “0” to “2” in the third generation snapshot, as shown in
Contrarily, with respect to the block 11 in the operation volume P-VOL having a block address of “3” in the third generation snapshot, as shown in
Further, with respect to this block 11, it is evident that it shares the difference data with the snapshot of a generation before the occurrence of the failure as described above, and that it also shares the difference data with the snapshot of subsequent generations from each of the corresponding address columns 67 of the respective rows of “V-VOL 1” and “V-VOL 2” in the failure-situation snapshot management table 49.
Thus, the CPU 2 stores the block addresses (“10”) of the difference volume D-VOL, which is the migration destination of the difference data thereof, in the save destination block address column 62 corresponding to the “V-VOL 3” in the snapshot management table 48. Further, the CPU 2 sets “None” in the corresponding column 67 of the “V-VOL 1” row in the failure-situation snapshot management table 49.
Meanwhile, with respect to the respective blocks 11 in the operation volume P-VOL where the block addresses are “4” and “5” in the third generation snapshot, as evident from
Meanwhile, with respect to the block 11 in the operation volume P-VOL having a block address of “6” in the third generation snapshot, as shown in
Further, with respect to this block 11, it is evident that it does not share the difference data with the snapshot of a generation before the occurrence of the failure as described above, and that it also does not share the difference data with the snapshot of subsequent generations from each of the corresponding address columns 67 of the respective rows of “V-VOL 1” and “V-VOL 2” in the failure-situation snapshot management table 49.
Thus, the CPU 2 stores the block addresses (“13”) of the difference volume D-VOL, which is the migration destination of the difference data thereof, in the save destination block address column 62 corresponding to the “V-VOL 3” in the snapshot management table 48. Further, the CPU 2 sets “None” in the corresponding column 67 of the “V-VOL 1” row in the failure-situation snapshot management table 49.
Meanwhile, with respect to the respective blocks 11 in the operation volume P-VOL where the block address is “7” in the third generation snapshot, as evident from
As a result of the series of processing described below, the difference data saved in the reproduction volume R-VOL can be migrated to the difference volume D-VOL while retaining the consistency of the snapshot management table 48 and failure-situation snapshot management table 49.
Further, according to this kind of snapshot maintenance method, even when a failure occurs in the difference volume D-VOL during the creation of a snapshot, the new difference data created based on the write processing of user data to the operation volume P-VOL until the difference volume D-VOL is recovered can be retained in the reproduction volume R-VOL, and the difference data can thereafter be migrated to the difference volume D-VOL at the stage when the failure in the difference volume D-VOL is recovered. Further, even with respect to the snapshot management table 48, inconsistencies until the failure in the difference volume D-VOL is recovered can be corrected with the failure-situation snapshot management table 49.
Therefore, according to this snapshot maintenance method, even when a failure occurs in the difference volume D-VOL, since a part or the whole of the snapshots created theretofore can be maintained while performing the ongoing operation, the reliability of the overall disk array device can be improved dramatically.
In the embodiment described above, although a case of employing the present invention in the NAS unit 33 (
Further, in the embodiments described above, although a case of respectively configuring the snapshot management table 48 as the first difference data management information and the failure-situation snapshot management table 49 as the snapshot management table 48 as shown in
In addition to the application in a disk array device, the present invention may also be widely employed in a NAS device or the like.
Number | Date | Country | Kind |
---|---|---|---|
2005-274125 | Sep 2005 | JP | national |