This application is based upon and claims the benefit of priority of the prior Japanese Patent Application No. 2012-179480, filed on Aug. 13, 2012, the entire contents of which are incorporated herein by reference.
The embodiments discussed herein are related to a storage system, a storage control method, and a program for a storage control.
A disk array device is frequently used as a storage system. The disk array device is a storage device including a plurality of magnetic storage devices such as a hard disk. The disk array device manages a plurality of magnetic storage devices as one RAID (Redundant Array of Inexpensive Disks) group with a RAID technique. To a RAID group, one or more logical volumes are allocated, and data locations are decided.
Additionally, an idea called “aggregate” that handles a plurality of RAID groups as one logical pool in order to increase an upper limit of a storage capacity that can be provided to a host is known.
In relation to the above described techniques, a storage system that reduces power needed for storage resources allocated to a pool and avoids the lifetime of the storage resources from being shortened in a storage system implemented with an AOU (Allocation on Use) technique is known. In this storage system, a storage device defined as belonging to a pool is allocated to a virtual volume. The storage device is powered off before the device is allocated to a virtual volume, and powered on when the device is allocated to the virtual volume.
Additionally, a storage control device for selecting a physical storage device where a logical storage device is arranged depending on whether a type of an access pattern to a logical volume of a higher-level device is either a random access or a sequential access, and a first performance index preset in the logical volume is known.
Furthermore, a computer system for preventing the performance of a virtualized storage system from being degraded is known. This computer system includes a storage device, a host computer, and a management computer. The management computer provides storage resources to the host computer based on storage configuration information obtained from the storage device. Moreover, the management computer obtains use information of the hardware resource including storage resources, and decides the configuration of the storage device in order to prevent a load from concentrating on a particular hardware resource based on the use information of the hardware resource.
Note that related art is described, for example, Japanese Laid-open Patent Publication No. 2007-293442, Japanese Laid-open Patent Publication No. 2008-123132 and Japanese Laid-open Patent Publication No. 2008-165620.
According to an aspect of the embodiments, a storage system includes: a grouping unit configured to generate one or more storage device sub-groups, each of the storage device sub-groups including a storage device used to store data, from storage devices included in a plurality of storage device groups that respectively include a plurality of storage devices; a selection unit configured to select any of the one or more storage device sub-groups; and a control unit configured to shut off power supply to a non-selected device group, which is a storage device sub-group other than a selected storage device sub-group and included in a storage device group including the selected storage device sub-group, and shut off power supply to storage devices included in a storage device group other than the storage device group including the selected storage device sub-group.
The object and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the claims.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention.
In the above described storage control device for selecting a physical storage device depending on the type of an access pattern and the first performance index, stored data is distributed and a plurality of disks to be accessed are needed when a RAID group is distributed and arranged according to an access from a host. Accordingly, the number of disks made to continuously run increases, leading to excessive power consumption.
Additionally, in the above described computer system for preventing the performance of a virtualized storage system from being degraded, access data is distributed and arranged to select a logical device according to an access from a host, so that excessive power is consumed due to an increase in the number of disks made to continuously run.
Embodiments according to the present invention are described below with reference to
Each of the storage device groups 140, 150, . . . includes a plurality of storage devices, each of which can be operated as one storage device like a RAID group. For example, the storage device group 140 includes storage devices, such as storage devices #0, . . . , #a, #b, . . . , #c, and the like, used to store data, and storage devices, such as storage devices #d, #e and the like, used to store information other than data, for example, parity information. In this case, the storage device group 140 is operable as a RAID-DP (RAID Double Parity).
The grouping unit 110 generates one or more storage device sub-groups by dividing, into groups, storage devices used to store data among storage devices included in the plurality of storage device groups 140, 150, . . . .
The selection unit 120 selects any of storage device sub-groups generated by the grouping unit 110, such as the storage device sub-groups 141, 142, 151 and 152 in
The control unit 130 performs a control such that power is not supplied to storage device sub-groups other than the selected storage device sub-group 141, which are included in the storage device group 140 including the storage device sub-group 141 selected by the selection unit 120. In other words, the control unit 130 shuts off power supply to a non-selected storage device sub-group within the storage device group 140 including the selected storage device sub-group 141. In
As described above, the storage system 100 performs a control such that power is not supplied to storage device sub-groups other than a selected storage device sub-group, which are included in a storage device group including the selected storage device sub-group selected by the selection unit 120. Moreover, the storage system 100 performs a control such that power is not supplied to storage devices included in a storage device group other than the storage device group including the selected storage device sub-group. As a result, the storage system 100 can reduce power consumed by the storage system 100.
The above provided description has referred to the case where the storage device group 140 operates as a RAID-DP. For example, if only the storage device #e is used to store parity information, the storage device group 140 can operate as a RAID 4. Moreover, for example, if all storage devices are used to store data, the storage device group 140 can operate as RAID 0+1.
a, b, c, and d, which are illustrated in
The control device 220 includes a memory 221, a RAID control unit 222, and a cache control unit 224.
The memory 221 can be used as a disk cache for temporarily storing data received from the host 250 via the CAs 210. Moreover, a program executed by a CPU, which is included in the storage system 200 and not illustrated, can be stored in the memory 221 in order to implement the RAID control unit 222 and the cache control unit 224. Moreover, an aggregate management table 221a, a RAID management table 221b, a disk group management table 221c, and a write management table 221d, and the like can be stored in the memory 221. For example, a nonvolatile memory such as a RAM (Random Access Memory) or the like can be used as the memory 221.
The RAID control unit 222 performs a control for generating a RAID group including some or all of disks in the disk group 230, and for reading/writing data from/to a RAID group. The RAID control unit 222 generates an aggregation of RAID groups called an “aggregate”, and can use the generated aggregate as one logical pool. Moreover, the RAID control unit 222 can provide a virtual volume to the host 250 by using some or all of storage resources included in the aggregate.
The RAID control unit 222 includes a disk control unit 223. The RAID control unit 222 can execute the disk control unit 223 and stop the disk control unit 223 according to an instruction issued from a user, or setting information stored in a nonvolatile memory or the like not illustrated. The RAID control unit 222 can implement the storage system 200 according to this embodiment to be described later with reference to
The disk control unit 223 generates disk groups by dividing disks included in a RAID group within the disk group 230 into a plurality of groups. Then, the disk control unit 223 powers off disk groups other than a disk group that satisfies a specified condition among the generated disk groups. Moreover, the disk control unit 223 reads/writes data from/to a disk group by a request issued from the data control unit 225.
The cache control unit 224 performs a control for implementing a disk cache with the memory 221, such as for temporarily storing data and storing data in a RAID group within the disk group 230 when needed. Moreover, the cache control unit 224 includes the data control unit 225.
The data control unit 225 performs striping for written data in a specified stripe size when the data is written to a disk group included in a RAID group within the disk group 230. Then, the data control unit 225 generates data groups by dividing the striping data into groups by the number of disks included in the disk group. In this embodiment, striping data included in a data group is distributed and stored on disks included in a disk group. For example, when a data group including striping data #1 and #2 is written to a disk group including disks #1 and #2, the striping data #1 and #2 are respectively distributed and stored on the disks #1 and #2.
The data control unit 225 generates, for each of disks, a data list, in which the order of striping data written to a disk is rearranged, in order to sequentially write the striping data to each of the disks of a disk group included in a RAID group within the disk group 230. The data control unit 225 requests the disk control unit 223 to write the data based on the generated data list.
The disk group 230 may include a plurality of storage devices. As a storage device included in the disk group 230, a magnetic disk device such as a hard disk or the like may be used. A storage device included in the disk group 230 is referred to simply as a “disk” throughout the explanation of this embodiment.
For example, when the storage system 200 is activated or when a user performs a specified operation for the storage system 200 via an input device, the disk control unit 223 generates a plurality of disk groups by dividing disks included in a RAID group into groups.
The disk control unit 223 powers off disk groups other than a disk group that satisfies a particular condition among generated disk groups after the disc control unit 223 has generated the disk groups. In this case, all disks included in a RAID group that does not include the disk group satisfying the particular condition are powered off. For example, in
Note that the discs for parity are not powered off in RAID #0, since RAID #0 includes the disc group which satisfies the particular condition. On the other hand, all discs including a disc for parity are powered off in RAID #1, since RAID #1 does not include a disc group which satisfies the particular condition.
The data control unit 225 partitions the data 410 into striping data of a specified size. Then, the data control unit 225 generates data groups by dividing the striping data into groups by the number of used disks of a disk group at a write destination of the data 410, namely, by each number of disks included in the disk group. In
In this embodiment, striping data included in a data group is distributed and stored on disks included in a disk group at a write destination. For example, in the case of
Accordingly, the data control unit 225 generates a data list, in which the order of striping data is rearranged, in order to sequentially write the striping data to each of disks at a write destination. In
The data control unit 225 requests the disk control unit 223 to write the striping data #1-#8 to the disk group #1 based on the data list after the data control unit 225 has generated the data list. The disk control unit 223 that has received the request sequentially writes the striping data to the disks at the write destination based on the data list. For example, the disk control unit 223 sequentially writes the striping data #1, #3, #5, and #7 to the disk #1 in this order. Similarly, for example, the disk control unit 223 sequentially writes striping data #2, #4, #6, and #8 to the disk #2 in this order.
The aggregate number is identification information assigned to each aggregate. The number of stored RAIDs is information indicating the number of RAID groups included in an aggregate. The stored RAID number is information indicating a RAID number of a RAID group included in the aggregate. The RAID number is identification information assigned to each RAID group. The number of disk groups is information indicating the total number of disk groups included in a RAID group within the aggregate. The stripe size is a size in which striping is performed for data received from the host 250.
The RAID management table 610 illustrated in
The disk group management table 710 illustrated in
The disk group number is identification information assigned to each disk group. The disk group number according to this embodiment is a serial number of all disk groups. In the example of
The storing RAID group number is information indicating a RAID number of a RAID group including disk groups. The number of used disks is information indicating the number of disks included in a disk group. The available capacity is information indicating an available capacity of a disk group. The write priority is information indicating a priority with which data is written. The power supply state is information indicating whether a power supply of a disk group is either in an ON or OFF state.
The write destination disk group number is a disk group number of a disk group as a write target. The number of write data is information indicating the number of striping data generated by performing striping for write data to be written to a disk group as a write target. The number of written disks is information indicating the number of disks included in a disk group as a write target. The data list is a data list indicating the order of writing data to a disk for each of disks included in a disk group as a write target.
The disk control unit 223 references the aggregate management table 221a of the target aggregate, which is stored in the memory 221. Then, the disk control unit 223 obtains a RAID number of a RAID group to be processed from a field of the stored RAID number in the aggregate management table 221a (step S901). The disk control unit 223 according to this embodiment obtains all RAID numbers registered as stored RAID numbers in the aggregate management table 221a. Then, the disk control unit 223 selects one of the obtained RAID numbers. Hereinafter, the RAID group having the selected RAID number is referred to as a “target RAID group”, and a RAID number of the target RAID group is referred to as a “target RAID number”.
The disk control unit 223 obtains the number of configuring disks and a RAID type from the RAID management table 221b, which is stored in the memory 221, of the target RAID group, namely, the RAID management table 221b having a RAID number that matches the target RAID number (step S902).
The disk control unit 223 calculates the number of disk groups of the target RAID group based on the number of configuring disks and the RAID type, which have been obtained in step S902 (step S903). For the calculation of the number of disk groups, the following expression (1) may be used.
{(the number of configuring disks)−(the number of parity disks)}/(the number of used disks) (1)
where “the number of configuring disks” is the number of configuring disks obtained in step S902, “the number of parity disks” is the number of parity disks used in the RAID group of the RAID type obtained in step S902, and “the number of used disks” is the number of disks used in a disk group. “the number of used disks” may be determined in advance. “the number of used disks” according to this embodiment is assumed to be 2, although it is not particularly limited.
For example, when the target RAID group is the RAID group having the RAID number #1, the disk control unit 223 obtains “9” and “RAID-DP” respectively as the number of configuring disks and the RAID type from the RAID management table 610 illustrated in
The disk control unit 223 divides disks included in the target RAID group into disk groups by the number of disk groups obtained in step S903 (step S904). Moreover, the disk control unit 223 assigns a disk group number and a write priority to each of the disk groups generated by dividing the disks into groups. For example, the disk control unit 223 sequentially assigns disk group numbers #1, #2, #3, . . . to the generated disk groups in the order where the disk groups are generated. Similarly, the disk control unit 223 sequentially assigns write priorities 1, 2, 3, . . . to the generated disk groups in the order where the disk groups are generated. In this case, the write priority “1” is the highest, and the write priorities “2”, “3”, . . . descends in this order. However, the assignments of disk group numbers and write priorities are not limited to the order where disk groups are generated. Disk group numbers and write priorities may be assigned to disk groups based on various policies when needed.
The disk control unit 223 generates the disk group management table 221c for each of the disk groups generated in step S904, and stores the disk group management table 221c in the memory 221. Then, the disk control unit 223 writes a storing RAID group number, the number of used disks, and a power supply state in each generated disk group management table 221c (step S905). Moreover, the disk control unit 223 writes the write priority assigned in step S904 to each generated disk group management table 221c (step S905).
The disk control unit 223 calculates an available capacity based on a capacity of a used storage area for each of the disk groups generated in step S904, and writes the calculated capacity to the disk group management table 221c (step S906).
The disk control unit 223 powers off disk groups other than the disk group having the write priority “1” assigned in step S904 among the disk groups included in the target RAID group (step S907). Moreover, the disk control unit 223 powers off disks included in RAID groups other than the target RAID group (step S907). The disk control unit 223 writes an OFF state to a field of the power supply state of the disk group management table 221c of each of the disk groups that have been powered off.
If a RAID group for which the processes of steps S902-S907 are not executed is included in a target aggregate (“YES” in step S908), the process of the disk control unit 223 moves to step S902. Then, the disk control unit 223 repeats the processes of steps S902-S908. When the processes of steps S902-S907 have been executed for all the RAID groups included in the target aggregate (“NO” in step S908), the disk control unit 223 terminates the disk group generation process (step S909).
The data write process according to this embodiment is executed by the data control unit 225 and the disk control unit 223. The write process executed on the data control unit 225 is described below with reference to
The data control unit 225 writes, to the memory 221, data that the host 250 has requested to write, and returns a write completion signal of the data to the host 250 (step S1001). The data that the host 250 has requested to write is hereinafter referred to as “write data”.
The data control unit 225 obtains a RAID number registered as a stored RAID number in the aggregate management table 221a by referencing the aggregate management table 221a (step S1002). Then, the data control unit 225 selects a RAID group having the highest selection priority set in the RAID management table 221b among RAID groups having the RAID number obtained in step S1002 (step S1003).
When an available capacity is not sufficient for the RAID group selected in step S1003 (“NO” in step S1004), the data control unit 225 selects a RAID group by re-executing step S1003. In this case, a RAID group already selected in step S1003 is excluded from selection targets.
When the available capacity is sufficient for the RAID group selected in step S1003 (“YES” in step S1004), the process of the data control unit 225 moves to step S1005. In this case, the data control unit 225 references a disk group management table 221c of each of disk groups included in the RAID group selected in step S1003, and extracts a disk group having an available capacity that can store write data (step S1005). Then, the data control unit 225 selects a disk group having the highest write priority included in the disk group management table 221c from among disk groups extracted in step S1005 (step S1006).
When the data size of the write data is equal to or smaller than a stripe size (“YES” in step S1007), the process of the data control unit 225 moves step S1010. When the data size of the write data is larger than the stripe size (“NO” in step S1007), the process of the data control unit 225 moves to step S1008. In this case, the data control unit 225 generates data groups by dividing the write data into groups according to the number of used disks and the stripe size (step S1008). In this embodiment, striping is performed for write data in a stripe size, and striping data is divided into groups by the number of used disks.
The data control unit 225 generates a data list for each of disks included in the disk group as a write target, which is selected in step S1006, and writes the data list to a field of the data list in the write management table 221d (step S1009). In this data list, the order of striping data included in a data group generated in step S1008 is rearranged in order to sequentially write the data to a disk.
Additionally, in the process of step S1009, the data control unit 225 writes values also to the write destination disk group number, the number of write data, and the number of written disks. To the write destination disk group number in the write management table 221d, a number of the disk group selected in step S1006 is written. To the number of write data in the write management table 221d, the number of striping data generated by performing the striping in step S1008 is written. To the number of written disks in the write management table 221d, the number of used disks of the disk group selected in step S1006 is written.
Upon termination of the above described processes, the data control unit 225 requests the disk control unit 223 to execute the write process (step S1010), so that the write process executed on the data control unit 225 is terminated (step S1011).
The disk control unit 223 references the write management table 221d stored in the memory 221 (step S1101). At this time, the disk control unit 223 can identify the disk group as a data write target based on the write destination disk group number in the write management table 221d. The disk group as a write target is hereinafter referred to as a “target disk group”.
The disk control unit 223 obtains a data list for each of disks within the target disk group from the write management table 221d (step S1102). Then, the disk control unit 223 reads, from the memory 221, the striping data in order where the data is registered to the data list, and writes the read data to disks within the target disk group (step S1102). Generation of parity at a data write will be described later with reference to
The disk control unit 223 reads the disk group management table 221c of the target disk group (step S1103). Then, the disk control unit 223 calculates an available capacity after the data has been written in step S1102 by subtracting the amount of data written in step S1102 from the available capacity included in the read disk group management table 221c. Then, the disk control unit 223 updates the available capacity of the disk group management table 221c to the calculated available capacity (step S1104).
When the used capacity of the target disk group exceeds a threshold, for example, when a ratio of the used capacity to the entire storage capacity of the target disk group exceeds 80 percent (“YES” in step S1105), the process of the disk control unit 223 moves to step S1106. In this case, the disk control unit 223 identifies a disk group having a priority second highest to the target disk group by referencing the disk group management table 221c. Then, the disk control unit 223 powers on the disk group having the priority second highest to the target disk group (step S1106). When the disk group that has been powered on belongs to a RAID group different from the target disk group and when a parity disk is included in the RAID group to which the disk group belongs, the disk control unit 223 powers on also the parity disk. Note that the used capacity of the target disk group can be obtained by subtracting the available capacity calculated in step S1104 from the entire capacity of the target disk group.
The disk control unit 223 updates information about the power supply state of the disk group management table 221c of the disk group that has been powered on in step S1106 to an ON state (step S1107).
Upon completion of the update of the disk group management table 221c, the process of the disk control unit 223 moves to step S1108. Moreover, when the used capacity of the target disk group is equal to or smaller than the threshold (“NO” in step S1105), the process of the disk control unit 223 also moves to step S1108. Then, the disk control unit 223 terminates the write process (step S1108).
The RAID #0 illustrated in
Since the RAID #0 illustrated in
As described above, this embodiment assumes that “0” is written to disks included in a disk group that is in a power-off state at a data write. This is based on the configuration in which “0” is written to disks by executing a process of zero-padding for each of the disks included in a generated RAID group when the RAID group is generated.
The above embodiments have been described by taking, as an example, the case where a RAID group is a RAID-DP. However, a RAID group available in this embodiment is not limited to a RAID-DP. For example, the RAID group may be a RAID group, such as a RAID 4 or a RAID 0+1, which is configured by including data disks and a parity disk or configured by only data disks.
As the number of used disks in the disk group management table 221c, a value obtained by counting duplicated disks as one disk is used. For example, the disk group #1 illustrated in
Additionally, for example, when data is written to the disk group #1 illustrated in
The storage system 200 divides data disks included in a RAID group into a plurality of disk groups. Then, the storage system 200 powers off disk groups other than a disk group having the highest write priority. As a result, the storage system 200 can reduce power consumed by the storage system 200.
The storage system 200 divides striping data into a plurality of data groups when the data is written to a disk group. Then, the storage system 200 respectively distributes and stores each of the data groups to disks within the disk group. At this time, the storage system 200 generates a data list where the striping data included in each of the data groups is rearranged for each of the written disks. Then, the storage system 200 writes the data to the disks within the disk group based on the generated data list. As a result, the storage system 200 writes the striping data included in each of the data groups to the disks at a write destination of the striping data as sequential data.
In a storage system implemented with the AOU technique, storage area allocation and power supply control are performed in units of disks. Therefore, accesses may concentrate on a certain disk. Accordingly, access performance of the storage system is sometimes reflected unchanged as that of a disk. The storage system 200 according to this embodiment, however, performs a power supply control in units of disk groups each including a plurality of disks. As a result, even if accesses concentrate on one disk group, data is distributed and stored on disks within the disk group. Consequently, performance of a data access can be avoided from being degraded.
As described above, the disclosed storage system can reduce power consumption.
All examples and conditional language provided herein are intended for the pedagogical purposes of aiding the reader in understanding the invention and the concepts contributed by the inventor to further the art, and are not to be construed as limitations to such specifically recited examples and conditions, nor does the organization of such examples in the specification relate to a showing of the superiority and inferiority of the invention. Although one or more embodiments of the present invention have been described in detail, it should be understood that the various changes, substitutions, and alterations could be made hereto without departing from the spirit and scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
2012-179480 | Aug 2012 | JP | national |