The present application claims priority under 35 U.S.C. § 119 to Japanese Patent Application No. 2021-132037 filed on Aug. 13, 2021. The above application is hereby expressly incorporated by reference, in its entirety, into the present application.
The present disclosure relates to an information processing apparatus, an information processing method, and an information processing program.
WO2014/141355A discloses a technique of determining, in a storage having a hierarchical structure, based on a usage status of an upper layer storage or a lower layer storage, which of update processing and data migration processing is to be executed with priority, the update processing being processing of updating search indexes of pieces of data stored in the upper layer storage, and the data migration processing being processing of migrating data from the upper layer storage to the lower layer storage.
On the other hand, there is known a technique of configuring a storage pool using a plurality of magnetic tapes and providing a logical volume to a user. In addition, there is also known a technique of multiplexing and storing data by recording the same data in a plurality of storage pools. Further, in a storage system using magnetic tapes, for the purpose of storing data for a long term, increasing a capacity per magnetic tape, and improving a transmission speed, processing of migrating data recorded in an old-generation magnetic tape to a new-generation magnetic tape (hereinafter, referred to as “inter-generation migration processing) is performed.
In a case of performing inter-generation migration processing in a plurality of storage pools in which the same data is recorded and each of which includes a plurality of magnetic tapes, it is considered to perform processing of migrating data (hereinafter, referred to as “inter-generation duplication and migration processing) from an old-generation magnetic tape of one storage pool to new-generation magnetic tapes included in each of the plurality of storage pools. This is, for example, to shorten a reading time in data migration and to reduce the number of tape drives used for reading.
Further, in a tape library in which a plurality of generations of magnetic tapes are used, in many cases, a plurality of generations of tape drives are prepared. In this case, an old-generation tape drive cannot read and write data from and to a new-generation magnetic tape, but a new-generation tape drive can often read and write data from and to an old-generation (for example, previous-generation) magnetic tape.
The new-generation tape drive has higher performance than the old-generation tape drive. Thus, in normal data reading and writing other than data migration, it is preferable to shorten a response time to a user by using the new-generation tape drive. However, in inter-generation duplication and migration processing, data is written to the new-generation magnetic tapes included in each of the storage pools by being multiplexed. As a result, a relatively large number of new-generation tape drives are occupied.
The technique described in WO2014/141355A does not take into consideration that an occupancy time of the new-generation tape drive is increased in the tape library including the plurality of generations of tape drives.
An object of the present disclosure is to provide an information processing apparatus, an information processing method, and an information processing program capable of migrating data while preventing an increase in an occupancy time of a new-generation tape drive.
According to an aspect of the present disclosure, there is provided an information processing apparatus including: at least one processor that performs control of migrating data recorded in a plurality of storage pools in which the same data is recorded with a preset multiplicity and each of which includes a plurality of magnetic tapes, in which the processor is configured to perform first migration processing of migrating data from a relatively-old-generation magnetic tape included in one storage pool of the plurality of storage pools to relatively-new-generation magnetic tapes included in each of the plurality of storage pools in a case where the number of tape drives which are available for data migration is equal to or larger than a threshold value, the tape drives being relatively-new-generation tape drives among a plurality of generations of tape drives; and perform second migration processing of migrating data from migration-source magnetic tapes as relatively-old-generation magnetic tapes included in the storage pool to migration-destination magnetic tapes of which the number is smaller than the multiplicity in a case where the number of the tape drives is smaller than the threshold value.
In the information processing apparatus according to the aspect of the present disclosure, the migration-source magnetic tapes may be relatively-old-generation magnetic tapes, and the processor may be configured to perform, in the second migration processing, control of not migrating, among pieces of data recorded in the migration-source magnetic tapes, data recorded in the relatively-new-generation magnetic tapes of the storage pool to which the migration-source magnetic tapes belong.
In the information processing apparatus according to the aspect of the present disclosure, the processor may be configured to perform, in the second migration processing, control of migrating, among pieces of data recorded in the migration-source magnetic tapes, valid data to the migration-destination magnetic tapes and not migrating invalid data to the migration-destination magnetic tapes.
In the information processing apparatus according to the aspect of the present disclosure, the processor may be configured to perform, as a result of the first migration processing or the second migration processing, control of initializing, among the migration-source magnetic tapes, the magnetic tape from which all pieces of recorded data are migrated to the relatively-new-generation magnetic tapes.
In the information processing apparatus according to the aspect of the present disclosure, the threshold value may be set according to the multiplicity.
In the information processing apparatus according to the aspect of the present disclosure, the threshold value may be set according to a frequency of an access to the relatively-new-generation magnetic tapes.
In the information processing apparatus according to the aspect of the present disclosure, the plurality of generations of tape drives may be provided in a tape library in which the magnetic tapes are included.
According to another aspect of the present disclosure, there is provided an information processing method executed by at least one processor of an information processing apparatus, the processor performing control of migrating data recorded in a plurality of storage pools in which the same data is recorded with a preset multiplicity and each of which includes a plurality of magnetic tapes, the method including: performing first migration processing of migrating data from a relatively-old-generation magnetic tape included in one storage pool of the plurality of storage pools to relatively-new-generation magnetic tapes included in each of the plurality of storage pools in a case where the number of tape drives which are available for data migration is equal to or larger than a threshold value, the tape drives being relatively-new-generation tape drives among a plurality of generations of tape drives; and performing second migration processing of migrating data from migration-source magnetic tapes as relatively-old-generation magnetic tapes included in the storage pool to migration-destination magnetic tapes of which the number is smaller than the multiplicity in a case where the number of the tape drives is smaller than the threshold value.
According to still another aspect of the present disclosure, there is provided an information processing program causing at least one processor of an information processing apparatus to execute a process, the processor performing control of migrating data recorded in a plurality of storage pools in which the same data is recorded with a preset multiplicity and each of which includes a plurality of magnetic tapes, the process including: performing first migration processing of migrating data from a relatively-old-generation magnetic tape included in one storage pool of the plurality of storage pools to relatively-new-generation magnetic tapes included in each of the plurality of storage pools in a case where the number of tape drives which are available for data migration is equal to or larger than a threshold value, the tape drives being relatively-new-generation tape drives among a plurality of generations of tape drives; and performing second migration processing of migrating data from migration-source magnetic tapes as relatively-old-generation magnetic tapes included in the storage pool to migration-destination magnetic tapes of which the number is smaller than the multiplicity in a case where the number of the tape drives is smaller than the threshold value.
According to the present disclosure, it is possible to migrate data while preventing an increase in an occupancy time of the new-generation tape drive.
Hereinafter, an example of an embodiment for performing a technique according to the present disclosure will be described in detail with reference to the drawings.
First, a configuration of an information processing system 10 according to the present embodiment will be described with reference to
The tape library 14 includes a plurality of slots (not illustrated) and a plurality of tape drives 18, and each slot includes a magnetic tape T as an example of a recording medium. Each tape drive 18 is connected to the information processing apparatus 12. The tape drive 18 writes or reads data to or from the magnetic tape T under a control of the information processing apparatus 12. Examples of the magnetic tape T include a linear tape-open (LTO) tape.
In a case where the information processing apparatus 12 writes or reads data to or from the magnetic tape T, the magnetic tape T as a write target or a read target is loaded from the slot into a predetermined tape drive 18. In a case where data is written or read to and from the magnetic tape T loaded into the tape drive 18, the magnetic tape T is unloaded from the tape drive 18 into the slot in which the magnetic tape T is originally included.
Next, a hardware configuration of the information processing apparatus 12 according to the present embodiment will be described with reference to
The storage unit 22 is realized by a hard disk drive (HDD), a solid state drive (SSD), a flash memory, or the like. An information processing program 30 is stored in the storage unit 22 as a storage medium. The CPU 20 reads the information processing program 30 from the storage unit 22, develops the read information processing program 30 in the memory 21, and executes the developed information processing program 30.
Further, the storage unit 22 stores a tape management table 32 for managing the magnetic tape T.
The invalid data is, for example, data for which a deletion instruction is input by the user and which is logically deleted. In order to physically delete the data for which the deletion instruction is input by the user from the magnetic tape T, all pieces of data recorded in the magnetic tape T are temporarily saved in another storage area, the magnetic tape T is initialized, and pieces of data excluding the logically-deleted data among all pieces of the saved data are recorded in the magnetic tape T. For this reason, in order to physically delete the data from the magnetic tape T, the tape drive 18 is occupied for a relatively long time. Therefore, in the present embodiment, the data for which the deletion instruction is input by the user is logically deleted by storing “1” in the invalid flag column of the tape management table 32. Thereby, the data will not be accessed by the user.
Further, as an example, as illustrated in
Further, as illustrated in
The tape library 14 according to the present embodiment includes a plurality of generations (two generations in the present embodiment) of the tape drives 18 in accordance with the generations of the magnetic tapes T. The old-generation tape drive 18 can read and write data only from and to the old-generation magnetic tape T among the two-generation magnetic tapes T. The new-generation tape drive 18 can read and write data from and to each of the two-generation magnetic tapes T.
Further, in the information processing system 10 according to the present embodiment, data is recorded with redundancy. Specifically, as an example, as illustrated in
Further, the information processing apparatus 12 performs control of migrating the data recorded in the two storage pools SPs. In the control, the information processing apparatus 12 performs the following two types of data migration processing. The first migration processing is processing of migrating data recorded in an old-generation magnetic tape T to a new-generation magnetic tape T. In the present embodiment, as an example, as illustrated in
Although the same data is recorded in each of two storage pools SPs, the data is not always recorded in the same physical manner in each storage pool SP. For example, as illustrated in
In this case, depending on the storage pool SP, the old-generation magnetic tape T cannot be reused or discarded, and thus the number of the magnetic tapes Ts in use is increased. In the example of
Therefore, as an example, as illustrated in
Further, as illustrated in
Further, as illustrated in
In the second migration processing, the migration-source magnetic tape T and the migration-destination magnetic tape T may be the same generation magnetic tape T, or the migration-destination magnetic tape T may be a new-generation magnetic tape T. Further, in the second migration processing, at least one of the migration-destination magnetic tapes Ts may be the migration-source magnetic tape T. In this case, the information processing apparatus 12 performs control of reading data recorded in the migration-source magnetic tape T, then performs control of initializing at least one of the migration-source magnetic tapes Ts, and use the initialized magnetic tape T as the migration-destination magnetic tape T. Further, the data migration destination in the first migration processing and the second migration processing may be the magnetic tape T in which data is not recorded, or a free area of the magnetic tape T in which data is recorded.
In the first migration processing, since the new-generation magnetic tapes Ts included in each of the two storage pools SPs are the migration-destination magnetic tapes Ts, at least two new-generation magnetic tapes T are used. In the second migration processing, in a case where the migration-destination magnetic tape T is the old-generation magnetic tape T, the new-generation magnetic tape T is not used. Further, in the second migration processing, in a case where the migration-destination magnetic tape T is the new-generation magnetic tape T, one new-generation magnetic tape T is used.
In the present embodiment, in each of the first migration processing and the second migration processing, in a case where data is read and written from and to the old-generation magnetic tape T, the old-generation tape drive 18 is used. Further, in each of the first migration processing and the second migration processing, in a case where data is read and written from and to the new-generation magnetic tape T, the new-generation tape drive 18 is used. That is, in a case where the first migration processing is performed, many new-generation tape drives 18 are occupied as compared with a case where the second migration processing is performed.
In reading and writing of data recorded in the magnetic tape T based on an access from the user terminal, in many cases, the reading and writing is often required to be performed at a higher speed than in the first migration processing and the second migration processing. In the following, the reading and writing of data recorded in the magnetic tape T based on an access from the user terminal are referred to as “normal access”. That is, in the normal access, it is preferable to perform reading and writing of data from and to the new-generation magnetic tape T by using the new-generation tape drive 18. Therefore, the information processing apparatus 12 has a function of selectively executing the first migration processing and the second migration processing based on a usage situation of the new-generation tape drive 18.
Next, a functional configuration of the information processing apparatus 12 according to the present embodiment will be described with reference to
The determination unit 40 determines whether or not the number of the tape drives 18 that are available for data migration is equal to or larger than a threshold value, the tape drives 18 being new-generation tape drives 18 among the plurality of generations of the tape drives 18 included in the tape library 14. The number of the tape drives 18 that are available for data migration means, for example, the number of the tape drives 18 that are not in use due to a normal access. For example, in a case where the tape library 14 includes four new-generation tape drives 18 of which two new-generation tape drives 18 are used for a normal access, the number of the new-generation tape drives 18 that are available for data migration is 2 (=4−2).
The threshold value in this case is set according to the multiplicity. Specifically, the threshold value is set to, for example, the same value as the multiplicity. Further, the threshold value may be set according to a frequency of an access to the new-generation magnetic tape T. Specifically, the threshold value is set to a larger value, for example, every unit time (for example, one hour) as a frequency of an access to the new-generation magnetic tape T increases. Thereby, the threshold value is set to a larger value during a time period for which the frequency of the normal access is high. Therefore, the second migration processing is easily executed, and as a result, the number of the new-generation tape drives 18 that are available for the normal access is increased.
In a case where the determination unit 40 determines that the number of the new-generation tape drives 18 which are available for data migration is equal to or larger than the threshold value, the controller 42 performs the first migration processing described above. Further, in a case where the determination unit 40 determines that the number of the new-generation tape drives 18 which are available for data migration is smaller than the threshold value, the controller 42 performs the second migration processing described above. Further, as a result of the first migration processing, the controller 42 performs control of initializing, among the magnetic tapes Ts as the data migration source, the magnetic tape T from which all pieces of recorded data are migrated to the new-generation magnetic tape T. As a result of the first migration processing, the controller 42 may perform control of unloading the magnetic tape T from which all pieces of recorded data are migrated to the new-generation magnetic tape T among the magnetic tapes Ts as the data migration source, from the tape library 14.
Further, as a result of the second migration processing, the controller 42 performs control of initializing, among the magnetic tapes Ts as the data migration source, the magnetic tape T from which all pieces of recorded data are migrated to the new-generation magnetic tape T. As a result of the second migration processing, the controller 42 may perform control of unloading the magnetic tape T from which all pieces of recorded data are migrated to the new-generation magnetic tape T among the magnetic tapes Ts as the data migration source, from the tape library 14.
Next, an operation of the information processing apparatus 12 according to the present embodiment will be described with reference to
In step S10 of
In a case where the determination in step S10 is NO, the process proceeds to step S14. In step S14, the controller 42 performs the second migration processing. As a result of the processing of step S12 or step S14, in step S16, the controller 42 performs control of initializing, among the magnetic tapes Ts as the data migration source, the magnetic tape T from which all pieces of recorded data are migrated to the new-generation magnetic tape T. In a case where the processing of step S16 is completed, data migration processing is completed.
As described above, according to the present embodiment, it is possible to migrate data while preventing an increase in an occupancy time of the new-generation tape drive 18.
In the above embodiment, the storage pool SP may include magnetic tapes Ts of three or more generations.
Further, in the embodiment, for example, as a hardware structure of a processing unit that executes various processing such as the determination unit 40 and the controller 42, the following various processors may be used. The various processors include, as described above, a CPU, which is a general-purpose processor that functions as various processing units by executing software (program), and a dedicated electric circuit, which is a processor having a circuit configuration specifically designed to execute a specific processing, such as a programmable logic device (PLD) or an application specific integrated circuit (ASIC) that is a processor of which the circuit configuration may be changed after manufacturing such as a field programmable gate array (FPGA).
One processing unit may be configured by one of these various processors, or may be configured by a combination of two or more processors of the same type or different types (for example, a combination of a plurality of FPGAs or a combination of a CPU and an FPGA). Further, the plurality of processing units may be configured by one processor.
As an example in which the plurality of processing units are configured by one processor, firstly, as represented by a computer such as a client and a server, a form in which one processor is configured by a combination of one or more CPUs and software and the processor functions as the plurality of processing units may be adopted. Secondly, as represented by a system on chip (SoC) or the like, a form in which a processor that realizes the function of the entire system including the plurality of processing units by one integrated circuit (IC) chip is used may be adopted. As described above, the various processing units are configured by using one or more various processors as a hardware structure.
Further, as the hardware structure of the various processors, more specifically, an electric circuit (circuitry) in which circuit elements such as semiconductor elements are combined may be used.
Further, in the embodiment, an example in which the information processing program 30 is stored (installed) in the storage unit 22 in advance has been described. On the other hand, the present disclosure is not limited thereto. The information processing program 30 may be provided by being recorded in a recording medium such as a compact disc read only memory (CD-ROM), a digital versatile disc read only memory (DVD-ROM), or a Universal Serial Bus (USB) memory. Further, the information processing program 30 may be downloaded from an external device via a network.
Number | Date | Country | Kind |
---|---|---|---|
2021-132037 | Aug 2021 | JP | national |