This application is based upon and claims the benefit of priority of the prior Japanese Patent Application No. 2011-146631, filed on Jun. 30, 2011, the entire contents of which are incorporated herein by reference.
The embodiments discussed herein are related to transference control.
As a method for realizing database (DB) duplication for the purpose of one of countermeasures against disaster, load distribution, and so on, a method is available in which a log representing an update result of a copy source database based on a structured query language (SQL) application in a primary system is transmitted to and used in a copy destination database in a secondary system.
In such a database duplication system, a fault may occur from which self-recovery by the secondary system is not possible. In this case, by extracting DB data to be transmitted to the secondary system, transferring the extracted DB data to the secondary system, and storing the DB data transmitted from the primary system in the copy destination database, the copy destination database may be logically restored using the DB data from the copy source database.
Data synchronization techniques in a database duplication system include, for example, the technique described next. That is, a primary storage system transfers, by synchronous remote copying, write data written to a log logical disk to a secondary storage system on a recovery site. The primary storage system also transfers, by asynchronous remote copying in which the writing order is guaranteed, write data written to a DB data logical disk to the secondary storage system on the recovery site. The log logical disk and the DB data logical disk are discriminated from each other. The write data is data written to the primary storage system as a result of a transaction but is not data separately extracted for recovery or the like.
According to an aspect of the invention, a transference control apparatus includes a communication controller, and a processor that executes a procedure, the procedure including processing a control that causes first processing, in which an update log representing the update content of updating performed for a first database is transmitted, by the communication controller, to a second database serving as a copy destination for the first database, to be performed with priority over second processing, in which transmission target data contained in the first database is transmitted to the second database, and restraining, in the case that updating of the transmission target data is allowed in the first database, the control that causes the first processing to be performed with priority.
The object and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the claims.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention, as claimed.
An example of the outline and state of a system configuration according to an embodiment will be explained with reference to
As illustrated in
The method described above has the following advantage compared to a method for duplicating the disk volume in which data in a database resides. That is, the amount of data transfer between systems is smaller, and a secondary system may be utilized (for example, for a reference service) while duplication of a database continues.
In a database duplication system such as the database duplication system described above, a fault from which self-recovery by the secondary system is not possible may occur. In this case, as illustrated in
The primary system includes a copy source database that includes a plurality of database resources used for a plurality of services, a DBMS that manages the copy source DB, a duplication unit, and a DB data extracting unit that extracts part of data in the copy source DB for recovery or the like. The secondary system includes a copy destination DB that is to store data identical to that stored in the copy source DB, a DBMS that manages the copy destination DB, a duplication unit, and a DB data storing unit. The primary system and the secondary system are connected via a transfer path.
For example, as indicated by the dotted-line rectangle A in
In a case such as where, for example, a fault occurs concerning a database resource in the copy destination DB, the DB data extracting unit extracts DB data in the database resource and transmits the extracted DB data to the secondary system via the same transfer path as that used to transfer an update log, in accordance with an instruction from an administrator or the like, as indicated by a dotted-line rectangle B in
As described above, it is assumed that an update log and DB data are transmitted across the transfer path between the primary system and the secondary system occurs. For the database resource that is operating normally as indicated by the dotted-line rectangle A, it is desirable that synchronization of data between the copy source DB and the copy destination DB be achieved as quickly as possible. This is because the secondary system may have other uses such as being separately used for reference.
In contrast, for the database resource that is performing a recovery operation as indicated by the dotted-line rectangle B, the service may completely be stopped, or the service may be asked to continue even though the copy destination DB is also originally meant to be used in terms of the characteristics and the like of the service but unavoidably only the copy source DB may be used. Even if the service is completely stopped, there may be little time left to restart the service, or there may be enough time left to restart the service.
The processing for causing the DB data extracting unit of the primary system to extract DB data and the processing for causing the DB data storing unit of the secondary system to store DB data impose loads on the copy source DB and the copy destination DB respectively. In general, however, some load distribution is done in database designing, and the loads do not greatly affect the service. Regarding the transfer path, the data transfer amount significantly differs between where only an update log is transferred and where both DB data and an update log are transferred. Thus, transfer of DB data through the transfer path without any consideration may greatly affect the service that is normally operating as indicated by the dotted-line rectangle A.
In view of the above-described points, it is desirable that DB data be transferred to the extent that the transfer of DB data does not to affect the transfer of an update log. That is, transfer of an update log is prioritized as much as possible. However, for transfer of DB data, the transfer status is changed to take into account at least one of the transfer status of DB data and the service operating status, which includes whether a service using a database resource in which a fault has occurred is stopped or operating and whether or not the time to restart the service is close.
The duplication unit 110 includes a log obtaining unit 111 that acquires data of a log generated by the DBMS 120, a data reading unit 115 that performs processing for obtaining DB data from the DB data extracting unit 130, a first transmission buffer 112 that stores data of a log, a second transmission buffer 116 that stores DB data, a data transmission unit 114 that performs processing for transmitting a log and processing for transmitting DB data, and a request queue 113 for managing requests and the like from the log obtaining unit 111 and the data reading unit 115. The data transmission unit 114 performs processing for reading and transmitting data for a request that is determined to be a transmission target from either the first transmission buffer 112 or the second transmission buffer 116.
The secondary system 200 illustrated in
The data reception unit 213 of the duplication unit 210 in the secondary system 200 receives log data from the primary system 100, and outputs the received log data to the update log reflection unit 211. The data reception unit 213 receives a DB file from the primary system 100, and outputs the received DB file to the DB data storing unit 214. The update log reflection unit 211 causes the DBMS 220 to update the DB 230 with reference to the update log. The DB data storing unit 214 causes the DBMS 220 to write DB data to the DB 230. Since processing performed by the duplication unit 210 in the secondary system 200 is performed in a method similar to a known method, the processing will not be explained.
In a system such as that described above, an administrator or the like instructs, using the administrator terminal 300, the DB data extracting unit 130 to extract data, while specifying the identifier of a database resource from which the data is to be extracted and a guaranteed transfer completion time. In accordance with this, the DB data extracting unit 130 requests the DBMS 120 to extract DB data for the specified database resource, and acquires from the DBMS 120 the DB data corresponding to the request.
At this time, the DBMS 120 acquires a checkpoint log representing the update status of the extracted DB data.
In general, a checkpoint log representing the update status of DB data is utilized for order guarantee in which after DB data that is transferred from a primary system to a secondary system is stored in the DB of the secondary system, an update log generated after extraction of the DB data is applied to the DB. In this embodiment, a checkpoint log may be used for a different purpose.
When receiving an SQL statement involving DB updating for a certain service, the DBMS 120 updates the DB 140, and generates a corresponding update log.
Furthermore, when receiving a log from the DBMS 120, the log obtaining unit 111 stores a checkpoint log and an update log into the first transmission buffer 112, and outputs a log transmission request to the data transmission unit 114. The DB data extracting unit 130 outputs a notification including the identifier of a database resource, the overall size (volume) of extracted DB data, and the guaranteed transfer completion time to the data reading unit 115, in accordance with an instruction from an administrator or the like. In accordance with the notification, the data reading unit 115 outputs a DB data start notification including the database resource ID to the data transmission unit 114. The DB data start notification may contain the overall size and the guaranteed transfer completion time. In this case, the processing of S21 and S23 described below may be omitted.
Furthermore, the data reading unit 115 divides DB data extracted by the DB data extracting unit 130 into data portions each having a certain size and stores the data portions into the second transmission buffer 116. The data reading unit 115 also outputs to the data transmission unit 114 a transmission request for each of the data portions generated by data division. Data regarding notifications is stored, for example, until a request is issued from the data transmission unit 114. After transmitting transmission requests for all the data portions of DB data, the data reading unit 115 outputs to the data transmission unit 114 a DB data completion notification containing the database resource ID.
The data transmission unit 114 stores an update log transmission request, a checkpoint log transmission request, a DB data transmission request, a DB data start notification, and a DB data completion notification in the request queue 113. For example, the request queue 113 has the structure illustrated in
In the example illustrated in
The data transmission unit 114 stores a management table 1141 to manage the data transfer status of DB data. Each entry of the management table 1141 has, for example, the data format illustrated in
Processing performed by the data transmission unit 114 will now be explained with reference to
As described above, an update log and a checkpoint log (CP log) for extraction of DB data are stored in the first transmission buffer 112. Extracted DB data is divided into a plurality of data portions and stored in the second transmission buffer 116.
In this embodiment, for example, the processing described below is repeatedly performed, for example, for initial requests or the like of the first to third queues 1131 to 1135 until, for example, a process termination request is received from the administrator terminal 300 or the like (S3). When a termination request is received, the process is terminated.
If a termination request has not been received, the data transmission unit 114 determines whether or not an update log transmission request is stored in the first queue 1131 of the request queue 113 (S5). If an update log transmission request is stored in the first queue 1131, the data transmission unit 114 sets the update log transmission request as a transmission target (S7). More specifically, the data transmission unit 114 moves the update log transmission request to the fourth queue 1137. Then, the process proceeds to S9.
If an update log transmission request is not stored in the first queue 1131 of the request queue 113 or after the processing of S7 is performed, the data transmission unit 114 determines whether or not a CP log transmission request is stored in the second queue 1133 of the request queue 113 (S9). If a CP log transmission request is stored in the second queue 1133, the data transmission unit 114 sets the CP log transmission request as a transmission target (S11). Then, the data transmission unit 114 determines whether the CP type of the CP log stored in the first transmission buffer 112 represents that DB updating is possible (S13). Here, the CP log has the data format illustrated in
If the CP type represents that DB updating is allowed, the data transmission unit 114 sets an entry in which “high” priority level is set for the CP log transmission request in the management table 1141 (S15). More specifically, an entry including a database resource ID contained in the CP log and “high” priority level is registered in the management table 1141. In this fashion, by setting the priority level as equivalent to that for an update log or the like, transfer of DB data may be completed early. Then, the process proceeds via a terminal A to S21 of
If the CP type represents that DB updating is not possible, the data transmission unit 114 sets an entry in which “low” priority level is set for the CP log transmission request in the management table 1141 (S17). More specifically, an entry including a database resource ID contained in the CP log and “low” priority level is registered in the management table 1141. As described above, basically, the priority level for transfer of DB data is lower than the priority level for transfer of an update log or the like, except for a specific case in which a recovery operation is performed while a service for a database resource is operating in the state where DB updating is possible.
The process then proceeds via the terminal A to S21 of
Referring to
After the processing of S23 is performed or if a DB data start notification is not stored in the third queue 1135, the data transmission unit 114 determines whether or not a DB data transmission request is stored in the third queue 1135 (S25). If a DB data transmission request is stored in the third queue 1135, the data transmission unit 114 performs a priority evaluation process (S27). The priority evaluation process will be explained with reference to
In the priority evaluation process, the data transmission unit 114 determines whether or not “low” priority level is set in the corresponding entry in the management table 1141 (S41 of
If “low” priority level is set, the data transmission unit 114 calculates the average transmission speed for DB data, and stores the calculated average transmission speed into a storage device, such as, for example, a main memory (S43). For example, it is assumed that the entry of the management table 1141 illustrated in
DB data for a plurality of database resources may be transferred at the same time. For example, if DB data for two database resources are transferred at the same time, when “low” priority level is set for the DB data for both the database resources, the same bandwidth as that illustrated in
Then, the data transmission unit 114 evaluates, using the average transmission speed, the time to be taken for transfer of non-transmitted data (S45). More specifically, the value obtained by subtracting the amount of transmitted data from the overall size is divided by the average transmission speed. In order to allow a margin, a specific value may be added to the value obtained by the above-mentioned calculation. In the example explained above, 1.56 hours (=(5−3.6)/0.9) is calculated.
Then, the data transmission unit 114 determines whether or not the time evaluated in S45 is longer than the time to the guaranteed transfer completion time (S47). The time to the guaranteed transfer completion time is calculated by subtracting the current time from the guaranteed transfer completion time. In the example explained above, the time to the guaranteed transfer completion time is 1 hour (=13:00−12:00), and the evaluated time is 1.56 hours. Thus, the evaluated time is longer than the time to the guaranteed transfer completion time. In this case, transfer of DB data is less likely to be completed by the guaranteed transfer completion time.
If the evaluated time is longer than the time to the guaranteed transfer completion time, the data transmission unit 114 changes the priority level for the corresponding entry in the management table 1141 to “high” (S49). That is, the state where priority is given to transfer of an update log is changed to the state where priority is equally given to both the transfer of an update log and transfer of DB data. Then, the process returns to the calling process.
In this case, as schematically illustrated in
If the evaluated time is less than or equal to the time to the guaranteed transfer completion time, even if “low” priority level is set at the current time, the transfer of DB data is scheduled to be completed by the guaranteed transfer completion time. Thus, the priority level is not changed. That is, the process returns to the calling process.
In this case, priority is given to log transfer. However, if a log does not exist in a queue, DB data is transferred. Thus, as schematically illustrated in
If “high” priority level is set in a CP log, as schematically illustrated in
As described above, it is determined, in accordance with the data transfer status, whether or not a “low” priority level, which is lower than the priority level for an update log or the like, is to be changed to a “high” priority level. If transfer of DB data is less likely to be completed by the guaranteed transfer completion time, a “high” priority level, which is equivalent to the priority level for an update log or the like, is set.
Referring back to
If “low” priority level is set, the data transmission unit 114 determines whether or not an update log transmission request that is set as a transmission target or a CP log transmission request that is set as a transmission target exists (S31). More specifically, the data transmission unit 114 checks if an update log transmission request or a CP log transmission request is stored in the fourth queue 1137. If a log transmission request that is set as a transmission target exists, the process proceeds to S35. If a log transmission request that is set as a transmission target does not exist, the process proceeds to S33. That is, since no log to be transmitted exists, DB data is transmitted.
The data transmission unit 114 determines whether or not a DB data completion notification exists in the third queue 1135 (S35). If a DB data completion notification is stored in the third queue 1135, transfer of DB data relating to the DB data completion notification has been completed. Thus, the data transmission unit 114 discards the corresponding entry in the management table 1141 (S37). Then, the process proceeds via a terminal C to S51 of
Referring to
Then, if the data transmitted in S53 contains DB data (Yes in S55), the data transmission unit 114 updates the amount of transmitted data in the corresponding entry of the management table 1141 according to the database resource ID of the DB data in such a manner that the amount of transmitted data is increased by the amount of transmitted DB data (S57). Then, the process returns via the terminal B to S1 of
By performing processing such as the above, for example, the determination as to whether or not the priority level for DB data is to be changed from “low” to “high” may be appropriately made in accordance with whether or not a service for a database resource relating to a recovery operation is operating, and whether or not the transfer status of the DB data for the database resource relating to the recovery operation is able to guarantee the guaranteed transfer completion time. That is, a transfer path may be efficiently utilized in accordance with the status.
Furthermore, the influence on a user service for a normal database resource may be reduced to close to if not the minimum, thus contributing to a flexible recovery operation.
By transmitting data of the management table 1141 to the administrator terminal 300, the data transfer status may be presented to the administrator. For example, the administrator may change the guaranteed transfer completion time or may perform processing such as manually changing the priority level.
Furthermore, in the example explained above, it is assumed that a fault has occurred in the secondary system 200. However, this embodiment is also applicable to the case where DB data of a specific database resource is transmitted from a copy source DB in the primary system 100 to the secondary system 200 for a different reason.
Although the embodiment of this technique has been described above, this technique is not limited to the embodiment. For example, the functional block diagrams illustrated in
Furthermore, the processing flows explained above are merely examples. The processing order may be changed or a plurality of processing steps may be performed in a parallel manner as long as the processing results are not changed. For example, the data transmission unit 114 may perform processing steps in a parallel manner for each type of transmission request.
In addition, the example in which the priority level for a DB data transmission request is set to “high”, which is the same level as that for a log transmission request, has been explained. However, if a larger number of priority levels may be set, the priority level for a DB data transmission request may be set higher than the priority level for a log transmission request.
The primary system 100 and the secondary system 200 each may be a single computer or each may share functions between a plurality of computers.
Each of the primary system 100 and the secondary system 200 is formed of one or more computers. As illustrated in
Features of the embodiments described above may be summarized as below.
A data communication method according to an embodiment includes (A) a step of receiving a first request for requesting transmission of first data, which includes the update content for a first database in a first system, to a second system that is connected to the first system via a transfer path and that includes a second database to store data identical to that stored in the first database, (B) a step of receiving a second request for requesting transmission of second data, which is at least part of the data in the first database, to the second system via the transfer path, (C) a step of determining, in accordance with at least one of the transfer status of the second data and the operating status of a service for at least a portion corresponding to the second data in the first database, whether or not a first state, in which transmission of the first data has a higher priority than transmission of the second data, is to be changed, and (D) a step of changing, if it is determined, in accordance with at least one of the operating status of the service and the transfer status of the second data, that the first state is to be changed, the first state to a second state, in which transmission of the second data has a priority greater than or equal to the priority of the transmission of the first data.
Accordingly, the state in which priority is given to the first data, which is a basic state, may be appropriately changed to the state in which priority is either shared with or given to the second data. Thus, data transfer appropriate for the status may be achieved.
In addition, the data communication method may further include (E) a step of receiving a third request for requesting transmission of third data to the second system, wherein the third data represents that data updating for at least the portion corresponding to the second data is allowed when the second data is extracted from the first database. In this case, in the determining step, it may be determined, in accordance with the third request, that the first state is to be changed. The third data represents that a service for a database resource for the second data is operating. Thus, in such a case, it is desirable that the second data be transmitted to the second system as soon as possible. Output of the third data may represent that data updating for at least the portion corresponding to the second data is allowed when the second data is extracted from the first database.
Furthermore, the data communication method may further include (F) a step of receiving a fourth request for requesting transmission of fourth data to the second system, wherein the fourth data represents whether or not data updating for at least the portion corresponding to the second data is allowed when the second data is extracted from the first database. In this case, the determining step may include (C1) a step of determining, in accordance with the fourth request, whether or not data updating is allowed by the fourth data and (C2) a step of determining, if the fourth data represents that data updating is allowed, that the first state is to be changed. If the fourth data is generated, the processing described above is performed.
In addition, the data communication method may further include (G) a step of managing, in accordance with reception of the second request, the guaranteed transfer completion time for the second data, the data amount of the second data, and the amount of the transmitted second data. In this case, the determining step may include (C3) a step of determining whether or not the time to be taken for transmission of the second data, which is evaluated from the transmission speed, the amount of the second data, and the amount of the transmitted second data, is longer than the time to the guaranteed transfer completion time and (C4) a step of determining, if the time to be taken for transmission of the second data is longer than the time to the guaranteed transfer completion time, that the first state is to be changed. As described above, if the guaranteed transfer completion time for the second data is designated, it may be estimated, from the current transfer status of the second data, that data transfer is expected not to be completed by the guaranteed transfer completion time. Thus, by determining, if the time to be taken for transmission of the second data is longer than the time to the guaranteed transfer completion time, that the first state is to be changed and by changing the state, data transfer of the second data may be completed by the guaranteed transfer completion time.
In addition, as illustrated in
Furthermore, as illustrated in
A program for causing a computer to execute the processing described above may be created. The program is stored, for example, in a computer-readable storing medium, such as a floppy disk, an optical disk such as a compact disc read only memory (CD-ROM), a magneto-optical disk, a semiconductor memory (for example, a ROM), or a hard disk, or a storage device. Data in the course of processing is temporarily stored in the storage device such as a random access memory (RAM).
According to an aspect of the embodiments explained above, in a database duplication system, delay of data transfer from a primary system to a secondary system due to an increase in the amount of update processing for a database in the primary system may be limited if not suppressed.
All examples and conditional language recited herein are intended for pedagogical purposes to aid the reader in understanding the invention and the concepts contributed by the inventor to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions, nor does the organization of such examples in the specification relate to a showing of the superiority and inferiority of the invention. Although the embodiments of the present invention have been described in detail, it should be understood that the various changes, substitutions, and alterations could be made hereto without departing from the spirit and scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
2011-146631 | Jun 2011 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
6453325 | Cabrera et al. | Sep 2002 | B1 |
20050210073 | Oeda et al. | Sep 2005 | A1 |
20090172142 | Hanai et al. | Jul 2009 | A1 |
20100274759 | Takeuchi et al. | Oct 2010 | A1 |
Number | Date | Country |
---|---|---|
2005-210606 | Aug 2005 | JP |
2005-267301 | Sep 2005 | JP |
2010-157884 | Jul 2010 | JP |
Entry |
---|
Oracle, Guide to Hot Standby Databases, 1996. |
Oracle, Oracle Data Guard, Aug. 2008. |
Number | Date | Country | |
---|---|---|---|
20130006930 A1 | Jan 2013 | US |