The present invention relates to data storage systems for data processors and, more specifically, to data storage systems software that automates the process of creating a remote mirror of a relational database or other application.
Typically, in a data processing system a backup subsystem will save a recent copy, version or a portion of one or more data sets on some form of backup data storage device. At present, backup subsystems include magnetic or optical disk drives, tape drives or other memory devices. The backup subsystem will protect against the loss of storage data. For example, if one or more data sets are destroyed, corrupted, deleted or changed then the latest version of those data sets that are stored in a backup subsystem can restore the data sets. Consequently, the backup system minimizes the risk of loss of data. However, the processes of setting up a remote mirroring data process backup system are error prone and time consuming.
The business and/or critical information are frequently stored on external storage servers. Frequently, this information is contained in relational database management systems (RDBMS). Remote data centers and redundant hardware store critical information in order to ensure continuity and prevent data loss in the event of a catastrophic failure. The configuration of the redundant system at a remote location is a complex manual process. For example, the initial manual process consists of configuring the server hardware and software, configuring the storage subsystem(s) and restoring a backup copy of the database.
The storage configuration for large RDBMS systems is very complex and performance of the configuration is critical. The most important factor influencing performance of the configuration is the physical layout of the database on the storage subsystem. After completing the initial configuration of the remote mirror, changes to the remote mirror must occur when there are any changes to the storage allocation at the primary site. This ensures the completeness and viability of the mirrored database copy. In another instance, the overall storage subsystem has not changed but the physical location of one or more components of the storage subsystem has changed. Updating the physical layout of the mirrored database guarantees completeness and viability of the RDBMS.
The current process of setting up a remote mirror involves many steps. These steps include locating all of the volume groups on which that database is located, mapping the volume group to logical volumes, mapping the logical volumes to physical disks, and potentially multi-pathed physical disks, and finally mapping the physical disk to storage subsystem volumes or logical unit numbers (LUNs). The next step is choosing an appropriate target LUN on the target storage subsystem for each of the LUN sources. The target properties include the same type (fixed-block (FB) for open systems or count-key-data (CKD) for mainframes). The target LUN also must be of the same size. The next step physically connects all of the remote mirroring links between the two storage subsystems. Creation of a logical path between each source subsystem and target subsystem occurs over each physical link. The created number of paths is equal to the number of source subsystems multiplied by the number of target subsystems multiplied by the number of physical links. If the logical paths are not created for every physical link, then the remote mirror will not be valid in case of a disaster or link failure. Furthermore, every physical link must be used in order to maximize remote mirroring performance. The final step creates a task to establish remote mirroring from each source to each target. If a user makes a mistake in any of these steps, the remote mirror may not be valid. Furthermore, the user may not discover the mistake until after a disaster, which by that time is too late. Finally, if the configuration changes, the user will have to go through these steps again to reconfigure the mirror.
U.S. Patent Application Publication No. US 2002/0103969 A1, entitled “Mirroring Agent Accessible To Remote Host Computers, And Accessing Remote Data-Storage Devices, VIA A Communications Medium,” discloses a hardware-based mirroring agent that provides a LUN based input/output (I/O) interfaced to remote host computers including mirrored LUNs. The hardware-based mirroring agent is similar to a disk array, but manages and provides to host computers an interface to remote data storage devices. Available to the mirroring agent are the location, addresses, remote data storage devices and/or specifications of mirror relationships to set up and initialize through a configuration and administration interface. The mirroring agent then provides a LUN-based interface to the remote data storage devices via a communications medium to host computers. The host computer can remap remote devices accessible via the communications medium via an automated discovery process, during which updating of the volume manager tables or host I/O tables occur. The mirroring agent establishes and synchronizes groups of mirrored data storage devices using well-known disk mirroring techniques. However, the processes of setting up the hardware-based mirroring agent are error prone and time consuming. It is a manual process and not an automatic process. The mirroring agent requires human intelligence to select the source and target volumes of the mirroring.
It is apparent that there is a need for a method and system that would perform the tasks of remotely automatically mirroring a database and other applications.
One aspect of the present invention is a computer for dynamically mirroring a data storage configuration. The computer includes a data interface, a software agent, a communications interface, and a data processor. The data interface is coupled to a data storage medium, and information related to a storage configuration of the first data storage medium is communicated to the computer through the data interface. The software agent is embodied on a computer readable medium, and compares the storage configuration information received through the data interface, termed a first storage configuration, to a second storage configuration. The second storage configuration is received through the communications interface. The software agent uses the second storage configuration to automatically conform the first storage configuration of the first data storage medium to mirror the second storage configuration. This is done at least when a storage configuration parameter differs between the first and the second storage configurations, and possibly more often. The data processor is coupled to the data interface, the communications interface, and the computer readable medium on which the software agent is embodied, and coordinates those various components.
Another aspect of the present invention is a computer program for dynamically mirroring a local assemblage of data. The computer program includes a remote software agent embodied on a computer readable storage medium that is configured to couple to at least one remote storage server and to a local software agent. The remote software agent includes computer instructions for receiving a local storage server configuration including a local storage parameter from the local software agent, for determining a remote storage parameter corresponding to the local storage parameter from the at least one remote storage server, and for configuring the remote storage server in accordance with the received storage parameter to mirror the local storage server configuration.
Another aspect of the present invention is a method of facilitating self-configuring of a remote mirroring system. This method includes at least discovering a primary storage configuration and database layout, and then mapping the discovered primary storage configuration and database layout to create at least one primary storage subsystem volume. The method further includes receiving information concerning a remote storage subsystem, polling the primary storage subsystem volume and a relational database management system (RDBMS), and comparing current information from the primary storage subsystem volume to the received information. At least when certain differences are noted in the comparison, the method includes transmitting storage changes to the remote storage subsystem.
Another aspect of the present invention is a method of automatically extending a storage systems hardware mirroring function. This method includes mapping volumes received from a particular local storage system corresponding to the physical LUNs. The LUNs are those being mirrored to a remote storage subsystem. The method also includes evaluating remote mirror LUNs based on at least one of size, type, performance and reliability to find a suitable LUN. If a suitable LUN is not found, the method includes creating a suitable remote mirror LUN and furthermore, if a volume is to be added, creating a suitable target and mirroring a volume.
These and other aspects of the claimed invention will become apparent from the following description, the description being used to illustrate a preferred embodiment of the claimed invention when read in conjunction with the accompanying drawings.
While the claimed invention is described below with reference to database volumes of a primary host being mirrored to volumes of a backup host, a practitioner in the art will recognize the principles of the claimed invention are applicable to other applications including those applications as discussed supra.
The local agent 20a is connected to the first external storage server 10a processing the relational database management system (RDBMS). The local agent discovers the configuration of the first external storage server 10a and then discovers the database layout on it.
The remote agent 30a is connected to the second external storage server 10b processing the RDBMS. The remote agent 30a receives the first external storage server 10a first configuration information 21 from the local agent 20a. The remote agent 30a then creates suitable second configuration information 31 on the second external storage server 10b and begins to mirror the local volumes 21a through one or more remote mirror links 40. The remote mirror logical unit numbers (LUNs) 31a are evaluated for suitability based primarily on size and type criteria. Alternately, the evaluation is extendable to include performance and reliability criteria. If no suitable LUNs 31a are found, the software agents will create one or more secondary LUNs 31b based upon type and size of the first configuration information 21 (local volumes 21a). Furthermore, the software agents can create secondary LUNs 31b based upon a user-defined policy. The remote agent 30a receives the physical database layout 22 from the local agent 20a at the local site 20 and then mirrors the identical configuration on the remote site 30.
After an initial configuration of the first configuration information 21, the local agent 20a processes in the background, periodically checking for changes in storage allocation or database configuration. If the local agent 20a detects changes that require replication at the remote site 30, it sends a message to the remote agent 30a to make the appropriate configuration changes to the second external storage server 10b. For example, changes that require re-configuration include, but are not limited to, a new volume added to the database, volume(s) removed from the database, and the database is moved to different volumes for performance or other reasons. Further changes that require reconfiguration include an error condition that causes a different or backup volume to be used. Alternately, the remote mirror link(s) 40 (path) between the first server 10a and the second server 10b have failed wherein another path must be used or a new path created. The remote agent 20a upon receipt of the configuration change information will effect the required changes on the first and second external storage servers. If no suitable volumes are available, then the local volumes 21a and the remote volumes 31c are creatable based upon, for example, a user-defined policy.
As is understood by the practitioner in the art, the self-configuring remote mirroring system 10, and in particular the software agents, are not limited to databases. In addition, the system 10, and in particular the software agents, are extendable to all of the volumes of a particular host or group of host users, to different applications, to the configuration of the entire storage subsystem, or to a storage area network (SAN).
The step 112 discovers the storage configuration and database layout. The storage subsystem layout depends upon the different software tools from the storage system supplier, or alternatively it can use standards-based interfaces (such as the Storage Networking Industry Association's Storage Management Interface (SMI)). The physical database layout is discoverable by collecting information at each layer of the system, that is, database, operating system, volume manager and storage subsystem. At step 114, the software agent determines the logical unit number (LUN) assignment, that is, which LUNs are assigned to which hosts (local site 20 and remote site 30 on
The steps 114, 116, 118 and 120 are the creation of mapping from the database/operating system container to one or more storage subsystem volumes. The relationship between the storage subsystem volumes and database/operating system containers is a large number to a large number. For example, a single container includes multiple storage volumes and a single storage volume is useable in multiple database/operating system containers. Furthermore, subsystem volumes are mapable to corresponding logical unit numbers LUNs at step 122. The LUNs are placed into logical groupings. For example, logical groupings include but are not limited to all volumes used by a database, all volumes used by a particular host, user-defined groupings or all volumes used for a set of business applications.
The step 134, the software agent relays the mapped information to a duplicate remote storage subsystem. A software agent (local agent 20a) that processes on a first external storage server 10a (
At step 124 the software agent queries the state of the mirroring. If the applications are already properly configured to perform mirroring, then at step 125 a decision is made to go to the polling mode and poll the storage subsystem at step 126. This allows the software agent to be used with existing mirroring configurations as well as new configurations. At step 125, if the application is not properly configured to perform mirroring then the software agent directs the process to step 133 where the change is noted.
The software agent processes continuously, polling for changes in storage allocation and application configuration. At step 132, the software agent determines if a change is detected in the local storage subsystem. If no change is detected the software agent proceeds back to step 126 and polls the storage sub-system. However, if a change is detected to the local storage subsystem at step 132 then a change is noted at step 133. For example, adding a new volume to the database is detected and the software agent identifies and understands the usage of a new volume. Once the change is noted at step 133, the software agent will act appropriately at step 140 depending on the change. If the change is a command to stop mirroring, then the process ends at step 138. Otherwise, the software agent at step 134 makes the appropriate modification on the remote systems. Then the software agent at step 136 will assign the new volume to the remote host and/or format the volume. In addition, at step 136 the software agent will add the new volume to the operating system logical volume and update the database and/or application configuration. Once step 136 is complete the software agent proceeds to step 137 invoking procedures to mirror the new volumes. When step 137 is complete the software agent returns to the polling mode in step 126 repeating the process of automatically self-configuring a remote mirroring of a dynamic relational database or application.
By way of a summary,
The backup computer 62 includes a data interface 64 that couples the backup computer 62 to a backup data storage that may include a series of backup volumes for storing data to be backed up. The backup data storage need not be of the same model or type as the source data storage, as the present invention only requires mirroring of the configuration. Where the backup and source data storages are not the same model and/or type, the respective data interfaces 44, 64 may not be identical physically, though they function similarly in transferring configuration data to and from each other through the source and backup computers 42, 62.
The backup computer 62 also includes a second data processor 66, one or more second stored programs 68 that are stored on one or more computer readable storage mediums, and a second memory 70 that may include volatile and/or non-volatile memory. The backup computer 62 further includes a backup communications interface 72 for receiving and transmitting data such as configuration information relating to the source data storage and the backup data storage, according to the present invention. Interconnects between the second processor 66, the stored second programs 68, the second memory 70, and the second data interface 64 depicted in
The source computer 42 and the backup computer 62 are coupled to one another via one or more communications links 80, which may be through the internet, an intra-net, a local area network, a piconetwork, an infrared or microwave link, a remote mirror link 40 as previously described, or any other viable communications means, whether wired, wireless, or a combination.
Operation of the source computer 42 and the backup computer 62 is as described above, wherein first and second agents may be resident in the source and remote stored programs areas 48, 68.
While there has been illustrated and described what is at present considered to be a preferred embodiment of the claimed invention, it will be appreciated that numerous changes and modifications are likely to occur to those skilled in the art. It is intended in the appended claims to cover all those changes and modifications that fall within the spirit and scope of the claimed invention.