Data replication is the process of storing the same data in multiple sites (e.g., at production/source site and at a disaster recovery (DR)/target site) to improve system resilience and reliability. Replication involves copying data from the production/source site to the DR/target site. Data can be copied on demand or be transferred in bulk or in batches according to a schedule, or be replicated in real time as the data is written, changed, or deleted at the production/source site. If the production/source site goes down due to faulty hardware, malware attack or other problem, or if the production/source site is scheduled for maintenance, the DR/target site can takeover operations normally performed by the production/source site without much interruption to clients.
A method and system for data consistency across failure and recovery of infrastructure. In one embodiment of the method, copies of first data blocks stored in a source storage are sent to a target site via a data link. While sending one or more of the copies of the first data blocks to the target site, source hashes for second data blocks stored in the source storage are calculated, wherein the first data blocks are distinct from the second data blocks. While sending one or more of the copies of the first data blocks to the target site, target hashes of data blocks stored in a target storage of the target site are received. While sending one or more of the copies of the first data blocks to the target site, the source hashes are compared with the target hashes, respectively. After sending the first data blocks to the target site via the data link, copies of only those second data blocks are sent to the target site with source hashes that do not compare equally with respective target hashes.
Embodiments of methods and systems such as those disclosed herein may be better understood, and its numerous objects, features, and advantages made apparent to those skilled in the art by referencing the accompanying drawings.
While embodiments such as those presented in the application are susceptible to various modifications and alternative forms, specific embodiments are provided as examples in the drawings and description of example embodiments. It should be understood that the drawings and description of example embodiments are not intended to limit the embodiments to the particular form disclosed. Instead, the intention is to cover all modifications, equivalents and alternatives falling within the spirit and scope of methods and systems such as those described herein, as defined by the appended claims.
Source site 102 includes one or more source hosts (e.g., servers) that receive and process transaction requests from client computer systems. For purpose of explanation only, the present disclosure will be described with reference to a single source host 108, it being understood the present disclosure should not be limited. Source host 108 is in data communication with a source storage 110, which stores source data in contiguous logical data blocks identified by number. Source host 108 is also in communication with data mover (e.g., replication appliance) 112, and map memory 114, which includes source active and synchronization maps 116 and 118, respectively, which will be more fully described below. Host 108 includes at least one real application or virtual machine (App/VM) 120 executing on one or more processors (not shown). App/VM 120 generates I/O transactions to update data blocks of source data in source storage 110 in response to receiving transaction requests from client computer systems.
Target site 104 includes target host 140 (e.g., a server) in data communication with a target storage 142 that can store a replica of the source data 202. Target host 140 is in data communication with data mover (e.g., replication appliance) 144, and map memory 146, which includes active and synchronization maps 148 and 150, respectively, which will be more fully described below. Host 140 includes App/VM 152 that is similar to App/VM 120. When activated during a takeover, App/VM 152 generates I/O transactions to update data in blocks of the source replica in storage 142 in response to receiving and processing transaction request from client computer systems. Target host 140 further includes filter and block drivers 154 and 156, in addition to synchronization and DRL modules 160 and 162, respectively. Each of the components 154-162 may take form in software executing on one or more processors of target host 140. Moreover, each of the components 154-162 is in data communication with each other and data mover 144.
As noted above target site 104 can takeover for source site 102 for any one of many different reasons. Before target site 104 can takeover for source site 102 for the first time, a copy of source data in source storage 110 must be created in target storage 142.
As the source data 202 is being migrated block-by-block to target storage 142 during steps 306-312, App/VM 120 may concurrently issue I/O transactions for updating blocks of data in source data 202. Filter driver 122 taps the I/O transactions received from App/VM 120 as these I/O transactions are forwarded to block driver 124. The I/O transaction should identify the logical block number N (or numbers N-M) or include information that can be used to identify the logical block number N (or numbers N-M) within source data 202 that will be updated by the I/O transaction. For ease of illustration, the present disclosure will be described with reference to I/O transactions that update one block of data at a time. Filter driver 122 can set the appropriate bit (i.e., A(N)) in active map 118 that corresponds to the logical data block N identified in each I/O transaction that is tapped during steps 306-312.
The updates to source data 202 during steps 306-312, should be replicated to target storage 142 after migration process steps 306-312 are complete. To that end synchronization module 126 copies the contents of active map 118 to synchronization map 116 as shown in step 314. Synchronization module 126 resets variable ST=1 in step 316. In step 320, synchronization module 126 accesses synchronization map 116, to read synchronization bit S(ST). If this bit is set to logical 1 then in step 322, data D(ST) is copied from source data 202 to target storage 142 via data movers 112 and 114. Then, the synchronization module 126 clears the synchronization bit S(ST) to logical 0 in the synchronization map 116. Synchronization module 126 then compares variable ST to Z, the total number of data blocks in source data 202 as shown in step 324. If ST is less than Z, synchronization module 126 increments ST and synchronization bit S(ST) is read from the synchronization map 116 for analysis in step 320. If, however, ST is greater than Z in step 324, the process ends, and contents of target storage 142 will be consistent with source data 202.
App/VM 120 continues to issue I/O transactions to update source data 202 after the migration process of
Target site 104 may takeover for source site 102 for any one of many different reasons. At takeover, App/VM 152 should be in a state where it can immediately receive, process and properly respond to transaction requests from client computer systems (not shown). During the process of responding to transaction requests App/VM 152 issues I/O transactions for updating the replicated source data 204 within target storage 142. Filter driver 154 taps each of these I/O transactions as they are sent to block driver 156. Filter driver 154 can set bits in the active map 148 corresponding to logical blocks in storage 142 that are updated by I/O transactions from App/VM 152. Periodically, DRL module 162 may clear the bits of active map 148 after the contents of active map 148 are copied to synchronization map 150. In this manner, active map 148 and/or synchronization map 150 track the logical data blocks within the source replica 204 that have been updated since the takeover. Updates to logical data blocks in storage 142 are not replicated back to source storage 110. As a result, the contents of memories 110 and 142 are out of synchronization. Eventually, operations will fallback to source site 102. Before source site 102 can resume responding to transaction requests it receives from client computers systems, however, data source 202 must be synchronized or resynchronized to the contents of the updated replica in target storage 142. The migration process described with reference to
With continuing reference to
The process of
When TB is determined to be greater than TT in step 622, the process proceeds to step 624 where synchronization module 160 accesses its merged map 220 to read the value in M(TT). If this value is set to logical 1, the process proceeds to step 626 where synchronization module 160 sends a copy of the data D(TT) and its identity TT to source site 102. Block TT in source data 202 is replaced with data D(TT) that is sent by target site 104 in step 626. In step 630 synchronization module 160 increments TT. And then in step 632 synchronization module 160 compares TT to Z. If these values are equal to, the process ends and as a result source data 202 should be synchronized or resynchronized to the updated replica at the target site. Otherwise steps 624 through 632 are repeated.
It is noted that the processes of
An Example Computing and Network Environment
The methods described herein can be implemented using a variety of computing systems (e.g., servers) and networks. Examples of such computing systems and network environments are described below with reference to
Bus 712 allows data communication between central processor 714 and system memory 717, which may include read-only memory (ROM) or flash memory (neither shown), and random access memory (RAM) (not shown), as previously noted. RAM is generally the main memory into which the operating system and application programs, drivers, and modules like those described above, are loaded. The ROM or flash memory can contain, among other code, the Basic Input-Output System (BIOS) which controls basic hardware operation such as the interaction with peripheral components. Applications, drivers, or modules like those described above, resident with computer system 710 are generally stored on and accessed from a computer-readable storage medium, such as a hard disk drive (e.g., fixed disk 744), an optical drive (e.g., optical drive 740), or other computer-readable storage medium.
Storage interface 734, as with the other storage interfaces of computer system 710, can connect to a standard computer-readable medium for storage and/or retrieval of information, such as a fixed disk drive 744. Fixed disk drive 744 may be a part of computer system 710 or may be separate and accessed through other interface systems. Modem 747 may provide a direct connection to a remote server via the Internet and an internet service provider (ISP). Network interface 748 may provide a direct connection to a remote server via a direct network link to the Internet via a POP (point of presence). Network interface 748 may provide such connection using wireless techniques, including digital cellular telephone connection, Cellular Digital Packet Data (CDPD) connection, digital satellite data connection or the like.
Many other devices or subsystems (not shown) may be connected in a similar manner (e.g., document scanners, digital cameras and so on). Conversely, all of the devices shown in
Moreover, regarding the signals described herein, those skilled in the art will recognize that a signal can be directly transmitted from a first block to a second block, or a signal can be modified (e.g., amplified, attenuated, delayed, latched, buffered, inverted, filtered, or otherwise modified) between the blocks. Although the signals of the above described embodiment are characterized as transmitted from one block to the next, other embodiments may include modified signals in place of such directly transmitted signals as long as the informational and/or functional aspect of the signal is transmitted between blocks. To some extent, a signal input at a second block can be conceptualized as a second signal derived from a first signal output from a first block due to physical limitations of the circuitry involved (e.g., there will inevitably be some attenuation and delay). Therefore, as used herein, a second signal derived from a first signal includes the first signal or any modifications to the first signal, whether due to circuit limitations or due to passage through other circuit elements which do not change the informational and/or final functional aspect of the first signal.
With reference to computer system 710, modem 747, network interface 748 or some other method can be used to provide connectivity from each of client computer systems 810, 820 and 830 to network 850. Client systems 810, 820 and 830 are able to access information on server 840A or 840B using, for example, a web browser or other client software (not shown). Such a client allows client systems 810, 820 and 830 to access data hosted by server 840A or 840B or one of storage devices 860A(1)-(N), 860B(1)-(N), 880(1)-(N) or intelligent storage array 890.
The systems and processes described herein are well adapted to attain the advantages mentioned as well as others inherent therein. While such systems and processes have been depicted, described, and are defined by reference to particular descriptions, such references do not imply a limitation on the claims, and no such limitation is to be inferred. The systems and processes described herein are capable of considerable modification, alteration, and equivalents in form and function, as will occur to those ordinarily skilled in the pertinent arts in considering the present disclosure. The depicted and described embodiments are examples only, and are in no way exhaustive of the scope of the claims.
The foregoing describes embodiments including components contained within other components. Such architectures are merely examples, and, in fact, many other architectures can be implemented which achieve the same functionality. In an abstract but still definite sense, any arrangement of components to achieve the same functionality is effectively “associated” such that the desired functionality is achieved. Hence, any two components herein combined to achieve a particular functionality can be seen as “associated with” each other such that the desired functionality is achieved, irrespective of architectures or intermediate components. Likewise, any two components so associated can also be viewed as being “operably connected,” or “operably coupled,” to each other to achieve the desired functionality.
The foregoing detailed description has set forth various embodiments of the systems and methods described herein via the use of block diagrams, flowcharts, and examples. It will be understood by those within the art that each block diagram component, flowchart step, operation and/or component illustrated by the use of examples can be implemented (individually and/or collectively) by a wide range of hardware, software, firmware, or any combination thereof.
The systems described herein have been described in the context of fully functional computer systems; however, those skilled in the art will appreciate that the systems described herein are capable of being distributed as a program product in a variety of forms, and that the systems described herein apply equally regardless of the particular type of computer-readable media used to actually carry out the distribution. Examples of computer-readable media include computer-readable storage media, as well as media storage and distribution systems developed in the future.
The above-discussed embodiments can be implemented by software modules that perform one or more tasks associated with the embodiments. The software modules discussed herein may include script, batch, or other executable files. The software modules may be stored on a machine-readable or computer-readable storage media such as magnetic floppy disks, hard disks, semiconductor memory (e.g., RAM, ROM, and flash-type media), optical discs (e.g., CD-ROMs, CD-Rs, and DVDs), or other types of memory modules, or downloaded to one of these devices. A storage device used for storing firmware or hardware modules in accordance with an embodiment can also include a semiconductor-based memory, which may be permanently, removably or remotely coupled to a microprocessor/memory system. Thus, the modules can be stored within a computer system memory to configure the computer system to perform the functions of the module. Other new and various types of computer-readable storage media may be used to store the modules discussed herein.
The above description is intended to be illustrative and should not be taken to be limiting. As will be appreciated in light of the present disclosure, other embodiments are possible. Those skilled in the art will readily implement the steps necessary to provide the structures and the methods disclosed herein, and will understand that the process parameters and sequence of steps are given by way of example only and can be varied to achieve the desired structure as well as modifications that are within the scope of the claims. Variations and modifications of the embodiments disclosed herein can be made based on the description set forth herein, without departing from the scope of the claims, giving full cognizance to equivalents thereto in all respects.
Although the systems described herein have been described in connection with several embodiments, these embodiments and their descriptions are not intended to be limited to the specific forms set forth herein. On the contrary, it is intended that such embodiments address such alternatives, modifications, and equivalents as can be reasonably included within the scope of the appended claims.
This application claims the domestic benefit under Title 35 of the United States Code § 119(e) of U.S. Provisional Patent Application Ser. No. 62/859,092, entitled “Method and System for Data Consistency Across Failure and Recovery of Infrastructure,” filed Jun. 8, 2019, which is hereby incorporated by reference in its entirety and for all purposes as if completely and fully set forth herein.
Number | Name | Date | Kind |
---|---|---|---|
5649152 | Ohran et al. | Jul 1997 | A |
5835953 | Ohran | Nov 1998 | A |
5875479 | Blount et al. | Feb 1999 | A |
5915098 | Palmer et al. | Jun 1999 | A |
6073222 | Ohran | Jun 2000 | A |
6085298 | Ohran | Jul 2000 | A |
RE37601 | Eastridge et al. | Mar 2002 | E |
6728898 | Tremblay et al. | Apr 2004 | B2 |
6732245 | Kaiya et al. | May 2004 | B2 |
6779093 | Gupta | Aug 2004 | B1 |
7103796 | Kekre et al. | Sep 2006 | B1 |
7386752 | Rakic et al. | Jun 2008 | B1 |
7987368 | Zhu | Jul 2011 | B2 |
8577850 | Genda | Nov 2013 | B1 |
8874508 | Mittal | Oct 2014 | B1 |
9087088 | Bose | Jul 2015 | B1 |
9442806 | Bardale | Sep 2016 | B1 |
9509697 | Salehpour | Nov 2016 | B1 |
9575789 | Rangari | Feb 2017 | B1 |
9740422 | Ozdemir | Aug 2017 | B1 |
10394485 | Chen | Aug 2019 | B1 |
10496494 | Haloi | Dec 2019 | B1 |
10592149 | Jenkins | Mar 2020 | B1 |
20030014433 | Teloh et al. | Jan 2003 | A1 |
20030061366 | Musante et al. | Mar 2003 | A1 |
20030126388 | Yamagami | Jul 2003 | A1 |
20030158869 | Micka | Aug 2003 | A1 |
20040049365 | Keller et al. | Mar 2004 | A1 |
20070185937 | Prahlad et al. | Aug 2007 | A1 |
20080279204 | Pratt, Jr. | Nov 2008 | A1 |
20090187923 | McKinney | Jul 2009 | A1 |
20100070447 | Pfunter et al. | Mar 2010 | A1 |
20100169720 | Lumpp et al. | Jul 2010 | A1 |
20110126099 | Anderson et al. | May 2011 | A1 |
20120259825 | Tashiro | Oct 2012 | A1 |
20130054523 | Anglin | Feb 2013 | A1 |
20150169613 | Bshara | Jun 2015 | A1 |
20150278395 | Ben Jemaa | Oct 2015 | A1 |
20150280999 | Chart et al. | Oct 2015 | A1 |
20160306560 | Maranna et al. | Oct 2016 | A1 |
20160371007 | Shani | Dec 2016 | A1 |
20170286690 | Chari | Oct 2017 | A1 |
20170289187 | Noel | Oct 2017 | A1 |
20170300244 | Crawford et al. | Oct 2017 | A1 |
20180109425 | Chart et al. | Apr 2018 | A1 |
20180285201 | Bangalore | Oct 2018 | A1 |
20180295029 | Shivaana | Oct 2018 | A1 |
20190138995 | Currin | May 2019 | A1 |
20200097198 | Bansal | Mar 2020 | A1 |
Number | Date | Country |
---|---|---|
109154905 | Jan 2019 | CN |
2020047107 | Mar 2020 | JP |
WO 2017014814 | Jan 2017 | WO |
Entry |
---|
PCT International Written Opinion and Search Report of the International Searching Authority for Application No. PCT/US2021/015379 dated May 12, 2021, 12 pages. |
PCT International Written Opinion and Search Report of the International Searching Authority for Application No. PCT/US2021/015384 dated Apr. 13, 2021, 12 pages. |
Bandopadhyay, Tushar et al.; “System and Method to Propagate Information Across a Connected Set of Entities Irrespective of the Specific Entity Type”; U.S. Appl. No. 15/857,577, filed Dec. 28, 2017; consisting of Specification, Claims, Abstract, and Drawings; 46 pages. |
Bandopadhyay, Tushar et al.; “Method and System for Dependency Analysis of Workloads for Orchestration”; U.S. Appl. No. 15/844,359, filed Dec. 15, 2017; consisting of Specification, Claims, Abstract, and Drawings; 77 pages. |
Patil, Rushikesh et al.; “Methods and Systems for Data Resynchronization in a Replication Environment”; U.S. Appl. No. 16/805,294, filed Feb. 28, 2020; consisting of Specification, Claims, Abstract, and Drawings; 67 pages. |
Patil, Rushikesh Patil et al.; “Methods and Systems for Data Resynchronization in a Replication Environment”; U.S. Appl. No. 16/805,292, filed Feb. 28, 2020; consisting of Specification, Claims, Abstract, and Drawings; 49 pages. |
Zhang, Shuangmin et al.; “Optimize Backup from Universal Share”; U.S. Appl. No. 16/835,657, filed Mar. 31, 2020; consisting of Specification, Claims, Abstract, and Drawings; 28 pages. |
Alferes, Jose Julio et al.; “Chapter 3—Evolution and Reactivity in the Semantic Web”; F. Bry and J. Maluszynski (Eds.): Semantic Techniques for the Web; Lecture Notes in Computer Science—5500; Sep. 29, 2009; pp. 161-200. |
Deng, Li et al., “vMerger: Server Consolidation in Virtualized Environment,” 2013 IEEE 11th Int'l Conference on Dependable, Autonomic and Secure Computing, pp. 606-612. |
PCT International Written Opinion and Search Report of the International Searching Authority for Application No. PCT/US2021/024850 dated Aug. 2, 2021, 14 pages. |
EP Application No. 18 707 994.2-1224, EP Examination Report dated Sep. 6, 2021, 9 pages. |
Number | Date | Country | |
---|---|---|---|
62859092 | Jun 2019 | US |