The technical field is live migration.
Live migration is the process whereby a running application is moved between different physical machines without disconnecting a client or the application. Applications to which live migration may be applied include virtual machines, containers or other processes or tasks run by an operating system. A virtual machine is an emulation of a computer system, such as an operating system or application environment. A container or lightweight virtualization is an operating system level virtualization environment for running multiple isolated operating systems on a single host. Each container is isolated from each other and has access to a subset of the resources of the operating system.
In live migration, the memory, storage, and network connectivity of the application are transferred from the original host machine to the destination host machine. Live migration allows an administrator to shut down a physical server for maintenance or upgrades without subjecting the system's users to downtime. It also facilitates proactive maintenance since, if an imminent failure is suspected, the potential problem can be resolved before disruption of service occurs. Live migration can also be used for load balancing, in which work is shared among computers in order to optimize the utilization of available CPU resources.
In order to be effective, live migration must cause the minimum of disruption to continued execution. To achieve this aim, two main techniques are used, namely, pre-copying and post-copying.
In pre-copy memory migration, the data is copied from source memory to destination memory while the application is still running on the source. If some memory pages change (become ‘dirty’) during this process, they will be re-copied. This first stage is referred to as the “warm-up” phase. After the warm up phase, the “stop and copy” phase is performed. This involves the stopping of the application on the source host and the remaining “dirty” pages being copied to the destination host. There is a downtime between the stopping of the application on the source host and its resumption on the destination host.
In post-copying, the application is suspended at the source host and a minimal subset of the execution state of the application is transferred to the destination host. The application is then resumed at the destination host. Concurrently the source host pushes the data in the remaining memory pages to the destination memory, in a process known as pre-paging. If at the destination host, the application tries to access a memory page that has not yet been transferred, it develops a so-called “network fault”. If this occurs, then the destination host accesses the memory in the source host. Too many network faults can degrade the performance of applications running in the application.
A number of attempts have been made to improve the performances of both pre-copy and post-copy live migration, and combinations of the two techniques. An early proposal was made by Clark et al (“Live Migration of Virtual Machines”, 2ndSymposium Networked Systems Design and Implementation 2005). This paper proposes a pre-copy process, but identifies a “writable working set”, which is a set of memory pages which are continuously updated and which are therefore not worth copying in the pre-copy phase. These memory pages are only copied in the downtime period.
In US2014/0089393, a tracking tool is used to track modifications made in the data by the application at the source host. When the source host is shutdown, these modifications are passed to the destination host. In U.S. Pat. No. 8,689,211 a recovery point in case of migration failure is defined.
These methods either impose a downtime (e.g., pre-copy) or are subject of possible corruption in case of any failures during the process (e.g., post-copy, and combination of pre-copy and post-copy). U.S. Pat. No. 8,671,238 discloses a method of overcoming these problems by using a shared data-store that stores a file with the instance memory, which is accessible to both source and destination hosts in case the direct communication between hosts fails. It does not however suite all cases, namely in the case of highly time critical processing instance and an approach that does not require a third component (i.e. a shared data-store) is more suitable and efficient.
Accordingly, in a first aspect of the present invention there is provided a method of migrating an application from a source host to a destination host, wherein the application is associated with a plurality of memory pages, the source host comprises a first instance of the application and a source memory region, and each memory page has an associated source memory block in the source memory region. The method comprises, at the destination host, reserving a destination memory region such that each memory page has an associated destination memory block in the destination memory region and launching a second instance of the application at the destination host. On receipt of an input to the application, the method further comprises processing the input in parallel at the first and second instances. At the source host, if the processing requires a read or a write call to a memory page, the method comprises respectively reading from or writing to the associated source memory block. At the destination host, if the processing requires a write call, the write call is made to the associated destination memory block. If the processing requires a read call, then the read call is sent to the source host requesting that the source host read the memory page from the associated source memory block and provide the memory page to the destination host. On receipt of the memory page by the destination host, memory page is written to the associated destination memory block.
The parallel execution of two instances of the application writing to their own memory allows the overcoming of the problems of other solutions to the implementation of live migration. The continued operation of the source host guarantees a faithful execution of the application in the event of a network failure. This overcomes the problems in post copying techniques, in which a degradation in performance can occur if a network fault occurs and the destination host does not have access to necessary memory in the source host. The parallel execution at the destination host means that there is no problem with the “dirtying” of memory as occurs in pre-copying systems.
Preferably, on receipt of an input whose processing requires a read call to a memory page, the method further comprises determining whether the read call is a first access of a memory page by the second instance. If the read call is not the first access, then the associated destination memory block is read. If the read call is the first access, then the associated source memory block is read.
Preferably, at the source host, on receipt of an input, the processing of which requires a write instruction to a source memory block, it is determined whether a read call has been made for the block by the first instance and whether a read call has been made for the block by the second instance. If a read call has been made by the first instance but not by the second instance, then execution of the write call is postponed. On receipt of a read call from the second instance, the read call from the second instance is executed followed by the execution of the write call.
Preferably, a subset of the plurality of memory pages are copied from the source memory region to the destination memory region prior to the launching of the second instance.
Preferably the subset of memory pages is selected based on the results of a statistical analysis of one or more of the network load, memory usage and data usage.
Optionally a statistical analysis of one or more of the network load, memory usage and data usage is used to determine a start time.
Optionally the opportunistic start time is determined by selecting a period of low activity.
Optionally the first instance is terminated upon completion of transfer of all memory pages to the destination memory region.
Preferably the migration is controlled by a migration layer located on one or both of the source and destination hosts.
According to a second aspect of the invention there is provided a computing platform comprising a central processing unit, a memory and a disk, programmed to, in response to an instruction to commence live migration from a source host, launch an instance of an application, the application being associated with a plurality of memory pages and to reserve a destination memory region such that every memory page associated with the application has an associated memory block in the destination memory region. In response to an input to the application, the computing platform is programmed to process the input and if the processing requires a read call to a memory page read from an associated source memory block at the source host and data acquired from the associated source memory block to the associated destination memory block. If the processing requires a write call to a memory page, the platform is programmed to write to the associated destination memory block.
Preferably the computing platform is further programmed to track read and write calls to the destination memory blocks. On receipt of an input, the processing of which requires a read call to a memory page, the platform is programmed to determine whether the read call is a first access of the memory page during the live migration and then either read the associated destination memory block, if the read call is not the first access or read the associated source memory block, if the read call is the first access.
According to a third aspect of the invention, there is provided a computing platform comprising a central processing unit, a memory and a disk, programmed to host a first instance of an application, host memory pages associated with the application in a source memory region, each memory page being stored in a source memory block, track read calls to source memory blocks, and receive and execute read calls to the source memory region from a second instance of the application. In response to an input to the application, the processing of which requires a write call to a source memory block, the platform is programmed to determine whether a read call has been made for the block by the first instance, determine whether a read call has been made for the block by the second instance and if the block has been read by the first instance but not by the second instance, postpone execution of the write call. On receipt of a read call from the second instance, the read call from the second instance is executed, followed by the write call.
According to a fourth aspect of the invention there is provided a computer program, comprising instructions which, when executed on at least one processor, cause the at least one processor to carry out a method according to the first aspect.
According to a fifth aspect of the invention, there is provided a computer program product comprising a computer program according to the fourth aspect.
According to a sixth aspect of the invention, there is provided a carrier containing the computer program product according to the fifth aspect, wherein the carrier optionally includes an electrical signal, an optical signal, a radio signal, a magnetic tape or disk, an optical disk or a memory stick.
The above and other aspects of the present invention will now be described by way of example only, with reference to the following figures:
Embodiments will now be described more fully hereinafter with reference to the accompanying drawings, in which certain embodiments are shown. However, other embodiments in many different forms are possible within the scope of the present disclosure. Rather, the following embodiments are provided by way of example so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
Each of the instances has a migration layer and a memory region associated with it. In
In addition to read and write calls to the destination memory, the second instance is able to perform read calls (16) from the source memory. As for calls to its own memory, these calls are made through the migration layer. In an embodiment, data may be transferred (17) from the source memory to the destination memory.
The parallel execution of two instances of the application writing to their own memory allows the overcoming of the problems of other solutions to the implementation of live migration. The continued operation of the source host guarantees a faithful execution of the application in the event of a network failure. This overcomes the problems in post copying techniques, in which a degradation in performance can occur if a network fault occurs and the destination host does not have access to necessary memory in the source host. The parallel execution at the destination host means that there is no problem with the “dirtying” of memory as occurs in pre-copying systems.
When an input to the application is received, it is sent to both the source and the destination hosts. At the source host, the processing of the input proceeds as in a normal operation of the application, with data being read from and written to the source memory as required. At the destination host, when the second instance is launched, a memory region is reserved for the application data, but none of that data is available on the destination host. The process of transferring memory occurs as operations are executed by second instance. An input is received and if stored data is required, it is read from the source memory. After processing the resultant data is written to the appropriate memory block in the destination memory. If the processing requires only a write call, then the data is written to the appropriate memory block in the destination memory. The exact write calls are being made in parallel with the same data at the source host. This means that with each operation, identical data is being written to the destination host memory. The data is copied after processing and hence there is no issue of the “dirtying” of memory. The external client of the application will continue to receive the output of the application from the first instance. This means that, if there is a network problem and the destination host is unable to read from memory, there will not be a degradation of performance as in post-copying techniques. The inability to access the source memory will merely mean a postponement of the data transfer for that input.
Each memory access is tracked by the migration layer. Once all the data has been successfully transferred to the destination host memory, the first instance may be shut down and the live migration is then complete.
Although it is possible for the destination host to read data from the source memory throughout the migration process, it is an inefficient use of network resources. In an embodiment this problem is overcome by reading locally from the destination memory when possible.
The memory transfer occurs as a result of successive inputs to the application, each of which results in the writing of further data to the destination host memory.
Not every input to the application will result in a transfer of data however.
In an embodiment, the destination host and the source host may both require access to the source host's memory. A problem may arise if an input requires a read call followed by a write call which changes the data in the memory block. Although the instances are operating in parallel, processing and network delays must be allowed for. For example, if the first instance processes an input and makes both its read and write calls to a memory block before the arrival at the source memory of the corresponding remote read call from the second instance, then the data will have been changed in the block. The result will be that incorrect data will be passed to the destination host. In an embodiment, this problem is solved by the source migration layer checking if a remote read has been made before a write call is executed at a memory block. A prerequisite to this method is the recording of read calls to memory blocks.
The above process of migration relies on inputs to the application to initiate data transfer. In an embodiment, the mechanisms described above are supplemented by using pre-copy techniques to accelerate the memory transfer process. In an embodiment, the process comprises a statistical analysis in terms of load, memory and data usage. The process can start in an opportunistic manner, i.e. selecting a time when the activity level is low.
The present disclosure has mainly been described above with reference to a few embodiments. However, as is readily appreciated by a person skilled in the art, other embodiments than the ones disclosed above are equally possible within the scope of the present disclosure, as defined by the appended claims.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2015/063417 | 6/16/2015 | WO | 00 |