Transactional memory (TM) typically attempts to simplify concurrent programming by allowing a set of load and store instructions to execute in an atomic manner. Software transactional memory (STM) is a concurrency control process for controlling access to shared memory in concurrent computing. For STM, a transaction occurs when a piece of code executes a series of reads and writes to shared memory. These reads and writes logically occur at a single instant in time, with intermediate states being invisible to other transactions.
Features of the present disclosure are illustrated by way of example and not limited in the following figure(s), in which like numerals indicate like elements, in which:
For simplicity and illustrative purposes, the present disclosure is described by referring mainly to examples. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present disclosure. It will be readily apparent however, that the present disclosure may be practiced without limitation to these specific details. In other instances, some methods and structures have not been described in detail so as not to unnecessarily obscure the present disclosure.
Throughout the present disclosure, the terms “a” and “an” are intended to denote at least one of a particular element. As used herein, the term “includes” means includes but not limited to, the term “including” means including but not limited to. The term “based on” means based at least in part on.
TM allows parallelism, and may be used to detect and resolve conflicts without resorting to the complexities of fine grained locking or non-blocking synchronization. Further, STM allows programs to take advantage of disjoint access parallelism on both the read-side and write-side. However, STM based systems can have a higher overhead compared to systems based on other synchronization techniques. To reduce some of the challenges associated with STM, data may be accessed non-transactionally. For example, privatization may be used to access data non-transactionally. However, the non-transactional accesses may negate the isolation guarantees of a transactional memory system.
According to examples, a copy-on-write (COW) update-triggered consistency apparatus and a method for COW update-triggered consistency are disclosed herein. Generally, the COW process may be based on the principle that when multiple separate tasks use identical copies of the same information (i.e., data stored in a computer memory bank), separate copies of that information are not needed for each process. Instead of the separate copies, each process may be provided with a pointer to the same resource. The apparatus and method disclosed herein may provide for COW to be applied to a shared memory for version control. Each application that is to access data stored in a shared central repository may be provided with an isolated local copy (i.e., a local version) of a shared memory segment (i.e., shared data from the shared central repository). A process associated an application may be free to read/write to the isolated local copy of its shared memory segment. Changes may be maintained locally until the process associated with the application uses a commit process to commit the changed results to the shared central repository. An update process may be invoked in order for the process associated with the application to update its view of its shared memory segment. Similarly, the update process may be invoked in order for other processes associated with other applications to update their respective views of their shared memory segments.
According to examples, the apparatus and method disclosed herein may be implemented as part of a virtual memory manager of an operating system. With respect to the commit process disclosed herein, a call to commit from any of the processes sharing a memory segment may make all the isolated local copies visible to the apparatus and method disclosed herein, which, in turn, may make all the isolated local copies available to other processes. A most recent version in the memory bank may be located by maintaining a page table (e.g., a hash table). For the page table, the key may be designated as the creation date of a page, and the value may be designated as the actual content of the memory segment. A lookup table (i.e., a translation lookaside buffer (TLB)) may be flushed after a commit operation is successfully completed. Updates in local copies may be update/commit driven.
As described herein, the modules and other elements of the apparatus 102 may be machine readable instructions stored on a non-transitory computer readable medium. In addition, or alternatively, the modules and other elements of the apparatus 102 may be hardware or a combination of machine readable instructions and hardware.
Referring to
With respect to the local version 200 for the particular example of
The local version 200 may be based on a duplicate mapping of the same physical memory frames (e.g., from the shared central repository 110) referenced by a page table 206. The page table 206 may generally indicate a location of the appropriate page and frame in the shared central repository 110. For the page table 206, a key may be designated as the creation date, and a value (e.g., P2, P5, etc., for the example of
According to a further example, assume V1 in the shared central repository 110 includes (P6,F6) that is a read-only frame. If an application attempts to write to (P6,F6), a new frame F7 may be allocated with the content P6 of the original frame F6. A new pair (P7,F7) may added to the local set of changes, where P7 may represent the new content originally associated with the content P6.
When a process involving one of the applications 106 attempts to write, the COW update-triggered consistency apparatus 102 may execute a COW page fault, as disclosed herein with respect to a write page fault process 400 described in further detail with reference to
Referring to
With respect to the commit process 300 that is implemented by the commit module 114, generally, committing local changes to the shared central repository 110 may prompt the addition of the new version 204 to the shared central repository 110. If needed, a new pair (page, frame#) may be added to the new version 204. For the example of
For the commit process 300, at block 302, the commit module 114 may further determine whether there is a previous reference to the offset pair (e.g., (P2, F2) and (P5,F5) from the local version 200 for the App(1) for the example of
At block 304, if the determination at block 302 is yes, the reference in the previous version may be removed. For the example of
At block 306, if the determination at block 302 is no or following block 304, the page and frame may be added to the central version. For the example of
At block 308, the entry for a page and frame (e.g., (P2,F2) and (P5,F5) for App(1) for the example of
With respect to the commit process 300, the local version 200 may represent a duplicate mapping of the same physical memory frames referenced by the page table 206. Any frames shared between processes may be marked as read-only in their respective page entries.
The shared central repository 110 may be subject to extensive contention between processes. The COW update-triggered consistency apparatus 102 may use a single read-write lock to protect the shared central repository 110 from race conditions. Race conditions may generally represent behavior of a system where the output is dependent on a sequence or timing of other uncontrollable events.
With respect to scalability for the shared central repository 110, it may not be thread safe to allow an update to traverse the page table 206 while a commit may be removing old entries for the very same pages being committed (e.g., P2 for the App(1) for the example of
With respect to concurrency, for the COW update-triggered consistency apparatus 102, concurrency may be implemented by providing committers (e.g., different ones of the applications 106 that initiate a commit process) access to different pages in a concurrent manner. Pages with a single committer (e.g., a single application 106 that initiates a commit process) may be committed concurrently. Since a thread acquires a new version on calling commit, pages with more than one committer may let the committing proceed in version order. A version map (not shown) may store the arrival order so that committers themselves can write when it is safe to do so (e.g., they are first on the version map).
Referring to
With respect to the write page fault process 400 that is implemented by the write page fault module 116, at block 402, a new page and frame may be allocated. For the example of
At block 404, the new page and frame may be initialized with contents from the read-only frame. For the example of
At block 406, the new and page frame may be written to. For the example of
At block 408, the new and page frame may be added to the local version. For the example of
At block 410, the COW update-triggered consistency apparatus 102 may retain reference to the original page and frame in the page table 206. For the example of
Referring to
With respect to the update process 500 that is implemented by the update module 118, when a process needs an updated view of the shared memory bank 108, and thus the shared central repository 110, the caller's view (e.g., the local version 200 for the example of
For the update process 500, at block 502, the update module 118 may determine if there are any local changes or central changes. For the example of
If the determination at block 502 is no, at block 504, the update module 118 may obtain a set of changes from the central version. For the example of
For each entry in the set of changes, at block 506, the update module 118 may modify the entries in the page table. For the example of
For each entry in the set of changes, at block 508, the update module 118 may flush the TLB (not shown) for the address at the given page number (page_i).
With respect to the update process 500, according to an example, the local version 200 of the process may not change (e.g., the central array V2 may be changed by receiving commits from other processes). Alternatively, if the local copy per the local version 200 and the central version (e.g., V1 for the example of
The merging of the conflicting local and central versions may then proceed such that if the local version differs from the local copy of the original, the local version is used. For example, if the determination at block 502 is yes, at block 510, the update module 118 may determine if the current local version differs from the local copy of the original. For the example of
If the determination at block 510 is yes, at block 512, the update module 118 may use the local version. For the example of
If the determination at block 510 is no, at block 504, the version from the set of changes (e.g., from the shared central repository 110) may be used. In other words, at block 504, the update module 118 may obtain a set of changes from the central version (e.g., difference between V1 and V2 for the example of
The update module 118 may allow users to indicate a modification unit (i.e., granularity level) for any given segment, and merge any disjoint set of changes of the given unit size. If the sets are not disjoint, the local version (e.g., local version 200 for the example of
Referring to
At block 604, the method may include assigning first and second local versions respectively to the shared set of data associated with the first and second applications. For example, referring to
At block 606, the method may include determining initiation of a commit process by the first or second applications respectively related to the shared set of data associated with the first or second applications. For example, referring to
At block 608, in response to initiation of the commit process, the method may include generating a second shared version related to the shared set of data based on the respective processes involving the first and the second applications. For example, referring to
According to an example, for the method 600, the commit process may further includes determining (e.g., at block 302 of
According to an example, the method 600 may further include determining the write associated with the process involving the first application or the process involving the second application, allocating (e.g., at block 402 of
According to an example, the method 600 may further include determining an update associated with the process involving the first application or the process involving the second application, and determining (e.g., at block 502 of
According to an example, the method 600 may further include determining an update associated with the process involving the first application or the process involving the second application, and determining (e.g., at block 502 of
Referring to
At block 704, the method may include assigning first and second local versions respectively to the shared set of data associated with the first and second applications.
At block 706, the method may include determining initiation of a commit process by the first or second applications respectively based on a write to the first or second local versions respectively related to the shared set of data associated with the first or second applications.
At block 708, in response to initiation of the commit process, the method may include generating a second shared version related to the shared set of data based on the respective processes involving the first and the second applications.
At block 710, the method may include assigning first and second segments of the memory bank respectively to the first and second local versions respectively associated with the first and second applications. For example, referring to
At block 712, the method may include generating a page table to indicate a location of the page and the frame in the shared central repository of the memory bank. For example, referring to
The computer system 800 may include a processor 802 that may implement or execute machine readable instructions performing some or all of the methods, functions and other processes described herein. Commands and data from the processor 802 may be communicated over a communication bus 804. The computer system may also include a main memory 806, such as a random access memory (RAM), where the machine readable instructions and data for the processor 802 may reside during runtime, and a secondary data storage 808, which may be non-volatile and stores machine readable instructions and data. The memory and data storage are examples of computer readable mediums. The memory 806 may include a COW update-triggered consistency module 820 including machine readable instructions residing in the memory 806 during runtime and executed by the processor 802. The COW update-triggered consistency module 820 may include the modules of the apparatus 102 shown in
The computer system 800 may include an I/O device 810, such as a keyboard, a mouse, a display, etc. The computer system may include a network interface 812 for connecting to a network. Other known electronic components may be added or substituted in the computer system.
What has been described and illustrated herein is an example along with some of its variations. The terms, descriptions and figures used herein are set forth by way of illustration only and are not meant as limitations. Many variations are possible within the spirit and scope of the subject matter, which is intended to be defined by the following claims—and their equivalents—in which all terms are meant in their broadest reasonable sense unless otherwise indicated.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2013/067759 | 10/31/2013 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2015/065429 | 5/7/2015 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6075938 | Bugnion et al. | Jun 2000 | A |
7620766 | Waldspurger | Nov 2009 | B1 |
8041855 | Ou | Oct 2011 | B1 |
9678997 | Burckart | Jun 2017 | B2 |
20030200398 | Harris | Oct 2003 | A1 |
20050262313 | Holt | Nov 2005 | A1 |
20070106993 | Largman et al. | May 2007 | A1 |
20070288526 | Mankad | Dec 2007 | A1 |
20070294319 | Mankad | Dec 2007 | A1 |
20080034364 | Lam et al. | Feb 2008 | A1 |
20100122041 | Nakaike | May 2010 | A1 |
20110307668 | Ferr et al. | Dec 2011 | A1 |
20120011509 | Husain | Jan 2012 | A1 |
Entry |
---|
Extended European Search Report, EP Application No. 13896540.5, dated Mar. 27, 2017, pp. 1-7, EPO. |
Calif Cascaval et al., “Software Transactional Memory: Why Is It Only A Research Toy?,” Sep. 2008, pp. title page and 47-58, ACM QUEUE. |
Didier Donsez et al., “WEA, A Distributed Object Manager based on a Workspace Hierarchy,” International Conference on Applications in Parallel and Distributed Computing, Apr. 1994, pp. 1-10. |
Eric Koskinen et al., “Coarse-Grained Transactions,” Jan. 2010, pp. 1-12, ACM. |
International Search Report and Written Opinion, International Application No. PCT/US2013/067759, dated May 30, 2014, pp. 1-8, KIPO. |
Luke Dalessandro et al., “NOrec: Streamlining STM by Abolishing Ownership Records,” Jan. 2010, pp. 67-77, ACM. |
Philip W. Howard and Jonathan Walpole, “A Relativistic Enhancement to Software Transactional Memory,” 2011, pp. 1-6, USENIX HotPar 2011. |
Timothy Merrifield and Jakob Eriksson, “Conversion: Multi-Version Concurrency Control for Main Memory Segments,” University of Illinois at Chicago, Mar. 8, 2013, pp. 1-13, ACM, Available at: <cs.uic.edu/˜jakob/papers/merrifield-eurosys13.pdf>. |
Unknown, “Memory Sharing Among Containers and Copy-On-Write Protection,” 2007, pp, 1-2, Available at: <download.parallels.com/vz/v4/docs/en/win/VzWindowsUG/19930.htm>. |
Vincenzo Innocente, “Sharing Memory: A Kernel Approach,” AA Meeting, Mar. 4, 2009, pp. 1-11, Available at: <indico.cern.ch/getFile.py/access?contribId=38tresId=1&materialId=slides&confId=52890>. |
Number | Date | Country | |
---|---|---|---|
20160232197 A1 | Aug 2016 | US |