Embodiments of the present invention relate to computer systems, and more particularly to such systems that use lock variables to control access to data.
Computer systems including multiprocessor (MP) and single processor systems may include a plurality of threads, each of which executes program instructions independently from other threads. Use of multiple processors and/or threads allows various tasks or functions (and even multiple applications) to be handled more efficiently and with greater speed. When using multiple threads or processors, two or more processors or threads can share the same data stored within the system. However, care must be taken to maintain memory ordering when sharing data.
For data consistency purposes, if multiple threads or processors desire to read, modify, or write data at a shared memory location, the multiple agents may not be allowed to perform operations on the data simultaneously. Further complicating the use of multiple processors is that data is often stored in a cache associated with a processor. Because such caches are typically localized to a specific processor, multiple caches in a multiprocessor computer system can contain different copies of a given data item. Any agent accessing this data should receive a valid or updated (i.e., latest) data value, and data being written from the cache back into memory must be the current data so that cache coherency is maintained.
Multithreaded (MT) software uses different mechanisms to interact and coordinate between different threads. Two common forms of synchronization are barriers and semaphores (locks). A barrier mechanism helps a program to synchronize different threads at predefined points in the program, where each thread waits for a memory variable to reach a predetermined barrier level. Synchronization is achieved once all threads have completed the updates. When the barrier is reached, all threads can then proceed.
A semaphore lock mechanism is used to guarantee mutual exclusion across multiple threads while accessing a shared memory variable or structure (i.e., a shared element). In order to provide a unique and consistent view of the shared element, it is guarded by a lock variable. Different types of locks exist. For example, a spin-lock mechanism is typically implemented such that a thread needing access to the shared element must acquire the guarding lock (i.e., locking) via an atomic semaphore operation. When a lock is acquired, the remaining threads can only acquire the lock after it is released (i.e., unlocking) by the original requester. Locking is performed by designating a particular value to represent a locked state, and a different value to represent an unlocked state.
Reader-writer locks allow multiple concurrent readers or a single writer to acquire the lock at any time. Reader-writer locks are used in sophisticated concurrent systems, for example, in implementing a software transaction memory (STM). To design software applications to scale for multi-core processors, reader-writer locks may be used to allow concurrency and allow more parallelism to be exploited.
Many modern languages include transactions as the basic synchronization primitive. A hardware transactional memory (HTM) is insufficient for these languages since these languages use nested transaction, partial aborts, non-transactional instructions and a number of other features. An STM implementation can provide these features. However, the usual implementation of a STM is optimistic, as each thread executes operations in an atomic block as if no other threads exist. When the atomic block finishes, data accessed by the block is checked for consistency with current data at a given memory location. If consistency is verified, the transaction is committed; otherwise the atomic block is aborted and must be restarted. Typical locks, however, are not optimized for use in an STM.
In various embodiments, a lock for a shared memory structure may be in the form of a data structure having two portions, namely a first portion and a second portion. The first portion may correspond to an identifier portion that is used to identify a write owner of the lock or an indication of the number of reader owners of the lock. The second portion may correspond to a control portion that may be accessed and written to by various entities (e.g., threads) to acquire access to the lock or to implement or change features or modes of operation of the lock.
In many implementations, the lock may be a reader-writer lock and may take the form of a data structure that can be sized differently in different embodiments. In one implementation, the lock may be a 32-bit structure that includes the first portion (i.e., an identifier portion) and the second portion (i.e., a control portion). In this implementation, the control portion may correspond to the low order 4 bits, while the identifier portion may correspond to the upper 28 bits, although the scope of the present invention is not so limited. The term “lockword” is used herein to refer to a lock variable in accordance with an embodiment of the present invention. Furthermore, while the term “lockword” is used throughout, it is to be understood that this term is not limited to any particular size of lock variable and instead a lockword may be any size desired for a particular implementation. Additional structures may be associated with a lockword, including a shared data structure that is to be accessed when a lock is acquired. Also, a mutual exclusion structure (MUTEX) may also be associated with the lockword. Furthermore, wait variables and the like may further be associated with the lockword as will be described below.
In various implementations, the control portion of the lock may be used to enable different lock features and modes of operation via a single control structure. Accordingly, entities may access the control portion, read its contents and/or write thereto in order to acquire the lock and/or modify properties or features of the lock. While only a few representative control mechanisms are described herein, it is to be understood that the scope of the present invention is not limited in this regard, and a lock may include other features and modes of operation controlled by elements in a control portion.
Referring now to
As further shown in
In one embodiment, N element 22 may be used to indicate that a reader seeks notification after a writer has acquired and released lockword 10. In addition to writing to N element 22, a reader may also store an identifier in a wait variable or other location. The reader performs these operations after acquiring the reader lock but before it has released the reader lock. This operation may be idempotent; that is, even if multiple readers want notification a single bit suffices to tell the writer to wake up all readers waiting at a corresponding wait variable. Because a reader can not acquire the lock (and hence will not try to set the notification bit) when a writer has acquired the lock, there is no race condition between setting this N element and a writer waking up the readers, since the writer wakes up the readers only at the time of release. In one embodiment, this scheme of notification allows an implementation via instructions to monitor a memory region and wait for a store thereto, e.g., MONITOR and MWAIT instructions in an Intel Architecture (IA)-32 environment. In one embodiment, N element 22 may be written using a bit test and set instruction (e.g., the BTS instruction in an IA-32 environment).
In one embodiment, U element 24 may be used as an upgrade indicator. If a reader needs to be upgraded to a writer, it atomically tries to set U element 24. If it succeeds, it waits until all readers have released their read locks. Correspondingly, if a would-be writer or reader sees U element 24 set, it does not try to acquire lockword 10. When all readers have released their locks, the upgrader acquires lockword 10 as a write lock. If it fails to atomically set U element 24, the reader may stop trying to upgrade itself to a writer. Depending on the context in which the reader-writer lock is being used, the reader may take further actions; for example, if the reader is executing a software transaction, then it may abort its transaction. In one embodiment, to effect the abort, the reader may release all locks it has acquired.
In one embodiment, I element 26 may be used as an inflation indicator. It may be set to one if lockword 10 is inflated, and to zero if lockword 10 is not inflated. Operation using I element 26 will described further below. In one embodiment, a reader indicator, i.e., R element 28, may be always set to zero if a writer has acquired lockword 10 otherwise it may be set to one.
While these particular features and states for the control elements of control portion 20 have been described, it is to be understood that the scope of the present invention is not limited in this regard and in other embodiments fewer, additional, or different elements and indicators for different modes of operation or features can be present.
Referring now to
As shown in
Still referring to
Upon release of the lockword, the thread may acquire a write lock and set the lockword with its thread identifier (TID) (block 140). In one implementation, the write lock may be acquired by setting predetermined values for the elements or bits within the control portion of the lockword. Furthermore, to identify itself as the owner of the lockword, the thread may insert its thread identifier into the first portion (i.e., indicator) portion of the lockword. Accordingly, at this time the thread has successfully gained ownership of the lockword and thus may write data to the shared memory location associated with the lockword (block 150).
After this write, the thread may release the write lock and set the lockword to its initial value (block 160). For example, the thread may clear its TID from the control portion and may further place a predetermined value in the control portion. This predetermined value may correspond to an initial value of the control portion, in some embodiments. In one implementation, the initial value may correspond to a value of 0□8, although the scope of the present invention is not so limited. Note that the events performed in blocks 130, 140, 150 and 160 may also correspond to the events for obtaining a write lock (without first upgrading from reader status).
After release of the write lock, method 100 may conclude. While described with this particular implementation in the embodiment of
In various embodiments, reader-writer locks can be used in multiple modes of operation. More specifically, these reader-writer locks can be used in multiple concurrency schemes, namely an optimistic concurrency mode and a pessimistic concurrency mode. In an optimistic concurrency mode, readers read data associated with the shared memory of a lockword without taking any form of lock and using the data as desired. When the reader reaches a commitment phase (e.g., of a transaction using the data), the lockword is analyzed to validate the data by confirming that the value of the lockword has not changed since the reader read the data. In this way, the reader validates that the value of the data read has not changed. Such optimistic concurrency can be relatively efficient and provide for improved caching effects.
However, optimistic concurrency can lead to a high number of abort operations when used in a STM, at least during certain execution periods. That is, when the data associated with a lockword is modified after it has been read by a reader and before the reader commits the operation that used the data, that operation and other pending operations, e.g., of a transaction are aborted to avoid data inconsistencies. Accordingly, depending on given system conditions, a lockword may be used instead in a pessimistic concurrency scheme. In such a pessimistic concurrency scheme, reader-writer locks enable read concurrency, but explicitly prevent writers from accessing the data while a read lock is present. Thus the data remains coherent, however performance can be degraded, as a writer cannot acquire the lockword (and the associated shared memory) until the one or more readers have released the lockword.
In various embodiments, an adaptive approach may be used to switch between these different concurrency modes based on system conditions. In some embodiments, a control element within the control portion of the lockword may be used to enable adaptive switching between these concurrency modes. Referring back to
Different manners of providing for adaptive switching between concurrency modes may be realized. Referring now to
Still referring to
Still referring to
If instead at diamond 240 it is determined that the count exceeds the threshold, control passes to block 250. There operation of the lockword may be dynamically changed to the second concurrency mode (block 250). This concurrency mode may correspond to a pessimistic concurrency mode, in various implementations. According to such a pessimistic concurrency mode, in order to read data at a shared memory location corresponding to a lockword, the reader must first acquire a lock. Accordingly, control passes to block 260, where the thread may acquire a read lock in order to access the shared memory location (block 260). After reading the data at the shared memory location and performing other actions (e.g., committing the transaction within which the shared data was read), control passes to block 270, where the thread may release the read lock (block 270). Subsequently, a writer may acquire a lock on the lockword. Accordingly, method 200 may conclude.
While described with this particular implementation in the embodiment of
Referring now to
The algorithm for a reader acquiring a read lock may be as follows in Table 1, in one embodiment:
In one embodiment, the algorithm for a reader releasing the lock may be as shown in Table 2:
Note that a reader increments the value of lockword by 0×F on acquire and decrements by the same on release. This ensures that the lower bits are unperturbed by the read lock operation. For example, if the notification bit was set, it does not get erased. Also, when a reader has the lock, the R indicator remains set.
To acquire a lock on the initial state of a lockword, a writer may clear the reader element to indicate a write lock, as shown in
In one embodiment, the algorithm for a writer acquiring the lock is as set forth in Table 3:
Note that when the lock is acquired, a thread identifier (TID) is shifted into the lockword. This preserves the invariant that when a writer has the lock, the lower four bits are always zero.
When the writer releases the lock, the bit pattern shown in
The write lock release sets the lockword to the initial value. If some readers had asked for notification, then the writer wakes them up at the corresponding wait variable.
In optimistic concurrency, a lock is in two phases, it is either owned by a writer or it contains a version number. When a writer releases the lock, it increments the version number. Thus, the version number is a monotonically increasing function, and is guaranteed to change if a writer has acquired the lock. A reader never acquires the lock. During a read, a reader tests whether the lock is free, and if so remembers the version number of the lock. At commit, it tests the version number again and if the version numbers match, then no writer has acquired the lock in between. This may provide better cache effects than a reader-writer lock mechanism, as the optimistic versioning approach does not cause a store on a read operation.
In one embodiment, optimistic concurrency may have the R indicator set to one if the lockword contains a version number and zero if the lockword is owned by a writer. To perform a write lock acquire, a thread remembers the old version number, and as before inserts its TID (by left shifting by 4 bits). This preserves the invariant that the lower four bits on a write lock acquire are zero. On a lock release, the writer increments the old version number by 0×F. This ensures that the lower 4 bits remain unperturbed, in particular, the R indicator remains set, which gives a valid version number.
Embodiments of the present invention may thus provide for adaptivity between optimistic and pessimistic forms. The adaptive reader-writer lock structure may be arranged as follows in one embodiment:
Every lockword may have an associated MUTEX, but the MUTEX is used only when inflation is in effect. Every lockword may also have an associated field that counts the number of readers that have acquired the read lock explicitly. Again, it is used only when inflation is in effect. Thus, given a lockword the associated MUTEX, as well as the count field can be obtained since they are arranged sequentially in memory. Implementations can choose to associate the lockword with the MUTEX and count fields in different ways.
To perform versioning with a reader, the algorithm of Table 5 may be used:
Note that for obtaining the proper version number, the inflation indicator may be masked. A validation algorithm for the reader may be implemented as shown in Table 6, in one embodiment:
Suppose a reader wants to perform read locking and not use versioning. Then the algorithm of Table 7 may be used, in one embodiment:
The read lock release in an adaptive scheme may work as shown in the algorithm of Table 8:
Next, if one of the readers desires to upgrade to a writer status, the bit pattern of
Note that this algorithm preserves the invariant that a write lock acquire sets the lower 4 bits to zero. When the lockword is inflated to a pessimistic mode of operation, the bit pattern shown in
The write lock release increments the version number by 0×F which means that the lower bits remain unperturbed including an R indicator, which preserves the invariant that it is set for a valid version number.
Finally,
Referring now to
As shown in
For purposes of illustration,
Furthermore, as shown in
Lock manager 350 may be used to control a concurrency mode of operation for lockword 310. As one example, upon initial configuration lockword 310 may be set for an optimistic concurrency mode to avoid the expense of acquiring locks and cache effects associated therewith. However by operating in an optimistic mode, one or more threads 365 may have to abort a transaction when a value of lockword 310 changes from the time that data in shared memory 340 is accessed and when an instruction related to the data later commits. Upon such aborts, a counter 352 within lock manager 350 may be incremented. Lock manager 350 may further include an inflation logic 354. Inflation logic 354 may be adapted to compare the value in counter 352 to a threshold. This threshold may correspond to a threshold number of transaction aborts. If greater than this threshold number of transaction aborts occurs, lock manager 350 may cause lockword 310 to be inflated to a pessimistic mode of operation. As described above, such mode of operation may be implemented by setting an inflation indicator within second portion 320, although the scope of the present invention is not so limited. While shown with this particular implementation in the embodiment of
As described above, reader-writer locks in accordance with an embodiment of the present invention may be used in connection with an STM. In such embodiments, transactions may be performed by threads in different concurrency modes, based upon a particular system operation. When operating in an optimistic concurrency mode, a thread may need to abort a transaction if a value of an accessed data associated with a lockword changes during use of the data. In a pessimistic concurrency mode, reader concurrency may be guaranteed at the expense of lower performance.
Different system architectures may implement an STM for use with reader-writer locks. Referring now to
Still referring to
As further shown in
Embodiments may be implemented in code and may be stored on a storage medium having stored thereon instructions which can be used to program a system to perform the instructions. The storage medium may include, but is not limited to, any type of disk including floppy disks, optical disks, compact disk read-only memories (CD-ROMs), compact disk rewritables (CD-RWs), and magneto-optical disks, semiconductor devices such as read-only memories (ROMs), random access memories (RAMs) such as dynamic random access memories (DRAMs), static random access memories (SRAMs), erasable programmable read-only memories (EPROMs), flash memories, electrically erasable programmable read-only memories (EEPROMs), magnetic or optical cards, or any other type of media suitable for storing or transmitting electronic instructions.
While the present invention has been described with respect to a limited number of embodiments, those skilled in the art will appreciate numerous modifications and variations therefrom. It is intended that the appended claims cover all such modifications and variations as fall within the true spirit and scope of this present invention.
This application is a divisional of U.S. patent application Ser. No. 13/325,688, filed Dec. 14, 2011, which is a continuation of U.S. patent application Ser. No. 11/392,381, filed Mar. 29, 2006, now U.S. Pat. No. 8,099,538, issued Jan. 17, 2012, the content of which is hereby incorporated by reference.
Number | Name | Date | Kind |
---|---|---|---|
4604694 | Hough | Aug 1986 | A |
5161227 | Dias | Nov 1992 | A |
5247672 | Mohan | Sep 1993 | A |
5261108 | Hayashi | Nov 1993 | A |
5263155 | Wang | Nov 1993 | A |
5263161 | Barth et al. | Nov 1993 | A |
5287521 | Nitta | Feb 1994 | A |
5293627 | Kato et al. | Mar 1994 | A |
5303368 | Kotaki | Apr 1994 | A |
5410697 | Baird et al. | Apr 1995 | A |
5561794 | Fortier | Oct 1996 | A |
5586331 | Levenstein | Dec 1996 | A |
5598562 | Cutler | Jan 1997 | A |
5623670 | Bohannon et al. | Apr 1997 | A |
5649102 | Yamauchi | Jul 1997 | A |
5675622 | Hewitt | Oct 1997 | A |
5774731 | Higuchi | Jun 1998 | A |
5860159 | Hagersten | Jan 1999 | A |
5892955 | Ofer | Apr 1999 | A |
5899993 | Jenkins | May 1999 | A |
5968157 | Joy | Oct 1999 | A |
5983326 | Hagersten et al. | Nov 1999 | A |
5991845 | Bohannon | Nov 1999 | A |
6041385 | Shipman | Mar 2000 | A |
6076126 | Shagam | Jun 2000 | A |
6148300 | Singhal et al. | Nov 2000 | A |
6199094 | Presler-Marshall | Mar 2001 | B1 |
6237019 | Ault | May 2001 | B1 |
6304938 | Srivastava | Oct 2001 | B1 |
6343338 | Reneris | Jan 2002 | B1 |
6480918 | Mckenney et al. | Nov 2002 | B1 |
6546443 | Kakivaya et al. | Apr 2003 | B1 |
6658513 | Boonie | Dec 2003 | B1 |
6823511 | McKenney et al. | Nov 2004 | B1 |
6826570 | Eshel et al. | Nov 2004 | B1 |
6842809 | Browning | Jan 2005 | B2 |
6850938 | Sadjadi | Feb 2005 | B1 |
6850969 | Ladan-Mozes et al. | Feb 2005 | B2 |
6898650 | Gao et al. | May 2005 | B1 |
6922744 | Sipple | Jul 2005 | B1 |
6952829 | Banavar et al. | Oct 2005 | B1 |
7120762 | Rajwar et al. | Oct 2006 | B2 |
7395382 | Moir | Jul 2008 | B1 |
7434010 | Duffy et al. | Oct 2008 | B2 |
7711909 | Lev et al. | May 2010 | B1 |
7725633 | Mochida et al. | May 2010 | B2 |
8099538 | Saha et al. | Jan 2012 | B2 |
8209689 | Raikin et al. | Jun 2012 | B2 |
8407386 | Saha et al. | Mar 2013 | B2 |
20020099703 | Whang | Jul 2002 | A1 |
20020147873 | Kwon | Oct 2002 | A1 |
20020147969 | Lethin | Oct 2002 | A1 |
20030208662 | Heisch | Nov 2003 | A1 |
20040019660 | E. | Jan 2004 | A1 |
20040117531 | McKenney | Jun 2004 | A1 |
20040143712 | Armstrong et al. | Jul 2004 | A1 |
20050204119 | Saha et al. | Sep 2005 | A1 |
20060005197 | Saha et al. | Jan 2006 | A1 |
20080256074 | Lev et al. | Oct 2008 | A1 |
20100153953 | Adl-Tabatabai | Jun 2010 | A1 |
20100191884 | Holenstein et al. | Jul 2010 | A1 |
20110185359 | Chakrabarti | Jul 2011 | A1 |
Number | Date | Country |
---|---|---|
101556597 | Mar 2011 | CN |
0854419 | Jul 1998 | EP |
58105362 | Jun 1983 | JP |
62200449 | Sep 1987 | JP |
02300939 | Dec 1990 | JP |
04116762 | Apr 1992 | JP |
05143372 | Jun 1993 | JP |
07160645 | Jun 1995 | JP |
08006905 | Jan 1996 | JP |
09231123 | Sep 1997 | JP |
2001075826 | Mar 2001 | JP |
2004227581 | Aug 2004 | JP |
2009193362 | Aug 2009 | JP |
WO 03042810 AL | May 2003 | WO |
WO 2008018963 | Feb 2008 | WO |
Entry |
---|
“NN910448: Method for Dynamic Variation of Concurrency Control”, Apr. 1, 1991, IBM, IBM Technical Disclosure Bulletin, vol. 33, Iss. 11, pp. 48-50. |
Eric G.; Julien, H.: Jean-Christophe, L., “Comparison of optimisitc and pessimistic pilgrims for concurrency management in CSCW through a probabilistic study,” Information Technology: Research and Education, 2005. ITRE 2005. 3rd International Conference on , pp. 307,311, Jun. 27-30, 2005. |
Carns et al., “A Case for Optimistic Coordination in HPC Storage Systems,” High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion: , pp. 48,53, Nov. 10-16, 2012. |
Makni, A.; Bouaziz, R., “Performance Evaluation of an Optimistic Concurrency Control Algorithm for Temporal Databases,” Advances in Databases Knowledges and Data Applications (DBKDA), 2010 Second International Conference on , pp. 75,81, Apr. 11-16, 2010. |
U.S. Appl. No. 11/305,506, filed Dec. 16, 2005, entitled “Speculative Execution PAS A Barrier” by Bratin Saha, Ali-Reza Adl-Tabatabai. |
U.S. Appl. No. 11/304,509, filed Dec. 14, 2005, entitled “Lock Elision With Transactional Memory” by Ali-Reza Adl-Tabatabai et al. |
Robert Ennals, “Software Transactional Memory Should Not Be Obstruction-Free,” Nov. 2005, pp. 1-10. |
“NB9402399: OS/2 Syystem Semaphore Limit Workaround”, Feb. 1, 1994, IBM, IBM Technical Disclosure Bulletin, vol. 37, Iss. 26, pp. 399-406. |
“NA9109455: Sixteen-bit Operating System Fast Safe RAM Semaphore Compatibility in an Intel 80386 Paged Environment”, Sep. 1, 1991, IBM, IBM Technical Disclosure Bulletin, vol. 34, Iss. 4A, pp. 445-447. |
Wanyu Kim; Dongkun Shin;, “Non-Preemptive Demand Paging Technique for NAND Flash-Based Real-Time Embedded Systems,” Consumer Electronics, IEEE Transactions on, vol. 56, No. 3, Aug. 2010, pp. 1516-1523. |
Tei-Wei Kuo; Ming-Chung Liang; LihChyun Shu;, “Abort-Oriented Concurrency Control for Real-Time Databases,” Computers, IEEE Transactions on, vol. 50, No. 7, Jul. 2001, pp. 660-673. |
U.S. Appl. No. 11/305,506, filed Dec. 16, 2005, entitled “Speculative Execution Past a Barrier” by Bratin Saha, Ali-Reza Adl-Tabatabai. |
U.S. Appl. No. 11/304,509, filed Dec. 14, 2005, entitled “Lock Elision With Transactional Memory” b Ali-Reza Adl-Tabatabai et al. |
Robert Ennals, “Software Transactional Memory Should Not Be Obstruction-Free”, Nov. 2005, pp. 1-10. |
“NA9109445: Sixteen-bit Operating System Fast Safe RAM Semaphore Compatibility in an Intel 80386 Paged Environment”, Sep. 1, 1991, IBM, IBM Technical Disclosure Bulletin, vol. 34, Iss. 4A, pp. 445-447. |
“NN9703207: Method for Initializing Semaphores that follow the UNIX System V Model”, Mar. 1, 1997, IBM, IBM Technical Disclosure Bulletin, vol. 40, Iss. 3, pp. 207-208. |
NN9607119: Blocking/Non Blocking Access to Message Queues, Jul. 1, 1996, IBM, IBM Technical Disclosure Bulletin, vol. 39, Iss. 7, pp. 119-120. |
NA8911208: Bus-Locking Mechanism in an Intel 82385 Cache Controller Subsystem, Nov. 1, 1989, IBM, IBM Technical Disclosure Bulletin, vol. 32, Iss. 6A, pp. 2208-2210. |
Number | Date | Country | |
---|---|---|---|
20130173869 A1 | Jul 2013 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13325688 | Dec 2011 | US |
Child | 13778318 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 11392381 | Mar 2006 | US |
Child | 13325688 | US |