Enterprises, such as business enterprises, operate enterprise systems to provide software functionality to customers and employees. In some examples, an enterprise system can include back-end enterprise servers that host enterprise applications. Example enterprise applications include enterprise resource planning (ERP) systems, client-relationship management (CRM) systems, product lifecycle management (PLM) systems, supply chain management (SCM) systems, and supplier relationship management (SRM) systems. During operation of an enterprise application, application data is accessed, which is stored in main memory of the enterprise server. In this manner, the application data is immediately accessible by processors of the enterprise server.
Increasingly large amounts of application data are stored in the main memory of enterprise servers. Main memory can include dynamic random access memory (DRAM), which consumes a relatively high amount of static energy (both in active and idle states) due to continuous leakage and refresh power. Non-volatile memory (NVM), also referred to as storage class memory (SCM) (e.g., phase change memory (PCM)) can address fundamental limitations of DRAM. Characteristics that differentiate NVM from DRAM include data persistence, high latency, high write energy, low static energy and low write endurance (e.g., wear-out of cells). Physically, NVM is inserted into a memory bus along with DRAM.
Implementations of the present disclosure include computer-implemented methods for placement of data in differing memory types of hybrid memory systems. In some implementations, methods include actions of determining a first cost and a second cost associated with a virtual memory page accessed during execution of an application, the first cost being associated with a first memory type, and the second cost being associated with a second memory type in a hybrid memory system, comparing the first cost and the second cost to provide a comparison result, determining a current location of the virtual memory page, the current location including one of the first memory type and the second memory type, and selectively migrating the virtual memory page from the current location based on the comparison result and the current location.
These and other implementations can each optionally include one or more of the following features: each of the first cost and the second cost is determined based on a read energy and a write energy expended for the virtual memory page with the respect to the first memory type and the second memory type, respectively; selectively migrating the virtual memory page including migrating the virtual memory page from the second memory type to the first memory type in response to determining that the second cost exceeds the first cost, and that the virtual memory page is stored in the second memory type; selectively migrating the virtual memory page includes migrating the virtual memory page from the first memory type to the second memory type in response to determining that the second cost does not exceed the first cost, and that the virtual memory page is stored in the first memory type; migrating the virtual memory page includes copying the virtual memory page from one of the first memory type and the second memory type to the other of the first memory type and the second memory type, and deleting the virtual memory page from the one of the first memory type and the second memory type; determining the first cost and the second cost is performed in response to the virtual memory page being accessed; and the first memory type includes dynamic random access memory (DRAM), and the second memory type comprises non-volatile memory (NVM).
The present disclosure also provides one or more non-transitory computer-readable storage media coupled to one or more processors and having instructions stored thereon which, when executed by the one or more processors, cause the one or more processors to perform operations in accordance with implementations of the methods provided herein.
The present disclosure further provides a system for implementing the methods provided herein. The system includes one or more processors, and a computer-readable storage medium coupled to the one or more processors having instructions stored thereon which, when executed by the one or more processors, cause the one or more processors to perform operations in accordance with implementations of the methods provided herein.
It is appreciated that methods in accordance with the present disclosure can include any combination of the aspects and features described herein. That is, methods in accordance with the present disclosure are not limited to the combinations of aspects and features specifically described herein, but also include any combination of the aspects and features provided.
The details of one or more implementations of the present disclosure are set forth in the accompanying drawings and the description below. Other features and advantages of the present disclosure will be apparent from the description and drawings, and from the claims.
Like reference symbols in the various drawings indicate like elements.
Implementations of the present disclosure are generally directed to placement of data in differing memory types of hybrid memory systems. In some implementations, actions can include determining a first cost and a second cost associated with a virtual memory page accessed during execution of an application, the first cost being associated with a first memory type, and the second cost being associated with a second memory type in a hybrid memory system, comparing the first cost and the second cost to provide a comparison result, determining a current location of the virtual memory page, the current location including one of the first memory type and the second memory type, and selectively migrating the virtual memory page from the current location based on the comparison result and the current location.
To provide context for implementations of the present disclosure, the requirement for main memory capacity in data centers has increased significantly in the last decade. Consequently, the energy consumed by the data centers has correspondingly increased. For example, from 2005 to 2010, the worldwide energy consumption in data centers has increased by approximately 50%. As one example, electricity consumption in European data centers was approximately 56 TWh per year, and is projected to increase up to approximately 104 TWh per year by 2020. However, studies have shown that average server utilization in a data center ranges from 20-30%, for example. Further studies have revealed that main memory (DRAM) accounts for 30% of the total power consumed in a data center. Several studies have proposed techniques to minimize power consumption in DRAM, however, the problem is more fundamental. For example, because DRAM is volatile, it requires continuous static power (e.g., leakage power, refresh power).
DRAM scaling has been used to address management of application data in main memory of enterprise servers. However, there are limits to DRAM scaling. For example, the scaling of DRAM to below 40 nm is questionable. As there are limits to DRAM scaling, storage class memory (SCM), such as byte-addressable non-volatile memory (NVM) (e.g., phase change memory (PCM)), is considered for use in main memory replacement.
NVM is an emerging category of memories that offer non-volatility, byte-addressability, high density, energy efficiency, and low cost. For example, NVM provides byte-addressability like DRAM, and persistence like hard disks drives (HDDs). The scaling property and energy efficiency of NVMs, such as PCM, has become popular in research communities. NVM provides low cost and low energy alternatives to DRAM, and also offers core features like persistence and high scalability. In some examples, studies on conventional DRAM have shown that the leakage energy is a significant portion of the total power consumed by DRAM, and should be a primary focus of energy conservation approaches. Unlike DRAM, SCM requires extremely low leakage power and no refresh power, and the scaling property of SCM is promising (e.g., unlike DRAM, PCM can scale down to approximately 10 nm). NVM, however, has certain disadvantages, which can vary between NVM technologies. Generally, disadvantages of NVM include increased latency and dynamic energy for NVM accesses, and reduced memory bandwidth and a fast wear-out of NVM devices as compared to DRAM.
Hybrid main memory, including multiple types of memory (e.g., DRAM, NVM), is implemented to address the advantages and disadvantages of DRAM and NVM. The concept behind hybrid main memory is that a small amount of DRAM holds frequently accessed data (hot data) and provides low latency and low dynamic energy, while a large amount of NVM is used to implement a scalable memory system and store the majority of less frequently accessed data (cold data).
As introduced above, implementations of the present disclosure provide an end-to-end approach to allocate data from an application by using memory allocators. Implementations of the present disclosure provide an interface between the application and an OS, and maps pages (virtual memory pages) to underlying physical memory devices. In some implementations, an NVM allocator is provided as a default allocator for a hybrid memory system, and allocations identified as critical allocations are conducted using a DRAM allocator.
Implementations of the present disclosure further provide data management policies at the granularity of application objects and virtual memory pages (pages). In some implementations, application knowledge and behavior are used to derive best initial placement for an application object at allocation time. In some examples, after the initial allocation, pages can be moved between DRAM and NVM in the hybrid memory system in order to adapt to dynamic workload characteristics. Accordingly, implementations of the present disclosure combine the static, initial page placement with a dynamic page placement. In this manner, implementations of the present disclosure conserve energy in hybrid memory systems.
In some examples, the example memory architecture 100 can be implemented in an in-memory database system. In some examples, an in-memory database system is a database management system that uses main memory for data storage. In some examples, main memory includes random access memory (RAM) that communicates with one or more processors (e.g., central processing units (CPUs)), over a memory bus. An in-memory database system can be contrasted with database management systems that employ a disk storage mechanism. In some examples, in-memory database systems are faster than disk storage databases, because internal optimization algorithms can be simpler and execute fewer CPU instructions. In some examples, accessing data in an in-memory database system eliminates seek time when querying the data, which provides faster and more predictable performance than disk-storage databases. In some examples, an in-memory database can be provided as a column-oriented in-memory database, in which data tables are stored as sections of columns of data (rather than as rows of data). An example in-memory database system includes HANA, provided by SAP SE of Walldorf, Germany.
As introduced above, implementations of the present disclosure provide a programming interface (not depicted in
In some examples, the application 202 is the application, for which data allocations between the DRAM 216 and NVM 218 are to be made. In some examples, the virtual address space 209 is provided as a set of binary addresses that is used by the operating system 204 to allocate memory addresses to any process requesting the memory. In some examples, the virtual address space 209 enables the processes to use more memory addresses than the actual DRAM memory available in the system. In some implementations, the operating system 204 manages the mappings between virtual addresses and physical addresses. In some examples, the storage 210 is provided as hard disk drive that is used for permanent storage of data.
In accordance with implementations of the present disclosure, the OS enables the application programmer to decide whether data should be allocated on NVM or on DRAM. In some examples, the mechanism to create new virtual memory pages is through a mmap system call. Implementations of the present disclosure enables the flag argument mmap to indicate whether the new pages should be mapped to DRAM or NVM. In some implementations, mapping to NVM is provided as a default. That is, the OS maps the pages to NVM, unless it is otherwise determined to map the pages to DRAM. In some examples, and if the pages are to be mapped to DRAM, the OS records the target memory request and allocates the physical page on DRAM, if space is available. If space is not available, the physical page is allocated on NVM.
In some examples, applications use finer-grain mechanisms for memory allocation, such as the malloc family of functions. The malloc functions manage a set of virtual memory pages obtained through mmap, and allocate objects from these pages. In order to differentiate between DRAM and NVM allocations, implementations of the present disclosure provide distinct memory allocators in the application, one managing pages mapped to DRAM, and the other managing pages mapped to NVM. In some examples, each memory allocator provides standard memory allocation functions prefixed with either “dram_” (e.g., dram_malloc) or “nvm_” (e.g., nvm_malloc). In some examples, the standard function names are retained and are mapped to the NVM allocator. In some implementations, only dynamically allocated memory is allocated through mmap or malloc, and it can be assumed that static variables and stack variables are always mapped to NVM.
In some implementations, a migration process is provided to migrate data between types of memory (e.g., moving objects between NVM and DRAM). In some examples, if migration is to be performed, the programmer can allocate a new copy of the object on the opposing memory type and copy the data to the opposing memory type. For example, if an object is to be migrated from NVM to DRAM, a new copy of the object can be allocated on DRAM, and the data of the object can be copied to DRAM. In some implementations, migrations coincide with existing memory copying code. For example, this occurs as migrations are performed only when data set sizes grow to exceed the last level cache (LLC) capacity. Because data set sizes are hard to predict in advance, data is stored in incrementally growing buffers. Growing these buffers involves allocating a new, larger buffer, and copying the data over. Implementations of the present disclosure leverage existing copies to avoid additional migration cost.
In some implementations, an object, and an allocation thereof, can be identified as critical, as described in further detail herein. For example, an object can be determined to be critical if accesses to main memory for the object exceed a threshold number of access. It has been determined that a significant number (e.g., more than 98%) of allocations are not critical. In accordance with implementations of the present disclosure, non-critical allocations use the default allocators (e.g., nvm_malloc, nvm_mmap). In some examples, non-critical allocations either have few accesses to main memory, or have good cache locality, such that most of their accesses are served from the caches.
Implementations of the present disclosure have been implemented in MonetDB (an open source column-oriented database management system), and it was found that there are approximately 3250 call sites in MonetDB for dynamic allocation using malloc and mmap. In accordance with implementations of the present disclosure, the default is to use NVM. It was determined that there are only 7 call sites in MonetDB, where a decision was to be made as to whether an allocation goes to DRAM or NVM. As such, adapting known database systems to run in hybrid memory requires minimal code changes.
In order to analyze the most critical objects in terms of main memory accesses, implementations of the present disclosure use cost models for performance and energy. In some examples, an object is an individual program variable and memory allocation within the application code. In some implementations, the cost model for performance takes into account only access latency. Bandwidth issues can occur when objects are frequently accessed, in which case it has already been determined to place the object on DRAM due to the latency benefits (e.g., NVM has slower access than DRAM).
In some implementations, the performance cost of an object o is provided as the average memory access time (AMAT) incurred by memory accesses to the object o stored in a memory of technology τ, which can be determined based on the following example relationship:
AMATτ(o)=μr(o)Lτ,r+μw(o)Lτ,w+(1−μr(o))LLLC (1)
where τ is either DRAM or NVM, Lτ,r is the latency to read a cache block in memory, Lτ,w is the latency to write a cache block in memory, LLLC is the latency to access the LLC, μr (o) is the number of read memory accesses made to o per load or store operation, and μw(o) is the number of write memory accesses made for o normalized per load or store operation. The example relationship (1) addresses a single-level cache, but can be extended to address multi-level caches. In some examples, for read accesses, μr(o) is set equal to the global cache miss rate for the LLC (e.g., the number of LLC misses divided by the number of load or store operations). Accordingly, Lτ,r corresponds to the average read latency for a cache block. In the case of write accesses, μw(o) is a measure of write-back traffic, and Lτ,w corresponds to the average write latency.
Besides estimating performance, implementations of the present disclosure estimate the energy impact of storing an object over its lifetime. In some examples, static energy is considered, which is always present throughout the lifetime of an object and includes leakage and refresh energy, and dynamic energy, which is proportional to the frequency of memory accesses. Average memory access energy (AMAE) can be determined based on the following example relationship:
AMAEτ(o)=μr(o)Eτ,r+μw(o)Eτ+S(o)PτT(o) (2)
where, Eτ,r and Eτ,w are the energy for reading and writing, respectively, a cache block to or from memory type τ, the parameters μr(o) and μw(o) represent the read access and write accesses to memory, respectively, as in the definition of AMAT, Pτ is the average leakage power per byte for memory type τ, and the parameters S(o) and T(o) represent the size and lifetime, respectively, of the object o normalized per load or store operations.
In accordance with implementations of the present disclosure, for static placement of objects, objects are placed in memory, such that energy is minimized and latency is raised by no more than a fixed percentage (λ) over a DRAM-only system (e.g., λ=5%). To this end, the objects are sorted in order of increasing performance difference (ΔAMAT(o)), and objects are placed on DRAM in this order until DRAM is full. In some examples, the performance difference is determined based on the following example relationship:
ΔAMAT(o)=AMATDRAM(o)−AMATNVM(o) (3)
which provides an estimation of the potential slowdown by placing the object on NVM. Further, an AMAE delta is determined based on the following relationship:
ΔAMAE(o)=AMAEDRAM(o)−AMAENVM(o) (4)
which provides an estimation of the energy gain by placing the object on NVM. The latter is typically a function of the trade-off between static and dynamic energy for the object.
In some implementations, the list of sorted objects (oi, 1≤i≤N) is partitioned by splitting the list at an index s, such that objects oi, i≤s are placed on DRAM, and objects oi, i>s are placed on NVM. In some examples, the index s is selected, such that the expected overall slowdown should be comparable to the DRAM-only memory system.
In accordance with implementations of the present disclosure, the performance model (AMAT) and energy model (AMAE) are used for static data placement, and dynamic data placement. More particularly, and as described in further detail herein, static data placement is performed to place an object on DRAM or NVM at the start of the application. During execution of the application, dynamic data placement is performed to selectively migrate data between DRAM and NVM.
A list of objects is received (402). In some examples, the list of objects includes objects (e.g., o1, . . . , on) that are to be allocated to memory of a hybrid memory system during execution of an application. A counter i is set equal to 1 (404). AMAT(o1) is determined for object oi (406). AMAE(o1) is determined for object oi (408). It is determined whether i is equal to n (410). In other words, it is determined whether all objects in the list of objects have been considered. If i is not equal to n, the counter i is incremented (412), and the example process 400 loops back. If i is equal to n, the list of objects is partitioned (414). For example, the list of objects is partitioned as described in detail herein. Objects in the list of objects are allocated to respective memory types based on the partitioned list (416). For example, static allocation of objects is performed to allocate objects as described herein.
Implementations of the present disclosure also provide for dynamic page migration in hybrid memory systems. In some implementations, static placement of data in hybrid memory systems can occur as discussed herein, and/or as discussed in commonly assigned, U.S. Ser. No. 14/704,461, filed on May 5, 2015, and entitled Managed Energy-Efficient Hybrid Main Memory Systems, the disclosure of which is expressly incorporated herein by reference in the entirety for all purposes. As discussed in U.S. Ser. No. 14/704,461, application source code is instrumented with a profiling tool that profiles all of the allocations made by the application during the execution phase. In some examples, the profiling tool collects statistics including memory loads, memory stores, off-chip memory accesses, memory allocations, allocation sizes, callpath and lifetime. In some examples, a callpath of an object is the source code locations within the application where the object is first allocated, initialized and accessed during the execution of the application. In some examples, the lifetime of an object is the time between allocation and de-allocation of the object.
In some examples, the statistics are provided in a statistics file. In some examples, the performance model and energy model, as described herein, and/or in U.S. Ser. No. 14/704,461 are used to determine data placement for each object of an application. In the examples of U.S. Ser. No. 14/704,461, the programmer can change the executable of the application to place objects on DRAM or NVM according to the performance and energy models.
As introduced above, implementations of the present disclosure provide dynamic OS page migration for hybrid memory systems. As discussed in further detail herein, implementations of the present disclosure use both offline object placement (e.g., as described above, and in U.S. Ser. No. 14/704,461), and online object placement (e.g., dynamic OS page migration, as described herein) to achieve energy savings with minimal performance degradation. For example, implementations of the present disclosure have been tested for an in-memory database running an analytical workload. More specifically, the analytical workload included benchmark queries provided in the TPC Benchmark H (TPC-H) provided by the Transaction Processing Performance Council of San Francisco, Calif. The TPC-H is a decision support benchmark that includes a set of business oriented ad-hoc queries (e.g., a set of benchmark queries), and concurrent data modifications. An energy savings of up to 57% with a performance degradation of up to 3% was observed (relative to a DRAM-only memory system).
In accordance with implementations of the present disclosure, for each page requested by an application, the page is initially placed in NVM. That is, NVM is provided as a default placement for pages. In some examples, if static placement of a particular data object provides for placement of the data object on DRAM, the object will instead be placed on DRAM.
In some implementations, each time a page is accessed (subsequent to the page's initial placement in NVM), energy costs for storage of the page on DRAM and NVM are respectively determined, and a page migration decision is made based on the energy costs. More particularly, for a page p, respective energy costs CDRAM and CNVM can be determined based on the following example relationship:
Cτ(p)=μr(o)Eτ,r+μw(o)Eτ (5)
where τ is either DRAM or NVM, Eτ,r and Eτ,w are the energy for reading and writing, respectively, to or from memory type τ, and the parameters μr(o) and μw(o) represent the read access and write accesses to the page in memory. In some examples, if CNVM is greater than CDRAM and the page p is located in NVM, the page is migrated to DRAM. In some examples, if CNVM is not greater than CDRAM and the page p is located in DRAM, the page is migrated to NVM. In some examples, the OS performs the page migration. In some examples, during page migration, the contents of the page are copied over to the other memory (e.g., from NVM to DRAM) using memcpy, and is deleted from the original memory (e.g., deleted from NVM).
Pages are stored in NVM (502). For example, during execution of an application using a hybrid memory system, virtual memory pages are initially stored in NVM (e.g., NVM is a default). A page is accessed (504). For example, during execution of the application, an already stored page is accessed (e.g., read from, written to). CRAM and CNVM are determined for the page (506). For example, CRAM and CNVM are determined based on Equation 5 above.
It is determined whether CNVM is greater than CRAM (508). If CNVM is not greater than CRAM, it is determined whether the page is currently stored in DRAM (510). If the page is stored in DRAM, the page is migrated to NVM (512). If the page is not stored in DRAM, the page is maintained in NVM (514). If CNVM is greater than CRAM, it is determined whether the page is currently stored in NVM (516). If the page is stored in NVM, the page is migrated to DRAM (518). If the page is not stored in NVM, the page is maintained in DRAM (520).
Referring now to
The memory 620 stores information within the system 600. In one implementation, the memory 620 is a computer-readable medium. In one implementation, the memory 620 is a volatile memory unit. In another implementation, the memory 620 is a non-volatile memory unit. The storage device 630 is capable of providing mass storage for the system 600. In one implementation, the storage device 630 is a computer-readable medium. In various different implementations, the storage device 630 may be a floppy disk device, a hard disk device, an optical disk device, or a tape device. The input/output device 640 provides input/output operations for the system 600. In one implementation, the input/output device 640 includes a keyboard and/or pointing device. In another implementation, the input/output device 640 includes a display unit for displaying graphical user interfaces.
The features described can be implemented in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations of them. The apparatus can be implemented in a computer program product tangibly embodied in an information carrier (e.g., in a machine-readable storage device) for execution by a programmable processor; and method steps can be performed by a programmable processor executing a program of instructions to perform functions of the described implementations by operating on input data and generating output. The described features can be implemented advantageously in one or more computer programs that are executable on a programmable system including at least one programmable processor coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system, at least one input device, and at least one output device. A computer program is a set of instructions that can be used, directly or indirectly, in a computer to perform a certain activity or bring about a certain result. A computer program can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment.
Suitable processors for the execution of a program of instructions include, by way of example, both general and special purpose microprocessors, and the sole processor or one of multiple processors of any kind of computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both. Elements of a computer can include a processor for executing instructions and one or more memories for storing instructions and data. Generally, a computer can also include, or be operatively coupled to communicate with, one or more mass storage devices for storing data files; such devices include magnetic disks, such as internal hard disks and removable disks; magneto-optical disks; and optical disks. Storage devices suitable for tangibly embodying computer program instructions and data include all forms of non-volatile memory, including by way of example semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in, ASICs (application-specific integrated circuits).
To provide for interaction with a user, the features can be implemented on a computer having a display device such as a CRT (cathode ray tube) or LCD (liquid crystal display) monitor for displaying information to the user and a keyboard and a pointing device such as a mouse or a trackball by which the user can provide input to the computer.
The features can be implemented in a computer system that includes a back-end component, such as a data server, or that includes a middleware component, such as an application server or an Internet server, or that includes a front-end component, such as a client computer having a graphical user interface or an Internet browser, or any combination of them. The components of the system can be connected by any form or medium of digital data communication such as a communication network. Examples of communication networks include, e.g., a LAN, a WAN, and the computers and networks forming the Internet.
The computer system can include clients and servers. A client and server are generally remote from each other and typically interact through a network, such as the described one. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
In addition, the logic flows depicted in the figures do not require the particular order shown, or sequential order, to achieve desirable results. In addition, other steps may be provided, or steps may be eliminated, from the described flows, and other components may be added to, or removed from, the described systems. Accordingly, other implementations are within the scope of the following claims.
A number of implementations of the present disclosure have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the present disclosure. Accordingly, other implementations are within the scope of the following claims.
Number | Name | Date | Kind |
---|---|---|---|
5845325 | Loo | Dec 1998 | A |
6157955 | Narad et al. | Dec 2000 | A |
6195731 | Bordaz et al. | Feb 2001 | B1 |
6760721 | Chasen | Jul 2004 | B1 |
6952664 | Lahiri et al. | Oct 2005 | B1 |
7085751 | Finlay et al. | Aug 2006 | B2 |
7181578 | Guha | Feb 2007 | B1 |
7360073 | Billstrom et al. | Apr 2008 | B1 |
7434002 | Zedlewski et al. | Oct 2008 | B1 |
7624381 | Czajkowski et al. | Nov 2009 | B1 |
7765200 | Kandil et al. | Jul 2010 | B2 |
7774556 | Karamcheti et al. | Aug 2010 | B2 |
7840397 | Chiou | Nov 2010 | B2 |
7958329 | Holt | Jun 2011 | B2 |
8170859 | Christensson et al. | May 2012 | B1 |
8185471 | Walker et al. | May 2012 | B1 |
8214191 | Ferren et al. | Jul 2012 | B2 |
8230395 | Koh et al. | Jul 2012 | B1 |
8456905 | Kasorla | Jun 2013 | B2 |
8572051 | Chen et al. | Oct 2013 | B1 |
8862588 | Gay | Oct 2014 | B1 |
8868537 | Colgrove et al. | Oct 2014 | B1 |
8874846 | Franceschini | Oct 2014 | B2 |
8880687 | Chandrachari et al. | Nov 2014 | B1 |
8959611 | Vincent | Feb 2015 | B1 |
8966462 | Gounares et al. | Feb 2015 | B2 |
9043530 | Sundaram et al. | May 2015 | B1 |
9304913 | Dong et al. | Apr 2016 | B2 |
9348539 | Saxena | May 2016 | B1 |
9411674 | Pattabiraman | Aug 2016 | B2 |
9626327 | Eilert et al. | Apr 2017 | B2 |
9652380 | Byun et al. | May 2017 | B2 |
9672158 | Karamcheti | Jun 2017 | B2 |
9712538 | Vincent et al. | Jul 2017 | B1 |
9720925 | Lawson | Aug 2017 | B1 |
9720967 | Lee et al. | Aug 2017 | B2 |
9740438 | Hassan | Aug 2017 | B2 |
9841914 | Hassan | Dec 2017 | B2 |
9846550 | Muralimanohar | Dec 2017 | B2 |
10083183 | Hassan | Sep 2018 | B2 |
10698732 | Hassan | Jun 2020 | B2 |
10783146 | Hassan | Sep 2020 | B2 |
20010027387 | Miyake et al. | Oct 2001 | A1 |
20030033431 | Shinomiya | Feb 2003 | A1 |
20030065648 | Driesch et al. | Apr 2003 | A1 |
20030065688 | Dageville et al. | Apr 2003 | A1 |
20040184340 | Dwarkasdas | Sep 2004 | A1 |
20040193935 | Kato et al. | Sep 2004 | A1 |
20050097078 | Lohman et al. | May 2005 | A1 |
20050108447 | Thadani | May 2005 | A1 |
20060059474 | Bhansali et al. | Mar 2006 | A1 |
20060117299 | Goldsmith et al. | Jun 2006 | A1 |
20060218123 | Chowdhuri et al. | Sep 2006 | A1 |
20060218125 | Kandil et al. | Sep 2006 | A1 |
20070050328 | Li | Mar 2007 | A1 |
20070050609 | Ferren et al. | Mar 2007 | A1 |
20070162425 | Betawadkar et al. | Jul 2007 | A1 |
20070202473 | Koda | Aug 2007 | A1 |
20070226186 | Ewen et al. | Sep 2007 | A1 |
20080005476 | Venkatesan | Jan 2008 | A1 |
20080034179 | Mewhinney et al. | Feb 2008 | A1 |
20080109592 | Karamcheti | May 2008 | A1 |
20080140682 | Grosset et al. | Jun 2008 | A1 |
20080288718 | Hepkin et al. | Nov 2008 | A1 |
20080288742 | Hepkin et al. | Nov 2008 | A1 |
20090024568 | Al-Omari et al. | Jan 2009 | A1 |
20090049234 | Oh et al. | Feb 2009 | A1 |
20090157952 | Kim et al. | Jun 2009 | A1 |
20090157964 | Kasorla | Jun 2009 | A1 |
20090182976 | Agesen et al. | Jul 2009 | A1 |
20090307462 | Fleming et al. | Dec 2009 | A1 |
20100010799 | Altrichter | Jan 2010 | A1 |
20100023800 | Harari et al. | Jan 2010 | A1 |
20100042999 | Dorai et al. | Feb 2010 | A1 |
20100153631 | Moon et al. | Jun 2010 | A1 |
20100169602 | Hulbert et al. | Jul 2010 | A1 |
20100262633 | Bhattacharjee et al. | Oct 2010 | A1 |
20100287356 | Cameron et al. | Nov 2010 | A1 |
20100306591 | Krishna | Dec 2010 | A1 |
20100318718 | Eilert et al. | Dec 2010 | A1 |
20110066808 | Flynn et al. | Mar 2011 | A1 |
20110072006 | Yu et al. | Mar 2011 | A1 |
20110078340 | Kim et al. | Mar 2011 | A1 |
20110093654 | Roberts et al. | Apr 2011 | A1 |
20110131199 | Simon et al. | Jun 2011 | A1 |
20110145221 | Kim et al. | Jun 2011 | A1 |
20110271264 | Vorbach et al. | Nov 2011 | A1 |
20110289126 | Aikas et al. | Nov 2011 | A1 |
20110313999 | Bruno et al. | Dec 2011 | A1 |
20120072744 | Jain | Mar 2012 | A1 |
20120089595 | Jaecksch | Apr 2012 | A1 |
20120089803 | Dice | Apr 2012 | A1 |
20120124318 | Bivens | May 2012 | A1 |
20120144092 | Hsieh | Jun 2012 | A1 |
20120151127 | Lim | Jun 2012 | A1 |
20120151252 | Harris et al. | Jun 2012 | A1 |
20120158799 | Morsi et al. | Jun 2012 | A1 |
20120246392 | Cheon | Sep 2012 | A1 |
20120290768 | Rubowitz et al. | Nov 2012 | A1 |
20130013860 | Franceschini | Jan 2013 | A1 |
20130074092 | Gounares et al. | Mar 2013 | A1 |
20130080621 | Jain | Mar 2013 | A1 |
20130081005 | Gounares et al. | Mar 2013 | A1 |
20130086309 | Lee | Apr 2013 | A1 |
20130103380 | Brandstatter et al. | Apr 2013 | A1 |
20130226903 | Wu et al. | Aug 2013 | A1 |
20130246698 | Estan | Sep 2013 | A1 |
20130275716 | Nishida | Oct 2013 | A1 |
20130283250 | Eichenberger | Oct 2013 | A1 |
20130326109 | Kivity | Dec 2013 | A1 |
20140007043 | Aliseychik et al. | Jan 2014 | A1 |
20140089564 | Liu et al. | Mar 2014 | A1 |
20140108723 | Nowoczynski | Apr 2014 | A1 |
20140188870 | Borthakur | Jul 2014 | A1 |
20140189204 | Sugimoto et al. | Jul 2014 | A1 |
20140258266 | Craunes et al. | Sep 2014 | A1 |
20140280685 | Magenheimer | Sep 2014 | A1 |
20140281212 | Schreter et al. | Sep 2014 | A1 |
20140281249 | Waldsperger | Sep 2014 | A1 |
20140282455 | Felch | Sep 2014 | A1 |
20140293801 | Dimon | Oct 2014 | A1 |
20140310462 | Waldspurger et al. | Oct 2014 | A1 |
20140351411 | Woods et al. | Nov 2014 | A1 |
20140372428 | Mathis et al. | Dec 2014 | A1 |
20150012465 | Pingenot | Jan 2015 | A1 |
20150062736 | Kim et al. | Mar 2015 | A1 |
20150077426 | Kweon et al. | Mar 2015 | A1 |
20150081300 | Kim | Mar 2015 | A1 |
20150089604 | Mathew | Mar 2015 | A1 |
20150106582 | Mai et al. | Apr 2015 | A1 |
20150154087 | Jin et al. | Jun 2015 | A1 |
20150169226 | Shen et al. | Jun 2015 | A1 |
20150199126 | Jayasena | Jul 2015 | A1 |
20150206574 | Greathouse | Jul 2015 | A1 |
20150261818 | Attaluri et al. | Sep 2015 | A1 |
20150309789 | Thorat | Oct 2015 | A1 |
20150363319 | Qi | Dec 2015 | A1 |
20150370560 | Tan | Dec 2015 | A1 |
20160005423 | Neppalli et al. | Jan 2016 | A1 |
20160019132 | Vilakkunnadathil | Jan 2016 | A1 |
20160117241 | Shah et al. | Apr 2016 | A1 |
20160117258 | Karamcheti et al. | Apr 2016 | A1 |
20160125927 | Wei | May 2016 | A1 |
20160150003 | Magdon-Ismall | May 2016 | A1 |
20160179685 | Byun et al. | Jun 2016 | A1 |
20160188217 | Golander et al. | Jun 2016 | A1 |
20160196112 | Edwards et al. | Jul 2016 | A1 |
20160196324 | Haviv et al. | Jul 2016 | A1 |
20160205174 | Pitio et al. | Aug 2016 | A1 |
20160253093 | Zhang | Sep 2016 | A1 |
20160283393 | Kawaba | Sep 2016 | A1 |
20160321048 | Matsuura | Nov 2016 | A1 |
20160328169 | Hassan | Nov 2016 | A1 |
20160336069 | Lin | Nov 2016 | A1 |
20160378169 | Naeimi | Dec 2016 | A1 |
20160378829 | Vengerov et al. | Dec 2016 | A1 |
20160378977 | Alme et al. | Dec 2016 | A1 |
20170010817 | Lim | Jan 2017 | A1 |
20170010952 | Nandakumar et al. | Jan 2017 | A1 |
20170052741 | Hassan | Feb 2017 | A1 |
20170052742 | Hassan | Feb 2017 | A1 |
20170060740 | Doerner | Mar 2017 | A1 |
20170090776 | Kowles | Mar 2017 | A1 |
20170091334 | Kabiljo et al. | Mar 2017 | A1 |
20170115892 | Gokita | Apr 2017 | A1 |
20170116210 | Park et al. | Apr 2017 | A1 |
20170147516 | De | May 2017 | A1 |
20170154136 | Eckmann et al. | Jun 2017 | A1 |
20170160955 | Jayasena | Jun 2017 | A1 |
20170161198 | Trika | Jun 2017 | A1 |
20170193136 | Prasad et al. | Jul 2017 | A1 |
20170206010 | Nachimuthu | Jul 2017 | A1 |
20170206172 | Ma | Jul 2017 | A1 |
20170212843 | Agesen et al. | Jul 2017 | A1 |
20170220256 | Balasubramonian | Aug 2017 | A1 |
20170220257 | Balasubramonian | Aug 2017 | A1 |
20170220488 | Balasubramonian | Aug 2017 | A1 |
20170220516 | Eilert et al. | Aug 2017 | A1 |
20170223046 | Singh | Aug 2017 | A1 |
20170242595 | Niu | Aug 2017 | A1 |
20170255674 | Attaluri et al. | Sep 2017 | A1 |
20170289000 | Parks et al. | Oct 2017 | A1 |
20170301386 | Parks et al. | Oct 2017 | A1 |
20170352012 | Hearn et al. | Dec 2017 | A1 |
20180024750 | Hassan | Jan 2018 | A1 |
20180024755 | Hassan | Jan 2018 | A1 |
20180024821 | Hassan | Jan 2018 | A1 |
20180024913 | Hassan | Jan 2018 | A1 |
20180024922 | Hassan | Jan 2018 | A1 |
20180024923 | Hassan | Jan 2018 | A1 |
20180024928 | Hassan | Jan 2018 | A1 |
20180024997 | Hassan | Jan 2018 | A1 |
20180025016 | Hassan | Jan 2018 | A1 |
20180025055 | Hassan | Jan 2018 | A1 |
20180025904 | Hassan | Jan 2018 | A1 |
20180357001 | Scheer | Dec 2018 | A1 |
20190057131 | Hassan | Feb 2019 | A1 |
Number | Date | Country |
---|---|---|
2016167824 | Oct 2016 | WO |
Entry |
---|
U.S. Appl. No. 14/704,461, filed May 5, 2015, Ahmad Hassan. |
U.S. Appl. No. 14/831,567, filed Aug. 20, 2015, Ahmad Hassan. |
U.S. Appl. No. 14/831,624, filed Aug. 20, 2015, Ahmad Hassan. |
U.S. Appl. No. 15/677,700, filed Aug. 15, 2017, Hassan. |
Dhiman et al., “PDRAM A hybrid PRAM and DRAM main memory system,” Proceedings of the 46th Annual Design Automation Conference, Jul. 26-31, 2009, pp. 664-669. |
Hassan et al., “Analytical models and techniques for Software-Managed Energy-Efficient Hybrid DRAM/NVM Main Memory,” AMC International Conference on Computing Frontiers 2015, May 18-21, 2015. |
Hassan et al., “Energy-Efficient In-Memory Data Stores on Hybrid Memory Hierarchies,” Eleventh International Workshop on Dada Management on New Hardware, Jun. 2015, last retrieved from https//event.cwi.nl/damon2015/slides/slides-hassan.pdf on Jan. 5, 2018. |
Hu et al., “Data allocation optimization for hybrid scratch pad memory with sram and nonvolatile memory,” IEEE Transactions on Very Large Scale Integration (VLSI) Systems, Jun. 2013, 21(6) 1094-1102. |
Li et al., “Assert(!Defined(Sequential I/O)),” Proceedings of the 6th USENIX Conference on Hot Topics in Storage and File Systems, Jun. 17-18, 2014, 1-5. |
Luk et al., “Pin Building Customized Program Analysis Tools with Dynamic Instrumentation,” ACM Sigplan Notices, Jun. 2005, 40(6) 190-200. |
Mogul et al., “Operating system support for NVM+DRAM hybrid main memory,” Proceedings of teh 12th Conference on Hot Topics in Operating Systems, May 18-20, 2009, 1-5. |
Ramos et al., “Page placement in hybrid memory systems,” Proceedings of the International Conference on Supercomputing, May 31-Jun. 4, 2011. |
U.S. Office Action in related U.S. Appl. No. 15/213,621 dated Dec. 13, 2018, 12 pages. |
U.S. Office Action in related U.S. Appl. No. 15/213,621 dated May 17, 2018, 11 pages. |
U.S. Office Action in related U.S. Appl. No. 15/213,654 dated Dec. 1, 2017, 21 pages. |
U.S. Office Action in related U.S. Appl. No. 15/213,654 dated Jul. 2, 2018, 41 pages. |
U.S. Office Action in related U.S. Appl. No. 15/213,654 dated Mar. 16, 2018, 31 pages. |
U.S. Office Action in related U.S. Appl. No. 15/213,654 dated Nov. 27, 2018, 7 pages. |
U.S. Office Action in related U.S. Appl. No. 15/213,816 dated Jul. 26, 2018, 27 pages. |
U.S. Office Action in related U.S. Appl. No. 15/213,930 dated Jun. 19, 2018, 20 pages. |
U.S. Office Action in related U.S. Appl. No. 15/213,930 dated Mar. 9, 2018, 20 pages. |
U.S. Office Action in related U.S. Appl. No. 15/213,930 dated Oct. 20, 2018, 23 pages. |
U.S. Office Action in related U.S. Appl. No. 15/213,960 dated Dec. 13, 2018, 22 pages. |
U.S. Office Action in related U.S. Appl. No. 15/213,960 dated Jan. 11, 2018, 22 pages. |
U.S. Office Action in related U.S. Appl. No. 15/213,960 dated Jul. 12, 2018, 24 pages. |
U.S. Office Action in related U.S. Appl. No. 15/214,019 dated Aug. 27, 2018, 8 pages. |
U.S. Office Action in related U.S. Appl. No. 15/214,019 dated Dec. 22, 2017, 12 pages. |
U.S. Office Action in related U.S. Appl. No. 15/214,019 dated Jun. 14, 2018, 10 pages. |
U.S. Office Action in related U.S. Appl. No. 15/214,082 dated Aug. 27, 2018, 27 pages. |
U.S. Office Action in related U.S. Appl. No. 15/214,102 dated Jul. 24, 2018, 33 pages. |
Wang et al., “Optimizated Allocation of Data Variables to PCM/DRAM-based Hybrid Main Memory for Real-Time Embedded Systems,” Embedded Systems Letters, IEEE, Sep. 2014, 6(3) 61-64. |
U.S. Office Action in related U.S. Appl. No. 15/213,816 dated Feb. 7, 2019, 33 pages. |
U.S. Office Action in related U.S. Appl. No. 15/214,102 dated Feb. 6, 2019, 41 pages. |
U.S. Office Action in U.S. Appl. No. 15/213,930 dated Feb. 26, 2019, 35 pages. |
Final Office Action issued in U.S. Appl. No. 15/214,082 dated Mar. 8, 2019, 41 pages. |
Final Office Action issued in U.S. Appl. No. 15/213,626 on Oct. 18, 2019, 41 pages. |
Final Office Action issued in U.S. Appl. No. 15/213,654 on Jul. 18, 2019, 21 pages. |
Final Office Action issued in U.S. Appl. No. 15/213,674 on Oct. 18, 2019, 43 pages. |
Final Office Action issued in U.S. Appl. No. 15/213,816 on Jan. 2, 2020, 43 pages. |
Final Office Action issued in U.S. Appl. No. 15/214,082 on Mar. 19, 2020, 38 pages. |
Final Office Action issued in U.S. Appl. No. 15/677,700 on May 28, 2020, 26 pages. |
Final Office Action issued in U.S. Appl. No. 15/213,626 on Dec. 24, 2020, 37 pages. |
Non-Final Office Action issued in U.S. Appl. No. 15/213,626 on Apr. 12, 2019, 23 pages. |
Non-Final Office Action issued in U.S. Appl. No. 15/213,626 on Jun. 9, 2020, 45 pages. |
Non-Final Office Action issued in U.S. Appl. No. 15/213,674 on Jan. 30, 2020, 38 pages. |
Non-Final Office Action issued in U.S. Appl. No. 15/213,674 on Apr. 12, 2019, 27 pages. |
Non-Final Office Action issued in U.S. Appl. No. 15/213,816 on Jun. 18, 2019, 46 pages. |
Non-Final Office Action issued in U.S. Appl. No. 15/214,082 on Sep. 6, 2019, 36 pages. |
Non-Final Office Action issued in U.S. Appl. No. 15/677,700 on Dec. 16, 2020, 23 pages. |
Non-Final Office Action issued in U.S. Appl. No. 15/677,700 on Nov. 18, 2019, 60 pages. |
Chen et al., “FSRAM: Flexible Sequential and Random Access Memory for Embedded Systems” Laboratory for Advanced Research in Computing Technology and Compilers Technical Report No. ARCTiC, Mar. 1, 2004, 6 pages. |
Dulloor et al., “Data tiering in heterogeneous memory systems” Proceedings of the Eleventh European Conference on Computer Systems, ACM, Apr. 18, 2016, 16 pages. |
Ouyang et al., “SSD-Assisted Hybrid Memory to Accelerate Menncached over High Performance Networks” 2012 41st International Conference on Parallel Processing, IEEE, Sep. 10, 2012, 10 pages. |
Wang et al., “NVMalloc: Exposing an Aggregate SSD Store as a Memory Partition in Extreme-Scale Machines” 2012 IEEE 26th International Parallel and Distributed Processing Symposium, May 21, 2012, 12 pages. |
Zakai, “Emscripten: An LL VM-to-JavaScript Compiler,” Proceedings of the ACM international conference companion on Object oriented programming systems languages and applications companion (OOPSLA), Portland, Oregon, Oct. 22-27, 2011, 12 pages. |
Number | Date | Country | |
---|---|---|---|
20180024754 A1 | Jan 2018 | US |