Systems and methods for rebuilding a cache index

Description

FIELD OF THE INVENTION

Embodiments of the invention relate to systems and methods for managing memory in a computing environment. More particularly, embodiments of the invention relate to systems and methods for rebuilding an index to data stored in a cache such as a flash cache.

BACKGROUND

Advances in computing technology has resulted in the ability to store ever growing amounts of data. However, the performance of data storage systems is often limited by hard disk drive (HDD) latency, which has been relatively constant for years. To improve performance, data storage systems use caching layers. Typically, each caching layer performs better than the lower layer.

In systems that deal with large amounts of data, flash memory can be used as a caching layer and can be much larger than DRAM (dynamic random access memory). In fact, caches configured from flash memory (flash cache) may be very large (e.g., hundreds of gigabytes to hundreds of terabytes in size). Flash memory has both higher TOPS (input output operations per second) and lower latency compared to HDDs.

The performance of a storage system can be improved by placing the most valuable data or metadata into the flash cache for faster access. Unlike DRAM, flash is persistent across system restarts. Consequently, content stored in the flash cache is not lost when a system restarts and the contents can be advantageously used. This is referred to as a warm cache and is distinct from starting with a cold cache that needs to be repopulated with data.

However, an index is needed to access the contents of the flash cache. The index is usually stored in memory such as DRAM and maps an identifier (e.g., a fingerprint, hash, key, or the like) to a location in the flash cache. The data stored in the flash cache may be data such as file blocks, content-defined chunks, or meta-data such as directory records, file indirect blocks, or the like. Because the index in DRAM is lost across restarts, it is necessary to rebuild the index before the content of the flash cache can be used.

The index could be stored in the flash cache instead of memory. When the index is stored in the flash cache, it may not be necessary to rebuild the index or load the index into memory. A drawback of this approach is that the index has to be kept up-to-date in the flash cache. This has the effect of causing high churn in the flash cache and can have an impact on the performance of the flash cache. Flash has a limited endurance and only supports a limited number of writes before it becomes read-only. As one example, consider a flash device of 100 GB that only supports one full overwrite per day for five years. That means it supports 100 GB times 356 days times 1 write per day times five years, which approximately equals 178 TB of writes before it becomes read-only. Frequent index updates can use up the writes supported by the flash device. Additionally updates to the index are usually very small, such as only a few bytes, but flash updates are at the unit of a page, usually 4 KB, requiring a page to be read, modified and written to a new location for each small update.

In another example, the flash cache can be completely scanned and the index can be rebuilt in memory from the scan. Reading the entire cache, however, requires a lot of time (depending on the size of the cache) and consumes I/O that could be used for other purposes. This is expensive and can negatively impact the performance of the flash cache. Systems and methods are needed for building or for rebuilding an index for a flash cache.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to describe the manner in which at least some aspects of this disclosure can be obtained, a more particular description will be rendered by reference to specific embodiments thereof which are illustrated in the appended drawings. Understanding that these drawings depict only example embodiments of the invention and are not therefore to be considered to be limiting of its scope, embodiments of the invention will be described and explained with additional specificity and detail through the use of the accompanying drawings, in which:

FIG. 1 illustrates an example of a computing environment that includes an index used to access the content of a flash cache;

FIG. 2 illustrates an example of an index stored in a memory and illustrates a relationship between the index and content stored in the flash cache;

FIG. 3 illustrates an example method for rebuilding an index; and

FIG. 4 illustrates an example method for inserting data segments into the flash cache.

DETAILED DESCRIPTION OF SOME EXAMPLE EMBODIMENTS

Embodiments of the invention relate to systems and methods for managing memory in a computing system. More particularly, embodiments of the invention relate to systems and methods for building or rebuilding indexes used to access content in a cache such as a flash cache.

A computing system may include multiple or tiered storage. The different tiers may include different types of storage. A computing system may include DRAM, a flash memory cache (flash or flash cache), and storage such as hard disk drives (HDD). The content stored in the flash cache is accessed using an index that maps an identifier of the content or data to a location of the data in the cache. In one example, the identifier may identify a location of the data in a container stored in the cache. A container is an example of a structure to store multiple data segments.

Unlike DRAM, flash is persistent across system restarts, so when a storage system restarts, the content in the flash cache can be used. This allows the storage system to start with a warm cache instead of starting from a cold cache that has to be repopulated. However, the index to the cache, which was stored in memory, is typically lost and must be rebuilt.

Embodiments of the invention relate to systems and methods for rebuilding or reconstructing the index when the flash cache is loaded. In one example, the process of rebuilding the cache is improved by managing the manner in which content is stored in the cache. More specifically, the data may be arranged or stored in containers. Each container includes a header that includes enough metadata to index the content of the container. As a result, the process of rebuilding the index is improved because it is only necessary to read the headers of the containers when rebuilding the cache. In addition, the container headers in the flash cache can be read using multiple parallel threads. The order of reading from the flash cache has little impact on the index because the index stores relationships between an identifier of the data (a data segment) and a location of the data segment in the flash cache and more specifically to a location of the data segment in a container in the cache. The index may also store other data or metadata. Even if the flash cache includes multiple flash devices, one or more threads can be created to read from each device independently and add entries to the index.

Further, multiple threads can be used to read from a single flash device when the flash device supports multiple outstanding I/Os. Each thread, for example, may be configured to read a distinct set of container identifiers (IDs). Reading the flash cache using multiple threads can shorten the time required to rebuild the index. When rebuilding the flash cache using multiple threads, the index or subregions of the index may be locked as necessary to avoid conflicting updates.

Traditionally, a cache is not available until the index is fully reconstructed. Embodiments of the invention, however, allow access to the cache even when the index is not fully reconstructed. In this scenario, some requests to the index may fail to find data that is actually located in the flash cache. However, more of the data becomes available over time as the index is built. In addition, content can be inserted into the flash cache while the index is being rebuilt. Insertions are simply added to the index.

To facilitate the process of rebuilding the cache, an overall cache state may be stored in a reserved region of the flash cache. The cache state may be referred to as a cache status header. The cache status header may include the highest container ID stored to flash as well as an allocation table indicating which regions of flash cache have containers and which regions of the flash cache are unused. When a client inserts new content into the flash cache, the content is packed into a new container (or multiple containers if necessary), the container's header is created representing the container, a new container ID is created, the container is written to flash, and the content is added to the index. The content is added to the index by adding at least an identifier (e.g., a key, a hash or partial identifier of the content) and a location of the data segment. The location may specify the container plus an offset within the container where the content or data segment is stored.

During the rebuild process, the location in the flash cache in which the container is written can be selected in different ways. If unused regions are available in the flash cache, the container may be written to an unused region. If a region of the flash cache is used but not yet indexed, this region can be selected for replacement without reading the container's header and without removing entries from the index. If a region that is used and indexed is selected, the region can be selected for replacement.

Various policies can be used to manage the content in the cache. Least recently used (LRU) is an example of a policy for evicting content from the cache. Thus, content can be added to the cache in parallel with the reconstruction of the index.

FIG. 1 illustrates an example of a computing system 100. The computing system 100 may be implemented as an integrated device or may include multiple devices that are connected together using a network, bus, or other suitable connection (wired and/or wireless). The computing system 100 may be configured to perform a special purpose. For example, the computing system 100 may be configured to perform data protection operations. Example data protection operations include, but are not limited to, backing up data from one or more clients, restoring data to one or more clients, de-duplicating data backed up in the computing system, indexing data stored in the computing system, optimizing the data stored in the computing system, reading a cache, writing to a cache, rebuilding an index to a cache, or the like or combination thereof.

The computing system 100 may include a processor 102 (or multiple processors), a memory 104, a flash cache 108 (or other suitable memory type), and storage 114. The memory 104 and the flash cache 108 may both be configured as a cache. The memory 104, for example, may be DRAM or the like. The memory 104 is typically faster and smaller than the flash cache 108. The flash cache 108 is typically smaller and faster than the storage 114. The storage 114 may include multiple HDDs or other storage type.

The memory 104, flash cache 108, and storage 114 are arranged to improve performance of the computing system 100. Over time, by way of example, data that is requested more frequently tends to reside in the flash cache 108.

In the computing system 100, an index 106 is maintained in the memory 104. The index 106 includes multiple entries and each entry corresponds to data or content stored in the flash cache 108. In one example, the index 106 may be implemented as a hash index. The hash in an entry of the index is an identifier of content in the flash cache 108 corresponding to the entry. In one example, the index 106 may not store the complete identifier. The index 106 may include identifiers of data or content stored in the flash cache 108. Each entry in the index 106 may also store other information or metadata such as a segment size, segment type, or the like or other combination thereof. In another example, the metadata in the index may be a logical block address or file handle and offset within a file.

The content in the flash cache 108 may be stored in containers 110. Each of the containers 110 may include a container header. The container header contains enough metadata to index the content stored in the corresponding container. Each container may include multiple segments. When rebuilding the index, the container headers may be read and entries in the index 106 may be generated from the metadata in the container header. More specifically, each container may include multiple data segments. The container header may include the identifier and location of each data segment in the container. The container header may also identify a segment type and a segment size for each data segment. By reading the container headers, all information needed to index each of the data segments can be obtained without having to read or process the data segments themselves.

A cache status header 112 may also be stored in the flash cache 108. The cache status header 112 may be stored at a known location such that information contained therein can be extracted across restarts without having to search for the cache status header.

The cache status header 112 may include information that can be used at least when the index is being rebuilt. For example, the cache status header 112 may identify the highest container identifier included in the containers 110. When rebuilding the index, the container headers can be read in parallel using multiple threads. Knowing the highest container identifier allows the computing system to know when all container headers have been read and allows the computing system to allocate a range of container headers to different threads. This eliminates redundancy when reading the container headers such that container headers are only read once.

The cache status header 112 may also identify regions of the flash cache 108 that are unused and/or regions that are used and/or regions recommended for replacement. This provides flexibility when inserting new content into the flash cache 108 while the index is being rebuilt.

FIG. 2 illustrates the process of rebuilding an index in the computing system 100. In FIG. 2, the index 106 is rebuilt using threads 226 (e.g., the thread 228 and the thread 230, which represent multiple threads) or other suitable executable or process.

More specifically, the flash cache 108 includes, by way of example a container 214 and a container 220. In practice, the flash cache 108 includes a large number of containers. The container 214 includes a container header 216 and data segments 218. The container header 216 includes metadata that allows the data segments 218 to be indexed without having to read the data segments 218. The metadata includes information such as the identifiers the data segments and locations of the data segments in the container. This metadata can be used to rebuild the index. The container 220 similarly includes a container header 222 and corresponding data segments 224.

In one embodiments, the container headers can be read using multiple threads 226. In this example, the thread 228 reads the container header 216 and the thread 230 reads the container header 222. After reading the container headers 216 and 222, the threads 226 insert the metadata into the index 106.

The thread 228 is reading the container header 216 and the thread 230 is reading the container header 222. After reading the container header 216, at least an identifier 206 (e.g., a key, hash or fingerprint or partial identifier of the corresponding segment) and a location 208 is written to the entry 202 of the index 106. The location stored in the index 106 may be represented in different ways. The location 208, for example, may point to the container 214 and include an offset to the relevant segment. Alternatively, the location 208 could point directly to a particular segment. Similarly, an identifier 210 and a location 212 are written to the entry 204 of the index 106. Once this is completed, the containers 214 and 220 are indexed. The containers can be read in a parallel manner. A lock may be applied to the index 106 as necessary during index updates.

Until all of the containers have been indexed in this manner, the index 106 is only partially rebuilt. However, embodiments of the invention allow the index to be used in a partially reconstructed state. This allows the computing system to be used more quickly compared to waiting for the index to be completely rebuilt.

FIG. 3 illustrates an example of a method for rebuilding an index. The method 300 begins by reading container headers in box 302. Multiple threads may be used to read the container headers such that the container headers are read in parallel. When reading the container headers, the cache status header may be used to identify the number of container headers to be read. Further, different container identifiers are allocated to different threads. In this manner, the container headers can be read in parallel and, in one example, in sequence. Each thread may read a range of container headers.

In box 304, the index is updated by the threads. Because multiple threads are reading the container headers, the index or portions of the index may be locked during an update. This ensures that the in-memory index is locked as necessary. In one example, the index may include index buckets. Locking the index or locking a specific bucket can ensure that the updates are performed appropriately without different threads interfering with each others writes and potentially creating invalid index entries. Each index bucket may include multiple entries.

After a container has been indexed, the data segments associated with the container can be accessed via the cache. Thus, lookup operations can be performed in box 306. Stated differently, a request for a data segment that has been indexed allows the data segment to be retrieved from the flash cache even if the cache index is not completely rebuilt. A request for a data segment that has not been indexed may be retrieved from storage even if the requested data segment is stored in the flash cache. In other words, cache misses are handled normally in the computing system.

FIG. 4 illustrates an example of a method for inserting a data segment into the cache. FIG. 4 further illustrates an example of inserting a data segment into the cache while rebuilding the cache. The method 400 allows data segments to be inserted into the cache while the index is under construction. In box 402, the cache state is accessed to determine the highest container number and to identify unused regions of the flash cache. The cache state can be determined from the cache status header. The cache status header may store the highest container number presently stored in the cache and may identify unused regions of the flash cache. The cache status header may be read into memory for faster repeated access. This information can be used when inserting data segments into the flash cache.

In box 404, a request to insert a data segment is evaluated using the index. In one example, the request is evaluated to determine whether the index includes the data segment associated with the request. This may occur, for example, when the container containing the data segment has been indexed. In one example, an identifier of the requested data segment is compared with identifiers presently indexed. If the identifier is found, then the data segment is stored in the flash cache and has been indexed. The response to the request in box 404 may be to read the data segment from the flash cache.

In one example when the identifier is not in the index in box 404, the data segment may be inserted into the cache in box 406. The index may also be updated. The data segment can be inserted in different manners. In one example, the cache status header may be read to identify the highest container number and to obtain information about free space. Then, the data segment is written to a container (that is given the next container number) and the container is written to an unused region of the flash cache if an unused region is available.

In another example, the container containing the data segment can be written to a region of the flash cache that has been used but has not been indexed. In this example, the header of the container being replaced does not need to be read and no entries need to be removed from the index. Information for the new container, however, may be added to the index.

In another example, a region that is already indexed and used can be selected for the new data segment. The selection of the used region can be made using various caching eviction policies, such as least recently used, lowest container number, or the like.

This process may continue until the cache is rebuilt. Once the cache is rebuilt, normal caching policies may be performed.

The embodiments disclosed herein may include the use of a special purpose or general-purpose computer including various computer hardware or software modules, as discussed in greater detail below. A computer may include a processor and computer storage media carrying instructions that, when executed by the processor and/or caused to be executed by the processor, perform any one or more of the methods disclosed herein.

As indicated above, embodiments within the scope of the present invention also include computer storage media, which are physical media for carrying or having computer-executable instructions or data structures stored thereon. Such computer storage media can be any available physical media that can be accessed by a general purpose or special purpose computer.

By way of example, and not limitation, such computer storage media can comprise hardware such as solid state disk (SSD), RAM, ROM, EEPROM, CD-ROM, flash memory, DRAM, phase-change memory (“PCM”), or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other hardware storage devices which can be used to store program code in the form of computer-executable instructions or data structures, which can be accessed and executed by a general-purpose or special-purpose computer system to implement the disclosed functionality of the invention. Combinations of the above should also be included within the scope of computer storage media. Such media are also examples of non-transitory storage media, and non-transitory storage media also embraces cloud-based storage systems and structures, although the scope of the invention is not limited to these examples of non-transitory storage media.

Computer-executable instructions comprise, for example, instructions and data which cause a general purpose computer, special purpose computer, or special purpose processing device to perform a certain function or group of functions. Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts disclosed herein are disclosed as example forms of implementing the claims.

As used herein, the term ‘module’ or ‘component’ can refer to software objects or routines that execute on the computing system. The different components, modules, engines, and services described herein may be implemented as objects or processes that execute on the computing system, for example, as separate threads. While the system and methods described herein can be implemented in software, implementations in hardware or a combination of software and hardware are also possible and contemplated. In the present disclosure, a ‘computing entity’ may be any computing system as previously defined herein, or any module or combination of modules running on a computing system.

In at least some instances, a hardware processor is provided that is operable to carry out executable instructions for performing a method or process, such as the methods and processes disclosed herein. The hardware processor may or may not comprise an element of other hardware, such as the computing devices and systems disclosed herein. A controller may include a processor and memory and/or other computing chips.

In terms of computing environments, embodiments of the invention can be performed in client-server environments, whether network or local environments, or in any other suitable environment. Suitable operating environments for at least some embodiments of the invention include cloud computing environments where one or more of a client, server, or target virtual machine may reside and operate in a cloud environment.

The present invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described embodiments are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is, therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.

Claims

1. A method for operating a flash memory cache while rebuilding an index for the flash memory cache, the method comprising: rebuilding the index by: reading headers of containers stored in the flash memory cache, each of the containers storing segments, wherein the headers contain sufficient information to index the segments stored in the containers without accessing the segments stored in the containers; andupdating the index with information extracted from the headers of the containers, wherein the information includes identifiers of the segments stored in the containers and locations of the segments stored in the containers; andadding a new segment to the flash memory cache in parallel with updating the index.
2. The method of claim 1, further comprising adding information including an identifier for the new segment and a location of the new segment in a container to the index.
3. The method of claim 1, further comprising operating the flash memory cache in a normal manner with a partially rebuilt index, wherein the normal manner includes at least evicting segments or containers from the flash memory cache using the partially rebuilt index and performing lookup operations using the partially rebuilt index.
4. The method of claim 1, further comprising adding the new segment to a portion of the flash memory cache that has not been used or that does not currently store any containers or segments.
5. The method of claim 1, further comprising adding the new segment to a portion of the flash memory cache that has been used but not yet indexed while rebuilding the index.
6. The method of claim 1, further comprising adding the new segment to the flash memory cache in accordance with first policies while the index is rebuilt and operating the flash memory cache in accordance with normal polices after the index is rebuilt, wherein the first policies relate to selecting a used region of the flash memory cache in which to add the new segment, wherein the first policies include at least one of least recently used or lowest container number.
7. The method of claim 1, further comprising performing lookup operations into the flash memory cache while rebuilding the index.
8. The method of claim 1, further comprising reading the flash memory cache with multiple threads, wherein each thread is assigned a range of containers to read.
9. The method of claim 1, further comprising storing a cache state in a reserved region of the flash memory cache.
10. The method of claim 9, wherein the cache state identifies a highest container identifier stored in the flash memory cache and identifies which regions of the flash memory cache are unused, wherein the highest container identifier ensures that all containers are read when rebuilding the index.
11. The method of claim 1, further comprising adding a new container to the flash memory cache while rebuilding the index, wherein the new container is added to an unused region of the flash memory cache or wherein the new container replaces an existing container in a used region of the flash memory cache that has not been indexed, wherein a header of the new container in the used region is not read during the rebuilding process.
12. A computing system configured to rebuild an index into flash memory, the computing system comprising: a flash memory configured to cache data segments in the flash memory;a processor configured to execute computer executable instructions for performing a method for rebuilding the index, the method comprising:rebuilding the index into the flash memory, wherein the index associates identifiers of the data segments with locations of the data segments in the flash memory, by: reading headers of containers stored in the flash memory, each of the containers stored in the flash memory and storing at least one of the data segments in the flash memory, wherein the headers contain sufficient information to index the data segments stored in the containers stored in the flash memory without accessing the data segments stored in the containers stored in the flash memory; andupdating the index with information extracted from the headers of the containers, wherein the information includes identifiers of the data segments stored in the containers and locations of the data segments in the containers; andadding a new data segment to the flash memory in parallel with updating the index.
13. The computing system of claim 12, further comprising adding information including an identifier for the new data segment and a location of the new data segment within one of the containers stored in the flash memory to the index.
14. The computing system of claim 12, further comprising a memory that is separate from the flash memory, wherein the index is stored in the memory, in the flash memory, or in both the memory and the flash memory.
15. The computing system of claim 12, further comprising adding the new data segment to a new container that is added to a portion of the flash memory that has not been used.
16. The computing system of claim 12, further comprising adding the new data segment to a portion of the flash memory that has been used but not yet indexed while rebuilding the index, wherein the new data segment being added is added to a new container stored in the flash memory in accordance with first policies while the index is being rebuilt and operating the flash memory in accordance with normal policies after the index is rebuilt.
17. The computing system of claim 12, further comprising performing lookup operations into the flash memory while rebuilding the index.
18. The computing system of claim 12, further comprising storing a cache state in a reserved region of the flash memory.
19. The computing system of claim 18, wherein the cache state identifies a highest container identifier stored in the flash memory and identifies which regions of the flash memory are unused, wherein identifying the highest container identifier ensures that all containers stored in the flash memory are read while rebuilding the index.
20. The computing system of claim 12, further comprising inserting a new container into a portion of a used region of the flash memory that is not yet indexed without indexing any containers replaced by the insertion of the new container and without removing entries that correspond to data segments stored in the containers being replaced from the index.

US Referenced Citations (235)

Number	Name	Date	Kind
4410946	Spencer	Oct 1983	A
4513367	Chan	Apr 1985	A
4785395	Keeley	Nov 1988	A
4905188	Chuang et al.	Feb 1990	A
4942520	Langendorf	Jul 1990	A
5333318	Wolf	Jul 1994	A
5590320	Maxey	Dec 1996	A
5630093	Holzhammer et al.	May 1997	A
5644701	Takewaki	Jul 1997	A
5682497	Robinson	Oct 1997	A
5684976	Soheili-Arasi	Nov 1997	A
5740349	Hasbun et al.	Apr 1998	A
5761501	Lubbers et al.	Jun 1998	A
5838614	Estakhri et al.	Nov 1998	A
5907856	Estakhri et al.	May 1999	A
5909694	Gregor	Jun 1999	A
5913226	Sato	Jun 1999	A
5966726	Sokolov	Oct 1999	A
6046936	Tsujikawa et al.	Apr 2000	A
6049672	Shiell	Apr 2000	A
6058038	Osada et al.	May 2000	A
6119209	Bauman et al.	Sep 2000	A
6128623	Mattis et al.	Oct 2000	A
6138209	Krolak et al.	Oct 2000	A
6192450	Bauman et al.	Feb 2001	B1
6216199	Dekoning et al.	Apr 2001	B1
6272593	Dujari	Aug 2001	B1
6351788	Yamazaki et al.	Feb 2002	B1
6356990	Aoki et al.	Mar 2002	B1
6360293	Unno	Mar 2002	B1
6397292	Venkatesh	May 2002	B1
6510083	See et al.	Jan 2003	B1
6535949	Parker	Mar 2003	B1
6594723	Chapman et al.	Jul 2003	B1
6636950	Mithal	Oct 2003	B1
6807615	Wong	Oct 2004	B1
6851015	Akahane et al.	Feb 2005	B2
6901499	Aasheim et al.	May 2005	B2
6965970	Mosur	Nov 2005	B2
6978342	Estakhri et al.	Dec 2005	B1
7076599	Aasheim et al.	Jul 2006	B2
7079448	Leconte et al.	Jul 2006	B2
7124249	Darcy	Oct 2006	B1
7290109	Horii et al.	Oct 2007	B2
7325097	Darcy	Jan 2008	B1
7356641	Venkiteswaran	Apr 2008	B2
7433245	Otsuka et al.	Oct 2008	B2
7472205	Abe	Dec 2008	B2
7533214	Aasheim et al.	May 2009	B2
7640262	Beaverson	Dec 2009	B1
7652948	Lee et al.	Jan 2010	B2
7673099	Beaverson	Mar 2010	B1
7702628	Luchangco et al.	Apr 2010	B1
7711923	Rogers et al.	May 2010	B2
7720892	Healey, Jr.	May 2010	B1
7793047	Asano	Sep 2010	B2
7870325	Joukan et al.	Jan 2011	B2
7930559	Beaverson	Apr 2011	B1
7996605	Koga	Aug 2011	B2
8250282	Confalonieri et al.	Aug 2012	B2
8300465	Jeon et al.	Oct 2012	B2
8370575	Eichenberger et al.	Feb 2013	B2
8533395	O'Connor	Sep 2013	B2
8581876	Wickes et al.	Nov 2013	B1
8583854	Ji et al.	Nov 2013	B2
8606604	Huber et al.	Dec 2013	B1
8634248	Sprouse et al.	Jan 2014	B1
8688650	Mutalik et al.	Apr 2014	B2
8688913	Benhase	Apr 2014	B2
8738841	Olbrich et al.	May 2014	B2
8738857	Clark	May 2014	B1
8793543	Tai	Jul 2014	B2
8811074	Goss	Aug 2014	B2
8817541	Li	Aug 2014	B2
8904117	Kalekar	Dec 2014	B1
8910020	Frayer	Dec 2014	B2
8917559	Bisen	Dec 2014	B2
8935446	Shilane	Jan 2015	B1
8943282	Armangau	Jan 2015	B1
9026737	Armangau	May 2015	B1
9043517	Sprouse	May 2015	B1
9053015	Nikolay	Jun 2015	B2
9098420	Bulut	Aug 2015	B2
9116793	Kandiraju	Aug 2015	B2
9122584	Kandiraju et al.	Sep 2015	B2
9135123	Armangau	Sep 2015	B1
9152496	Kanade	Oct 2015	B2
9171629	Kokubun	Oct 2015	B1
9189402	Smaldone	Nov 2015	B1
9189414	Shim	Nov 2015	B1
9213603	Tiziani et al.	Dec 2015	B2
9213642	Chiu	Dec 2015	B2
9251063	Nakamura et al.	Feb 2016	B2
9274954	Bairavasundaram	Mar 2016	B1
9281063	Xiang	Mar 2016	B2
9313271	Venkat	Apr 2016	B2
9317218	Botelho	Apr 2016	B1
9405682	Meshchaninov et al.	Aug 2016	B2
9436403	Zhang	Sep 2016	B1
9442662	Dancho	Sep 2016	B2
9442670	Kruger	Sep 2016	B2
9524235	Sprouse	Dec 2016	B1
9535856	Coronado	Jan 2017	B2
9542118	Lercari et al.	Jan 2017	B1
9690507	Matthews et al.	Jun 2017	B2
9690713	Khermosh	Jun 2017	B1
9697267	Kadayam	Jul 2017	B2
9703816	George	Jul 2017	B2
9753660	Mani	Sep 2017	B2
9811276	Taylor	Nov 2017	B1
9870830	Jeon	Jan 2018	B1
9921954	Sabbag et al.	Mar 2018	B1
9952769	Badam	Apr 2018	B2
9959058	O'Brien	May 2018	B1
10002073	Cai	Jun 2018	B2
10037164	Wallace et al.	Jul 2018	B1
10055150	Fenol	Aug 2018	B1
10055351	Wallace	Aug 2018	B1
10089025	Wallace et al.	Oct 2018	B1
10146438	Shilane et al.	Dec 2018	B1
10146851	Tee et al.	Dec 2018	B2
10169122	Tee et al.	Jan 2019	B2
10235397	Shilane et al.	Mar 2019	B1
10243779	Tee et al.	Mar 2019	B2
10261704	Shilane et al.	Apr 2019	B1
10318201	Wallace et al.	Jun 2019	B2
10331561	Shilane	Jun 2019	B1
10353607	Wallace et al.	Jul 2019	B2
10353820	Wallace et al.	Jul 2019	B2
10379932	Tee et al.	Aug 2019	B2
10521123	Shilane et al.	Dec 2019	B2
10585610	Wallace et al.	Mar 2020	B1
10628066	Wu et al.	Apr 2020	B2
20010029564	Estakhri	Oct 2001	A1
20030009623	Arimilli	Jan 2003	A1
20030009637	Arimilli	Jan 2003	A1
20030009639	Arimilli	Jan 2003	A1
20030009641	Arimilli	Jan 2003	A1
20030009643	Arimilli	Jan 2003	A1
20030217227	Parthasarathy	Nov 2003	A1
20040123270	Zhuang	Jun 2004	A1
20050120180	Schornbach	Jun 2005	A1
20050165828	Lango	Jul 2005	A1
20060015768	Valine	Jan 2006	A1
20060059171	Borthakur	Mar 2006	A1
20060101200	Doi	May 2006	A1
20060143390	Kottapalli	Jun 2006	A1
20060179174	Bockhaus	Aug 2006	A1
20060184744	Langston	Aug 2006	A1
20070005928	Trika	Jan 2007	A1
20070061504	Lee	Mar 2007	A1
20070156842	Vermeulen	Jul 2007	A1
20070180328	Cornwell	Aug 2007	A1
20070192530	Pedersen et al.	Aug 2007	A1
20070300037	Rogers	Dec 2007	A1
20080046655	Bhanoo	Feb 2008	A1
20080065809	Eichenberger	Mar 2008	A1
20080077782	Lataille	Mar 2008	A1
20080120469	Kornegay	May 2008	A1
20080147714	Breternitz	Jun 2008	A1
20080177700	Li	Jul 2008	A1
20080183955	Yang	Jul 2008	A1
20080263114	Nath	Oct 2008	A1
20080266962	Jeon	Oct 2008	A1
20080273400	La Rosa	Nov 2008	A1
20080313132	Hao	Dec 2008	A1
20090063508	Yamato	Mar 2009	A1
20090216788	Rao	Aug 2009	A1
20090222626	Ingle	Sep 2009	A1
20090240871	Yano	Sep 2009	A1
20090300265	Vyssotski	Dec 2009	A1
20100023697	Kapoor	Jan 2010	A1
20100070715	Waltermann	Mar 2010	A1
20100082886	Kwon	Apr 2010	A1
20100115182	Murugesan	May 2010	A1
20100165715	Donze	Jul 2010	A1
20100185807	Meng	Jul 2010	A1
20100199027	Pucheral	Aug 2010	A1
20100211744	Morrow	Aug 2010	A1
20100229005	Herman	Sep 2010	A1
20100306448	Chen	Dec 2010	A1
20100332952	Chung	Dec 2010	A1
20110010698	Byom	Jan 2011	A1
20110022778	Schibilla	Jan 2011	A1
20110072217	Hoang	Mar 2011	A1
20110138105	Franceschini	Jun 2011	A1
20110138132	Brueggen	Jun 2011	A1
20110153953	Khemani	Jun 2011	A1
20110225141	Chaudhry	Sep 2011	A1
20110264865	Mobarak	Oct 2011	A1
20110276780	Sengupta	Nov 2011	A1
20110276781	Sengupta	Nov 2011	A1
20110296110	Lilly	Dec 2011	A1
20120054414	Tsai	Mar 2012	A1
20120084484	Post	Apr 2012	A1
20120102268	Smith	Apr 2012	A1
20120110247	Eleftheriou	May 2012	A1
20120215970	Shats	Aug 2012	A1
20120275466	Bhadra	Nov 2012	A1
20130036418	Yadappanavar	Feb 2013	A1
20130103911	Bulut	Apr 2013	A1
20130205089	Soerensen	Aug 2013	A1
20130282964	Sengupta	Oct 2013	A1
20130325817	Whitehouse	Dec 2013	A1
20130339576	Liu	Dec 2013	A1
20140013027	Jannyavula Venkata	Jan 2014	A1
20140098619	Nazarian	Apr 2014	A1
20140122818	Hayasaka	May 2014	A1
20140136762	Li	May 2014	A1
20140143505	Sim	May 2014	A1
20140149401	Liu	May 2014	A1
20140173330	Samanta	Jun 2014	A1
20140215129	Kuzmin	Jul 2014	A1
20140281167	Danilak	Sep 2014	A1
20140281824	Oh	Sep 2014	A1
20150127889	Hwang	May 2015	A1
20150205722	Chiu	Jul 2015	A1
20150277786	Rostock	Oct 2015	A1
20150331807	Lie	Nov 2015	A1
20150347291	Choi	Dec 2015	A1
20150363285	Delaney	Dec 2015	A1
20160041927	Jung	Feb 2016	A1
20160147669	Huang	May 2016	A1
20160274819	Choi	Sep 2016	A1
20170060439	Harawasa	Mar 2017	A1
20170091054	Delaney	Mar 2017	A1
20180335948	Wallace et al.	Nov 2018	A1
20190004957	Wallace et al.	Jan 2019	A1
20190034100	Wallace et al.	Jan 2019	A1
20190107946	Shilane et al.	Apr 2019	A1
20190243565	Shilane et al.	Aug 2019	A1
20190286329	Wallace et al.	Sep 2019	A1
20190339882	Wallace et al.	Nov 2019	A1
20190340128	Wallace et al.	Nov 2019	A1
20200117359	Shilane et al.	Apr 2020	A1

Non-Patent Literature Citations (50)

Entry
A. Arelakis and P. Stenström, “A Case for a Value-Aware Cache,” in IEEE Computer Architecture Letters, vol. 13, No. 1, pp. 1-4, Jan. 21-Jun. 2014, doi: 10.1109/L-CA.2012.31. (Year: 2014).
A comparison of adaptive radix trees and hash tables; Alvarez et al; 31st International Conference on Data Engineering; Apr. 13-17, 2015; pp. 1227-1238 (12 pages) (Year: 2015).
A Workload-Aware Adaptive Hybrid Flash Translation Layer with an Efficient Caching Strategy; Park et al; 19th International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems; Jul. 25-27, 2011; pp. 248-255 (8 pages) (Year: 2011).
B-tree indexes and CPU caches; Graefe et al; 17th International Conference on Data Engineering; Apr. 2-6, 2001; pp. 349-358 (10 pages) (Year: 2001).
History-aware page replacement algorithm for NAND flash-based consumer electronics; Lin etal.; IEEE Transactions on Consumer Electronics, vol. 62, iss. 1; Feb. 2016; pp. 23-39 (Year: 2016).
A novel hot data identification mechanism for NAND flash memory; Liu et al.; IEEE Transactions on Consumer Electronics, vol. 61, iss. 4; Nov. 2015; pp. 463-469 (Year: 2015).
Hot data identification for flash-based storage systems using multiple bloom filters; Park et al.; 27th Symposium on Mass Storage Systems and Technologies; May 23-27, 2011 (Year: 2011).
BloomFlash: Bloom Filter on Flash-Based Storage; Debnath et al.; 2011 31st International Conference on Distributed Computing Systems; Jun. 20-24, 2011; pp. 635-644 (Year: 2011).
Software Support Inside and Outside Solid-State Devices for High Performance and High Efficiency; Chen etal.; Proceedings of the IEEE, vol. 105, iss. 3; Sep. 2017; pp. 1650-1665 (Year: 2017).
A performance model and file system space allocation scheme for SSDs; Hyun etal.; IEEE 26th Symposium on Mass Storage Systems and Technologies; May 3-7, 2010 (Year: 2010).
A Forest-structured Bloom Filter with flash memory; Lu et al; IEEE 27th Symposium on Mass Storage Systems and Technologies; May 23-27, 2011 (6 pages).
A self-adjusting flash translation layer for resource-limited embedded systems; Wu, Chin-Hsien; ACM Transactions on Embedded Computing Systems, vol. 9, iss. 4, Article No. 31; Mar. 2010 (Year: 2010).
Algorithms in Java, Third Edition; Sedgewick, Robert; ISBN 0-201-36120-5; 2003; pp. 91-94 (4 pages).
BloomStream: Data Temperature Identification for Flash Based Memory Storage Using Bloom Filters; Bhimani et al.; IEEE 11th International Conference on Cloud Computing; Jul. 2-7, 2018 (Year: 2018).
CAFTL: A Content-Aware Flash Translation Layer Enhancing the Lifespan of Flash Memory based Solid State Drives; Chen et al; Proceedings of the 9th USENIX conference on File and stroage technologies; Feb. 15-17, 2011; retrieved from Proceedings of the 9th USENIX conference on File and stroage technologies on Jul. 15, 2017 (14 pages).
DHash: A cache-friendly TCP lookup algorithm for fast network processing; Zhang et al; 38th Conference on Local Computer Networks; Oct. 21-24, 2013; pp. 484-491 (8 pages) (Year 2013).
Dysource: a high performance and scalable NANO flash controller architecture based on source synchronous interface; Wu et al; Proceedings of the 12th ACM International Conference on Computing Frontiers, Article No. 25; May 18-21, 2015 (Year 2015).
Hardware/software architecture for flash memory storage systems; Min et al; Proceedings of the 14th international conference on Compilers, architectures and synthesis for embedded systems; Oct. 9-14, 2011; pp. 235-236 (Year 2011).
NAND Flash Memory: Challenges and Opportunities; Li et al; IEEE Computer, vol. 46, iss. 8; Aug. 2013; pp. 23-29 (Year 2013).
Optimal Bloom Filters and Adaptive Merging for LSM-Trees; Dayan et al.; ACM Transactions on Database Systems (TODS)—Best of SIGMOD 2017 Papers, vol. 43, iss. 4, Article No. 16; Dec. 2018 (Year: 2018).
Sergey Hardock, Ilia Petrov, Robert Gottstein, and Alejandro Buchmann. 2017. From In-Place Updates to In-Place Appends: Revisiting Out-of-Place Updates on Flash. ACM International Conference on Management of Data, pp. 1571-1586 (Year: 2017).
Skip lists: a probabilistic alternative to balanced trees; Pugh, William; Communications of the ACM, vol. 33, iss. 6; Jun. 19990; pp. 668-676 (9 pages).
U.S. Appl. filed Jun. 29, 2016, Wallace et al., U.S. Appl. No. 15/196,163.
U.S. Appl. No. 15/196,110, filed Jun. 29, 2016, Wallace, et al.
U.S. Appl. No. 15/196,261, filed Jun. 29, 2016, Shilane, et al.
U.S. Appl. No. 15/196,283, filed Jun. 29, 2016, Shilane, et al.
U.S. Appl. No. 16/049,891, filed Jul. 31, 2018, Wallace, et al.
U.S. Appl. No. 16/103,499, filed Aug. 14, 2018, Wallace et al.
U.S. Appl. No. 16/146,584, filed Sep. 28, 2018, Wallace et al.
U.S. Appl. No. 16/209,054, filed Dec. 4, 2018, Shilane, et al.
U.S. Application Filed on Apr. 15, 2019, by Shilane et al., U.S. Appl. No. 16/384,591.
US. Appl. No. 15/196,150, filed Jun. 29, 2016, Shilane, et al.
U.S. Appl. No. 16/434,470, filed Jun. 7, 2019, Wallace, et al.
U.S. Appl. No. 16/511,256, filed Jul. 15, 2019, Wallace, et al.
U.S. Appl. No. 16/511,261, filed Jul. 15, 2019, Wallace, et al.
Algorithms and data structures for flash memories; Gal et al.; ACM Computing Surveys, vol. 37, iss. 2, pp. 138-163; Jun. 2005 (Year: 2005).
Design and implementation of NAND Flash files system based on the double linked list; Wang et al.; 2011 International Conference on Electrical and Control Engineering; Sep. 16-18, 2011 (Year: 2011).
Hydra: A Block-Mapped Parallel Flash Memory Solid-State Disk Architecture; Seong etal.; IEEE Transactions on Computers, vol. 59, iss. 7, pp. 905-921; Jul. 2010 (Year: 2010).
A DRAM-flash index for native flash file systems; Ho et al.; 2013 International Conference on Hardware/Software Codesign and System Synthesis, pp. 1-10; Sep. 29, 2013-Oct. 4, 2013.
A sequential indexing scheme for flash-based embedded systems; Yin et al.; Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, pp. 588-599; Mar. 24-26, 2009.
FlashStore: high throughput persistent key-value store; Debnath et al.; Proceedings of the VLDB Endowment, vol. 3, iss. 1-2, pp. 1414-1425; Sep. 2010.
SkimpyStash: RAM space skimpy key-value store on flash-based storage; Debnath et al.; Proceedings of the 2011 ACM SIGMOD International Conference on Management of data, pp. 25-36; Jun. 12-16, 2011.
A multi-level elaborate least frequently/recently used buffer cache for flash storage systems; Noh et al.; Proceedings of the 2009 International Conference on Hybrid Information Technology , pp. 34-41; Aug. 27-29, 2009.
Implementing personal home controllers on smartphones for service-oriented home network; Tokuda et al.; IEEE 8th International Conference on Wireless and Mobile Computing, Networking and Communications, pp. 769-776; Oct. 8-10, 2012 (Year: 2012).
RwHash: Rewritable Hash table for Fast Network Processing with Dynamic Membership Updates; Song et al.: ADM/IEEE Symposium on Architectures for Networking and Communications Systems; May 18-19, 2017.
A. Berman and Y. Birk, “Integrating de-duplication and write for increased performance and eenndduurrance of Solid-State Drives,” 2010 IEEE 26-th Convention of Electrical and Electronics Engineers in Israel, 2010, p. 000821-000823 (Year: 2010).
D. Wang, J. Tang, M. Jia, Z. Xu and H. Han, “Review of NAND Flash Information Erasure Based on Overwrite Technology,” 2020 39th Chinese Control Conference (CCC), 2020, pp. 1150-1155 (Year: 2020).
K. Terazono and Y. Okada, “An extended delta compression algorithm and the recovery of failed updating in embedded systems,” Data Compression Conference, 2004. Proceedings. DCC 2004, 2004, p. 570. (Year: 2004).
S. Hardock, I. Petrovy, R. Gottstein and A. Buchmann, “Selective In-Place Appends for Real: Reducing Erases on Wear-prone DBMS Storage,” 2017 IEEE 33rd International Conference on Data Engineering (ICDE), 2017, pp. 1375-1376. (Year: 2017).
W. Jianpo, Y. Liqun and X. Qing, “Research on hash algorithm for retrieval of global multi-resolution terrain cache data,” 2010 International Conference on Audio, Language and Image Processing, 2010, pp. 980-984 (Year: 2010), Jun. 30, 2021.

Related Publications (1)

	Number	Date	Country
	20190294545 A1	Sep 2019	US

Continuations (1)

	Number	Date	Country
Parent	15196150	Jun 2016	US
Child	16440457		US

Systems and methods for rebuilding a cache index

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

CPC

International Classifications

Disclaimer