This disclosure relates to data storage systems. More particularly, the disclosure relates to systems and methods for monitoring data retention in solid-state drives.
Certain solid-state memory devices, such as flash drives, store information in an array of memory cells constructed with floating gate transistors. In single-level cell (SLC) flash devices, each cell stores a single bit of information. In multi-level cell (MLC) devices, each cell stores two or more bits of information. When a read operation is performed, the electrical charge levels of the cells are compared to one or more voltage reference values (also called “voltage read level” or “voltage threshold”) to determine the state of individual cells, lip SLC devices, a cell can be read using a single voltage reference value. In MLC devices, a cell is read using multiple voltage references values. Certain solid-state devices allow for a memory controller to set voltage read levels.
Various factors can contribute to data read errors in solid-state memory devices. These factors include charge loss or leakage over time, and device wear caused by usage. When the number of bit errors on a read operation exceeds the ECC (error correction code) correction capability of the storage subsystem, the read operation fails. Voltage read levels can contribute to a device's ability to decode data.
Various embodiments are depicted in the accompanying drawings for illustrative purposes, and should in no way be interpreted as limiting the scope of the inventions. In addition, various features of different disclosed embodiments can be combined to form additional embodiments, which are part of this disclosure. Throughout the drawings, reference numbers may be reused to indicate correspondence between reference elements.
While certain embodiments are described, these embodiments are presented by way of example only, and are not intended to limit the scope of protection. Indeed, the novel methods and systems described herein may be embodied in a variety of other forms. Furthermore, various omissions, substitutions and changes in the form of the methods and systems described herein may be made without departing from the scope of protection.
Overview
In certain solid-state media, such as NAND Flash, “data retention” refers to a memory failure factor relating to how long memory cells maintain a correct programmed state when the memory array is not under power bias. Data retention can be dependent on external environmental factors, such as, for example, high temperature and number of program/erase (P/E) cycles, which tend to degrade data retention in certain embodiments. Therefore, precise information relating to the data retention of solid-state memory can be useful in managing drive performance. However, obtaining such information can present significant challenges in solid-state memory systems.
In certain embodiments, a real timestamp is utilized to monitor data age in a solid-state data storage device, wherein the timestamp is stored in solid-state memory and provides a record of the time when data is programmed to the solid-state memory. The timestamp may accurately account for the time that elapses after programming for data retention purposes. However, using a time stamp alone, the system temperature, which can be particularly significant with respect to heavily-cycled blocks and can vary over time, may not be systematically accounted for. According to Arrhenius' equation, data retention may accelerate exponentially with elevated temperature, and therefore temperature effect can be significant. It may therefore be desirable to account for temperature effect in data retention management systems.
Certain embodiments disclosed herein provide systems and methods wherein additional metadata is programmed together with the raw data on the target page. The metadata may contain information relating to the raw data of the page, such as ‘1’ or ‘0’ bit count information. Upon execution of a read request, the data storage device may compare the count of ‘0’ or ‘1’ bits obtained from the read of the raw data against the count metadata stored. The difference between the count(s) for the raw data and the count(s) stored in metadata can provide a measure of data retention effect, wherein such measure may substantially account for both time and temperature. In certain embodiments, an optimal, or adjusted, reading voltage can be determined based on such data retention information when using a calibrated memory. The differential bit count information may also be adopted as an indicator for when certain data refreshing operations, such as data scrubbing, garbage collection, and/or the like, should be performed. For example, such information may be determined from raw bit error rate (RBER), which may be indicated by the difference information. In certain embodiments, data retention management functionality described herein may be performed independently of participation of the system error correction functionality (e.g., ECC engine).
As used in this application, “non-volatile solid-state memory,” “non-volatile memory,” “NVM,” or variations thereof may refer to solid-state memory such as NAND flash. However, the systems and methods of this disclosure may also be useful in more conventional hard drives and hybrid drives including both solid-state and hard drive components. Solid-state memory may comprise a wide variety of technologies, such as flash integrated circuits, Phase Change Memory (PC-RAM or PRAM), Programmable Metallization Cell RAM (PMC-RAM or PMCm), Ovonic Unified Memory (OUM), Resistance RAM (RRAM), NAND memory, NOR memory, EEPROM, Ferroelectric Memory (FeRAM), MRAM, or other discrete NVM (non-volatile solid-state memory) chips. The non-volatile solid-state memory arrays or storage devices may be physically divided into planes, blocks, pages, and sectors, as is known in the art. Other forms of storage (e.g., battery backed-up volatile DRAM or SRAM devices, magnetic disk drives, etc.) may additionally or alternatively be used.
The terms “page,” “target page,” or variations thereof are used herein according to their broad and ordinary meaning. For Example, “page” may refer to a block of a physical memory cells, or to the physical memory cells themselves. Furthermore, within a multi-level cell (MLC), “page” may refer to either of upper or lower pages.
The terms “equivalent,” as used to describe equivalent data retention time, may refer to an effective data retention state of a cell, block, die, or other portion of solid-state memory. Therefore, equivalent data retention time may take into consideration both the temperature experienced by the memory, as well as the time the data has been written to memory.
Data Storage System
The controller 130 may be configured to receive memory access commands from a storage interface (e.g., a device driver) 112 residing on a host system 110. The controller 130 may further execute commands in response to such host-issued memory commands in the non-volatile solid-state memory arrays 150. Storage access commands communicated by the storage interface 112 can include write and read commands issued by the host system 110. The commands can specify a block address in the data storage device 120, and the controller 130 can execute the received commands in the non-volatile solid-state memory array(s) 150. Data may be accessed/transferred based on such commands.
The data storage device 120 can store data received from the host system 110 such that the data storage device 120 acts as data storage for the host system 110. To facilitate this function, the controller 130 can implement a logical interface. The logical interface can present to the host system memory as a set of logical addresses (e.g., sequential/contiguous addresses) where data can be stored. Internally, the controller 130 can map logical addresses to various physical memory addresses in the non-volatile solid-state memory array 150 and/or other memory module(s). Mapping data indicating the mapping of logical addresses to physical memory addresses may be maintained in the data storage device. For example, mapping table data may be stored in non-volatile memory array(s) 150 in order to allow for recreation of mapping tables following a power cycle.
In certain embodiments, on power-up, the controller may cache mapping table data stored in the non-volatile solid-state memory array(s) 150 to volatile memory 132. In response to receipt and execution of host-issued memory commands, the controller 130 may update the cached mapping table data and periodically or sporadically flush such data to the memory array(s) 150. For example, as data stored in volatile memory can be lost when power to the memory is removed, it may be necessary or desirable to flush any cached mapping data to non-volatile memory before power to the controller 130 is lost.
In an embodiment, the data storage device 120 may be a hybrid disk drive that additionally includes magnetic memory storage (not shown). In such case, one or more controllers 130 may control the magnetic memory storage and the non-volatile solid-state memory array(s) 150.
Over time, memory degradation, loss of data retention, and the like, can cause the states of a voltage distribution can widen and overlap.
Data Retention Flag Bytes
Certain embodiments disclosed herein provide for the assignment of one or more bytes in a target page as one or more copies of a data retention flag. The flag may include a count of the number of ‘0’ or ‘1’ bits in the raw data to be written to the target page. In certain embodiments, the data retention flag is stored in a separate memory array or device than the raw data. As described herein, a variable ‘Np’ may be used to represent the count of the number of ‘0’ or ‘1’ bits, which may be stored in usable area of the target page, which is described below with respect to
In certain embodiments, the ‘0’ or ‘1’ count (Np) is stored in the spare area 334 of the page 330. Alternatively, Np may be stored in bytes allocated from the user area of the page. In certain embodiments, Np is stored in a separate page, array, or device than the raw data with which it is associated. The data structure 336 includes interleaved user area and spare area. As shown, the spare area 334 may include one or more ECC sectors, as well as one or more additional flag bytes, or sectors. Np may be stored in one or more of the flag bytes. When the raw data is read, the number of ‘1’ or ‘0’ bits may be extracted from the read operation (represented herein by the variable Nr).
In certain embodiments, Nr may be compared to Np, wherein the difference between Np and Nr is used to estimate the data retention effect between the current read operation and the previous program operation associated with the data. The data retention effect indicated by the difference between Np and Nr may represent a combined effect of both time and temperature.
Based on the difference between Np and Nr, the data storage device may also be configured to estimate how much the relevant voltage reading level (Vt) for the page, or one or more cells thereof, should be adjusted to reduce bit errors.
In certain embodiments, prior to implementation of methods described herein, it may be desirable for a solid-state drive, or portions thereof, to be calibrated to determine the correlation among data retention, Nr−Np, and/or Vt shift. Such calibration may be performed in any suitable manner, as understood by those having ordinary skill in the art. The characterization and calibration may involve taking data retention measurements of the drive with varying P/E cycling, retention time, temperature, and/or data retention-relevant characteristics. For example, the drive may be baked to accelerate data retention, wherein bit error counts are periodically taken to determine drive data retention behavior.
Systems and methods for determining data retention and/or read level adjustment in solid-state drives, as disclosed herein, may or may not involve ECC. In order to improve the integrity for the data retention flag, more than one set of Np may be programmed. Upon read, the data storage device may access a plurality of stored instances for determining the valid value of Np. In order to improve the reliability of the Np metadata, in certain embodiments, such metadata cells are programmed to lower Vt state(s), such as State 1 (first program state) or State 2 (second program state) as shown in
If the flash page has approximately 8K bytes, a Np number stored in the spare area of the page may utilize approximately 2 bytes, for example. Therefore, in certain embodiments, three sets of Np may utilize approximately 6 bytes of storage space. In a data storage device including a randomizer, the number of ‘0’s and ‘1’s may be substantially close to each other. If, for example, there exists a 5% unbalance between the number of ‘0’s and ‘1’s, then the unbalance between the count of ‘0’ and ‘1’ may be in the vicinity of 1638 for an 8 KB page. Therefore, if the 4 KB point of the page is used as the middle point reference, the value for Np may be limited by a maximum of around 820, which can be programmed using 10 bits. Further, storing three copies of Np may therefore require approximately 30 bits; 4 bytes may provide enough storage space for such bits.
In certain embodiments, reliability of data retention flag bytes can be improved by programming such bytes interleaved with dummy bits between, for example, any two data retention flag bits; the dummy bites may not be programmed. Use of dummy bits may reduce cell-to-cell interference, thereby improving the read margins for the flag bytes.
With reference to
In certain embodiments, determining upper page data may be relatively complex. As shown in
When the user data is subsequently read from the solid-state memory at block 506, the number of bits of the read user data that are of the certain logical state is determined at block 508. The difference between the number of bits written to the certain logical state and the number of bits read of the certain logical state is determined at block 510. The difference is used to determine data retention characteristics of the solid-state memory.
When the user data is subsequently read from the solid-state memory at block 606, the number of ‘0’ or ‘1’ bits (Nr) in the read user data is determined at block 608. Furthermore, the number Np stored in the memory is also read at block 610. The difference between Nr and Np (Nr−Np) is determined at block 612. The difference is used to determine data retention characteristics of the solid-state memory.
At block 808, the difference between NpL3 and NrL3 is determined, which may be used to determine adjusted voltage read levels for R1 and R3. In certain embodiments, the difference may be used for other purposes, such as to determine a data refreshing schedule, or for other data retention-related purposes. The data may then be read using the adjusted voltage read level values.
Certain factors, such as imperfect programming and cell-to-cell interference, even without data retention, may cause the actual number of ‘0’ (or ‘1’) bits Nr0 may differ from Np immediately after programming. Such difference is reflected in the calibration curves (
Certain systems and methods disclosed herein provide for automatically accounting of the combined effects on data retention of both time and temperature. Data retention can affect the failing bit number (transition from higher Vt to lower Vt) and may be reflected in the value of Nr−Np, as described above. In certain embodiments, ECC is not required for equivalent data retention time and/or optimal Vt shift estimation. Therefore, certain embodiments may provide a mechanism for read-level tracking in very poor data retention cases where ECC may fail with the default read level. By reading the raw data and comparing it against the reference flag(s), data retention, and therefore read level adjustment amount, may be determined. Such systems and methods may be effective whether the solid-state drive has been powered off or not.
The implementation these metadata bytes, as described herein, may indicate a threshold for data scrubbing alert. Because data retention flag bytes can be written at the page level, certain embodiments may provide an adaptive method for individual pages. If read substantially immediately after programming, comparison of Np and Nr can provide useful information for the page/block. In certain embodiments, data retention flag bytes may use two or few bytes, which may be assigned from the raw data in each flash page. Therefore, data retention flag bytes may not result in a significant waste of storage capacity.
With respect to upper page reads, lower page data may be required to determine cells in State 3. Then based on fail bit increases for cells in State 3, data retention effect may be correctly calculated. The process may include reading the upper page and it's paired lower page to determine the fail bit number for State 3 (or State 1) only. This however, may require the target number of State 3 cells in programming (NpL3), which can be stored in the spare area of the page. Once the difference between Nr and Np is determined for either State 0 or State 1, the process may be similar to that shown in
Those skilled in the art will appreciate that in some embodiments, other types of data storage systems and/or power management can be implemented. In addition, the actual steps taken in the processes discussed herein may differ from those described or shown in the figures. Depending on the embodiment, certain of the steps described above may be removed, others may be added.
While certain embodiments have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of protection. Indeed, the novel methods and systems described herein may be embodied in a variety of other forms. Furthermore, various omissions, substitutions and changes in the form of the methods and systems described herein may be made. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the protection. For example, the various components illustrated in the figures may be implemented as software and/or firmware on a processor, ASIC/FPGA, or dedicated hardware. Also, the features and attributes of the specific embodiments disclosed above may be combined in different ways to form additional embodiments, all of which fall within the scope of the present disclosure. Although the present disclosure provides certain preferred embodiments and applications, other embodiments that are apparent to those of ordinary skill in the art, including embodiments which do not provide all of the features and advantages set forth herein, are also within the scope of this disclosure. Accordingly, the scope of the present disclosure is intended to be defined only by reference to the appended claims.
The present application claims the benefit of priority under 35 U.S.C. § 120 as a continuation of U.S. patent application Ser. No. 15/043,443 titled “Data Retention Flags in Solid-State Drives,” filed on Feb. 12, 2016, which claims the benefit of priority under 35 U.S.C. § 120 as a continuation of U.S. patent application Ser. No. 14/090,426, titled “Data Retention Flags in Solid-State Drives,” filed Nov. 26, 2013, now U.S. Pat. No. 9,263,136, which claims the benefit of priority under 35 U.S.C. § 119, as a nonprovisional of U.S. Provisional Application No. 61/873,503 titled “Data Retention Flags in Solid-State Drives,” filed on Sep. 4, 2013, each of the above applications being hereby expressly incorporated herein by reference in their entirety.
Number | Name | Date | Kind |
---|---|---|---|
6856556 | Hajeck | Feb 2005 | B1 |
7126857 | Hajeck | Oct 2006 | B2 |
7430136 | Merry, Jr. et al. | Sep 2008 | B2 |
7447807 | Merry et al. | Nov 2008 | B1 |
7502256 | Merry, Jr. et al. | Mar 2009 | B2 |
7509441 | Merry et al. | Mar 2009 | B1 |
7596643 | Merry, Jr. et al. | Sep 2009 | B2 |
7653778 | Merry, Jr. et al. | Jan 2010 | B2 |
7685337 | Merry, Jr. et al. | Mar 2010 | B2 |
7685338 | Merry, Jr. et al. | Mar 2010 | B2 |
7685374 | Diggs et al. | Mar 2010 | B2 |
7733712 | Walston et al. | Jun 2010 | B1 |
7765373 | Merry et al. | Jul 2010 | B1 |
7898855 | Merry, Jr. et al. | Mar 2011 | B2 |
7912991 | Merry et al. | Mar 2011 | B1 |
7936603 | Merry, Jr. et al. | May 2011 | B2 |
7962792 | Diggs et al. | Jun 2011 | B2 |
8078918 | Diggs et al. | Dec 2011 | B2 |
8090899 | Syu | Jan 2012 | B1 |
8095851 | Diggs et al. | Jan 2012 | B2 |
8108692 | Merry et al. | Jan 2012 | B1 |
8122185 | Merry, Jr. et al. | Feb 2012 | B2 |
8127048 | Merry et al. | Feb 2012 | B1 |
8135903 | Kan | Mar 2012 | B1 |
8151020 | Merry, Jr. et al. | Apr 2012 | B2 |
8161227 | Diggs et al. | Apr 2012 | B1 |
8166245 | Diggs et al. | Apr 2012 | B2 |
8243525 | Kan | Aug 2012 | B1 |
8254172 | Kan | Aug 2012 | B1 |
8261012 | Kan | Sep 2012 | B2 |
8296625 | Diggs et al. | Oct 2012 | B2 |
8312207 | Merry, Jr. et al. | Nov 2012 | B2 |
8316176 | Phan et al. | Nov 2012 | B1 |
8341339 | Boyle et al. | Dec 2012 | B1 |
8375151 | Kan | Feb 2013 | B1 |
8392635 | Booth et al. | Mar 2013 | B2 |
8397107 | Syu et al. | Mar 2013 | B1 |
8407449 | Colon et al. | Mar 2013 | B1 |
8423722 | Deforest et al. | Apr 2013 | B1 |
8433858 | Diggs et al. | Apr 2013 | B1 |
8443167 | Fallone et al. | May 2013 | B1 |
8447920 | Syu | May 2013 | B1 |
8458435 | Rainey, III et al. | Jun 2013 | B1 |
8478930 | Syu | Jul 2013 | B1 |
8489854 | Colon et al. | Jul 2013 | B1 |
8503237 | Horn | Aug 2013 | B1 |
8521972 | Boyle et al. | Aug 2013 | B1 |
8549236 | Diggs et al. | Oct 2013 | B2 |
8583835 | Kan | Nov 2013 | B1 |
8601311 | Horn | Dec 2013 | B2 |
8601313 | Horn | Dec 2013 | B1 |
8612669 | Syu et al. | Dec 2013 | B1 |
8612804 | Kang et al. | Dec 2013 | B1 |
8615681 | Horn | Dec 2013 | B2 |
8638602 | Horn | Jan 2014 | B1 |
8639872 | Boyle et al. | Jan 2014 | B1 |
8683113 | Abasto et al. | Mar 2014 | B2 |
8700950 | Syu | Apr 2014 | B1 |
8700951 | Call et al. | Apr 2014 | B1 |
8706985 | Boyle et al. | Apr 2014 | B1 |
8707104 | Jean | Apr 2014 | B1 |
8713066 | Lo et al. | Apr 2014 | B1 |
8713330 | Sommer | Apr 2014 | B1 |
8713357 | Jean et al. | Apr 2014 | B1 |
8719531 | Strange et al. | May 2014 | B2 |
8724422 | Agness et al. | May 2014 | B1 |
8725931 | Kang | May 2014 | B1 |
8745277 | Kan | Jun 2014 | B2 |
8751728 | Syu et al. | Jun 2014 | B1 |
8769190 | Syu et al. | Jul 2014 | B1 |
8769232 | Suryabudi et al. | Jul 2014 | B2 |
8775720 | Meyer et al. | Jul 2014 | B1 |
8782327 | Kang et al. | Jul 2014 | B1 |
8788778 | Boyle | Jul 2014 | B1 |
8788779 | Horn | Jul 2014 | B1 |
8788880 | Gosla et al. | Jul 2014 | B1 |
8793429 | Call et al. | Jul 2014 | B1 |
9263136 | Zhao | Feb 2016 | B1 |
9620220 | Zhao | Apr 2017 | B2 |
20040210710 | Su | Oct 2004 | A1 |
20050281112 | Ito et al. | Dec 2005 | A1 |
20080016272 | Harrand | Jan 2008 | A1 |
20080175063 | Son | Jul 2008 | A1 |
20080209150 | Byeon | Aug 2008 | A1 |
20100157671 | Mokhlesi | Jun 2010 | A1 |
20100174849 | Walston et al. | Jul 2010 | A1 |
20100250793 | Syu | Sep 2010 | A1 |
20110044101 | Chou et al. | Feb 2011 | A1 |
20110099323 | Syu | Apr 2011 | A1 |
20110283049 | Kang et al. | Nov 2011 | A1 |
20120260020 | Suryabudi et al. | Oct 2012 | A1 |
20120278531 | Horn | Nov 2012 | A1 |
20120284460 | Guda | Nov 2012 | A1 |
20120324191 | Strange et al. | Dec 2012 | A1 |
20130016562 | Mun | Jan 2013 | A1 |
20130132638 | Horn et al. | May 2013 | A1 |
20130145106 | Kan | Jun 2013 | A1 |
20130185606 | Fai | Jul 2013 | A1 |
20130290793 | Booth et al. | Oct 2013 | A1 |
20140059405 | Syu et al. | Feb 2014 | A1 |
20140101369 | Tomlin et al. | Apr 2014 | A1 |
20140115427 | Lu | Apr 2014 | A1 |
20140133220 | Danilak et al. | May 2014 | A1 |
20140136753 | Tomlin et al. | May 2014 | A1 |
20140149826 | Lu et al. | May 2014 | A1 |
20140157078 | Danilak et al. | Jun 2014 | A1 |
20140181432 | Horn | Jun 2014 | A1 |
20140223255 | Lu et al. | Aug 2014 | A1 |
20160163392 | Zhao | Jun 2016 | A1 |
Number | Date | Country | |
---|---|---|---|
20170352423 A1 | Dec 2017 | US |
Number | Date | Country | |
---|---|---|---|
61873503 | Sep 2013 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15043443 | Feb 2016 | US |
Child | 15484044 | US | |
Parent | 14090426 | Nov 2013 | US |
Child | 15043443 | US |