Technical Field
This disclosure relates to computing systems. More particularly, the disclosure relates to systems and methods for managing data retention in a data storage system.
Description of the Related Art
Solid-state memory may utilize transistor devices to store charge representing data. Over time, such transistor devices can lose charge, causing data retention issues. Data retention in solid-state storage devices can depend of various environmental conditions, such as temperature.
Various embodiments are depicted in the accompanying drawings for illustrative purposes, and should in no way be interpreted as limiting the scope of this disclosure. In addition, various features of different disclosed embodiments can be combined to form additional embodiments, which are part of this disclosure.
While certain embodiments are described, these embodiments are presented by way of example only, and are not intended to limit the scope of protection. Indeed, the novel methods and systems described herein may be embodied in a variety of other forms. Furthermore, various omissions, substitutions and changes in the form of the methods and systems described herein may be made without departing from the scope of protection.
The term “equivalent,” as used to describe equivalent data retention time, may refer to an effective data retention state of a cell, block, die, or other portion of solid-state memory. Therefore, equivalent data retention time may take into consideration both the temperature experienced by the memory, as well as the time the data has been written to memory.
As used in this application, “non-volatile solid-state memory,” “non-volatile memory,” “NVM,” or variations thereof may refer to solid-state memory such as NAND flash. However, the systems and methods of this disclosure may also be useful in more conventional hard drives and hybrid drives including both solid-state and hard drive components. Solid-state memory may comprise a wide variety of technologies, such as flash integrated circuits, Phase Change Memory (PC-RAM or PRAM), Programmable Metallization Cell RAM (PMC-RAM or PMCm), Ovonic Unified Memory (OUM), Resistance RAM (RRAM), NAND memory, NOR memory, EEPROM, Ferroelectric Memory (FeRAM), MRAM, or other discrete NVM (non-volatile solid-state memory) chips. The non-volatile solid-state memory arrays or storage devices may be physically divided into planes, blocks, pages, and sectors, as is known in the art. Other forms of storage (e.g., battery backed-up volatile DRAM or SRAM devices, magnetic disk drives, etc.) may additionally or alternatively be used.
Overview
Floating-gate transistors can be used as charge-storing devices for maintaining data in a solid-state storage system.
The repeated electron tunneling mechanism from writing and erasing the transistor cell can cause the buildup of charge traps in the tunnel oxide layer, which can affect the cell's ability to store charge. De-trapping of charges can cause the cell threshold to decrease from the level set through the relevant program algorithm, leading to a potential “bit flip.” Various factors may affect the effect of charge de-trapping on data retention, such as, for example, time and temperature.
Because charge de-trapping can cause the charges stored in solid-state memory cells to degrade, it may be desirable to implement data refreshing/scrubbing in solid state drive applications in order to refresh stored data (e.g., to other unused blocks) before the data retention degrades to the extent that data cannot be correctly read out. One method for determining when data scrubbing should be performed involves performing a data integrity scan (DIS). Such scans may be performed periodically, such as on at monthly, weekly, or other intervals. In certain embodiments, data integrity scans determine whether data scrubbing is needed based at least in part on calculated row bit error rate. Data scrubbing determinations according to such methods may be based on duration of time since programming.
In addition to storage time, as referenced above, additional environmental factors may contribute to a solid-state drive's data retention characteristics. For example, temperature can adversely affect data retention characteristics, particularly with respect to heavily cycled blocks. In certain implementations, the temperature that a drive is subjected to during power-off periods is not taken into consideration when determining when blocks of data should be refreshed.
The ability of solid-state memory, such as NAND flash, to store and retain data can depends on the temperature that the memory is subjected to during writing, as well as between the time the data is written to the time the data is read. According to Arrhenius' equation, data retention acceleration caused by elevated temperatures is an exponential function, and therefore its effect can be significant.
Thermal activated acceleration can be used for data retention testing of solid-state drives, where an escalated temperature is used to accelerate data retention effects so that a full data retention test can be completed in a shortened period of time. The term “acceleration factor” (AF), or “data retention AF,” which may be used in the context of such testing, refers to the degree to which a relatively elevated temperature accelerates the degradation of solid-state cells. As described herein, an elevated temperature can greatly accelerate the de-trapping process in the context of data retention; AF can be an exponential function with respect to the relationship between temperature and activation energy. Certain embodiments disclosed herein provide improved estimation of equivalent data retention time after data is programmed to a block of solid-state memory. Certain embodiments involve equivalent data retention time calculation based at least in part on calculated or determined acceleration factor values for various sampled temperature points.
Certain embodiments disclosed herein provide a solid-state storage device comprising a non-volatile solid-state memory array including a plurality of non-volatile memory devices configured to store data and a temperature sensor. The solid-state storage device may further include a controller configured to write user data to the non-volatile solid-state memory array at a first time and receive a signal from the temperature indicating a temperature of at least a portion of the non-volatile memory array. In certain embodiments, the controller is further configured to calculate or determine an acceleration factor value based at least in part on the temperature signal and calculate or determine an accumulated effective data retention time period relative to a reference temperature. The controller may be further configured to determine whether the effective data retention time period exceeds a predetermined threshold value and cause a data scrubbing operation to be executed when it is determined that the predetermined threshold value was exceeded.
Data Storage System
In certain embodiments, the controller 130 is configured to receive memory access commands from a storage interface (e.g., a device driver) 112 residing on a host system 110. The controller 130 is configured to execute commands in response to such host-issued memory commands in the non-volatile solid-state memory arrays 150. Storage access commands communicated by the storage interface 112 can include write and read commands issued by the host system 110. The commands can specify a block address in the data storage system 120, and the controller 130 can execute the received commands in the non-volatile solid-state memory array(s) 150. Data may be accessed/transferred based on such commands.
The solid-state storage device 120 can store data received from the host system 110 such that the solid-state storage device 120 acts as memory storage for the host system 110. To facilitate this function, the controller 130 can implement a logical interface. The logical interface can present to the host system 110 storage system memory as a set of logical addresses (e.g., contiguous addresses) where data can be stored. Internally, the controller 130 can map logical addresses to various physical memory addresses in the non-volatile solid-state memory array 150 and/or other memory module(s). Mapping data indicating the mapping of logical addresses to physical memory addresses may be maintained in the solid-state storage device.
In an embodiment, the storage device 120 may be a hybrid disk drive that additionally includes magnetic memory storage (not shown). In such case, one or more controllers 130 may control the magnetic memory storage and the non-volatile solid-state memory array(s) 150.
Acceleration Factor Calculation
In certain embodiments, data retention in solid-state devices, such as NAND flash, involves a typical charge thermal activated de-trapping process, which, as described above, can be significantly affected by temperature. For purposes of discussion, the time-to-fail τ of a portion of solid-state memory can be expressed as:
where τ0 is a reference constant, Eα is the activation energy which is a measure of the difficulty of charge de-trapping (As used herein, Eα=1.1 eV for NAND flash working with F-N tunneling for program and erase), KB is the Boltzmann constant, and T is the temperature to which the device is exposed. As demonstrated by equation (1), as the value of T increases, the time-to-fail may decrease. The acceleration factor
when T is increased from T1 to T2 can be calculated as:
As is apparent from equation (2), when T1 and T2 are known AF may be calculated. For example, the controller 130 of
As shown, in an embodiment, a temperature of 85° C. may correspond to an AF approximately equal to 168. That is, 1 hour data retention at 85° C. may be equivalent to approximately 168 hours of data retention at 40° C.
Equivalent Data Retention Time Calculation
As described herein, the temperature history of a solid-state drive, or portion thereof, may be expressed as a function of time as T=T(t). Therefore, the total equivalent time from programming to a later reference time (relative to, for example, 40° C.) may be expressed as:
teffective=∫0tAF(T)dt (3)
Additionally, the temperature history may be represented as a discrete numerical integral over the temperature history as:
teffective=Σ0nAF(T)Δt (4)
The duration of time represented by Δt may be any suitable period of time, such as, for example, 1 hour. In certain embodiments, the controller 130 may check temperature every Δt interval and calculate or look-up AF values for each measured temperature. Then the equivalent time segment AF(T)Δt may be accumulated to the current, or relevant, total effective time. In certain embodiments, such checking and calculation may persist until teffective achieves a pre-set threshold, such as the equivalent of 30 days at 40° C.
The process 400 may provide a mechanism for tracking “effective” age of the data written to the solid-state storage array. That is, the process 400 may involve maintaining a calculated or determined value, teffective, which represents the equivalent amount of time since the data was written, taking into account the effect of temperature on the data retention characteristics of the memory cells. At block 408, a value for teffective associated with the data write is reset, wherein the value increases over time as data retention of the memory degrades. teffective may correspond to some portion of the solid-state array. For example, the method 400 may track teffective values on a block-level, page-level, or other scheme.
At block 410, the process 400 involves checking system temperature. For example, a storage drive may have one or more temperature sensors disposed in proximity to the solid-state memory array(s), wherein a one or more temperature signals provided by such sensor(s) indicate a temperature of the solid-state memory. Using the system temperature, a data retention acceleration factor (AF) may be calculated or determined at block 412. For example, the AF may be calculated using Equation (2) above. In certain embodiments, discrete AF values are maintained in one or more look-up tables in system memory, wherein AF calculation at block 412 comprises accessing the look-up table and selecting an appropriate AF value based at least in part on the system temperature.
At block 412, teffective is calculated based at least in part on the calculated/determined AF value. For example, teffective may be calculated/determined according to the following equation:
teffective=teffective+AF×Δt (5)
During a first iteration of process 400, where teffective is being calculated or determined for a first time following reset at block 408 (i.e., teffective=0), the calculation of a current value of teffective using Equation (5) is reduced simply to:
teffectiveAF×Δt (6)
where Δt corresponds to a period of time over which the solid-state memory is determined or estimated to have experienced the measured temperature. For example, if the process 400 involves checking system temperature periodically, Δt may correspond to the period between temperature checks. In an embodiment, Δt is approximately one hour. In certain other embodiments, Δt may be dependent on system temperature. For example, Δt may be shorter for higher temperatures due to the data retention effects of increased temperature. Furthermore, Δt may be based at least in part on program/erase (P/E) cycle condition.
After determining teffective, the process 400 involves determining whether the value for teffective exceeds a predetermined threshold. If so, the process 400 proceeds to block 420 where the system is alerted for data scanning or data refreshing. When teffective is below the predetermined threshold, the process 400 proceeds to block 418 to wait for the predetermined time interval, Δt, before checking the temperature again at block 410. In certain embodiments, the pre-set time threshold is a function of P/E cycling (or other equivalent characteristic, such as raw bit error rate RBER). For example, more P/E cycled blocks may receive more frequent scanning; that is, they may correspond to shorter time “thresholds.” The process 400 loops between blocks 410 and 416 until teffective exceeds the threshold, at which point data scanning and/or refreshing of the solid-state memory is performed or scheduled.
Other Variations
Those skilled in the art will appreciate that in some embodiments, other types of data storage systems and/or power management can be implemented. In addition, the actual steps taken in the processes discussed herein may differ from those described or shown in the figures. Depending on the embodiment, certain of the steps described above may be removed, others may be added.
While certain embodiments have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of protection. Indeed, the novel methods and systems described herein may be embodied in a variety of other forms. Furthermore, various omissions, substitutions and changes in the form of the methods and systems described herein may be made. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the protection. For example, the various components illustrated in the figures may be implemented as software and/or firmware on a processor, ASIC/FPGA, or dedicated hardware. Also, the features and attributes of the specific embodiments disclosed above may be combined in different ways to form additional embodiments, all of which fall within the scope of the present disclosure. Although the present disclosure provides certain preferred embodiments and applications, other embodiments that are apparent to those of ordinary skill in the art, including embodiments which do not provide all of the features and advantages set forth herein, are also within the scope of this disclosure. Accordingly, the scope of the present disclosure is intended to be defined only by reference to the appended claims.
This application is a continuation of U.S. patent application Ser. No. 14/026,017, filed Sep. 13, 2013, which claims priority to provisional U.S. Patent Application Ser. No. 61/859,714, filed on Jul. 29, 2013. The contents of U.S. Patent Application Ser. No. 61/859,714 and Ser. No. 14/026,017 are hereby incorporated by reference in their entirety.
Number | Name | Date | Kind |
---|---|---|---|
6639937 | Doetsch et al. | Oct 2003 | B2 |
6940775 | Schneider et al. | Sep 2005 | B2 |
7248527 | Park | Jul 2007 | B2 |
7395176 | Chung | Jul 2008 | B2 |
7460394 | Happ et al. | Dec 2008 | B2 |
7493531 | Ito et al. | Feb 2009 | B2 |
7539050 | Philipp | May 2009 | B2 |
7623401 | Philipp et al. | Nov 2009 | B2 |
7764561 | Lee | Jul 2010 | B2 |
7813205 | Sako | Oct 2010 | B2 |
7817469 | Avraham et al. | Oct 2010 | B2 |
7843753 | Mayer | Nov 2010 | B2 |
7990776 | Kim | Aug 2011 | B2 |
8111575 | Mori et al. | Feb 2012 | B2 |
8279696 | Shinozaki | Oct 2012 | B2 |
8300488 | Pyeon | Oct 2012 | B2 |
8395323 | Wendt et al. | Mar 2013 | B2 |
8462560 | Furutani | Jun 2013 | B2 |
8719531 | Strange et al. | May 2014 | B2 |
8724415 | Kuroda | May 2014 | B2 |
8737136 | Cometti | May 2014 | B2 |
8982653 | Nurminen | Mar 2015 | B2 |
8995218 | Koyama | Mar 2015 | B2 |
9165668 | Zhao | Oct 2015 | B1 |
20020004882 | Kai | Jan 2002 | A1 |
20110219203 | Nurminen et al. | Sep 2011 | A1 |
20110302353 | Confalonieri et al. | Dec 2011 | A1 |
20120101731 | Joseph | Apr 2012 | A1 |
20120287711 | Kim et al. | Nov 2012 | A1 |
20120324191 | Strange et al. | Dec 2012 | A1 |
20140101371 | Nguyen et al. | Apr 2014 | A1 |
20140229131 | Cohen et al. | Aug 2014 | A1 |
20140254298 | Dally | Sep 2014 | A1 |
Entry |
---|
U.S. Appl. No. 14/026,017, filed Sep. 13, 2013 by Dengtao Zhoa et al. |
Prosecution History from U.S. Appl. No. 14/026,017, dated Jan. 22, 2015 through Jun. 12, 2015, 18 pp. |
Number | Date | Country | |
---|---|---|---|
20160027523 A1 | Jan 2016 | US |
Number | Date | Country | |
---|---|---|---|
61859714 | Jul 2013 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 14026017 | Sep 2013 | US |
Child | 14876567 | US |