The field of invention pertains generally to the computing sciences and, more specifically, to word line read disturb error reduction through fine grained access counter mechanism.
As the storage densities of mass storage devices continues to increase, both data corruption error mechanisms and the manner of trying to avoid them are becoming increasingly complex. An increasingly problematic data corruption mechanism for high density FLASH memory devices is the word line read disturb error mechanism. In the case of the word line read disturb error mechanism, data kept by storage cells that are proximate to a cell being frequently accessed for read operations may become corrupted on account of the signaling being frequently applied to the cell.
A better understanding of the present invention can be obtained from the following detailed description in conjunction with the following drawings, in which:
As observed in
The physical layout of
Additionally, the block of
Other nodes of a storage cell stack, however, are not shared amongst the multiple storage cell stacks of the block. Besides the storage cells of the stacks themselves, note that each storage cell stack in the particular layout embodiment 200 of
In order to access any particular storage cell within the block, such as a cell in the leftmost column in
Note that the horizontal X axis of
According to the perspective of
The storage cells of more recent FLASH memory technologies, however, are designed to store more than one bit. For example, in the case of triple level cell (TLC) FLASH technology, each FLASH storage cell stores three bits of data. Referring to
One of the responsibilities of the control function 221 is the implementation of wear-leveling. As is known in the art, FLASH devices can “wear out” (their ability to keep data becomes unreliable) if they are accessed too frequently. As such, the control function 221 implements a special algorithm that monitors how often each of the storage cell blocks within each FLASH chip 222 are being accessed, and, “swaps” the data content of a more frequently accessed block with the data content of a less frequently used block. By so doing, the reliability of the storage cells of the block that was storing the more frequently accessed data is preserved because, after the swap, they will be storing less frequently accessed data.
Traditionally, the control function 221 has maintained a counter for each block within each FLASH chip 222 that increments each time its corresponding block is accessed. Those blocks whose access counts surpass a first threshold are marked as candidate blocks whose data should be swapped out for wear leveling purposes. Those blocks whose access counts remain beneath a second threshold are marked as candidate blocks for receiving data from blocks whose counts surpass the first threshold. During an actual wear leveling swap, a blocks whose count is below the second threshold has its less frequently accessed data moved into a block whose access count has surpassed the first threshold.
Another data reliability problem, referred to as “word line read disturb”, results from a word line being accessed too frequently for read operations. When a word line is activated for a read operation one or more storage cells that are directly tied to the word line are accessed. Unfortunately, the access induces, e.g., high energy fields and/or voltage potentials, in and/or around certain storage cells that are proximate to the accessed storage cell(s). In cases where a particular word line is activated for read operations too frequently, such proximate storage cells can have their stored data corrupted (one or more bits of the proximate storage cells can be flipped).
Unfortunately, preventing word line read disturb errors by way of the aforementioned block based access count mechanism described just above is becoming increasingly unfeasible. Specifically, traditionally, word line read disturb error rates correlated fairly tightly with uniform block errors induced from other kinds of block accesses. As a consequence, block access count based wear leveling was sufficient to also predict/avoid word line read disturb errors. With the smaller stacked storage cells of newer FLASH technologies, however, word line read disturb error rates are becoming multiple factors higher than uniform block errors rates (e.g., 16 times higher, 128 times higher, etc.). As such, a finer grained access counting mechanism that includes, e.g., some kind of word line access counting mechanism is warranted.
Because pages of information are recognized/understood by the control function 221, one approach for counting word line accesses is to keep a counter for each page that is stored in each of the FLASH chips 222. In this case, whenever a specific page is accessed for a read operation, its specific counter is incremented. Unfortunately, this approach is not feasible because the number of different counters that would need to be maintained by the control function 221 is very large which, in turn, would devote too much of the control function's memory resources to the different counter values.
For example, referring briefly back to
Thus, in order to reduce the sheer number of counters, another approach is to use two counters: 1) a first counter of the form [die_ID; block_ID]; and, 2) a second counter of the form [block_ID; SGS-WL_ID]. Here, for the first counter, die_ID is used to identify a particular one of the FLASH chips 222_1 through 222_N in the SSD and block_ID is used to identify a particular block within the chip identified by the die_ID. By contrast, for the second counter, block_ID corresponds to a particular block within all of the SSD's FLASH chips 222 (also referred to as a “macroblock”) and SGS-WL_ID identifies a particular segment of a word line within the set of blocks identified by the block_ID whose respective storage cell stacks are tied to a same SGS node (also referred to an “SGS-WL segment”).
As there are thousands of blocks per FLASH device, the number of counters of the first type (hereinafter referred to as “per block counters”) should only be in the range of tens of thousands or less for SSD's having tens of FLASH chips or less. Furthermore, as there are typically no more than hundreds of different SGS-WL segments within a block and no more than thousands of blocks per FLASH chip, the number of counters of the second form (hereinafter referred to as “SGS-WL segment counters”) should be in the range of hundreds of thousands or less. Thus, the number of counters is dramatically reduced as compared to an approach that tracks access counters for each page in the SSD.
The per block counters 324 provide a read access count for each block in the SSD and are akin to the counters used by traditional SSDs in that there are separate counters for each block within each die in the SSD. The SGS-WL counters 325 provide a respective read access count for different SGS-WL segments across all the dies in the SSD and are particularly insightful into the prediction of word line read disturb errors.
Referring briefly back to
For the particular block embodiment of
The SGS-WL_ID notation of the SGS-WL counters indicates that there is a separate counter reserved for each unique combination of word line and group of columns, across all the FLASH chips in the SSD, that are tied to a same SGS node. As such there are 16 different counters maintained for the block structure of
Active avoidance of read disturb errors within the SSD can be achieved by observing the state of the per block and SGS-WL counters 324, 325 and implementing specific wear level routines in response. Examples are described immediately below. Here, the control function 321 also includes threshold settings 326, are parameters that, e.g, are stored in the SSD's non volatile storage and/or, e.g., BIOS firmware non volatile storage of the SSD's host platform and loaded into the control function during bring up and/r power-on-reset of the SSD. The control function 321 determines when triggering a wear leveling data swap is appropriate by repeatedly comparing the current state of counters 324 and 325 to the threshold settings 326 (or calculations made from them).
The per block threshold is a first threshold setting and is repeatedly compared against each of the per block counter values 324. If the value of any one of the per block counters 325 surpasses the per block threshold, that data contained by that counter's corresponding block is swapped with the data of another block within the SSD 320 whose corresponding per block counter is, e.g., especially low (e.g., beneath some second threshold).
The per SGS-WL threshold can be used to detect when a particular SGS-WL segment of a block has received a sufficient number of read accesses to justify the swapping of the segment's pages with the pages of another SGS-WL segment that has received a relatively low number of read accesses.
In an embodiment, because SGS-WL counter values for a particular SGS-WL are accumulated across multiple chips of an SSD, an e.g., empirically determined count threshold for the SGS-WL segment of a single chip is scaled upward depending on how evenly or unevenly read accesses to a particular SGS-WL segment is distributed across the dies of the SSD. As such, in various embodiments, the control function 321 is provided the single die SGS-WL threshold as one of thresholds 326. The wear leveling algorithms executed by the control function 321 then dynamically adjust/determine the thresholds that trigger wear leveling swaps for individual SGS-WL segments based on the count state of the SGS-WL counters 326 and the per block counters 325. Here, the per block counters 325 add insight into usage patterns within the SSD 320 that can be used to refine the thresholds for specific SGS-WL segments.
Consider an example where the empirically determined single die SGS-WL threshold corresponds to 3,000 read accesses for word line read disturb errors. Consider further an SSD having four FLASH chips in the SSD (N=4 in
In the depicted SGS-WL counter values 402, the second column 412 is aligned with the exemplary block design of
From the SGS-WL counters 402 it is clear that SGS-WL segment 00 is receiving the vast majority of read accesses that are made to blocks having ID=300 across all four chips. This means the total per block counts for block 300 observed in the read block counters 401 are approximately tallies of the SGS-WL segment 00 by itself. Viewing the counts 401, 402 in this manner it is clear that the accesses to SGS-WL segment 00 are being evenly distributed across the dies. As such, the SGS-WL counter threshold for SGS-WL segment 00 sufficient to trigger page data swapping of SGS-WL segment 00 within each of the four die should be set at the single chip threshold times the number of chips in the SSD (e.g., 3,000×4=12,000).
If the per block counter state 501 of
Referring to the per block counter state 501, it is clear that die 0 is receiving the vast majority of accesses to block 300. In this case, the decision to swap the pages of SGS-WL segment 00 could be made at a SGS-WL counter value that is equal to the single die threshold (3,000) especially since all other SGS-WL segments within block 300 continue to show little/no access. That is, combining the information from both counter value tables 501 and 502, it is clear that SGS-WL segment 00 of block 300 of die 0 has received all 3,000 accesses to block 300 of die 0. Therefore, the threshold for the SGS-WL counter value that triggers the swapping of the pages of SGS-WL segment 00 should be set at 3,000 instead of 12,000 and should only affect SGS-WL segment 00 within die 0. Thus, whereas the scenario of
Other scenarios that result in SGS-WL counter thresholds that reside somewhere between the extremes of
With respect to the aforementioned page swapping, when the storage cells along a particular SGS-WL are swapped out, all of the pages contained by each of the cells is swapped out. Thus, for instance, in the case of TLC FLASH technology, three pages are swapped out per cell.
As observed in
An applications processor or multi-core processor 850 may include one or more general purpose processing cores 815 within its CPU 801, one or more graphical processing units 816, a memory management function 817 (e.g., a memory controller) and an I/O control function 818. The general purpose processing cores 815 typically execute the operating system and application software of the computing system. The graphics processing units 816 typically execute graphics intensive functions to, e.g., generate graphics information that is presented on the display 803. The memory control function 817, which may be referred to as a main memory controller or system memory controller, interfaces with the system memory 802. The system memory 802 may be a multi-level system memory.
Each of the touchscreen display 803, the communication interfaces 804-807, the GPS interface 808, the sensors 809, the camera 810, and the speaker/microphone codec 813, 814 all can be viewed as various forms of I/O (input and/or output) relative to the overall computing system including, where appropriate, an integrated peripheral device as well (e.g., the camera 810). Depending on implementation, various ones of these I/O components may be integrated on the applications processor/multi-core processor 850 or may be located off the die or outside the package of the applications processor/multi-core processor 850. Non volatile storage 820 may include non volatile mass storage which may include one or more three dimensional FLASH SSDs having dual counters to implement fine grained wear leveling as described at length above. Non volatile storage 820 may hold the BIOS and/or firmware of the computing system.
One or more various signal wires within the computing system, e.g., a data or address wire of a memory bus that couples the main memory controller to the system memory, may include a receiver that is implemented as decision feedback equalizer circuit that internally compensates for changes in electron mobility as described above.
Embodiments of the invention may include various processes as set forth above. The processes may be embodied in machine-executable instructions. The instructions can be used to cause a general-purpose or special-purpose processor to perform certain processes. Alternatively, these processes may be performed by specific hardware components that contain hardwired logic for performing the processes, or by any combination of programmed computer components and custom hardware components.
Elements of the present invention may also be provided as a machine-readable medium for storing the machine-executable instructions. The machine-readable medium may include, but is not limited to, floppy diskettes, optical disks, CD-ROMs, and magneto-optical disks, FLASH memory, ROMs, RAMs, EPROMs, EEPROMs, magnetic or optical cards, propagation media or other type of media/machine-readable medium suitable for storing electronic instructions. For example, the present invention may be downloaded as a computer program which may be transferred from a remote computer (e.g., a server) to a requesting computer (e.g., a client) by way of data signals embodied in a carrier wave or other propagation medium via a communication link (e.g., a modem or network connection).
This application is a divisional of and claims the benefit of U.S. patent application Ser. No. 15/627,928, entitled, “WORD LINE READ DISTURB ERROR REDUCTION THROUGH FINE GRAINED ACCESS COUNTER MECHANISM”, filed Jun. 20, 2017, now U.S. Pat. No. 10,236,069, Issued Mar. 19, 2019 which are incorporated by reference in their entirety.
Number | Date | Country | |
---|---|---|---|
Parent | 15627928 | Jun 2017 | US |
Child | 16357771 | US |