Non-volatile memory storage devices based on NAND technology employ an erase/program cycle, which may impact performance if free blocks are not available at the time of write. Modern NAND-based storage devices erase blocks of storage proactively when they are not in use to improve performance. At present, some host devices can send “trim” commands to NAND storage devices in order to indicate to the storage device that a certain logical block address (LBA) range is no longer in use by the host's file system. However, using trim commands effectively requires tuning and analysis to insure that enough free blocks are available and that the trim commands themselves do not reduce performance/endurance. Microsoft's Performance Toolkit includes the ability to log input/output commands at the driver level, but it lacks the analysis capability required to properly assess the effectiveness of trim commands in order to optimize performance.
Embodiments of the present invention are defined by the claims, and nothing in this section should be taken as a limitation on those claims.
By way of introduction, the below embodiments relate to a method and storage device for assessing execution of trim commands. In one embodiment, a trace of trim and write commands sent to a storage device are obtained. For each trim command in the trace, a subsequent write command to a same logical block address (LBA) as the trim command is identified, and an elapsed time between the trim and write commands is calculated. This information can be used to display a histogram of the data and/or to optimize when the storage device executes trim commands and/or when the host device issues trim commands. Other embodiments are possible, and each of the embodiments can be used alone or together in combination. Accordingly, various embodiments will now be described with reference to the attached drawings.
Exemplary Host and Storage Devices
Turning now to the drawings,
In this embodiment, the storage device 100 is a mass storage device and takes the form of a solid-state drive (SSD), as that type of storage device currently supports the trim command. Other storage devices that can be used if they support the trim command include, but are not limited to, a handheld, removable memory card (such as a Secure Digital (SD) card or a MultiMedia Card (MMC)), a universal serial bus (USB) device, and embedded memory (e.g., a secure module embedded in the host device 50), such as an iNAND™ eSD/eMMC embedded flash drive by SanDisk Corporation.
As shown in
Turning now to the host device 50, the host device 50 comprises a controller 160 that has a storage device interface 161 for interfacing with the storage device 100 The controller 160 also comprises a central processing unit (CPU) 163, read access memory (RAM) 165, and read only memory (ROM) 166. The storage device 100 also contains a memory 172 for storing, for example, applications (apps) and programs (e.g., a browser, a media player, etc.) used in the operation of the host device 50. The controller's RAM 165 and/or the memory 172 can be used as a buffer for storing commands to be sent to the storage device 100. The host device 50 can contain other components (e.g., a display device, a speaker, a headphone jack, a video output connection, etc.), which are not shown in
In some environments, the host device 50 is operable to render content stored in the storage device 100. As used herein, “content” can take any suitable form, including, but not limited to, a song, a movie, a game, an application (“app”), a game installer, etc. Depending on the type of content, “render” can mean playing (e.g., when the content is a song or movie), deciphering (e.g., when the content is a game installer), or whatever action is needed to “enjoy” the content. In some embodiments, the host device 50 contains the necessary software to render the content (e.g., a media player), whereas, in other embodiments, such software is provided to the host device 50 by the memory device 100 or another entity.
Embodiments Related to Assessing Execution of Trim Commands
The host device 50 can issue a “trim” command to inform the storage device 100 if a logical block address (LBA) is no longer in use (e.g., when a user deletes a file, when a file is moved or defragmented, etc.). In response to the trim command, the storage device 100 may erase the page that contains the LBA, which provides additional free space to hold data for subsequent writes. The trim command from the host device 50 is often followed by a write command to write data to the erased LBA. If the storage device 100 executes the trim command when received from the host device 50, the storage device 100 may incur an unnecessary performance penalty, as the erase process takes time and may not be immediately needed if there are free blocks available to execute the write command. Instead, the storage device 100 can execute the write command by storing data in the free blocks and, later during idle time, perform the erase operation. However, typical storage devices execute trim commands upon receipt.
The following embodiments provide techniques that can be used to analyze the elapsed time between when a host device sends a trim command and when it sends a subsequent write command to the same LBA. This time delta can be used to assess trim command efficiency and determine whether the storage device 100 can delay or avoid execution of a trim command and still achieve the same or similar performance level as if the trim command were executed upon receipt. This information can also be used to delay when a trim command is sent by the host device 50 to the storage device 100. The information can be visually displayed to a user (e.g., via a histogram) and/or can be used to manually, automatically, or semi-automatically tune the algorithm for executing or sending trim commands. In addition, the analysis can show whether the trim command, if executed immediately, would improve performance by freeing blocks that would otherwise have to be erased during the execution of a write command.
Returning to the drawings,
Next, it is determined whether there is a trim command in the trace (act 215), and, if there is, an attempt is made to identify the first write command to the same LBA (acts 220 and 225). This may involve skipping read and other commands that may be present between the trim and write commands. If a write to the same LBA is found, the elapsed time between the trim and write commands is calculated (act 230), and the resulting calculation is saved (act 235) (here, for the generation of a histogram; however, as noted above, the calculation can be used to tune the trim execution or transmission algorithms).
If a write to the same LBA is not found, it is determined whether another trim command was performed on the same LBA without an intervening write command (act 240). If another trim command was performed (either by error or due to the programming of the host device 50), the process logs this event as a “double trim” event (act 245). If, on the other hand, another trim command was not performed, the process logs this event as a “never rewritten” event (act 250). This can occur, for example, if the host device 50 did not properly record the first trim command as having occurred.
The above process is repeated for the remaining trims in the trace. At the conclusion, a histogram from the calculations is displayed (act 255), and the process ends (act 260). An example of the resulting histogram is shown in
With this histogram, a user or technician can tune the firmware of the storage device 100, so the storage device 100 will delay at least 15 second before performing a trim command. Also, the information from the histogram on the number of writes issued sooner than 15 seconds as a proportion of the total number of writes in the trace can be used to inform the storage device 100 of the average number of free blocks it should have available. As noted above, the analysis can be performed by any suitable device, and the histogram can be displayed on the host device 50 or some other device. Also, instead of or in addition to displaying a histogram, the information gathered by this process can be used to tune the storage device 100 and/or the host device 50 to adjust the delay in executing or transmitting, respectively, a trim command.
There are many alternatives that can be used with these embodiments. For example, an operating system free disk space query may be used to further enhance the analysis by calculating theoretical free space available to the storage device 100. That is, if the process knows how much free space there is, it can factor in how important it is to perform a trim command to obtain more free space (e.g., by adjusting the frequency of performing trim commands based on available free space). This is shown in the flow chart 400 of
While the process shown in
It is intended that the foregoing detailed description be understood as an illustration of selected forms that the invention can take and not as a definition of the invention. It is only the following claims, including all equivalents, that are intended to define the scope of the claimed invention. Finally, it should be noted that any aspect of any of the preferred embodiments described herein can be used alone or in combination with one another.
This application claims the benefit of U.S. Provisional Patent Application No. 61/654,006, which was filed on May 31, 2012 and is hereby incorporated by reference herein.
Number | Name | Date | Kind |
---|---|---|---|
20020120815 | Zahavi et al. | Aug 2002 | A1 |
20100217927 | Song et al. | Aug 2010 | A1 |
20100262795 | Hetzler et al. | Oct 2010 | A1 |
20110145474 | Intrater | Jun 2011 | A1 |
20110208898 | Shin | Aug 2011 | A1 |
20120059978 | Rosenband et al. | Mar 2012 | A1 |
20120110247 | Eleftheriou et al. | May 2012 | A1 |
20120110249 | Jeong et al. | May 2012 | A1 |
20120117309 | Schuette | May 2012 | A1 |
20120131263 | Yeh | May 2012 | A1 |
20120243329 | Nagashima | Sep 2012 | A1 |
20130135767 | Hall et al. | May 2013 | A1 |
20130219106 | Vogan et al. | Aug 2013 | A1 |
Entry |
---|
Windows Performance Analysis Developer Center, http://msdn.microsoft.com/en-us/performance/cc709422.aspx, 3 pages, printed May 31, 2012. |
International Search Report and Written Opinion for PCT/US2013/036903 dated Sep. 17, 2013, 12 pages. |
“Sierra M6-4 SAS/SATA Protocol Analyzer-User Manual”, Lecroy, retrieved from internet: http://cdn.teledynelecroy.com/files/manuals/sierram6-4usermanual.pdf, May 2011, pp. FP-466. |
Application as filed for U.S. Appl. No. 13/524,924 entitled, “Method and Host Device for Assessing Execution of Trim Commands”, filed on Jun. 15, 2012, 20 pages. |
Office Action for U.S. Appl. No. 13/524,924 dated Jan. 29, 2014, 23 pages. |
Giryoung et al., “Performance Analysis of SSD Write Using TRIM in NFTS and EXT4”, Nov. 29, 2011-Dec. 1, 2011, Computer Sciences and Convergence Information Technology (ICCIT), 2011 6th International Conference, pp. 422-423. |
Koltsidas et al., “Data Management Over Flash Memory”, Jun. 12-16, 2011, SIGMOD '11, Athens, Greece, pp. 1209-1212. |
Office Action for U.S. Appl. No. 13/524,924 dated Aug. 7, 2014, 25 pages. |
Number | Date | Country | |
---|---|---|---|
20130326096 A1 | Dec 2013 | US |
Number | Date | Country | |
---|---|---|---|
61654006 | May 2012 | US |