Hybrid drive writing copy of data to disk when non-volatile semiconductor memory nears end of life

Information

  • Patent Grant
  • 8825977
  • Patent Number
    8,825,977
  • Date Filed
    Tuesday, September 28, 2010
    14 years ago
  • Date Issued
    Tuesday, September 2, 2014
    10 years ago
Abstract
A hybrid drive is disclosed comprising a head actuated over a disk, and a non-volatile semiconductor memory (NVSM). When a write command is received from a host that is mapped to the NVSM, the write command is serviced by writing data to the NVSM, and when a life remaining of the NVSM falls below a threshold, by also writing a copy of the data to the disk.
Description
BACKGROUND

Hybrid drives are conventional disk drives augmented with a non-volatile semiconductor memory (NVSM) such as a flash which helps improve certain aspects of the disk drive. For example, the non-volatile semiconductor memory may store boot data in order to expedite the boot operation of a host computer. Another use of a NVSM may be to store frequently accessed data and/or non-sequential data for which the access time is typically much shorter than the disk (which suffers from mechanical latency including seek and rotational latency). Other policies may reduce write amplification of the NVSM in order to maximize its longevity, such as storing frequently written data to the disk (or data having a write/read ratio that exceeds a predetermined threshold).





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1A shows a hybrid drive according to an embodiment of the present invention comprising a head actuated over a disk, and a non-volatile semiconductor memory (NVSM).



FIG. 1B is a flow diagram according to an embodiment of the present invention wherein when a write command is received from a host that is mapped to the NVSM, and a life remaining of the NVSM falls below a threshold, a copy of the data is also written to the disk.



FIG. 2 shows an embodiment of the present invention wherein the life remaining of the NVSM is determined based on a number of program/erase cycles, program/erase times, or write/read/erase errors.



FIG. 3A is a flow diagram according to an embodiment of the present invention wherein when a read command is received from the host that is mapped to the NVSM, data is read from the NVSM, and when a life remaining of the NVSM falls below a threshold, a copy of the data read from the NVSM is also written to the disk.



FIG. 3B is a flow diagram according to an embodiment of the present invention wherein when a read command is received from the host, and a number of correctable read errors exceeds a threshold when reading data from the NVSM, a copy of the data read from the NVSM is also written to the disk.



FIG. 4 is a flow diagram according to an embodiment of the present invention wherein when data is unrecoverable from the NVSM, the copy is read from the disk and written to a new page of the NVSM.



FIG. 5 is a flow diagram according to an embodiment of the present invention wherein when data is unrecoverable from the NVSM and the NVSM is read only, a second copy of the data is written to the disk.



FIGS. 6A and 6B show an embodiment of the present invention wherein a copy of NVSM data is stored on the disk in an area having an increased number of ECC symbols.





DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION


FIG. 1A shows a hybrid drive according to an embodiment of the present invention comprising a head 2 actuated over a disk 4, and a non-volatile semiconductor memory (NVSM) 6. The hybrid drive further comprises control circuitry 8 operable to execute the flow diagram of FIG. 1B, wherein when a write command is received from a host (step 10) that is mapped to the NVSM (step 12), the write command is serviced by writing the data to the NVSM (step 14), and when a life remaining of the NVSM falls below a threshold (step 16), by also writing a first copy of the data to the disk (step 18).


The disk 4 shown in the embodiment of FIG. 1A comprises a plurality of servo sectors 200-20N that define data tracks 18. The control circuitry 8 processes a read signal 22 emanating from the head 2 to demodulate the servo sectors 200-20N into a position error signal (PES) representing a position error of the head relative to a target data track. The control circuitry 8 comprises a servo compensator for filtering the PES to generate a control signal 24 applied to a voice coil motor (VCM) 26 that rotates an actuator arm 28 about a pivot in order to actuate the head 2 radially over the disk 4 in a direction that reduces the PES.


In the embodiment of FIG. 1A, any suitable NVSM 6 may be employed, such as any suitable electrically erasable/programmable memory (e.g., a flash memory). In one embodiment, the NVSM 6 comprises a plurality of blocks, where each block comprises a plurality of memory segments referred to as pages and each page may store one or more data sectors. The blocks are programmed a page at a time, and an entire block is erased in a unitary operation. In one embodiment, a garbage collection operation may be executed on previously written blocks in order to copy valid pages to new blocks (and/or to the disk) so that the previously written blocks may be erased and re-used.


In one embodiment, there is a limited number of program/erase cycles that may be performed on each block in the NVSM 6 (referred to as endurance). A wear leveling algorithm is typically executed to spread write operations evenly over the blocks so that the blocks wear out together. Prior to degrading to the point where written data may become unrecoverable, the NVSM 6 is typically disabled so that subsequent write operations are serviced by the disk 4. However, there may still be a significant amount of life remaining for a number of the blocks when the NVSM 6 disabled, which reduces performance of the hybrid drive prematurely. Accordingly, in one embodiment of the present invention, when the NVSM 6 nears end of life (or a block within the NVSM 6 nears end of life), the data written to the NVSM 6 is also written to a backup copy on disk 4. In this manner, if data in the NVSM 6 becomes unrecoverable, the backup copy can still be recovered from the disk, thereby enabling use of the NVSM 6 until it literally reaches the end of its life.


Any suitable technique may be employed to determine the life remaining of the NVSM 6 (or a block in the NVSM 6) in order to enable the backup operation. FIG. 2 shows an embodiment wherein the life remaining may be determined from a number of program/erase cycles performed on the NVSM 6 (or block), the program or erase time, or an error detected when writing/reading/erasing data when accessing the NVSM 6. The number of program/erase cycles (endurance) of the NVSM 6 may be pre-characterized such that the life remaining is essentially estimated. In addition, one or both of the program and erase times will typically increase as a block wears out and therefore may also reflect the life remaining of the particular block, as well as the entire NVSM 6. Similarly, an error detected during a write/read/erase operation to the NVSM 6 is typically a sign of wear and therefore a reflection of the life remaining of the particular block, as well as the entire NVSM 6. A write error or erase error may be detected if the NVSM 6 fails a write verify operation (reading the data after writing), and a read error may be detected by an error correction code (ECC). Read errors may be correctable such that the corresponding page is recoverable, or read errors may be uncorrectable such that the corresponding page is unrecoverable. In any event, when a page exhibits one or more signs that it's nearing end of life (life remaining falling below a threshold), the data stored in the failing block may be backed up to the disk (when servicing write or read commands to the block). In one embodiment, when a predetermined number of blocks begin to fail, it is assumed that all blocks are beginning to fail and therefore a disk backup copy is made for all data stored in the NVSM 6 when servicing write/read commands to/from the NVSM 6.


In the embodiment of FIG. 1B, when servicing a write command using the NVSM 6 and the life remaining has fallen below a threshold, the data is written to both the NVSM 6 and the disk 4. In one embodiment, the NVSM 6 (or block) may be marked as failing prior to receiving the write command, and therefore a copy of the data will be stored on the disk 4 regardless as to a result of the current NVSM write operation. In another embodiment, a written block will be detected as failing due to a write error (e.g., a write verify error) and therefore a copy of the data is stored on the disk 4 in response to the write error.



FIG. 3A shows a flow diagram according to an embodiment of the present invention wherein when a read command is received from the host (step 30) and the data is stored in a block of the NVSM (step 32), the read command is serviced by reading data from the block (step 34), and when a life remaining of the NVSM (or block) falls below a threshold (step 35), by also writing a copy of the data read from the NVSM to the disk (step 36). In one embodiment, the entire NVSM is determined to be failing and therefore a copy of the data is always written to the disk when servicing read commands from any block of the NVSM. In an alternative embodiment, each block is determined to be failing independent of the other blocks, and therefore a copy of the data is written to the disk only when servicing a read command from a failing block.


In another embodiment illustrated in the flow diagram of FIG. 3B, when a read command is received from the host (step 30) and the data is stored in a block of the NVSM (step 32), the read command is serviced by reading data the block (step 34) and detecting a number of correctable errors. When the number of correctable errors exceeds a threshold (step 37), a copy of the data read from the NVSM is written to the disk (step 38). In one embodiment, when data is read from other pages of the failing block (and a backup copy has not yet been written to the disk), the data is copied to the disk regardless as to the number of correctable errors detected. In another embodiment, when the number of correctable errors exceeds the threshold for a predetermined number of blocks, a backup copy is written to the disk when data is read from any of the blocks in the NVSM (and a backup copy has not yet been written to the disk).



FIG. 4 shows a flow diagram according to an embodiment of the present invention wherein when a read command is received from the host (step 40), and the data is stored on the NVSM (step 42), the read command is serviced by attempting to read the data from the NVSM (step 44). If the data is unrecoverable from the NVSM (step 46), the backup copy is read from the disk (step 48), and the data read from the disk is written to a new block in the NVSM (step 50). In one embodiment, the block comprising the unrecoverable page is marked as read-only to prevent any further write operations to the block. In an alternative embodiment, the block is removed from the garbage collection operation so that it eventually becomes read only after all empty pages have been written a final time. This embodiment enables continued read operations on the other pages of the block until all of the pages become unrecoverable. That is, a block is not retired as unusable until all of the pages have become unrecoverable (or invalidated due to an overwrite of the corresponding LBAs), thereby realizing the benefit of every page in every block until each page reaches end of life (or is invalidated).


In one embodiment, all of the blocks in the NVSM will eventually become read only as the NVSM nears end of life. Accordingly, in an embodiment shown in the flow diagram of FIG. 5, when a read command is received (step 40) and a corresponding page in a block of the NVSM is unrecoverable (step 46) requiring the backup copy of the data to be read from the disk (step 48), and if all of the blocks in the NVSM have been converted to read only (step 52), then a backup copy of the data is stored on the disk (step 54). This embodiment ensures that at least two copies of the data are stored in non-volatile memory (both copies on the disk) in the event that one of the copies becomes unrecoverable.



FIGS. 6A and 6B show an embodiment of the present invention wherein the disk comprises a first plurality of data tracks for storing data sectors having a first number of error correction code (ECC) symbols, and a second plurality of data tracks having a second (greater) number of ECC symbols. When a backup copy of data stored in the NVSM is written to the disk, it is written to a data sector in the second plurality of data tracks in order to increase the reliability of the data. For example, in one embodiment the NVSM is used to store highly critical data (e.g., operating system data) whereas the first plurality of data tracks on the disk are used to store less critical data (e.g., audio/video files). Therefore, the backup copy of the NVSM data is stored on the disk using extended ECC symbols to help ensure the data is recoverable when the NVSM copy becomes unrecoverable. In the example shown in FIG. 6B, the ECC symbols are generated using any suitable polynomial code (e.g., a Reed-Solomon code) wherein the ECC symbols are appended to the end of each data sector. However, any suitable ECC may be employed, such as any suitable iterative code such as a low-density parity-check (LDPC) code wherein the ECC symbols are distributed throughout each data sector.


Any suitable control circuitry may be employed to implement the flow diagrams in the embodiments of the present invention, such as any suitable integrated circuit or circuits. For example, the control circuitry may be implemented within a read channel integrated circuit, or in a component separate from the read channel, such as a disk controller and/or NVSM controller, or certain steps described above may be performed by a read channel and others by a disk controller and/or NVSM controller. In one embodiment, the read channel and controllers are implemented as separate integrated circuits, and in an alternative embodiment they are fabricated into a single integrated circuit or system on a chip (SOC). In addition, the control circuitry may include a suitable preamp circuit implemented as a separate integrated circuit, integrated into the read channel or disk controller circuit, or integrated into an SOC.


In one embodiment, the control circuitry comprises a microprocessor executing instructions, the instructions being operable to cause the microprocessor to perform the steps of the flow diagrams described herein. The instructions may be stored in any computer-readable medium. In one embodiment, they may be stored on a non-volatile semiconductor memory external to the microprocessor, or integrated with the microprocessor in a SOC. In another embodiment, the instructions are stored on the disk and read into a volatile semiconductor memory when the hybrid drive is powered on. In yet another embodiment, the control circuitry comprises suitable logic circuitry, such as state machine circuitry.

Claims
  • 1. A hybrid drive comprising: a disk;a head actuated over the disk;a non-volatile semiconductor memory (NVSM); andcontrol circuitry operable to: receive a write command from a host, the write command comprising first data;service the write command by: writing the first data to the NVSM; andwhen a life remaining of the NVSM falls below a threshold, writing a first copy of the first data to the disk;after servicing the write command, receive a first read command from the host; andservice the first read command by: reading the first data from the NVSM; andwhen the first data is unrecoverable from the NVSM, reading the first copy of the first data from the disk.
  • 2. The hybrid drive as recited in claim 1, wherein the control circuitry is further operable to determine the life remaining of the NVSM relative to a number of program/erase cycles.
  • 3. The hybrid drive as recited in claim 1, wherein the control circuitry is further operable to determine the life remaining of the NVSM relative to at least one of a program time and an erase time.
  • 4. The hybrid drive as recited in claim 1, wherein the control circuitry is further operable to determine the life remaining of the NVSM relative to a number of read errors.
  • 5. The hybrid drive as recited in claim 1, wherein the control circuitry is further operable to: receive a second read command from the host; andservice the second read command by: reading second data from the NVSM; andwhen the life remaining of the NVSM falls below the threshold, writing a copy of the second data to the disk.
  • 6. The hybrid drive as recited in claim 1, wherein the control circuitry is further operable to: receive a second read command from the host; andservice the second read command by: detecting a number of correctable errors when reading second data from the NVSM; andwhen the number of correctable errors exceeds a threshold, writing a copy of the second data to the disk.
  • 7. The hybrid drive as recited in claim 1, wherein when the first data is unrecoverable from the NVSM, the control circuitry is further operable to write a second copy of the first data to the NVSM.
  • 8. The hybrid drive as recited in claim 1, wherein when the first data is unrecoverable from the NVSM and the NVSM is read only, the control circuitry is further operable to write a second copy of the first data to the disk.
  • 9. The hybrid drive as recited in claim 1, wherein the control circuitry is further operable to: write to a first area of the disk using a first number of error correction code (ECC) symbols;write to a second area of the disk using a second number of ECC symbols, wherein the second number is greater than the first number; andwrite the first copy of the first data to the second area of the disk.
  • 10. A hybrid drive comprising: a disk;a head actuated over the disk;a non-volatile semiconductor memory (NVSM); andcontrol circuitry operable to: receive a first read command from a host;service the first read command by: detecting a number of correctable errors when reading first data from the NVSM; andwhen the number of correctable errors exceeds a threshold, writing a first copy of the first data to the disk;receive a second read command from a host; andservice the second read command by: reading the first data from the NVSM; andwhen the first data is unrecoverable from the NVSM, reading the first copy of the first data from the disk.
  • 11. The hybrid drive as recited in claim 10, wherein when the first data is unrecoverable from the NVSM the control circuitry is further operable to service the second read command by writing a second copy of the first data to the NVSM.
  • 12. The hybrid drive as recited in claim 10, wherein when the first data is unrecoverable from the NVSM and the NVSM is read only the control circuitry is further operable to service the second read command by writing a second copy of the first data to the disk.
  • 13. The hybrid drive as recited in claim 10, wherein the control circuitry is further operable to: write to a first area of the disk using a first number of error correction code (ECC) symbols;write to a second area of the disk using a second number of ECC symbols, wherein the second number is greater than the first number; andwrite the first copy of the first data to the second area of the disk.
  • 14. A method of operating a hybrid drive, the hybrid drive comprising a head actuated over a disk, and a non-volatile semiconductor memory (NVSM), the method comprising: receiving a write command from a host, the write command comprising first data;servicing the write command by: writing the first data to the NVSM; andwhen a life remaining of the NVSM falls below a threshold, writing a first copy of the first data to the disk;after servicing the write command, receiving a first read command from the host; andservicing the first read command by: reading the first data from the NVSM; andwhen the first data is unrecoverable from the NVSM, reading the first copy of the first data from the disk.
  • 15. The method as recited in claim 14, further comprising determining the life remaining of the NVSM relative to a number of program/erase cycles.
  • 16. The method as recited in claim 14, further comprising determining the life remaining of the NVSM relative to at least one of a program time and an erase time.
  • 17. The method as recited in claim 14, further comprising determining the life remaining of the NVSM relative to a number of read errors.
  • 18. The method as recited in claim 14, further comprising: receiving a second read command from the host; andservicing the second read command by: reading second data from the NVSM; andwhen the life remaining of the NVSM falls below the threshold, writing a copy of the second data to the disk.
  • 19. The method as recited in claim 14, further comprising: receiving a second read command from the host; andservicing the second read command by: detecting a number of correctable errors when reading second data from the NVSM; andwhen the number of correctable errors exceeds a threshold, writing a copy of the second data to the disk.
  • 20. The method as recited in claim 14, wherein when the first data is unrecoverable from the NVSM, further comprising writing a second copy of the first data to the NVSM.
  • 21. The method as recited in claim 14, wherein when the first data is unrecoverable from the NVSM and the NVSM is read only, further comprising writing a second copy of the first data to the disk.
  • 22. The method as recited in claim 14, further comprising: writing to a first area of the disk using a first number of error correction code (ECC) symbols;writing to a second area of the disk using a second number of ECC symbols, wherein the second number is greater than the first number; andwriting the first copy of the first data to the second area of the disk.
  • 23. A method of operating a hybrid drive, the hybrid drive comprising a head actuated over a disk, and a non-volatile semiconductor memory (NVSM), the method comprising: receiving a first read command from a host; andservicing the first read command by: detecting a number of correctable errors when reading first data from the NVSM; andwhen the number of correctable errors exceeds a threshold, writing a first copy of the first data to the disk;receiving a second read command from a host; andservicing the second read command by: reading the first data from the NVSM; andwhen the first data is unrecoverable from the NVSM, reading the first copy of the first data from the disk.
  • 24. The method as recited in claim 23, wherein when the first data is unrecoverable from the NVSM, further comprising servicing the second read command by writing a second copy of the first data to the NVSM.
  • 25. The method as recited in claim 23, wherein when the first data is unrecoverable from the NVSM further comprising servicing the second read command by writing a second copy of the first data to the disk.
  • 26. The method as recited in claim 23, further comprising: writing to a first area of the disk using a first number of error correction code (ECC) symbols;writing to a second area of the disk using a second number of ECC symbols, wherein the second number is greater than the first number; andwriting the first copy of the first data to the second area of the disk.
US Referenced Citations (54)
Number Name Date Kind
5333138 Richards et al. Jul 1994 A
5581785 Nakamura et al. Dec 1996 A
5586291 Lasker et al. Dec 1996 A
6044439 Ballard et al. Mar 2000 A
6115200 Allen et al. Sep 2000 A
6275949 Watanabe Aug 2001 B1
6429990 Serrano et al. Aug 2002 B2
6661591 Rothberg Dec 2003 B1
6662267 Stewart Dec 2003 B2
6687850 Rothberg Feb 2004 B1
6754021 Kisaka et al. Jun 2004 B2
6807630 Lay et al. Oct 2004 B2
6898680 Chambers May 2005 B2
6909574 Aikawa et al. Jun 2005 B2
6968450 Rothberg et al. Nov 2005 B1
7017037 Fortin et al. Mar 2006 B2
7028174 Atai-Azimi et al. Apr 2006 B1
7082494 Thelin et al. Jul 2006 B1
7107444 Fortin et al. Sep 2006 B2
7120806 Codilian et al. Oct 2006 B1
7142385 Shimotono et al. Nov 2006 B2
7177184 Chen Feb 2007 B2
7334082 Grover et al. Feb 2008 B2
7395452 Nicholson et al. Jul 2008 B2
7411757 Chu et al. Aug 2008 B2
7461202 Forrer, Jr. et al. Dec 2008 B2
7472222 Auerbach et al. Dec 2008 B2
7477477 Maruchi et al. Jan 2009 B2
7509471 Gorobets Mar 2009 B2
7516346 Pinheiro et al. Apr 2009 B2
7610438 Lee et al. Oct 2009 B2
7613876 Bruce et al. Nov 2009 B2
7644231 Recio et al. Jan 2010 B2
7685360 Brunnett et al. Mar 2010 B1
7752491 Liikanen et al. Jul 2010 B1
20060195657 Tien et al. Aug 2006 A1
20080040537 Kim Feb 2008 A1
20080059694 Lee Mar 2008 A1
20080130156 Chu et al. Jun 2008 A1
20080177938 Yu Jul 2008 A1
20080222353 Nam et al. Sep 2008 A1
20080256287 Lee et al. Oct 2008 A1
20080307270 Li Dec 2008 A1
20090019218 Sinclair et al. Jan 2009 A1
20090031072 Sartore Jan 2009 A1
20090103203 Yoshida Apr 2009 A1
20090106518 Dow Apr 2009 A1
20090144501 Yim et al. Jun 2009 A2
20090150599 Bennett Jun 2009 A1
20090172324 Han et al. Jul 2009 A1
20090249168 Inoue Oct 2009 A1
20090271562 Sinclair Oct 2009 A1
20090327603 McKean et al. Dec 2009 A1
20100088459 Arya et al. Apr 2010 A1
Non-Patent Literature Citations (6)
Entry
Hannes Payer, Marco A.A. Sanvido, Zvonimir Z. Bandic, Christoph M. Kirsch, “Combo Drive: Optimizing Cost and Performance in a Heterogeneous Storage Device”, http://csl.cse.psu.edu/wish2009—papers/Payer.pdf.
Gokul Soundararajan, Vijayan Prabhakaran, Mahesh Balakrishan, Ted Wobber, “Extending SSD Lifetimes with Disk-Based Write Caches”, http://research.microsoft.com/pubs/115352/hybrid.pdf, Feb. 2010.
Xiaojian Wu, A. L. Narasimha Reddy, “Managing Storage Space in a Flash and Disk Hybrid Storage System”, http://www.ee.tamu.edu/˜reddy/papers/mascots09.pdf.
Tao Xie, Deepthi Madathil, “Sail: Self-Adaptive File Reallocation on Hybrid Disk Arrays”, The 15th Annual IEEE International Conference on High Performance Computing (HiPC 2008), Bangalore, India, Dec. 17-20, 2008.
Non-Volatile Memory Host Controller Interface revision 1.0 specification available for download at http://www.intel.com/standards/nvmhci/index.htm. Ratified on Apr. 14, 2008, 65 pages.
U.S. Appl. No. 12/720,568, filed Mar. 9, 2010, 22 pages.