The invention relates to the field of data storage devices and more particularly to apparatus and methods for recovering data when errors occur during reading of data.
Data storage devices used with computers use various techniques for storing the data including optical, magnetic and optomagnetic methods. The following will focus on magnetic disk drives for simplicity, but the invention is not limited to use with magnetically stored data.
Data in magnetic disk drives is recorded in concentric, radially spaced tracks arranged on the planar surfaces of one or more disks. Magnetic films on the disks are used to record bits of information in the form of magnetic domains. When operating the disks are rotated on a spindle and magnetic heads are mechanically positioned over the tracks to read and write data from the disks. Permanent servo information is typically written on the disk at the time of manufacture that allows the system to ascertain the radial and circumferential position of the head within certain limits. The tracks are typically subdivided into many sectors that can be read and written independently.
In addition to the permanent servo information, magnetic signals are written on the disk to record various information for users. Special sync fields are included with the data to aid detecting the starting positions. A sync field typically is architected at the start of the sector. The data is recorded in special patterns that aid in the detection and recovery from errors caused by noise, magnetic defects, and so forth. For example, channel codes are mappings of data bits into symbols that avoid certain characteristics in the bit-stream of symbols that make recovery of the data bits more difficult. In addition, parity frames and error correction code (ECC) blocks are used to detect and correct various bit errors. When errors are encountered that cannot be corrected by these initial error correction techniques, a more extensive data recovery procedure (DRP) is invoked.
In a magnetic data recording device the data channel is the portion of the system responsible for decoding the information recorded in the signals read from the disk. The analog signal is first converted into digital form before being fed through the channel detector. The detector typically uses Partial Response Maximum Likelihood (PRML) principle to convert the string of digitized voltage values into binary data bits. The typical channel hardware can accept variables or parameters that act to modify the way the raw data stream is decoded. Unwanted variations in the amplitude, frequency, phase, resolution and overwrite properties can result in corruption of the written data. Some errors are permanent while others are transient. Re-reading a sector during data recovery procedure (DRP) requires a relatively large amount of time because a full rotation of the disk is needed before the sector comes back around under the head. Although a sector may be re-read many times during DRP, minimizing the number of re-reads is clearly desirable.
U.S. Pat. No. 6,937,415 (Aug. 30, 2005) to Galbraith, et al. describes the use of a sample buffer to store problem data for analysis during a data recovery procedure (DRP). The data in the buffer can be repeatedly processed during the DRP without having to re-read the sector a third time. However, the data in the buffer can be used for characterization of error information for the purpose of enhancing the effectiveness of a subsequent third (or more) re-readings. Using the data stored in the buffer enables data recovery without identification of a sync field and enables DRP to use changed channel data detection settings to aid in recovering the data. Adaptive equalization is used to enhance recovery during a subsequent re-read of the data sector; and extraction of timing information to allow introduction of timing loop changes during a re-read to offset phase or frequency related timing errors. A procedure involving movement of a pointer into the buffer to allow the data to be correctly framed and read with ECC is described. The '415 patent also notes that the buffer can be used to accumulate data from several different readings of the problem sector in effect averaging the data from multiple reads. By accumulating the results of more than one read operation some or all of the noise can be averaged out.
The invention includes apparatus and methods that allow a data storage device to perform an enhanced data recovery procedure (DRP). An embodiment of a DRP according to the invention obtains a new digital sampling of the failing unit of data by re-reading the analog signal and converting it to digital form using an analog-to-digital conversion using a fixed phase clock signal. By using a fixed phase clock the sampling is made asynchronous to content of the analog signal, which is not the case during standard operation. In an embodiment of the invention the analog-to-digital converter (ADC) is driven with an unmodulated phase-locked loop (PLL) clock during the re-reading process. The PLL clock can optionally include positive frequency offset to achieve over-sampling. When the digitized samples are obtained with a stable clock source and fixed signal gain, the samples are not biased or distorted by the gain and timing loops of the normal synchronous techniques which have already failed to properly detect the data. The data is re-interpolated using a programmable delay line with flat magnitude and linear phase response. Fixed signal gain is preferred when the analog sector is re-read to avoid problems induced by automatic gain control. If fixed gain is used, then signal level adjustments may be needed when the data is processed through the detector.
In an embodiment of the invention, digital data is stored in a buffer so that the data can be processed repeatedly at high speed using varied parameters as part of data recovery procedure. For each re-run (iteration), parameters including those related to re-interpolation and amplitude adjustment can be changed in a trial-and-error effort to recovery the data. The success or failure an iteration to decode a unit of data is preferably determined by the ECC system according to the prior art. Defect and erasure locations can be determined. If necessary synchronization can be obtained by varying the assumed starting point in the buffer at which the data is fed into the detector.
When the samples are stored in a buffer, optionally the data can be processed in the reverse direction (from end of sector to beginning of sector). The implementation of this option requires that the data in the buffer be fed into the detector in reverse order, but the Viterbi portion of the detector requires no modification since it inherently works on data processed in either direction. Reverse processing can allow recovery of data when a large defect causes a phase shift in the middle of the sector. In an alternative embodiment sync-field marks are placed at the end of the sector as well as the beginning in order to aid in processing the data in the reverse order.
The data stored in a buffer can be processed by hardware and software that are not used in normal decoding process. This allows optional analysis techniques to be used. These include fast Fourier transform (FFT) and discrete autocorrelation. Optional analysis techniques can be implemented in hardware, software or a combination of hardware and software. The frequency offset of the acquired samples can be determined accurately by analyzing the captured samples. This will allow for accurate setting of the frequency offset for sample interpolation to obtain synchronous samples.
After the standard, initial error recovery attempts have failed, a data storage device typically goes into data recovery procedure (DRP) mode. A data storage device according to the invention includes data recovery procedure (DRP) that can be used in addition to any other prior art data recovery procedures. In an embodiment of the invention the DRP re-reads the analog signal for the unit of data (typically a sector) that has generated an error when processed in the standard way through the detector. Because the raw signal has unknown defects that caused the normal processing to fail, it is beneficial to limit or isolate the effects that these defects have on the decoding process. In an embodiment of the invention a fixed signal gain is used when the problem data sector is re-read in order to avoid the deleterious effects that the defects in the signal may have had on the automatic gain adjustment process that is normally used.
Another way that defects in the raw signal can disrupt the decoding process is through adverse effects on the ADC clocking that arise through the phase-feedback supplied to the phase-locked loop (PLL). The clocking of the ADC in DRP mode according to an embodiment of the invention allows the deleterious effect of defects in the raw signal on the clocking of the ADC to be avoided. Making the phase of the clocking of the ADC independent from the characteristics of the raw signal in turn makes the sampling (digitization) of the raw data stream independent from the characteristics of the raw signal. In an embodiment of the invention the clocking for the analog-to-digital converter (ADC) during the re-read is made to be independent (asynchronous) from the characteristics of the data by eliminating the phase adjustment that is normally used by the phased-locked loop (PLL) circuitry that supplies the clocking. In this way the digital representation of the raw analog signal in the DRP of the invention is different from the standard digital representation of the raw analog signal and will allow recovery from errors that would otherwise not be recoverable.
The clocking for the ADC 102 is supplied by Phase-Locked Loop (PLL) 104 that is controlled by the Channel Controller 106. The PLL 104 has at least two modes of operation that are selectable. In normal operation the PLL 104 adjusts the phase of the clocking pulse using feedback derived from the analog signal. In order to achieve asynchronous clocking in the DRP mode the feedback is internally disconnected (eliminated) inside the PLL circuitry based on control signals from the Channel Controller 106. The clocking for the ADC will then have a fixed phase that is independent (asynchronous) from the characteristics of the raw analog signal in the defective area. The timing (period/frequency) of the fixed clocking signal can be set based on defaults or from a value derived from a previous sector that was read without errors. In addition it may be desirable to increase the rate of the ADC clocking by a small amount, for example, 1% or 2% over nominal to achieve over-sampling.
Although it may be possible to only re-read the failing sector one time using the invention, optionally the failing sector can be re-read additional times. One technique that can be employed is to use signal averaging as described in U.S. Pat. No. 6,937,415 (Aug. 30, 2005) to Galbraith, et al. It is also possible to re-read the failing sector using different parameters that can be derived from analysis of the data in the buffer.
In the embodiment of
The digital data in the Sample Buffer represents sampled voltage values from the analog signal read from media. The voltage values are recorded as N-bits, where N might be, for example, 6 or 8 bits. As in the prior art, a PRML-type detector is used to convert the N-bit digital form of data into a second set of digital data that includes essentially one data bit for each of the N-bit voltage values.
The N-bit voltage values may require adjustment to correct for the fact the analog signal was re-read without automatic gain control (AGC) as described above. Gain Adjustment Unit 97 is used to adjust the amplitude levels of the N-bit data in DRP mode according to the invention before the data is processed through the Detector 109. The Gain Adjustment Unit 97 operates under the control of the Channel Controller 106 and can be implemented as programmable multiplier that scales the amplitude of the N-bit samples up or down. The scaling parameter can be supplied by Channel Controller 106 and can be varied during the processing of a unit of data, as well as between iterations. In an alternative embodiment the amplitude adjustment can take place after the re-interpolation. In this alternative embodiment the Gain Adjustment Unit 97 would conceptually become part of the Detector 109.
The internals of Detector 109 will be further described below in connection with
The sequence of events is controlled by Channel Controller 106 after the failure event that triggers the DRP mode is detected. Channel Controller 106 sets the asynchronous mode on the PLL, sets the signal gain to a fixed value and selects the appropriate switch settings in the DSL-1. The actual re-reading of the failing sector is accomplished by the Hard Disk Controller 99 which controls the mechanical elements of the disk drive.
The Channel Controller 106 sets parameters used when the problem data is processed. These included parameters for the Gain Adjustment Unit 97, as well as the parameters for Detector 109, which in this embodiment include the parameters used in re-interpolating the data. The stored data in the Sample Buffer can also be read by the Channel Controller 106 to enable independent analysis separate from the Detector 109. The stored samples can be reprocessed potentially hundreds of times during the time for one revolution of the disk. For each iteration, variables can be adjusted in an effort to find a set of conditions that will result in correct decoding of the data. Parameters for the decoder such as equalization or Viterbi path weighting can also be set to various values in an effort to recover the data. The Detector 109 may also generate error signals that can be read by the Channel Controller 106 and used to make adjustment to the parameters for subsequent runs. When the correct parameters are found, the N-bit data will decode properly into the binary data and this will be confirmed by a correct ECC result according to prior art techniques. Because ECC can correct some errors, the iterative trial and error process succeeds when all of the remaining errors are correctable by ECC.
Additional analysis techniques can be employed for the N-bit data stored in the buffer using any combination of hardware, software or firmware. Any prior art analysis technique can be used. Examples include fast Fourier transform (FFT) and discrete autocorrelation. Optionally the frequency offset of the acquired samples can be determined accurately by analyzing the captured samples which will allow for accurate setting of the frequency offset for sample interpolation to obtain synchronous samples. Defect and erasure locations can be determined this way as well.
The data in the buffer can also be used develop error information for the purpose of enhancing the effectiveness of a subsequent re-read of the problem unit of data. Adaptive equalization can be used to enhance recovery during a subsequent re-read and extraction of timing information can allow timing loop changes during a re-read to offset phase or frequency related timing errors.
Embodiments of the invention include means for achieving bit-synchronization and word-synchronization in the N-bit sample stream. In the case where the word sync pattern is erased or unreadable at the start of the unit of data, finding the correct starting point in the buffer may require a trial and error process in which various starting positions are tried. The system could, for example, try each of the first few hundred integral positions in the Sample Buffer to cover the uncertainty of where the correct data starts. At each assumed starting position the data would be processed to determine whether the data will decode correctly using that starting point.
A programmable fractional-bit delay function with flat magnitude and linear phase response is used. In the embodiment shown in
The effect of the re-interpolation is illustrated in
In alternative embodiments of the invention, any linear processing of the data can be performed before the data is placed in Sample Buffer 108.
Since the N-bit sample values are stored in a buffer, it is possible to process the data in the opposite direction (backwards) from the standard order of processing. In this option the processing starts at the end of the unit of data and works back toward the beginning. This can allow recovery of data when a large defect of phase shift occurs in the middle of the sector. The DRP processing can include many trial and error steps as part of the overall process of searching for a set of parameters, starting points, etc. that will yield a correctly decoded result. The backwards processing according to the invention is optionally one approach that can be tried in one or more iterations.
In order to facilitate processing in the reverse order in an embodiment of the invention, the system design can include writing sync marks at the end of each unit of data in addition to sync marks at the beginning as is done in the prior art. To implement backwards processing it is necessary to process the N-bit values in the Sample Buffer 108 in the reverse order. The principle of operation of the Viterbi decoder 115 inherently functions the same whether the data is run backwards or forwards; therefore, neither the Viterbi decoder 115 nor Post-Processor Parity (PPP) logic 116 has to be modified. The reverse processing order of the N-bit values in the Sample Buffer 108 can be achieved as a selectable function controlled by the DSL-1107 which is in turn controlled by the Controller 106. Alternatively the reverse order reading from the Sample Buffer 108 could be selected under direct control of the Channel Controller 106.
When the data is processed in reverse order, the data stream coming out of the PPP 116 will be in reverse order and will need to be put back in forward order. This can be accomplished in a variety of ways. One way would be to have the Hard Disk Controller (HDC) 99 load the data into its buffer in reverse order when signaled by the Channel Controller that the data coming out is in reverse order.
The system selects an initial set of parameters for the detector for the first re-run of the data 125. The parameters include those used to re-interpolate sample points and to adjust the signal amplitude. The iterative processing loop includes adjustment the amplitude of the N-bit digital voltage values that are fed into the detector to correct for possible problems arising from the use of flat gain as described above. Optionally the starting point in the sample buffer can be varied between the runs. Varying the starting point in integral N-bits units may help achieve word synchronization. Fractional bit adjustments made by the re-interpolation process may help achieve bit-synchronization where it is otherwise lost. Once a set of parameters has been determined the data can be re-processed through the detector by setting the switching logic to feed the buffer contents into the detector 126. The success or failure of the effort to decode a unit of data can be ultimately determined by the ECC system according to the prior art. The ECC system can conceptually be located inside the Channel or the HDC, but the success or failure signal should be available in the DRP mode in order to know when a trial decoding iteration has succeeded. If the Detector succeeded in decoding the data, then the DRP is finished 127.
The Detector may also supply specific data about the results of the process that can be used by the Channel Controller to modify the parameters for the next run. Unless criteria for ending the DRP process have been met, the Channel Controller modifies the parameters for the next iteration of the processing of the data 128. The process loops back to step 126 to re-run the data again with at least one different parameter. The process terminates upon successful decoding, but there must also be some way to end the process in failure. For example, if the set of allowable parameters has been exhausted or other criteria such as retry limits have reached, the DRP process ends in failure.
The foregoing detailed description is provided for specific embodiments of the invention and should not be taken as limitations on the invention. Although block diagrams are used to describe the conceptual units of circuitry, these conceptual units should not be interpreted to be physically separate elements. It is typical for most or all of the circuitry that is referred to generically as a channel to be implemented on a single integrated circuit.
Number | Name | Date | Kind |
---|---|---|---|
5220569 | Hartness | Jun 1993 | A |
5233482 | Galbraith et al. | Aug 1993 | A |
5918001 | Ueno et al. | Jun 1999 | A |
6049439 | Ono et al. | Apr 2000 | A |
6084733 | Ohzeki et al. | Jul 2000 | A |
6130794 | Christensen | Oct 2000 | A |
6345366 | Asano et al. | Feb 2002 | B1 |
6392830 | Malone, Sr. | May 2002 | B1 |
6490691 | Kimura et al. | Dec 2002 | B1 |
6530034 | Okada et al. | Mar 2003 | B1 |
6594094 | Rae et al. | Jul 2003 | B2 |
6937415 | Galbraith et al. | Aug 2005 | B2 |
20040156138 | Yun | Aug 2004 | A1 |
Number | Date | Country | |
---|---|---|---|
20080144454 A1 | Jun 2008 | US |