The present invention relates generally to reducing the number of errors in data that is read from magnetic tape and, in particular, to the use of a noise predictive maximum-likelihood detector
During a read process of a data storage device, a read head is typically passed over a data record recorded on a storage media in order to convert the recorded data into an analog signal. For example, in a magnetic tape drive, a magneto-resistive read head is passed over a data record that has been previously written as flux reversals on a magnetic tape. As the head is passed over the tape, the head converts the flux reversals into an electrical analog signal that represents the data originally stored on the magnetic tape.
An analog to digital converter (“ADC”) periodically samples the analog signal and converts the sampled analog signal to a digital input signal and creates a digital waveform. The amplitude of the digital waveform is processed to form binary values. The data storage device, such as a magnetic media storage device, processes the digital waveform to reconstruct the data that was originally written to the tape and passes the data to a host device.
Because of noise in the read channel, a binary value corresponding to one or more digital input signals may be indeterminate. For example, one or more of the digital input signals may have a lower amplitude, which is corrupted by noise, increasing the difficulty of reconstructing the data and determining the binary value.
Storage devices such as magnetic tape drives frequently use a maximum-likelihood detector to reconstruct data from the digital waveform. Such a detector employs a plurality of states organized as a logical trellis. The trellis specifies allowable subsequent states for each state. For example, the trellis may specify that the detector may change from a first state to a second or a third state, but not to a fourth state. The detector thus changes from the first state to the second state or to the third state depending on a plurality of branch metrics. A branch metric for the current state is a function of branch metrics for previous states and of a current digital input signal. Thus, path metrics maintain a record of the detector's progression through previous states. The detector begins in an initial state and proceeds to change from state to state in response to the branch metrics, identifying the binary value for one or more digital input signals using information about previous and subsequent digital input values from the path metrics. As a result, the detector is better able to reconstruct data from digital input values.
An 8-state EPR4 maximum-likelihood detector has been commonly used to detect data in current tape drives. However, a disadvantage of such a detector is that the target channel is fixed to a single, predetermined EPR4 target and thus the detector target cannot be changed dynamically or adapt to the signals that are being read from tape.
The present invention reduces the number of errors in data that is read from magnetic tape while avoiding the disadvantage of a fixed 8-state maximum-likelihood detector. A 16-State adaptive NPML (“Noise Predictive Maximum-Likelihood”) detector of the present invention addresses weaknesses of a conventional fixed, 8-state EPR4 detector. Rather than having a fixed target channel like a current maximum-likelihood detector, the detector of the present invention is programmable to allow a range of target channels. The detector can support “classical” partial response channels such as PR4 or EPR4 and can also support a range of target channels by programming two predictor or whitening filter coefficients. In one embodiment two predictor or whitening filter coefficients may be set via XREG inputs or dynamically determined through the use of an LMS (“Least Means Square”) algorithm. The use of an LMS algorithm allows the detector to adapt the predictor coefficients as data from the tape is being read. Another embodiment provides a 16-State adaptive NPML for an EPR4 target channel in which the whitening filter has one coefficient.
Components of the detection system include the detector itself, an LMS engine, a coefficient engine and a noise predictive or whitening filter. Coefficients from the LMS engine may be loaded or stored dynamically based upon conditions in the tape drive.
In order that the advantages of the invention will be readily understood, a more particular description of the invention briefly described above will be rendered by reference to specific embodiments that are illustrated in the appended drawings. Understanding that these drawings depict only typical embodiments of the invention and are not therefore to be considered to be limiting of its scope, the invention will be described and explained with additional specificity and detail through the use of the accompanying drawings, in which:
Some of the functional units described in this specification have been labeled as modules in order to more particularly emphasize their implementation independence. For example, a module may be implemented as a hardware circuit comprising custom VLSI circuits or gate arrays, off-the-shelf semiconductors such as logic chips, transistors, or other discrete components. A module may also be implemented in programmable hardware devices such as field programmable gate arrays, programmable array logic, programmable logic devices or the like. Modules may also be implemented in software for execution by various types of processors. An identified module of executable code may, for instance, comprise one or more physical or logical blocks of computer instructions which may, for instance, be organized as an object, procedure, or function. Nevertheless, the executables of an identified module need not be physically located together, but may comprise disparate instructions stored in different locations which, when joined logically together, comprise the module and achieve the stated purpose for the module. A module of executable code could be a single instruction, or many instructions, and may even be distributed over several different code segments, among different programs, and across several memory devices. Similarly, operational data may be identified and illustrated herein within modules, and may be embodied in any suitable form and organized within any suitable type of data structure. The operational data may be collected as a single data set, or may be distributed over different locations including over different storage devices and may exist, at least partially, merely as electronic signals on a system or network.
Furthermore, the described features, structures, or characteristics of the invention may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided, such as examples of programming, software modules, hardware modules, hardware circuits, etc., to provide a thorough understanding of embodiments of the invention. One skilled in the relevant art will recognize, however, that the invention can be practiced without one or more of the specific details, or with other methods, components and so forth. In other instances, well-known structures, materials, or operations are not shown or described in detail to avoid obscuring aspects of the invention.
In operation, the encoder 112 encodes write data, received from a host (not shown) to be written onto recording medium 10, into a predetermined code. For example, an RLL (Run Length Limited) code, in which the number of adjacent zeros must remain between specified maximum and minimum values, is commonly used for this predetermined code. However, the present invention is not meant to be limited to RLL and other coding may be used. The pre-coder 114 is included to prevent error propagation and the write pre-compensator 116 reduces non-linear influences arising from the read/write head 102. However, because the response of the actual recording channel does not exactly coincide with the resulting transfer function, some subsequent equalization may be required.
The AGC amplifier 122 amplifies an analog signal read by the read/write head 102 from the recording medium 10. The low pass filter 124 removes high frequency noise from and reshapes the signal output from AGC amplifier 122. The signal output from the low pass filter 124 is converted into a discrete digital signal by the ADC 126. The resulting digital signal is then applied to the adaptive equalizer 128, which adaptively controls inter-symbol interference (ISI) to generate desired waveforms. The detector 200 receives the equalized signal output from the adaptive equalizer 128 and generates encoded data. The decoder 130 decodes the encoded data output from detector 128 to generate the final read data to be transmitted to the host. Additionally, in order to correct the analog signal envelope and the digitization sample timing, the gain controller 132 controls the gain of the AGC 122 and the timing controller 134 controls sample timing for the ADC 126. It will be appreciated that the foregoing description of the operation of the read/write channel circuit 100 is only a brief summary of the operation.
A first embodiment of the 16-State adaptive NPML will now be described. A detector module 200 according to the first embodiment is illustrated in
The primary input to the system, GAINADJ, is a sequence of synchronized and gain-adjusted digitized samples from the equalizer 128. The output of the system, DATA_OUT, is the binary data associated with the output of the encoder 112. Two parameters, W1 and W2, are used to define the detection target and to determine the waveform shape at the output of the whitening filter 300. W1 and W2 may be determined dynamically using an LMS algorithm or the may be set statically through a register (XREG) interface. Each of the four blocks will be described in detail.
A block diagram of the whitening filter 300 is illustrated in
Two pre-calculations for the NPML detector 400 are also performed by the whitening filter 300: P1·yk 304A and P2·yk 304B. Similar to the FIR filter 302, the multipliers P1 and P2 are eight-bit words and the multiplicand yk is a nine-bit word. Each product generated P1yk(12:0) and P2yk(12:0) is a 19-bit number that is first truncated to nine-bits and then sign extended to 13-bits. Truncation is preferably performed with rounding.
The following mathematical equations describe a 4th order maximum-likelihood detector 400 that may be used in the detector module 200 of the present invention.
m
k(0)=maximum{mk−1(0)+ln [p(yk|sk=0;ak=0)];mk−1(1)+ln [p(yk|sk=1;ak=0)]}
m
k(1)=maximum{mk−1(2)+ln [p(yk|sk=2;ak=0)];mk−1(3)+ln [p(yk|sk=3;ak=0)]}
m
k(2)=maximum{mk−1(4)+ln [p(yk|sk=4;ak=0)];mk−1(5)+ln [p(yk|sk=5;ak=0)]}
m
k(3)=maximum{mk−1(6)+ln [p(yk|sk=6;ak=0)];mk−1(7)+ln [p(yk|sk=7;ak=0)]}
m
k(4)=maximum{mk−1(8)+ln [p(yk|sk=8;ak=0)];mk−1(9)+ln [p(yk|sk=9;ak=0)]}
m
k(5)=maximum{mk−1(10)+ln [p(yk|sk=10;ak=0)];mk−1(11)+ln [p(yk|sk=11;ak=0)]}
m
k(6)=maximum{mk−1(12)+ln [p(yk|sk=12;ak=0) ];mk−1(13)+ln [p(yk|sk=13;ak=0)]}
m
k(7)=maximum{mk−1(14)+ln [p(yk|sk−14;ak=0) ];mk−1(15)+ln [p(yk|sk=15;ak=0)]}
m
k(8)=maximum{mk−1(0)+ln [p(yk|sk=0;ak=1) ];mk−1(1)+ln [p(yk|sk=1;ak=1)]}
m
k(9)=maximum{mk−1(2)+ln [p(yk|sk=2;ak=1) ];mk−1(3)+ln [p(yk|sk=3;ak=1)]}
m
k(10)=maximum{mk−1(4)+ln [p(yk|sk=4;ak=1)]; mk−1(5)+ln [p(yk|sk=5;ak=1)]}
m
k(11)=maximum{mk−1(6)+ln [p(yk|sk=6;ak=1)]; mk−1(7)+ln [p(yk|sk=7;ak=1)]}
m
k(12)=maximum{mk−1(8)+ln [p(yk|sk=8;ak=1)]; mk−1(9)+ln [p(yk|sk=9;ak=1)]}
m
k(13)=maximum{mk−1(10)+ln [p(yk|sk=10;ak=1)]; mk−1(11)+ln [p(yk|sk=11;ak=1)]}
m
k(14)=maximum{mk−1(12)+ln [p(yk|sk=12;ak=1)]; mk−1(13)+ln [p(yk|sk=13;ak=1)]}
m
k(15)=maximum{mk−1(14)+ln [p(yk|sk=14;ak=1)]; mk−1(15)+ln [p(yk|sk=15;ak=1)]}
The metric calculations shown below for the 16-state detector 400 are implemented in hardware and a similar methodology is described in more detail in the aforementioned U.S. Publication No. 2006/0176982. Upon first inspection, the logic associated with the metric calculations is very large. There are 32 multiplication operations associated with the mean parameters (μ0/0 through μ15/1) and 32 multiplication operations with the mean-squared parameters (μ0/02 through μ15/12). However, the number of multiplication operations may be reduced from 64 to three: two associated with the mean parameters and one associated with the mean-squared parameters.
m
k(0)=maximum{mk−1(0)+2μ0/0yk−μ0/02;mk−1(1)+2μ1/0yk−μ1/02} Eq. 1
m
k(1)=maximum{mk−1(2)+2μ2/0yk−μ2/02;mk−1(3)+2μ3/0yk−μ3/02} Eq. 2
m
k(2)=maximum{mk−1(4)+2μ4/0yk−μ4/02;mk−1(5)+2μ5/0yk−μ5/02} Eq. 3
m
k(3)=maximum{mk−1(6)+2μ6/0yk−μ6/02;mk−1(7)+2μ7/0yk−μ7/02} Eq. 4
m
k(4)=maximum{mk−1(8)+2μ8/0yk−μ8/02;mk−1(9)+2μ9/0yk−μ9/02} Eq. 5
m
k(5)=maximum{mk−1(10)+2μ10/0yk−μ10/02;mk−1(11)+2μ11/0yk−μ11/02} Eq. 6
m
k(6)=maximum{mk−1(12)+2μ12/0yk−μ12/02;mk−1(13)+2μ13/0yk−μ13/02} Eq. 7
m
k(7)=maximum{mk−1(14)+2μ14/0yk−μ14/02;mk−1(15)+2μ15/0yk−μ15/02} Eq. 8
m
k(8)=maximum{mk−1(0)+2μ0/1yk−μ0/12;mk−1(1)+2μ1/1yk−μ1/12} Eq. 9
m
k(9)=maximum{mk−1(2)+2μ2/1yk−μ2/12;mk−1(3)+2μ3/1yk−μ3/12} Eq. 10
m
k(10)=maximum{mk−1(4)+2μ4/1yk−μ4/12;mk−1(5)+2μ5/1yk−μ5/12} Eq. 11
m
k(11)=maximum{mk−1(6)+2μ6/1yk−μ6/12;mk−1(7)+2μ7/1yk−μ7/12} Eq. 12
m
k(12)=maximum{mk−1(8)+2μ8/1yk−μ8/12;mk−1(9)+2μ9/1yk−μ9/12} Eq. 13
m
k(13)=maximum{mk−1(10)+2μ10/1yk−μ10/12;mk−1(11)+2μ11/1yk−μ11/12} Eq. 14
m
k(14)=maximum{mk−1(12)+2μ12/1yk−μ12/12;mk−1(13)+2μ13/1yk−μ13/12} Eq. 15
m
k(15)=maximum{mk−1(14)+2μ14/1yk−μ14/12;mk−1(15)+2μ15/1yk−μ15/12} Eq. 16
When the input to the detector (GAINADJ(8:0) at the input to the whitening filter 300) has a PR4 target, then the overall target channel for the 4th order NPML detector 400 is:
(1−D2)(1−P1D−P2D2)=h0+h1D+h2D2+h3D3 +h4D4
where:
The 32 mean parameters are based upon all possible combinations of the five h-coefficients. As can be seen in TABLE I below, each of the mean parameters may be derived from the sum/difference of only three terms: 1, P1, and P2. Similarly, each of the mean-product calculations (μ0/0yk,μ0/1yk . . . μ15/1yk) may be determined by adding/subtracting three terms: yk,P1·yk and P2·yk. This greatly simplifies the number of multiplications associated with the NPML detector. As noted earlier in the section on the whitening filter 300, in a hardware implementation the mean-product terms are nine-bits in size and sign extended to 13 bits.
The VHDL for the metric calculations set forth in Equations 1-16 above may be designed into a structure that can be replicated sixteen times, illustrated in
The inputs SK(3:1) and AK are hard-coded in the VHDL for each metric block. It is assumed that the VHDL synthesizer will keep only the logic that is needed for that specific block. Thus, each YK, P1YK, and P2YK select block will be synthesized into fixed logic structures that do not change over time.
Because different drives and different media have different characteristics, the present invention allows the system 200 to adapt to the actual read-back signal in the channel 100. The LMS engine 700 determines the predictor coefficients used to for the adaptive process. The hardware architecture of the LMS engine 700 is illustrated in
A generalized diagram of the target and error generator 900 and the error filter 1000 is shown in
The logic associated with target and error generator block 900 of the LMS engine 700 is shown in
The error filter 1000 is illustrated in
Two LMS taps 1100A, 1100B (both referred to generally in
αε{0.00001544, 0.00003088, 0.00006176, 0.00012352}
A source select module 202 (
The calculations may be performed in parallel. Alternatively, the number of multiplications performed at one time may be further reduced to one if the mean-squared terms are calculated sequentially. This employs a small state machine to select at most three inputs to an adder and squaring block. The results are saved as they are calculated (REG0-REG8) and then all of the mean-squared terms (ZREG0-ZREG8) plus C1Z and C2Z are sent to the detector 400 and whitening filter 300, preferably simultaneously.
TABLE IV presents the status of the outputs of the P1 Select, P2 Select and One Select for each of the nine possible states of the State-Engine.
A second embodiment of the 16-State adaptive NPML, for an EPR4 target, will now be described. A detector module 1300 according to the second embodiment the present invention is illustrated in
The primary input to the system, GAINADJ, is a sequence of synchronized and gain-adjusted digitized samples which have been equalized to an EPR4 target in the equalizer 128. The output of the system, DATA_OUT, is the binary data associated with the output of the encoder 112. In contrast with the first embodiment, a single parameter, W1, is used to define the detection target and to determine the waveform shape at the output of the whitening filter 1400. W1 may be determined dynamically using an LMS algorithm or the may be set statically through the XREG interface. Each of the four blocks of the second embodiment will be described in detail.
A block diagram of the whitening filter 1400 is illustrated in
One pre-calculation for the NPML detector 400 is also performed by the whitening filter 1400: P1·yk 1404. Similar to the FIR filter 1402, the multiplier P1 is an eight-bit word and the multiplicand yk is a nine-bit word. The product generated P1yk(12:0) is a 19-bit number that is first truncated to nine-bits and then sign extended to 13-bits. Truncation is preferably performed with rounding.
The 16 mathematical equations above describing the 4th order maximum-likelihood detector 400 of the first embodiment may also be used in the detector module 1300 of the second embodiment of the present invention and will not be repeated here.
Similarly, the 16 metric calculations shown above for the 16-state detector 400 of the first embodiment may also be used with the second embodiment and will not be repeated here.
The target channel for the 4th order NPML detector for an EPR4 target is:
(1−D2)(1+D)(1−P1D)=h0+h1D+h2D2+h3D3+h4D4
where:
The 32 mean parameters are based upon all possible combinations of the five h-coefficients. As can be seen in TABLE V below, each of the mean parameters may be derived from the sum/difference of only four terms: 1, 2, P1, and 2P1. Similarly, each of the mean-product calculations (μ0/0yk,μ0/1yk . . . μ15/1yk) may be determined by adding/subtracting four terms: yk, P1·yk, 2yk, and 2P1·yk. This greatly simplifies the number of multiplications associated with the NPML detector. As noted earlier in the section on the whitening filter 1400, in a hardware implementation the mean-product terms are nine-bits in size and sign extended to 13 bits.
The VHDL for the metric calculations set forth in Equations 1-16 above and designed into the structures illustrated in
The inputs SK(3:1) and AK are hard-coded in the VHDL for each metric block. It is assumed that the VHDL synthesizer will keep only the logic that is needed for that specific block. Thus, each YK, P1YK, 2YK and P1YK select block will be synthesized into fixed logic structures that do not change over time.
The compare and select logic 600 (
Because different drives and different media have different characteristics, the present invention allows the system 1300 to adapt to the actual read-back signal in the channel 100. The LMS engine 1600 determines the predictor coefficients used to for the adaptive process. The hardware architecture of the LMS engine 1600 is illustrated in
A generalized diagram of the target and error generator 1800 and the error filter 1900 is shown in
The logic associated with target and error generator block 1800 of the LMS engine 1600 is shown in
The error filter 1900 is illustrated in
The LMS tap 2000 (
αε{0.00001544, 0.00003088, 0.00006176, 0.00012352}
A source select module 202 (
The calculations may be performed in parallel. Alternatively, the number of multiplications performed at one time may be further reduced to one if the mean-squared terms are calculated sequentially. This employs a small state machine to select at most two inputs to an adder and squaring block. The results are saved as they are calculated (REG0-REG8) and then all of the mean-squared terms (ZREG0-ZREG8) plus P1 are sent to the detector 400 and whitening filter 1400, preferably simultaneously.
TABLE IV presents the status of the outputs of the P1 Select, and One Select for each of the seven possible states of the State_Engine.
It is important to note that while the present invention has been described in the context of a fully functioning data processing system, those of ordinary skill in the art will appreciate that the processes of the present invention are capable of being distributed in the form of a computer storage readable medium of instructions and a variety of forms and that the present invention applies regardless of the particular type of signal bearing media actually used to carry out the distribution. Examples of computer readable storage media include recordable-type media such as a floppy disk, a hard disk drive, a RAM, and CD-ROMs.
The description of the present invention has been presented for purposes of illustration and description, but is not intended to be exhaustive or limited to the invention in the form disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art. The embodiment was chosen and described in order to best explain the principles of the invention, the practical application, and to enable others of ordinary skill in the art to understand the invention for various embodiments with various modifications as are suited to the particular use contemplated. Moreover, although described above with respect to methods and systems, the need in the art may also be met with a computer program product containing instructions for improved data detection or a method for deploying computing infrastructure comprising integrating computer readable code into a computing system for improved data detection.
The present application is related to commonly-assigned and co-pending U.S. application Ser. No. 11/054,060 (U.S. Publication No. 2006/0176982), entitled APPARATUS, SYSTEM AND METHOD FOR ASYMMETRIC MAXIMUM LIKELIHOOD DETECTION, filed on Feb. 9, 2005, which application is incorporated herein by reference in its entirety.