Robust maximum-likelihood based timing recovery

Information

  • Patent Application
  • 20060256896
  • Publication Number
    20060256896
  • Date Filed
    May 10, 2005
    19 years ago
  • Date Published
    November 16, 2006
    18 years ago
Abstract
A method of timing recovery comprises receiving an input signal, sampling the input signal to produce a plurality of signal samples, detecting the samples to produce an output signal, and controlling timing of the sampling in response to a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant. Apparatus that performs the method is also provided.
Description
FIELD OF THE INVENTION

This invention relates to signal processing, and more particularly to timing recovery in receivers of communications systems.


BACKGROUND OF THE INVENTION

In communications systems where a transmitter sends information symbols to a receiver over time, there is a fundamental requirement that the receiver and transmitter operate according to a common clock. The receiver generally knows the nominal value of the transmission frequency, but the receiver clock is never truly synchronized with the transmitter clock. Thus, the receiver needs to recover the clock associated with the received signal from the signal itself. This problem of recovering the correct timing information from the received signal is called timing recovery or clock synchronization.


A magnetic recording system may be viewed as a communications system if the write head is viewed as a transmitter and the read head is viewed as the receiver. In magnetic recording, the desire to push recording densities higher has caused today's recording devices to operate at low signal-to-noise ratios (SNR) and with more sophisticated coding algorithms, making timing recovery more challenging. An analog signal produced at the read head is sampled to produce a plurality of samples. Traditional timing recovery architectures include a timing error detector that processes the received samples to produce a quantity that is a measure of the timing phase error. This quantity is further passed through a loop filter to produce a correction signal that is used to control the sampling timing, for example by driving a sampler through a voltage controlled oscillator (VCO). The system is decision directed, i.e., the detected bits are used by the timing recovery algorithm with the assumption that they are error free. A commonly used timing error detector is the Mueller and Müller (MM) detector.


Under ideal operation, the timing phase is updated by the timing recovery algorithm in a way such that the received signal is sampled at the proper locations. However, noise in the system can sometimes cause incorrect timing updates. For example, suppose that errors jn in the timing updates accumulate over several hundreds of bits to produce a net delay of one bit. There will certainly be several detection errors during this interval, but at the end of the interval, the detected bits will become error free but delayed by one bit. The result is that the timing error detector will not detect this one-bit offset in the sampling phase because it is decision directed and operates under the assumption that the decision bits are correct. Thus, any timing recovery that uses a decision directed approach suffers from the above phenomenon called cycle slip.


More generally, the timing phase can jump behind or ahead of the correct sampling phase by an integer number of bits and cause a cycle slip. The MM method performs quite well at high SNRs where the cycle slip rates are generally very low. Then the resulting bit error rates are nearly equal to that of the perfect timing case. However, at lower SNRs, the system is plagued by cycle slips, and the bit error rate rises.


Consequently, there is a need for a timing recovery technique that can be used in magnetic recording devices and other communications devices that operate at low SNRs.


SUMMARY OF THE INVENTION

This invention provides a method of timing recovery comprising: receiving an input signal, sampling the input signal to produce a plurality of signal samples, detecting the samples to produce an output signal, and controlling timing of the sampling using a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant.


In another aspect the invention provides an apparatus comprising an input for receiving an input signal, a sampler for sampling the input signal to produce a plurality of signal samples, a detector for detecting the samples to produce an output signal, and a timing estimator for controlling timing of the sampler using a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant.


The invention also encompasses a method of timing recovery comprising: receiving an input signal, sampling the input signal using a free-running clock to produce a plurality of signal samples, interpolating the signal samples to produce interpolated samples, detecting the interpolated samples to produce an output signal, and controlling interpolation of the signal samples using a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant.


In another aspect, the invention provides an apparatus comprising an input for receiving an input signal, a sampler sampling the input signal using a free-running clock to produce a plurality of signal samples, an interpolator for interpolating the signal samples to produce interpolated samples, a detector for detecting the interpolated samples to produce an output signal, and a timing estimator for controlling interpolation of the signal samples using a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant.




BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a block diagram of the timing recovery portion of a conventional communications system.



FIG. 2 is a block diagram of the timing recovery portion of a communications system constructed in accordance with the present invention.



FIG. 3 is a schematic diagram of a Markov system.



FIG. 4 is a graph illustrating a local clock frequency based on a random walk model.



FIG. 5 is a graph showing a comparison of timing error variance for longitudinal recording.



FIG. 6 is a graph showing a comparison of timing error variance for perpendicular recording.



FIG. 7 is a graph showing a comparison of cycle slip rate (CSR) for longitudinal recording.



FIG. 8 is a graph showing a comparison of CSR for perpendicular recording.



FIG. 9 is a graph showing a comparison of timing error variance for perpendicular recording with 50% transition noise.



FIG. 10 is a graph showing a comparison of CSR for perpendicular recording with 50% transition noise.



FIG. 11 is a graph showing a comparison of timing error variance for perpendicular recording with 90% transition noise.



FIG. 12 is a graph showing a comparison of CSR for perpendicular recording with 90% transition noise.



FIG. 13 is a block diagram of a conventional communications system including interpolated timing recovery.



FIG. 14 is a block diagram of a communications system constructed in accordance with the present invention including interpolated timing recovery.




DETAILED DESCRIPTION OF THE INVENTION

Referring to the drawings, FIG. 1 is a block diagram of portions of a communications system 10 that uses conventional timing recovery. A transmitter 12 produces signals an that are transmitted over a channel 14 to a receiver 16. The transmitted signals may be subject to noise or interference, resulting in signals r(τ) being received at the receiver.


The receiver includes a front end circuit 18 that can include a low pass filter, which produces a filtered signal s(τ). A sampler, illustrated as a switch 20, samples the filtered signal to produce a plurality of samples yn that form a digital representation of the filtered signal. An equalizer 22 equalizes the samples to produce a plurality of equalized samples zn. A detector 24, which can be, for example, a Viterbi detector, processes the equalized samples to produce estimates ân2 of the samples on line 26. A timing error detector 28 receives the equalized samples zn and the estimates ân, and processes these signals using an algorithm, such as the MM algorithm, to produce a timing error signal on line 30. The timing error signal passes through a loop filter 32 and the resulting filtered timing error signal on line 34 is used to control the frequency of a voltage controlled oscillator 36, which controls the sampling frequency of the sampler such that it produces samples at times t={circumflex over (τ)}n, where {circumflex over (τ)}n is an estimate of the ideal sampling time instant τn. The timing error is the time between an actual sampling time instant and a desired sampling time instant. In this description the variable t is used as a generic variable to represent time.


This invention provides a method and apparatus for timing recovery for communications channels in general, and in one embodiment, for magnetic data storage channels, based on a maximum-likelihood (ML) estimation technique. When applied to data storage channels, a random walk model is used for the clocks associated with read and write heads that allows for imperfections such as slowly varying phase and frequency offsets of the clocks.


Bits are written on the magnetic medium by applying a magnetic field on the recording medium using the write head. The write head is controlled by a clock, which determines where the bit-transitions occur. To read these bits (at a later time), a read head senses the magnetic field on the medium and converts it into an electrical signal, which is further processed. The read process also uses a clock, which can be the same clock as the write clock. However the two clocks are to be considered non-synchronous because they operate at different times. The read channel samples the readback waveform, and processes the samples to recover the bits.


Timing recovery is a mechanism used to determine where to sample the signal. The nominal sampling frequency is known, but because no two clocks are synchronous, we need to determine exactly where to sample the signal using other means. This timing information is determined from the readback signal itself. There are two phases in (any) timing recovery method called the acquisition phase and the tracking phase. In a disc drive storage system, information is stored in sectors on a disc. Each sector includes a known (i.e., fixed) sequence of bits called the preamble, and data bits.


The acquisition phase corresponds to the first part (preamble) of the sector. The function of the preamble is to help the system acquire the correct timing information, i.e., the correct sampling phase and the correct sampling frequency.


The tracking phase follows the preamble part of the sector, and relates to the actual data bits for the user. These data bits are unknown (but will become known after the read head detects them). In the tracking phase, we make constant adjustments to the timing information to make sure that we are sampling at the right places. In the tracking phase, we use the detected bits in a feedback mode to help with the timing recovery. This is called a decision directed system.


There is fundamentally no difference between the preamble and data phases except that in the preamble phase, the algorithm uses the known preamble bits, while in the data phase, we use the detected user bits.


Timing recovery is important for the system to function without any cycle slips. The conventional method is simple and works well. But its performance degrades when the signal-to-noise ratio decreases, resulting in cycle slips. This invention combines some functions in the conventional system into a single “timing estimator” block that does the timing estimation optimally (in the maximum-likelihood (ML) sense).


The timing information is computed using a maximum-likelihood estimator. We begin with an initial guess of the timing information. This occurs even before the preamble. This may be incorrect, but in today's recording devices, the sampling phase is correct by about half a bit.


During the preamble, the timing information is updated and by the end of the preamble, we will have acquired the correct timing information (phase and frequency). Next, in the data phase, we use the decision directed approach, to constantly update the timing information.


Thus the algorithm sets an initial guess for timing information and its covariance matrix; and, for each readback sample, it updates the timing information and the covariance matrix. This updating is done efficiently and optimally.


Based on this model, estimates of the optimal sampling instants are derived. These estimates are optimal in the ML sense and also, asymptotically, in the minimum mean square error (MMSE) sense. This technique is robust and tracks changes in the clock phase and frequency extremely well.


This invention seeks to improve the system performance, specifically to reduce the cycle slip rate at low SNRs. One embodiment of the invention combines the timing error detector and the loop filter into one block and optimizes it. FIG. 2 is a block diagram of a communications system 40 that includes timing recovery performed in accordance with the present invention. A transmitter 42 produces signals an that are transmitted over a channel 44 to a receiver 46. The transmitted signals may be subject to noise and/or interference, resulting in signals r(τ) being received at the receiver. The receiver includes a front end circuit 48 that can include a low pass filter, which produces a filtered signal s(τ). A sampler, illustrated as a switch 50, samples the filtered signal to produce a plurality of samples yn that form a digital representation of the filtered signal. An equalizer 52 equalizes the samples to produce a plurality of equalized samples zn. A detector 54, which can be, for example, a Viterbi detector, processes the equalized samples to produce estimates ân of the samples on line 56. A timing error detector 58 receives the plurality of samples yn and estimates all and subjects these signals to an algorithm, to produce a timing error signal. The timing error signal is used to control the sampling frequency of the sampler, such that it produces samples at times t={circumflex over (τ)}n. The timing error signal can be used to control a voltage controlled oscillator that controls the sampling frequency.


This invention can be applied to the readback signals in a magnetic recording system. A random walk model is used for the transmitter and receiver clocks, which are referred to as the write clock and read clock respectively. The clocks can have a slowly varying relative phase and frequency offset. Expressions are derived for the sampling instants that are optimal in the maximum-likelihood (ML) sense. These estimates also coincide with the minimum mean square error (MMSE) estimates asymptotically. The timing estimation can be performed efficiently using a simple algorithm. Like the conventional timing recovery algorithm, the algorithm used in this invention is also decision directed, implying that the readback waveform is processed along with the detected bits to estimate the future sampling instants.


The ideal readback waveform from a magnetic recording medium can be modeled as follows:
r(τ)=mbmhT(τ-mT)+z(τ)(1)

where
bm=(am-am-1)2

is the transition sequence corresponding to the written bit sequence, and where {am=±1}, which represents the magnetization of the data storage medium, T is the bit-width, hT(t) is the transition response of the system, defined as the response to a transition of the magnetization from −1 to +1, and z(τ) is the additive Gaussian electronic noise with a power spectral density height of N0/2. This model ignores the effects of transition noise in the channel. The input bits are assumed to be independent and identically distributed (IID), i.e., the system is uncoded. For perpendicular recording, the transition response is
hT(t)=erf(2tln2W)whereerf(x)=2π0x-t2t(2)

is the error function and W is the pulse width defined by the limits t=±W/2, and where the derivative of the transition response is one half of its peak value. For longitudinal recording
hT(t)=11+(2t/W)2(3)

where W is the pulse width defined by the limits t=±W/2, and where the transition response is one half of its peak value. The quantity W is the “50% pulse width” and is often written as PW50. Assuming that the channel response is essentially band-limited to the frequencies, [−½T, ½T], applying a front end analog low pass filter with cut-off frequencies ±½T produces the filtered readback waveform
s(t)=mbmhT(t-mT)+w(t)(4)

where w(t) is the filtered Gaussian noise with a flat spectrum of N0/2T on [−½T, ½T]. By Shannon's sampling theorem, samples of this waveform at t=nT, nεZ provide sufficient statistics, where Z is the set of integers, i.e., { . . . ,−2,−1,0,1,2, . . . }.


In the presence of transition jitter noise in the channel, the readback signal and the output of the front end filter are given by
r(t)=mbmhT(t-mT-Δtm)+z(t)s(t)=mbmhT(t-mT-Δtm)+w(t)

where Δtm is the transition jitter noise if there is a transition at time mT. The transition noise is modeled as an independent and identically distributed (IID) truncated Gaussian random variable with variance var(Δtm)=σj2. The following definition of signal-to-noise ratio (SNR) of the channel has been adopted:
SNR=EiN0+M0Ei=W2RhT(t)2tM0=Eiσj2W2

where Ei is the energy in the derivative of the transition response hT(t) scaled by W2 to preserve the units of energy, and M0/2 is the average transition noise energy per bit, which is obtained by using a first order Taylor series approximation to separate the effect of the transition noise from the clean signal.


The model for the readback waveform presented in equation (4) assumes that the read head and write head have synchronous clocks. Unfortunately, this assumption is not true and there are several factors that cause the read and write clocks to differ from each other. For instance, variations in the disc drive spindle speeds during the read and write processes cause the relative velocity between the head and the medium to be different every time data is written or read. In fact, during the course of one sector alone, the speed is non-constant. The fundamental problem of timing recovery is to use the readback waveform to estimate the write clock, i.e., to determine where to sample the waveform.


Random walk models can be used for the clocks. Assume that both the read and write clocks are imperfect and that the times on these clocks can be described in terms of an absolute clock (which is unavailable to the read and write heads) that maintains the absolute time. Denote the absolute time by t and the corresponding times on the read and write clocks by τ and τ′ respectively, then:

τ′=fr(t)
τ′=fw(t)

where fr(t) and fw(t) are the clock mapping functions, which are continuous, monotonically increasing, invertible functions. The slopes of these functions are nearly unity, implying that their frequencies are nearly correct. Of course, if both clocks were synchronized to the absolute time, then τ=τ′=t.


The readback signal at time τ on the read clock corresponds to an absolute time t=fr−1(τ). If the write clock were perfect, i.e., τ′=t, then the actual readback waveform of equation (1) is modified as follows:
r(τ)=mbmhT(fr-1(τ)-mT)+z(τ).


However, the write clock is not perfect and the transitions are written at time instants τ′=mT where mεZ. Their corresponding absolute times are t=fw−1(mT). Therefore, the above expression can be further modified to obtain the actual readback waveform
r(τ)=mbmhT(fr-1(τ)-fw-1(mT))+z(τ).(5)


This expression is unwieldy, and can be subjected to a few simplifications. Note that for any t1 and t2, we have (fr(t1)−fr(t2))=(t1−t2)μ where μ≅1 is the average slope of fr(t) on [t1, t2]. Therefore, the dominant term in the Taylor series expansion yields

|hT(fr(t1)−fr(t2))−hT(t1−t2)≈(μ−1)(t1−t2)hT(t1−t2).


For both perpendicular and longitudinal modes of recording, the function thT(t) decays to zero quickly. Therefore,

|hT(fr(t1)−fr(t2))−hT(t1−t2)|≦C|μ−1|

where C=maxtthT(t). It is easily verified that C=0.5 for longitudinal recording and C=√{square root over (2π)}e−0.5=0.4839 for perpendicular recording. Applying this to t1=fr−1(τ) and t2=fw−1(mT) we obtain

|hT(τ−fr(fw−1(τ)))−hT(fr−1(τ)−fw−1(τ)−fw−1(mT)|≦C|μ−1|.


The error incurred by replacing hT(t1−t2) by hT(fr(t1)−fr(t2)) is in the same order of magnitude as the frequency offset in the read clock μ−1 that is usually about a fraction of a percent. This error is negligible compared to the additive in-band electronic noise in the system. Therefore, equation (5) can be approximated as
r(τ)mbmhT(τ-fr(fw-1(mT)))+z(τ).(6)


Now, at the output of the front end low pass filter, we have
s(τ)=mbmhT(τ-fr(fw-1(mT)))+w(τ)                                                 (7)mbmhT(fw(fr-1(τ))-mT)+w(τ)(8)

where the second step follows from a similar approximation as before, and w(τ) is the additive Gaussian noise band-limited to [−½T, ½T] with a flat spectrum of N0/2T. From equation (8), it is clear that the desired sampling locations with respect to the read clock are approximately τm:=τ(mT) where τ(τ′):=fr(fw−1(τ′)).


Instead of modeling fr(t) and fw(t) separately, it is convenient to model τ(τ′)=fr(fw−1(τ′)) or the read clock relative to the write clock using the following random walk model for the function τ(τ′):

τ(τ′)=τ(0)+∫0τ′μ(x)dx+∫0τ′ρ(x)dx  (9)

where ρ(x) is a white noise and μ(x) is the local clock frequency, which is also slowly varying according to a random walk model:

μ(τ′)=μ(0)+∫0τ′σ(x)dx.  (10)


The quantities ρ(x) and σ(x) are Gaussian noise processes with power spectral densities Sρ(f)=σR2 and Sσ(f)=σS2 respectively. We assume that σS is small enough so that the local relative clock frequency is approximately unity, i.e., μ(x)≅1 for several thousands of cycles.


Recall that τn=τ(nT) are the desired sampling times on the read clock. Also let μn=μ(nT). Then, these quantities are modeled by the following difference equations:

τnn−1+Tμn−1+rn  (11)
μnn−1+sn  (12)

or in vector form as the following random walk process
λn=(1T01)λn-1+υnwhereλn=(τnμn),υn=(rnsn)and(13)E[υnυmT]=δmn(σR2T+σS2T3/3σS2T2/2σS2T2/2σS2T).(14)


The variable λn is a vector consisting of the phase and frequency information at time n. λn can be called the timing information vector. The variable νn is a Gaussian noise whose statistics are given in equation (14).



FIG. 1 shows the block diagram of a conventional timing recovery system. The channel output is filtered and sampled to produce the readback signal s(τ). These samples are equalized and detected to produce estimates {ân} of the input bits. The timing error detector is controlled by the equalized samples zn and the detected data ân.


From the data processing theorem, it is clear that we can extract more information about {circumflex over (τ)}n by processing the equalizer input yn instead of the output zn. While the conventional MM method processes the equalized samples zn, the ML method (as illustrated in FIG. 2) uses the raw sequence yn. For this approach, we require knowledge of the dibit response of the channel.


The invention can also be implemented in a variant of the ML method that processes the equalized sequence zn just like the MM method. For this variant, the system of FIG. 2 would be modified to use zn and ân as the inputs of the timing estimator. This method does not require knowledge of the dibit response. Hence it is more attractive from a practical viewpoint. The mathematical analysis for the variant ML method is a straightforward modification of the original ML method.


The system of FIG. 2 combines the timing error detector, loop filter and the voltage controlled oscillator (VCO) blocks into a single block labeled timing estimator 58. As in the conventional system, we use a decision directed approach and the assumption that ann most of the time. Although a VCO is not shown, the system can be implemented using one.


In a disc drive storage system, data is written in sectors on a disc. As with the conventional method, we require a preamble at the start of each sector to ensure proper acquisition of the timing information. The preamble is a fixed bit-sequence that is used for the initial acquisition of the timing information. During the preamble part of the sector, we do not use the detected bits ân to estimate the timing information. Instead, we use the actual preamble bits, which are known to the receiver, until the timing information is obtained. At the end of the preamble, we switch to the data mode where the timing information is updated using the detected bits ân. This is the decision directed mode.


Let so(t) denote the ideal noise-free readback waveform:
so(t)=mbmhT(t-mT)=mamhD(t-mT)(15)

where
hD(t)=(hT(t)-hT(t-T))2

is the dibit response. Denote its samples at t=nT by
yno=so(n)=man-mhD(mT).(16)

The superscript “o” indicates that the readback signal is ideal and devoid of imperfections due to clock jitter or noise.


Suppose that we have an estimate {circumflex over (τ)}n of the sampling time τn, which is obtained from an earlier step of the timing recovery algorithm. The corresponding sample of the filtered readback waveform can be written in terms of the ideal readback waveform using equation (8):
yn:=s(τ^n)=mbmhT(fw(fr-1(τ^n))-mT)+w(τ^n)so(fw(fr-1(τ^n)))+w(τ^n)(17)


Now, observe that
fw(fr-1(τ^n))fw(fr-1(τ^n))+(τ^n-τn)[fw(fr-1(τ^n))t]t=τn=nT+(τ^n-τn)(18)

where we use the first order Taylor approximation, the approximation that the local clock frequencies of the read and write clock are both close to unity, and fw(fr−1n))=nT.


Combining equations (17) and (18), we obtain
yn=s(τ^n)so(nT)+(τ^n-τn)[so(t)t]t=nT+w(τ^n).(19)


Using equations (15), (16) and (19) we obtain
yn=yno+(τ^n-τn)man-m[hD(t)t]t=mT+w(τ^n)                            (20)=yno+γn(τn-τ^n)+wn(21)

where var (wn)=σw2=N0/2 T,
γn=-man-m[hD(t)t]t=mT.(22)


Since {circumflex over (τ)}n is a known quantity from a previous estimation step, we can calculate the following quantity
θn:=(yn-yno)+γnτ^n=γnτn+wn(23)

which we term as our “observation” of τn at time n. Note that, to calculate yno and γn, we need all the past and future data input bits. A workaround for this issue is to truncate the tail parts of the summations (16) and (22) and replace the actual bits by detected bits, i.e.,
ynom=-JJoa^n-mhD(mT)(24)γn-m=-JJ0a^n-m[hD(t)t]t=mT(25)

where J0 and J are sufficiently large integers. In general, larger values of J0 and J yield better approximations. However, the above expression depends on ân+J which implies an unavoidable delay of J bits between the detector and the timing estimator. Although a large value for J0 does not affect the loop delay, it does affect the complexity. Since much of the dibit response energy lies in [−2W, 2W], we choose J=F┌2W/T┐ and a more liberal choice of J0=┌5W/T┐, where ┌.┐ is the ceiling function or the smallest integer upper bound.


Ideally, we would like to use the observations {θk: k≦n} to estimate the next sampling instant τn+1. However, there is a delay in computing θn for the following reason. Suppose that the equalizer has length 2K+1 and is centered at the origin, it has K non-causal taps and hence introduces a delay of K samples in the forward link. Additionally, the detector introduces a certain delay. For instance the Viterbi detector, although a sequence detector, can be made to produce sub-optimal decisions after decoding through a depth of D bits in the trellis. This causes a delay of D bits. Thus the total delay in the timing loop is L=J+K+D, i.e., the earliest time when θn and γn can be computed is n+L. Thus, our estimation algorithm must be able to predict not just τn+1 but the future value τn+L+1 where L is the loop delay described above:

{circumflex over (τ)}n+L+1:=En+L+1k:k≦n].  (26)


Note that we can presumably improve the timing phase estimate if we use all the available information. This would translate to conditioning additionally on {yk:n<k≦n+L} in the above estimation:

{circumflex over (τ)}n+L+1:=E[τn+L+1|{θk:k≦n},{yk:n<k≦n+L}].  (27)


However, this is a hard problem because it is no longer a Gaussian estimation problem. For the purpose of this description, we consider only the estimation in equation (26), although slightly sub-optimal. The estimation is described fully below where we derive the ML estimate of the timing information and discuss its optimality.


The transition response hT(t) is needed to compute yno and γn. The transition response can be estimated from a large training sequence. These coefficients need not be known with very high precision because any estimation errors in these coefficients can be absorbed into the noise term wn, and σw can be adjusted accordingly.


Equation (23) indicates that the observation at time n is expressible as

θnnτn+wn  (28)

where γn is known and wn is an IID Gaussian noise with variance σw2. The ideal sampling instants τn and local clock frequencies μn are described by the random walk model (13). The random variables λn and θn clearly form a Markov chain whose dependence is illustrated in FIG. 3. The function of the timing estimator block is to compute the optimal estimate of τL+n+1 from the observations {θk:k≦n}. However, in the following analysis, we derive a more general expression for estimates of the current and all future values τm, m≧n from the observations θ0n:={θk:k≦n}. Since all the random variables in this problem are jointly Gaussian, the estimates can be expressed as linear functions of the observations θ0n.


A consequence of the random walk model is that at steady state (large n), the variances of τn and μn approach ∞. In other words, the prior information on these quantities vanish at steady state implying that their minimum mean square estimates (MMSE) coincide with their (unbiased) maximum-likelihood estimates (MLE). We shall focus on the MLE alone, but for convenience we write the estimates of τm and μm for m≧n given θ0n as expectations with the understanding that the steady state assumption is used, i.e.,

{circumflex over (τ)}m(n)=E[τm0n]  (29)
{circumflex over (μ)}m(n)=E[λm0n]  (30)

where E[•] denotes the expectation. The subscript m denotes the index of the quantity that we are estimating and superscript n denotes the highest index of the observation used for estimation. The above equations can be written compactly as

{circumflex over (λ)}m(n)=Em0n], m≧n.  (31)


Observe that equation (13) implies that
λm=(1T01)m-nλn+k=n+1m(1T01)m-kυk=(1(m-n)T01)λn+k=n+1m(1(m-k)T01)υk.


Now, {vk:k>n} has a zero-mean and is independent of the observations θ0n. Thus,
λ^m(n)=E[λmθ0n]=(1(m-n)T01)E[λnθ0n]=(1(m-n)T01)λ^n(n).Equivalently,(32)τ^m(n)=τ^n(n)+(m-n)Tμ^n(n)(33)μ^m(n)=μ^n(n).(34)

Therefore, it suffices to compute λn(n). The estimates of λm for m≧n are given by equation (32).


A direct computation of equation (31) is prohibitive because its computational complexity grows as n grows. Fortunately, there is a simple technique that allows us to compute these estimates recursively and efficiently. We first present a useful result.


Lemma 1: Suppose A→B→C→D is a Markov chain, then P(C|ABD)=P(C|BD). This is proved easily by the property of conditional probabilities:
P(CABD)=P(CDAB)P(DAB)=P(CDB)P(DB)=P(CBD).


Since all of the random variables in this description are Gaussian, the following is a Markov chain:

θ0n−1→{circumflex over (λ)}n-1(n−1)→λn-1→λn→θn.


The Markov property of θ0n−1→{circumflex over (λ)}n−1(n−1)→λn−1 follows because the estimated error {circumflex over (λ)}n−1(n−1)−λn−1 is uncorrelated and independent of the observations θ0n−1, while the Markov property of {circumflex over (λ)}n−1(n−1)→λn−1→λn→θn follows because vn and wn are independent of each other and of {circumflex over (λ)}n−1(n−1) which is a linear function of θ0n−1.


Consider the estimate {circumflex over (λ)}n(n) at time n. This quantity can be expressed as

θ0n−1→{circumflex over (λ)}n−1(n−1)→λn−1

because {circumflex over (λ)}n−1(n−) is a function of θ0n−1. We have already shown that θ0n−1→{circumflex over (λ)}n−1(n−1)→λn−θn is a Markov chain. Therefore, by lemma 1 we can drop the conditioning on θ0n−1 in the above expectation to obtain

{circumflex over (λ)}n(n)=Enn,{circumflex over (λ)}n−1(n−1)].  (35)


Two expressions for the estimate of {circumflex over (λ)}n(n) (the timing information) are shown above. Equation (31) says that {circumflex over (λ)}n is the expectation of λn conditioned on all the past observations: θ0n={θ0, . . . ,θn}. We use the Markov chain property to simplify this expression to (35), which is also a conditional expectation, but depends only on two quantities: the last observation θn, and the last estimate {circumflex over (λ)}n−1(n−1). Equation (35) can be computed in an efficient way unlike equation (31). This simplification does not compromise accuracy because both these expressions are equal to each other.


The quantities θn and {circumflex over (λ)}n−1(n−1) provide sufficient statistics for estimating λn and we have recast our estimation into the above simple form. At time n−1, let

{circumflex over (λ)}n−1(n−1)n−1+un−1λ

where un−1λ is the estimation error of λn−1 having covariance Kn−1:

Kn−1=E[un−1λ(un−1λ)T].  (36)


Using equations (13) and (28), we can express our sufficient observations for estimating λn+1 as
θn=γnτn+wnλ^n-1(n-1)=λn-1+un-1λ=(1-T01)(λn-υn)+un-1λ.


Therefore, our sufficient statistics are where
(θnλ^n-1(n-1))=Anλn+ηnwhereAn=(γn01-T01)andηn=(wn(1-T01)υn+un-1λ).(37)


The covariance matrix of ηn is readily computed using equations (14) and (36):
Cn:=E[ηnηnT]=(σW2000σR2T+σS2T3/3-σS2T2/20-σS2T2/2σS2T)+(000Kn-1).(38)


Finally, using equations (37), (38) and we obtain the estimate in equation (35)
λ^n(n)=E[λnθn,λ^n-1(n-1)]=(AnTCn-1An)-1AnTCn-1(θnλ^n-1(n-1)).(39)


Combining Equations (37) and (39) we obtain

{circumflex over (λ)}n(n)n+unλ

where the estimation error and its covariance at time n are

{circumflex over (λ)}n(n)n+unλ
Kn=E[unλ(unλ)T]=(AnTCn−1An)−1.  (41)


The covariance matrix in equation (41) is used for estimation in the next iteration. Assuming that the delay in the feedback loop is L, our timing estimate {circumflex over (τ)}n+L+1 is computed using equations (26) and (33):

{circumflex over (τ)}n+L+1={circumflex over (τ)}n(n)+(L+1)T{circumflex over (μ)}n(n).


The ML estimation algorithm can be summarized as follows:


1. Choose initial values for {circumflex over (λ)}−1(−1) and K−1.


2. Set {circumflex over (τ)}k=kT for k≦L.


3. For n=0, 1, 2 . . . do steps 4-8.


4. Obtain θn and γn from the timing error detector.


5. Compute Cn using equation (38).


6. Compute {circumflex over (λ)}n(n) using equation (39).


7. Compute {circumflex over (τ)} n+L+1=(1(L+1)T){circumflex over (λ)}n(n).


8. Update Kn using equation (41).


9. End


The initial value for {circumflex over (λ)}−1(−1) can be taken as
λ^-1(-1)=(τ^-1(-1)μ^-1(-1))=(-T1)

and K−1 is chosen to reflect our uncertainty of these estimates. A large value of K−1 would appear be a conservative choice since it reflects our noncommittal feeling about the initial estimates. However, this choice causes the phase and frequency updates in the first few iterations to be so large that the Taylor series approximations break down. This can yield cycle slips at the very start of the sector.


In today's magnetic recording media, the initial phase offset is generally within one half of a cycle and the frequency offset is less than 1%. Thus, we can adopt a more liberal choice such as
K-1=(T2/40010-4)(42)

or any matrix whose entries are in the same order of magnitude since the performance is not very sensitive to the precise value of K−1. Finally, a very small value for K−1 is undesirable because it makes the initial updates very sluggish and slows down the convergence rate. The result is that large frequency offsets are not tracked quickly. The ML algorithm automatically updates this covariance matrix Kn in each iteration and the covariance matrix converges to the same value independent of K−1.


Apart from its optimality, the ML estimation method has several advantages over the conventional MM method. For example, we do not need to rely on any heuristics or a trial and error approach as in the design of a loop filter in the conventional algorithm. Also, we do not need to modify any coefficients in the acquisition and tracking phases of the algorithm. For the ML method, we need to specify only an initial estimate and a covariance matrix K−1 describing the uncertainty of the timing information. Finally, the quantities σR and σS are properties of the mechanical system (head and medium combination) and can be estimated for a particular storage device.


The performance of the above algorithm can be compared with a conventional Mueller and Müller based algorithm to demonstrate that the ML algorithm converges quickly and tracks changes in the phase and frequency extremely well. It performs better than the conventional MM method in terms of bit error rates (BERs) and cycle slip rates (CSRs), as well as the variance of the steady state timing error. Variants of the ML algorithms can be applied to different signal and channel models.


The performance of the ML timing estimation algorithm for an uncoded magnetic recording channel by computer simulation can be compared with a conventional method that uses the MM timing error detector. We consider longitudinal and perpendicular modes of recording at a normalized recording density of W/T=2. In the simulations, sectors consisting of 4096 information bits and an additional 100 bits for the preamble are processed. An equalizer of length 2K+1=11 designed for a monic generalized partial response (GPR) target of length 3, and a Viterbi detector with detection depth of D=4 bits were used. In the implementations of (16) and (22) we kept only J=4 non-causal terms and J0=10 causal terms. Therefore, the net delay in the timing recovery loop is L=K+D+J=13 bits. Thus our ML estimates of the timing phase and frequency are

{circumflex over (τ)}={circumflex over (τ)}n(n−14) and {circumflex over (μ)}n={circumflex over (μ)}n(n−14).


Note that at the start of the sector (i.e., the timing acquisition part) causality is not an issue because the preamble data bits are known. Thus, during acquisition we have L=0, i.e.,

{circumflex over (τ)}n={circumflex over (τ)}n(n−1) and {circumflex over (μ)}n={circumflex over (μ)}n(n−1)

and switch to L=13 at the end of the preamble.


Although we derived our ML estimation algorithm assuming no transition jitter noise, we consider simulations of channels with and without this noise. Our definition of the effective SNR due to the electronic and transition noise is
SNR=EiN0+M0.


In our simulations, we choose T=1 without loss of generality. The actual frequency offset encountered in a real system is the range of 0.2% to 0.4% and the initial phase offset is about 0.5 of a clock pulse. In all the simulations, we choose an initial phase offset of τ0=0.5T and a frequency offset of 0.4%, i.e., μ0=1.004. We also allow the local frequency to change slowly according to the random walk model described by equations (11) and (12). The values of σR and σS control the rate at which the clock frequency drifts. FIG. 4 illustrates a typical realization of μn (the actual local frequency of the write clock relative to the read clock) generated from the random walk model for

σS=1=10−5.



FIG. 4 shows that these parameters allow for realistic changes in the frequency offset during the course of one sector. We use σRS=10−5 in our simulations.


We first consider the case with no transition noise, i.e., M0=0 for SNR=Ei/N0 in the range 16 dB to 25 dB. At 16 dB, the SNR is considered very low as it produces raw bit error rates well in excess of 10−2. We evaluate the performance of the ML and the conventional local frequency algorithms in terms of their cycle slip rate (CSR) and timing error variance. FIGS. 5 and 6 compare the normalized steady state timing error variance for the two modes of recording (longitudinal and perpendicular). FIG. 5 shows a comparison of timing error variance for longitudinal recording. FIG. 6 shows a comparison of timing error variance for perpendicular recording. The normalized timing error variance is defined as

E∥τn−{circumflex over (τ)}n2/T2

where E[•] denotes the expectation over n as well as the ensemble. These values are obtained by processing sectors that do not have cycle slips. Clearly, the ML method yields a better estimate of the timing phase and hence a lower error variance for each SNR. FIG. 7 shows a comparison of CSR for longitudinal recording. FIG. 8 shows a comparison of CSR for perpendicular recording. All these plots were obtained by simulating over 1000 sectors (each consisting of 4096 data bits and 100 preamble bits) at each SNR.


We now consider the performance of the ML timing recovery algorithm in the presence of jitter noise. We consider two cases: transition noise values of 50% and 90% in terms of the effective noise power, i.e.

M0=α(M0+N0)

where α=0.5 and α=0.9. The remaining system parameters are kept the same as in the previous case.



FIG. 9 shows a comparison of timing error variance for perpendicular recording with 50% transition noise. FIG. 10 shows a comparison of CSR for perpendicular recording with 50% transition noise. FIG. 11 shows a comparison of timing error variance for perpendicular recording with 90% transition noise. FIG. 12 shows a comparison of CSR for perpendicular recording with 90% transition noise. From the data in these figures, it is apparent that the ML algorithm works well for the channel with transition noise even though it was not specifically designed for this case. Once again, the ML method outperforms the conventional method. There are similar gains for longitudinal recording as well.


From a practical standpoint, it is desirable to eliminate the VCO in the timing recovery system completely. This is because a fully digital implementation is easier to build. A solution to this problem in the context of conventional timing recover is to use an A/D sampler triggered by a free-running clock and an interpolator to predict the samples.



FIG. 13 is a block diagram of a conventional communications system 100 including interpolated timing recovery. A transmitter 102 produces signals an that are transmitted over a channel 104 to a receiver 106. The transmitted signals may be subject to noise or interference, resulting in signals r(τ) being received at the receiver. The receiver includes a front end circuit 108 that can include a low pass filter, which produces a filtered signal s(τ). A sampler, illustrated as a switch 110 samples the filtered signal using a free-running clock. The samples are interpolated by interpolator 112 to produce a plurality of samples yn that form a digital representation of the filtered signal. An equalizer 114 equalizes the samples to produce a plurality of equalized samples zn. A detector 116, which can be, for example, a Viterbi detector, processes the equalized samples to produce estimates ân of the samples on line 118. A timing error detector 120 receives the equalized samples zn and the estimates ân, and subjects these signals to an algorithm, such as the MM algorithm, to produce a timing error signal on line 122. The timing error signal passes through a loop filter 124 and the resulting filtered timing error signal on line 126 is integrated by integrator 128, which controls the interpolator.


The sampler uses a free-running clock to obtain samples of the filtered readback waveform at its Nyquist frequency. The job of the interpolator is to compute value of the readback waveform at other sampling instants. The rest of the timing recovery is essentially equivalent to the conventional approach of FIG. 1.


Another embodiment of the present invention uses ML timing recovery in a VCO-free implementation. FIG. 14 is a block diagram of a communications system 140 constructed in accordance with the invention and including interpolated timing recovery. A transmitter 142 produces signals an that are transmitted over a channel 144 to a receiver 146. The transmitted signals may be subject to noise or interference, resulting in signals r(τ) being received at the receiver. The receiver includes a front end circuit 148 that can include a low pass filter, which produces a filtered signal s(τ). A sampler, illustrated as a switch 150, samples the filtered signal using a free-running clock. The samples are interpolated by interpolator 152 to produce a plurality of samples yn that form a digital representation of the filtered signal. An equalizer 154 equalizes the samples to produce a plurality of equalized samples zn. A detector 156, which can be, for example, a Viterbi detector, processes the equalized samples to produce estimates ân of the samples on line 158. A timing error detector 160 receives the plurality of samples yn and the estimates ân, and subjects these signals to an algorithm, to produce a timing error signal. The timing error signal is used to control the interpolator.


In a variant of the system of FIG. 14, the system of FIG. 14 can be modified to used zn and ân as the inputs of the timing estimator. This method does not require knowledge of the dibit response. Hence it is more attractive from a practical viewpoint.


Another possibility is to use the ML estimation algorithm in conjunction with persurvivor processing (PSP). In the conventional PSP based approach, the timing error detector computes an error signal for each surviving path in the trellis of the Viterbi detector. This reduces the loop delay by D bits caused by the Viterbi detector. This method owes its improved performance to its smaller loop delay but has a larger implementation complexity. The present invention can utilize an ML-PSP method with the ML estimator in place of the MM detector and the loop filter.


This invention allows for realistic changes in the clock phase and frequency caused by mechanical imperfections in the magnetic storage device such as spindle speed variation. The channel and clock models were used to derive expressions for the optimal timing information from the readback samples. These estimates for the sampling instants are optimal in the maximum-likelihood sense. These estimates also coincide with the minimum mean square sense asymptotically. The above description shows that the estimates can be implemented efficiently using a recursive method and a simple algorithm. Simulation results show that this method performs better than a conventional timing recovery scheme in terms of both steady state timing error variance and cycle slip rates.


In one embodiment, this invention provides a timing recovery scheme based on maximum-likelihood (ML) estimation in which several functional blocks of a conventional timing recovery system are combined into one block and optimized. In order to solve the problem analytically, the ML estimation is transformed into a Gaussian estimation problem by linearization using Taylor series expansions. A cost efficient solution to the linearized problem is obtained by exploiting the Markov chain relationship between the random variables in the problem. The computations of the ML timing estimate and its error covariance matrix are done recursively using their previous estimates and the convergence of the recursive algorithm parameters is robust to their initial values. The method is not limited to longitudinal and perpendicular modes of magnetic recording, but is also applicable to other recording architectures or communication systems.


The method of this invention can be implemented in the read channel of a magnetic storage device or in receivers of communications systems, using conventional hardware, such as electronic filters, oscillators and signal processors.


While the invention has been described in terms of several examples, it will be apparent to those skilled in the art that various changes can be made to the described examples without departing from the invention as set forth in the following claims.

Claims
  • 1. A method of timing recovery comprising: receiving an input signal; sampling the input signal to produce a plurality of signal samples; detecting the samples to produce an output signal; and controlling timing of the sampling using a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant.
  • 2. The method of claim 1, wherein the step of controlling timing of the sampling in response to a maximum-likelihood estimation of timing error between the actual sampling time instant and the desired sampling time instant includes: specifying a covariance matrix representative of uncertainty in the sample timing; and updating the covariant matrix for each maximum-likelihood estimation of timing error.
  • 3. The method of claim 1, wherein the step of controlling timing of the sampling in response to a maximum-likelihood estimation of timing error between the actual sampling time instant and the desired sampling time instant uses the signal samples and the output signal.
  • 4. The method of claim 1, wherein the step of controlling timing of the sampling in response to a maximum-likelihood estimation of timing error between the actual sampling time instant and the desired sampling time instant uses equalized signal samples and the output signal.
  • 5. The method of claim 1, wherein the step of controlling timing of the sampling in response to a maximum-likelihood estimation of timing error between the actual sampling time instant and the desired sampling time instant uses only a last signal sample and a last output signal.
  • 6. The method of claim 1, wherein the step of controlling timing of the sampling in response to a maximum-likelihood estimation of timing error between the actual sampling time instant and the desired sampling time instant includes persurvivor processing.
  • 7. An apparatus comprising: an input for receiving an input signal; a sampler for sampling the input signal to produce a plurality of signal samples; a detector for detecting the samples to produce an output signal; and a timing estimator for controlling timing of the sampling using a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant.
  • 8. The apparatus of claim 7, wherein the timing estimator specifies a covariance matrix representative of uncertainty in the sample timing; and updates the covariant matrix for each maximum-likelihood estimation of timing error.
  • 9. The apparatus of claim 7, wherein the timing estimator uses the signal samples and the output signal.
  • 10. The apparatus of claim 7, wherein the timing estimator uses equalized signal samples and the output signal.
  • 11. The apparatus of claim 7, wherein the timing estimator uses only a last signal sample and a last output signal.
  • 12. The apparatus of claim 7, wherein the timing estimator includes persurvivor processing.
  • 13. A method of timing recovery comprising: receiving an input signal; sampling the input signal using a free-running clock to produce a plurality of signal samples; interpolating the signal samples to produce interpolated samples; detecting the interpolated samples to produce an output signal; and controlling interpolation of the signal samples using a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant.
  • 14. The method of claim 13, wherein the step of controlling interpolation of the signal samples using a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant includes: specifying a covariance matrix representative of uncertainty in the sample timing; and updating the covariant matrix for each maximum-likelihood estimation of timing error.
  • 15. The method of claim 13, wherein the step of controlling interpolation of the signal samples using a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant uses the signal samples and the output signal.
  • 16. The method of claim 13, wherein the step of controlling interpolation of the signal samples using a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant uses equalized signal samples and the output signal.
  • 17. The method of claim 13, wherein the step of controlling interpolation of the signal samples using a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant uses only a last signal sample and a last output signal.
  • 18. The method of claim 13, wherein the step of controlling interpolation of the signal samples using a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant includes persurvivor processing.
  • 19. An apparatus comprising: an input for receiving an input signal; a sampler for sampling the input signal using a free-running clock to produce a plurality of signal samples; an interpolator for interpolating the signal samples to produce interpolated samples; a detector for detecting the interpolated samples to produce an output signal; and a timing estimator for controlling interpolation of the signal samples using a maximum-likelihood estimation of timing error between an actual sampling time instant and a desired sampling time instant.
  • 20. The apparatus of claim 19, wherein the timing estimator specifies a covariance matrix representative of uncertainty in the sample timing, and updates the covariant matrix for each maximum-likelihood estimation of timing error.
  • 21. The apparatus of claim 19, wherein the timing estimator uses the signal samples and the output signal.
  • 22. The apparatus of claim 19, wherein the timing estimator uses equalized signal samples and the output signal.
  • 23. The apparatus of claim 19, wherein the timing estimator uses only a last signal sample and a last output signal.
  • 24. The apparatus of claim 19, wherein the timing estimator includes persurvivor processing.