Method and apparatus for providing iterative timing recovery

Information

  • Patent Application
  • 20060067434
  • Publication Number
    20060067434
  • Date Filed
    September 24, 2004
    20 years ago
  • Date Published
    March 30, 2006
    18 years ago
Abstract
A method includes the steps of receiving a signal indicative of data bits, and performing per survivor processing-iterative timing recovery (PSP-ITR) on the received signal to generate probabilities of the data bits. To perform PSP-ITR on the received signal, the signal can be processed using a per survivor processing-soft decision algorithm (PSP-SDA) which jointly performs timing recovery and equalization in accordance with embodiments of the present invention. The soft decision algorithm (SDA) can be, for example, a Bahl, Cocke, Jelinek, and Raviv (BCJR) algorithm or a Soft Output Viterbi Algorithm (SOVA) modified in accordance with the concepts of the present invention such that it is configured to implement per survivor processing (PSP) to jointly perform timing recovery and equalization.
Description
FIELD OF THE INVENTION

The present invention relates generally to timing recovery. More particularly, but not by limitation, the present invention relates to timing recovery using iterative coding schemes.


BACKGROUND OF THE INVENTION

The process of synchronizing a sampler with a received analog signal is known as timing recovery. It is a crucial component in a recording system channel detector, such as magnetic recording channel detectors. The quality of synchronization has a tremendous impact on the overall performance of the channel detector. At current areal recording densities, existing timing recovery architectures perform well. However, at the higher areal densities which will be used in the future, signal energy will be lower and noise in the system will increase. Thus, the signal-to-noise ratio (SNR) will decrease.


The advent of iterative error-correction codes allows the system to operate at low SNRs with acceptable performance due to their large coding gains. This means that timing recovery must also function at low SNRs. A conventional receiver performs timing recovery and error-correction decoding separately. Specifically, conventional timing recovery ignores the presence of error-correction codes; therefore, it fails to function properly at low SNRs, and timing errors increase.


Theoretically, joint maximum-likelihood (ML) estimation of timing offsets and message bits, which will jointly perform timing recovery, equalization and decoding, is a preferred method of synchronization; however, its complexity is gigantic. Fortunately, the solution to this problem with complexity comparable to a conventional receiver has been proposed, which is realized by embedding the timing recovery step inside the turbo equalizer so as to perform their tasks jointly. From this point on, that iterative timing recovery (ITR) scheme is denoted as “NonPSP-ITR”, where “PSP” stands for per survivor processing. However, NonPSP-ITR requires a large number of turbo iterations to provide an acceptable performance when the channel experiences severe timing jitter noise.


Embodiments of the present invention provide solutions to these and other problems, and offer other advantages over the prior art.


SUMMARY OF THE INVENTION

A method of the present invention includes the steps of receiving a signal indicative of data bits, and performing per survivor processing-iterative timing recovery (PSP-ITR) on the received signal to generate probabilities of the data bits. To perform PSP-ITR on the received signal, the signal can be processed using a per survivor processing-soft decision algorithm (PSP-SDA) which jointly performs timing recovery and equalization in accordance with embodiments of the present invention. The soft decision algorithm (SDA) can be, for example, a Bahl, Cocke, Jelinek, and Raviv (BCJR) algorithm modified in accordance with the concepts of the present invention such that it is configured to implement per survivor processing (PSP) to jointly perform timing recovery and equalization. In other embodiments, the SDA is a Soft Output Viterbi Algorithm (SOVA) configured to implement PSP to jointly perform timing recovery and equalization. Still other SDAs can be used with PSP to jointly perform timing recovery and equalization in other embodiments of the present invention.


In some embodiments of the invention, the step of processing the received signal using a PSP-SDA algorithm includes the step of calculating a plurality of branch metrics, with each branch metric corresponding to a transition branch between states in a trellis. A survivor path between the states is then identified as a function of the calculated branch metrics.


In some embodiments, each state has an associated sampling phase offset used to sample the received signal. The sampling phase offsets differ between various states. In these embodiments, the step of calculating the plurality of branch metrics further includes calculating each branch metric as a function of the sampling phase offset at a starting state of the corresponding branch. The branch metrics can be calculated during both forward and backward recursions.


Other features and benefits that characterize embodiments of the present invention will be apparent upon reading the following detailed description and review of the associated drawings.




BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is an isometric view of a disc drive configured to implement the present invention.



FIG. 2-1 is a block diagram illustrating a data encoding with a conventional PR-IV channel model circuit.



FIG. 2-2 is a block diagram illustrating a conventional receiver architecture.



FIG. 2-3 is a block diagram illustrating a prior art receiver architecture which implements NonPSP-ITR.



FIG. 2-4 is a block diagram illustrating the iterative nature of timing recovery, equalization and decoding using the NonPSP-ITR.



FIG. 3 is a block diagram illustrating a PSP-based iterative timing recovery architecture or circuit.



FIG. 4 is a trellis structure demonstrating how PSP-BCJR performs during forward recursion.



FIG. 5 is a trellis structure illustrating how PSP-BCJR performs during backward recursion.



FIG. 5-1 is a table illustrating PLL gain parameters for different system conditions.



FIG. 6 is a plot illustrating performance comparisons of different timing recovery schemes with a phase offset σw/T=0.5%.



FIG. 7 is a plot illustrating performance comparisons of different timing recovery schemes with a phase offset σw/T=1%.



FIG. 8 is a plot illustrating convergence rates of different timing recovery schemes at SNR=5 dB and σw/T=1%.



FIG. 9 is a plot illustrating probability of cycle slip correction at SNR=5 dB and σw/T=1%.



FIGS. 10-1 and 10-2 are plots illustrating cycle slip correction for two different sample packets at SNR=5 dB and σw/T=1%.



FIG. 11 is a plot illustrating performance comparisons of different timing recovery schemes as a function of a σw/T's at SNR=5 dB.



FIG. 12 is a block diagram illustrating data encoding with a recording channel.



FIG. 13 is a block diagram illustrating a PSP-based iterative timing recovery architecture.



FIG. 14 illustrates plots of cycle slip correction for a longitudinal recording channel.



FIG. 15 illustrates plots of cycle slip correction for a perpendicular recording channel.



FIG. 16 is a block diagram illustrating a PSP-based iterative timing recovery architecture.



FIG. 17 is a plot illustrating BER versus σw/T for different timing recovery schemes at SNR=5 dB and media jitter of 5%.



FIG. 18 is a plot illustrating BER versus σw/T for different timing recovery schemes at SNR=10 dB and media jitter of 10%.



FIG. 19 is a plot illustrating performance comparisons after five iterations at Eb/N0=5 dB, σj/T=5%, frequency offset=0.3%, α200β200.



FIG. 20 is a plot illustrating performance comparisons after ten iterations at Eb/N0=5 dB, σj/T=5%, frequency offset=0.3%, α200β200.




DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS


FIG. 1 is an isometric view of a disc drive 100 in which embodiments of the present invention are useful. Disc drive 100 includes a housing with a base 102 and a top cover (not shown). Disc drive 100 further includes a disc pack 106, which is mounted on a spindle motor (not shown) by a disc clamp 108. Disc pack 106 includes a plurality of individual discs, which are mounted for co-rotation about central axis 109. Each disc surface has an associated disc head slider 110 which is mounted to disc drive 100 for communication with the disc surface. In the example shown in FIG. 1, sliders 110 are supported by suspensions 112 which are in turn attached to track accessing arms 114 of an actuator 116. The actuator shown in FIG. 1 is of the type known as a rotary moving coil actuator and includes a voice coil motor (VCM), shown generally at 118. Voice coil motor 118 rotates actuator 116 with its attached heads 110 about a pivot shaft 120 to position heads 110 over a desired data track along an arcuate path 122 between a disc inner diameter 124 and a disc outer diameter 126. Voice coil motor 118 is driven by servo electronics 130 based on signals generated by heads 110 and a host computer (not shown). The present invention is useful in providing timing recovery in a channel, such as a recording channel, which is represented diagrammatically in FIG. 1 at reference number 128. The recording channel can be of the type used with data storage systems such as disc drive 100, but is not limited to use with any particular type of data storage system. Disc drive 100 is intended to represent any type of data storage system, or other types of systems such as communication systems, in which the present invention is embodied and used.


As noted previously, when used in a recording or other channel, NonPSP-ITR requires a large number of turbo iterations to provide an acceptable performance when the channel experiences severe timing jitter noise. This problem can be solved by utilizing a PSP technique. Per survivor processing, or PSP, is a technique for jointly estimating a data sequence and unknown parameters, such as channel coefficients, carrier phase, and so forth. PSP has been employed in many applications including channel identification, adaptive maximum likelihood (ML) sequence detectors, and phase/carrier recovery. PSP can be applied to the development of PSP-based timing recovery implemented based on a Viterbi algorithm, which performs timing recovery and data detection jointly. Results have shown that it performs better than the conventional receiver that separates these two tasks, especially when the timing jitter is severe.


Similarly, timing recovery and equalization can be performed jointly by using a PSP technique, which can yield better performance than performing them separately. To do so, a PSP-soft decision algorithm (SDA) is provided in which the timing recovery is embedded inside the soft decision equalizer, and is used to provide a PSP-based iterative timing recovery scheme, which is referred to herein as “PSP-ITR”. PSP-ITR iteratively exchanges soft information between the PSP-SDA and an error-correction decoder. The SDA can be a Bahl, Cocke, Jelinek, and Raviv (BCJR) algorithm resulting in the PSP-SDA being a PSP-BCJR, a Soft Output Viterbi Algorithm (SOVA) resulting in the PSP-SDA being a PSP-SOVA, or other types of soft decision algorithms.


The following discussion provides a description of a new timing recovery architecture, in accordance with the present invention, which is more robust to cycle slips in the system. Without loss of generality or limitation of scope, the PR-IV channel model (a partial response channel with three target values) is used to illustrate the steps in the new algorithm. However, it is worth noting that the algorithm can be applied not only to any target response (other than PR-IV), but also to equalized channels as well. Before going into the details of the timing recovery algorithm of the present invention, the channel architecture is first introduced, and a brief explanation of a conventional approach is provided.


Channel Model


Consider the coded partial response (PR) channel model 200 shown in FIG. 2-1. The message bits {xk} are encoded by a recursive systematic convolutional (RSC) encoder 205 and then interleaved by an s-random interleaver 210 to form an interleaved sequence αk. The interleaved sequence αk with bit period T is further precoded by a 1/(1⊕D2) precoder 215 (to form a precoded set of bits bk) and modulated using modulation circuitry 220 and 225 by a PR-IV pulse h(t)=p(t)−p(t−2T), where p(t)=sin(πt/T)/(πt/T) is an ideal zero excess-bandwidth Nyquist pulse.


The readback signal, s(t), can therefore be written as
s(t)=kbkh(t-kT-τk)+n(t)Equation1

where τk is the k-th timing offset, defined as the difference between the actual and the expected arrival time of the k-th pulse, and n(t) is additive white Gaussian noise (AWGN) with two-sided power spectral density N0/2. Timing offset circuit 230 models τk as a random walk model according to Equation 2,

τk+1kwωk   Equation 2

where σw determines the severity of the timing jitter and ωk is an independent identically distributed (i.i.d.) zero-mean unit-variance Gaussian random variable. The random walk model is chosen because of its simplicity to represent a variety of channels by changing only one parameter. Perfect acquisition, i.e., τ0=0, is also assumed.


Conventional Receiver


At the front-end receiver 250 of FIG. 2-2, an ideal low-pass filter 255, whose impulse response is p(t)/T, is employed to eliminate out-of-band noise. The received analog signal, y(t), is then sampled using a sampling circuit 260 at time kT+{circumflex over (τ)}k, where {circumflex over (τ)}k is the receiver's estimate of τk (or the k-th sampling phase offset), creating
yk=y(kT+τ^k)=ibih(kT+τ^k-iT-τi)+nkEquation3

where n′k is zero-mean Gaussian random variable with variance σ2n=N0/(2T).


A conventional timing recovery practically takes the form of a phase-locked-loop (PLL) 262 where, with perfect acquisition and no frequency offset component in the system, the sampling phase offset is updated by a first-order PLL, i.e.,

{circumflex over (τ)}k+1={circumflex over (τ)}k+ξ{circumflex over (ε)}k   Equation 4

where ξ is a PLL gain parameter determining the loop bandwidth and the convergence rate, and {circumflex over (ε)}k is an estimate of the timing error εkk−{circumflex over (τ)}k. This estimate is generated by a well-known Mueller and Müller (M&M) timing error detector (TED) according to:
ɛ^k=3T16{ykr~k-1-yk-1r~k}Equation5

where the constant 3T/16 ensures the S-curve slope of Equation 5 is one at the origin, and {tilde over (r)}=E[rk|yk] is the k-th soft estimate of the channel output rkε{0,±2}, which is given by:
r~k=2sinh(syk/σn2)cosh(2yk/σn2)+2/σn2Equation6

The soft estimate is considered in this disclosure because it provides better performance than the hard estimate, which is obtained by a memory-less three-level quantization of yk.


In the conventional receiver, conventional timing recovery is followed by a turbo equalizer 265, which iteratively exchanges information between a soft-in soft-out (SISO) equalizer 270 for the precoded PR-IV channel and an error-correction SISO Forward Error-Correction (FEC) decoder 275, both based on BCJR. The iterative exchange of information between SISO equalizer 270 and SISO FEC decoder 275 uses a de-interleaver 280 and an interleaver 285, as well as summation circuitry 290 and 295, in a conventional manner.



FIG. 2-3 is a block diagram of a front end receiver 500 of the type which implements NonPSP-ITR as discussed above. Receiver 500 has similar components to receiver 250 described above, but with different timing recovery circuit 505. Timing recovery circuit 505 includes first and second PLLs 510 and 520, and an interpolation circuit 515. NonPSP-ITR is realized by embedding the timing recovery step inside the turbo equalizer so as to perform timing recovery, equalization, and error-correction decoding jointly. After the first iteration, the turbo equalizer 270 produces soft estimates {{tilde over (r)}k}, which might be more reliable than the decisions used in the previous iteration. By running the PLL again using the original readback signal with {{tilde over (r)}k}, an improved set of timing estimates {{circumflex over (τ)}newk} can be obtained. Then, the new samples can be obtained by means of interpolation according to Equation 6-1:
yknew=iyiq(kT-iT-τ^i+τ^knew)Equation6-1

where q(t) is a sinc function. This set of samples is then fed to the turbo equalizer 270. In summary, timing recovery benefits from better decisions, and the turbo equalizer benefits from better samples. The process is illustrated in FIG. 2-4 with the steps of timing recovery 555, equalization 560 and decoding 565 being iteratively performed.


Iterative Timing Recovery of the Present Invention


To obtain a new iterative timing recovery scheme based on PSP, a description is first provided of the application of PSP to develop PSP-BCJR, which jointly performs timing recovery and equalization. With PSP-BCJR, a PSP-based iterative timing recovery scheme denoted as PSP-ITR is proposed, which performs timing recovery, equalization and decoding jointly, as shown in FIG. 3. FIG. 3 illustrates a receiver 300 in accordance with the present invention which embodies this concept. After low pass filter 255, analog signal y(t) is provided to the PSP-BCJR equalizer 310 of a turbo equalizer 305. PSP-BCJR equalizer 310 and SISO FEC decoder 275 iteratively exchange information using de-interleaver 280 and an interleaver 285, as well as summation circuitry 290 and 295. In this document, a description of the operation of PSP-BCJR is provided only, because the interaction between PSP-BCJR 310 and an FEC decoder 275 is the same as performed in the turbo equalizer 265 shown in FIG. 2-2 (i.e., a PSP-BCJR block can be viewed as a BCJR equalizer). Also, it must be noted that while the present invention is described with reference to a PSP-BCJR algorithm and equalizer, the present invention applies generally to PSP-SDA algorithms and equalizers.


PSP-BCJR Algorithms


PSP-BCJR is realized by embedding the timing recovery inside the BCJR equalizer. The PSP-based timing recovery performs timing update operation at each state based on the history data obtained from the survivor path. Unfortunately, there is no such notion as a survivor path in the context of BCJR. In order to perform timing update operation inside BCJR, the concept of a virtual survivor path (or, simply, the survivor path) inside the BCJR is introduced. This survivor path can be easily obtained, once the best state transition leading to each state is determined.


PSP-BCJR has different sampling phase offsets associated with each state. Thus, the branch metrics at each stage of the trellis are calculated based on the sampling phase offset of the starting state. Since BCJR involves two recursions, namely forward and backward recursions, it is useful to perform timing update operation for both directions. The timing update operation during backward recursion will serve as refining the sampler outputs {yk}, thus resulting in an improved set of {γk}, which will be used to compute the log likelihood ratios (LLRs) of {αk}. For simplicity, some embodiments of the invention are restricted to the M&M TED algorithm when performing timing update.


Forward Recursion


Consider the trellis structure in FIG. 4, which demonstrates how PSP-BCJR of the present invention performs during forward recursion. There are 2v=4 states in this trellis, i.e., Q={a,b,c,d}, where v=2 is a memory of the precoded PR-IV channel and Q is the set of states in the trellis. Define {circumflex over (τ)}k(p) as the k-th forward sampling phase offset at state Ψk=p (p being one of states a, b, c, d) which is used to sample y(t) at time k for the state transition emanating from Ψk=p, e.g., yk(p, q)=y(kT+{circumflex over (τ)}k(p)), where yk (p, q) is the k-th sampler output associated with (p,q). The parameter yk(p,q) represents the signal obtained by going from branch q to branch p (where q and p are equal to a, b, c, d).


Consider the state transition at time k. There are two state transitions arriving at Ψk+1=c, i.e., (b,c) and (d,c). First, y(t) is sampled using the forward sampling phase offsets {circumflex over (τ)}k(b) and {circumflex over (τ)}k(d) to obtain yk(b,c) and yk(d,c), respectively. Next, γk(b,c) and γk(d,c) are computed in order to update αk+1(c). The transition metric during forward recursion can be calculated using the relationship illustrated in Equation 6-2:
γk(a,b)=exp{-12σn2yk(a,b)-r^k(a,b)2+a^k(a,b)λk2}Equation6-2


Then, the starting state is chosen that corresponds to the best state transition leading to Ψk+1=c by:
p^=argmaxp{b,d}{Pr[Ψk=p,Ψk+1=cyk(p,c),yl<k(p)]}=argmaxp{b,d}{αk(p)γk(p,c)uαk+1(u)}=argmaxp{b,d}{αk(p)γk(p,c)}Equation7

where y1<k(p) is a collection of all previous sampler outputs associated with the survivor path leading to Ψk=p, and uεQ.


Suppose (b,c) is the best state transition leading to Ψk+1=c (i.e., {circumflex over (p)}=b). The algorithm then stores the starting state and the sampler output associated with (b,c) according to Sk+1(c)={Ψk=b} and πk+1(c)=yk(b,c), respectively. Then, the next forward sampling phase offset is updated by

{circumflex over (τ)}k+1(c)={circumflex over (τ)}k(b)+ξ{circumflex over (ε)}k(b,c)   Equation 8

where {circumflex over (ε)}k(b,c) is the k-th estimated timing error associated with (b,c), which is computed using the information from Sk(b) and πk(b), i.e.,
^k(b,c)=3T16{yk(b,c)r^k-1(a,b)-yk-1(a,b)r^k(b,c)}Equation9

This {circumflex over (τ)}k+1(c) will be used to sample y(t) at time k+1 for the state transitions emanating from Ψk+1=c. This process is repeated from time k=0 to k=L+v−1.


There are many possibilities to exploit the forward sampling phase offsets in the timing update operation during backward recursion. The first example is to ignore the forward sampling phase offsets at all. Another example is to let each state in the trellis store its own forward sampling phase offset. Then, the algorithm can average the backward sampling phase offset at each state using the forward sampling phase offset associated with that state. Nonetheless, for description of an exemplary embodiment, it can be a goal to extract the best set of the forward sampling phase offsets denoted as {{circumflex over (τ)}ƒwk}, which is obtained by tracing back the survivor path that maximizes αL+V. Hence, {{circumflex over (τ)}ƒwk} is used to average the backward sampling phase offset according to a certain criterion, as shall be seen later.


Note that the reasons for averaging the backward sampling phase offsets with the forward ones are: 1) to improve a set of {γk}, and 2) to avoid a cycle slip that might occur when the backward sampling phase offsets start deviating from the forward ones.


Backward Recursion


Now consider backward recursion where the time index starts from k=L+v−1 to k=0. In order to explain how the timing update operation is performed during backward recursion, the virtual transition (or, simply, the backward transition) is introduced represented by the gray arrows as shown in FIG. 5, which explains how PSP-BCJR performs during backward recursion. Define {circumflex over (τ)}bk(q) as the k-th backward sampling phase offset at Ψk+1=q, which is employed to sample y(t) at time k during backward recursion, e.g., yk(p,q)=y(kT+{circumflex over (τ)}bk(q)). Consider the backward transition at time k. There are two backward transitions arriving at Ψk=b, which corresponds to (b,c) and (b,d). First, the algorithm samples y(t) using the backward sampling phase offsets {circumflex over (τ)}bk(c) and {circumflex over (τ)}bk(d) to obtain yk(b,c) and yk(b,d), respectively. Next, γk(b,c) and γk(b,d) are computed in order to update βk(b). The transition metric during backward recursion:
γkb(a,b)=exp{-12σn2ykb(a,b)-r^k(a,b)2+a^k(a,b)λk2}Equation9-1


Then, the starting state is chosen that corresponds to the best backward transition leading to Ψk=b by
q^=argmaxq{c,d}{Pr[Ψk=b,Ψk+1=qyk(b,q),yl>k(q)]}=argmaxq{c,d}{γk(b,q)βk+1(q)Pr[Ψk=b]uβk(u)Pr[Ψk=u]}=argmaxq{c,d}{γk(b,q)βk+1(q)}Equation10

where the third equality is obtained by ignoring all terms irrelevant to maximization, and yl>k(q) is a collection of all future sampler outputs associated with the survivor path that emanating from Ψk+1=q.


Suppose (b,c) corresponds to the best backward transition leading to Ψk=b (i.e., {circumflex over (q)}=c). The algorithm stores the starting state and the sampler output associated with (b,c) according to Sbk(b)=c and πbk(b)=yk(b,c), respectively. Then, the next backward sampling phase offset is updated by

{circumflex over (τ)}bk−1(b)={circumflex over (τ)}bk(c)+ξ{circumflex over (ε)}bk(b,c)   Equation 11

where {circumflex over (ε)}bk(b,c) is the k-th backward estimated timing error associated with (b,c), which is computed using the information from Sbk+1(c) and πbk+1(c), i.e.,
^kb(b,c)=3T16{yk(b,c)r^k+1(c,a)-yk+1(c,a)r^k(b,c)}Equation12


To avoid a cycle slip when {circumflex over (τ)}bk−1(b) starts deviating from {circumflex over (τ)}ƒwk−1, the backward sampling phase offset is averaged according to
τ^k-1b(b)={(τ^k-1b(b)+τ^k-1fw)/2ifτ^k-1b(b)-τ^k-1fw>Δτ^k-1b(b)otherwiseEquation13

where Δ is the threshold that allows {circumflex over (τ)}bk−1(b) to deviate from {circumflex over (τ)}ƒwk−1. In this document, we set Δ=0.1T to keep {{circumflex over (τ)}bk} close to {{circumflex over (τ)}ƒwk} so that the parameters {αk} and {βk} will be optimized. This {circumflex over (τ)}bk−1(b) will be used to sample y(t) at time k−1 for the backward transitions emanating from Ψk=b.


This process is performed from time k=L+v−1 to k=0. Note that when performing the backward timing update operation, it is important to assure that the S-curve slope of Equation 12 during backward recursion is positive at the origin.


Summary of a PS-BCJR Algorithm Embodiment


1) Initialize forward recursion register values α0=[10 . . . 0]


2) Forward recursion:

For k = 0,1,...L + v − 1For q = {a,b,c,d}* Consider two transitions at time k arriving at Ψk+1 = qe.g., (p,q) and (u,q)* Compute γk(p,q) and γk(u,q)* Update αk+1(q)* Choose the best state transition leading to Ψk+1 = q* Update Sk+1(q)* Update πk+1(q)* Update {circumflex over (τ)}k+1(q)EndEnd


4) Output {circumflex over (τ)}ƒw from the survivor path that maximizes αL+V


5) Initialize backward recursion register values βL+VL+V


6) Backward recursion:

For k = L + v − 1,L + v − 2,...,0For. p = {a, b, c, d}* Consider two backward transitions at time k arriving atΨk = p , e.g., (p,q) and (p,u)* Compute γk(p,q) and γk(p,u)* Update βk(p)* Choose the best backward transition leading to Ψk = p* Update Skb(p)* Update πkb(p)* Update {circumflex over (τ)}k−1b(p)End


Compute λk according to
λk=log{(p,q)Ak(+1)αk(p)γk(p,q)βk+1(q)(p,q)Ak(-1)αk(p)γk(p,q)βk+1(q)}Equation14


End


Beyond the conventional BCJR, PSP-BCJR necessitates new storage requirements for:






    • the forward/backward sampling phase offsets,

    • the starting states,

    • the sampler outputs.


      For the PSP-BCJR algorithm described above, it can be necessary to store all {πk(p)} {πbk(p)}, {Sbk(p)} and {{circumflex over (τ)}bk(p)} only for previous and current stages so as to minimize memory requirement.





It is apparent that each survivor path has its own PLL to update the sampling phase offset. Therefore, for a PR-IV channel, PSP-BCJR requires eight PLLs, i.e., one PLL for each survivor path during both forward and backward recursions.


Simulation Results


This section compares the performance of PSP-ITR with the conventional receiver and NonPSP-ITR in the precoded PR-IV channel shown in FIG. 2-2. The analysis considers a rate-8/9 system in which a block of 3636 message bits is encoded by the rate-1/2 encoder with generator polynomial
[1,1DD3D41DD4],

and then punctured to a block length of 4095 bits by retaining only every eighth parity bit. The punctured sequence passes through an s-random interleaver with s=16 to obtain an interleaved sequence of αk. Note that the PLL gain parameter, ξ, for different timing recovery schemes were optimized based on minimizing the RMS timing error σε=√{square root over (E[(τk−{circumflex over (τ)}k)2])} at per-bit SNR, Eb/N0, of 5 dB. The PLL gain parameters for different system conditions are shown in Table 1 of FIG. 5-1. Each bit-error rate (BER) point was computed using as many data sectors as possible until at least 100 sectors in error were collected at the 100-th iteration.



FIG. 6 compares BER performance of different timing recovery schemes with a phase offset σw/T=0.5%, which represents low probability of the occurrence of cycle slips. The curve labeled “Perfect timing” means the conventional receiver using {circumflex over (τ)}kk to sample y(t). Furthermore, the curve labeled “Trained PLL” represents the conventional receiver whose PLL has access to all correct decisions, thus serving as a lower bound for all timing recovery schemes that are based on PLL. As depicted in FIG. 6, PSP-ITR performs slightly better than NonPSP-ITR at the 50-th iteration, and both yield about 0.45 dB gain at BER=10−5over the conventional receiver. Note that the performance of the conventional receiver at the 50-th and the 100-th iterations is alike (not shown). In addition, PSP-ITR performs close to the system with a trained PLL and is only 0.35 dB away from the system with perfect timing at BER=10−5.


Next, let us consider the system with a phase offset a, σw/T=1%, which represents high probability of the occurrence of cycle slips. FIG. 7 shows BER performance of different timing recovery schemes with σw/T=1%. NonPSP-ITR still outperforms the conventional receiver; however, it seems to have an error floor at high BER. On the other hand, PSP-ITR provides a huge performance gain over NonPSP-ITR and starts having an error floor at low BER. Again, PSP-ITR still performs similar to the system with a trained PLL and loses approximately 0.35 dB from the system with perfect timing at BER=10−5.


The reason that PSP-ITR outperforms NonPSP-ITR when the phase offset σw/T is large is because the front-end PLL used in NonPSP-ITR does not work well compared to the PSP-based timing recovery. Additionally, PSP-ITR can automatically correct a cycle slip (without a cycle slip detection and correction technique as used in NonPSP-ITR) much more efficiently than NonPSP-ITR. In other words, PSP-ITR achieves faster convergence than NonPSP-ITR, which can be confirmed by plotting the sector-error rate (SER) versus the number of iterations in FIG. 8. The convergence rate plot illustrated in FIG. 8 is for different timing recovery schemes at SNR=5 dB and the phase offset σw/T=1%. The convergence rate of PSP-ITR takes about 30 iterations to provide a good performance. Conversely, NonPSP-ITR takes hundreds of iterations to yield a good performance (not shown).



FIG. 9 illustrates a plot of the probability of the occurrence of the cycle slip at the k-th iteration, given the occurrence of the cycle slip. The plot is of the probability of cycle slip correction at SNR=5 dB and the phase offset σw/T=1%. This plot shows how fast each timing recovery scheme can correct a cycle slip. Note that, in this experiment, a cycle slip is declared when the actual timing offset and the estimated one are 0.75T apart from each other for more than 100 consecutive bit periods. Apparently, NonPSP-ITR requires a large number of iterations in order to correct a cycle slip as opposed to PSP-ITR. This is because NonPSP-ITR can only correct a cycle slip when there is a sudden phase change in the estimated timing offsets, not in the actual ones. Note that the reason that NonPSP-ITR increases a probability of the occurrence of the cycle slip at the first 10 iterations (see FIG. 9) is because NonPSP-ITR takes a few iterations to recognize a cycle slip.


It is also worth plotting the estimated timing offset obtained from NonPSP-ITR and PSP-ITR for two different sample packets, at SNR=5 dB and phase offset σw/T=1%, as shown in FIGS. 10-1 and 10-2. As illustrated in FIG. 10-1, NonPSP-ITR takes about 200 iterations to correct a cycle slip (not shown), whereas PSP-ITR takes only one iteration to do so. Similarly, FIG. 10-2 indicates that PSP-ITR can correct a cycle slip within 5 iterations but NonPSP-ITR cannot correct a cycle slip even with 50 iterations.


In order to verify that PSP-ITR outperforms NonPSP-ITR, especially when τv/T is high, BER performance of different timing recovery schemes (with 10 iterations) as a function of σw/T's at SNR=5 dB is plotted in FIG. 11, where the PLL gain parameters were optimized for each timing recovery scheme and each σw/T. As shown in FIG. 11, the performance of both the conventional receiver and NonPSP-ITR with 10 iterations decreases dramatically as σw/T increases. This implies that the conventional receiver and NonPSP-ITR does not work well when σw/T is large. On the other hand, PSP-ITR still provides a large performance gain even when σw/T=1.5%. Furthermore, PSP-ITR performs as good as the system with a trained PLL does up to σw/T=0.6%. This suggests that PSP-ITR is much more robust to the severe timing jitter than NonPSP-ITR.


Simulation Results with an Equalized Channel


Until now, this disclosure has assumed the ideal channel model in FIG. 2-1. Now, the new iterative timing recovery algorithm of the present invention is also applied to the equalized channel 350 shown in FIG. 12. In FIG. 12, recording channel 350 is shown to include components similar to those included in channel 200 illustrated in FIG. 2-1. These components share the same reference numbers. In FIG. 12, g(t) represents the transition response of the magnetic channel implemented in circuit 355.


The transition response for a longitudinal recording channel (usually known as a Lorentzian pulse) is given by
g(t)=K1+(2tPW50)2Equation15

where K is a scaling constant and PW50 indicates the width of the Lorentzian pulse at half of its peak value. Similarly, the transition response for a perpendicular recording channel is given by
g(t)=erf(2tln2PW50)Equation16

where erf(·) is an error function which is defined by
erf(x)=2π0x-t2t,

and PW50 determines the width of the derivative of g(t) at half its maximum. The ratio, normalized density, ND=PW50/T represents the normalized recording density which defines how many data bits can be packed within the resolution unit PW50, and the dibit response (the pulse resulting from two transitions corresponding to one bit) is defined as h(t)=g(t)−g(t−T).


After convolving the transition sequence dk with the transition response g(t), electronic noise is added in the system through the SNR value definition given as
SNR=10logEtσ2Equation17

where Ei is the energy of the impulse response of the recording channel, and σ2 is the power of the electronic noise. For convenience, the impulse response of the recording channel is normalized so that Ei becomes unity.



FIG. 13 illustrates a PSP-based iterative timing recovery receiver or architecture 400 in accordance with another embodiment of the present invention. As can be seen from FIG. 13 the readback signal is first low pass filtered by LPF 255, then processed by the “PSP-BCJR With Equalizer” block or circuit 405. The difference between this block and the equalizer 310 in FIG. 3 is that PSP-BCJR 405 implements a finite impulse response (FIR) equalizer F(z) at each of its branches after adjusting the samples both during forward and backward recursion. The purpose of this equalizer is to equalize the recording channel to a target response G(z).


The plots in FIG. 14 and FIG. 15 show the cycle slip correction timing estimates as a function of time (in bit periods) for both longitudinal and perpendicular magnetic recording channels, respectively. To obtain those figures

    • ND=2 is considered for both longitudinal and perpendicular magnetic recording channels.
    • Clock jitter noise in the system is assumed to be 0.5%, media jitter noise to be 3%, and frequency offset to be 0.4% of the bit period.
    • The 3-tap General Partial Response (GPR) target and its corresponding 21-tap equalizer is designed at Bit-Error-Rate (BER) equal to 10−5 based on the uncoded channel model without any clock jitter, media jitter, and frequency offset effects. The corresponding GPR target for longitudinal channel is [1 0.0986-0.7015], and for perpendicular channel [1 1.1482 0.4751].
    • A second order timing recovery loop is used. PLL gain parameters are designed to catch the phase and frequency offsets within 256 bit periods (which is the length of the preamble within each data sector) based on a linearized PLL model assuming no noise in the system. The same PLL gain parameters are then used for both acquisition and tracking of timing information.


Looking at those FIGS. 14 and 15, it can be seen that the trend is very similar to the one in FIG. 10 which corresponds to a PR-IV channel. In other words, for equalized longitudinal and perpendicular channels, it can again be seen that the cycle slip is corrected at the end of the second iteration for the timing recovery algorithm of the present invention, while it requires tens of iteration for its counterpart in literature. Of course, such a behavior results into very similar performance plots shown for the PR-IV channel in FIG. 6, and in FIG. 7 for the equalized channel case also.


Reduction in Implementation Complexity


If one considers the proposed architecture in FIG. 3, or its version with equalizers at each branch in FIG. 13, it can be seen that these architectures get the low-pass filtered analog signal y(t) as their input, and then process this signal within the PSP-BCJR block to output channel bit estimates in the digital domain. This requires Analog-to-Digital (A/D) converters at each branch of the PSP-BCJR block. Implementing those A/D blocks in detector branches increases the implementation complexity.


The complexity of the architecture in those FIGS. can be reduced by applying the idea of interpolated timing recovery, and converting to the receiver architecture 450 shown in FIG. 16. As seen from this FIG., the analog signal y(t) is first sampled using an A/D converter 455 with a fixed sampling clock period Ts. It is sufficient to have Ts to be only 5% to 10% larger than the channel bit duration T. The output of the A/D converter 455 is the samples of the digital signal y(kTs) to be processed with the digital version of the PSP-BCJR block or equalizer 460. The digital version 460 of this block in FIG. 16 is essentially the same as the PSP-BCJR block 310 in FIG. 3 or block 405 in FIG. 13. The main difference is, block 460 implements interpolation filters in place of A/ D blocks at each of its branches.


The previous studies have shown that interpolated timing recovery, once configured correctly, results into essentially the same system performance compared with a timing loop employing hybrid A/D blocks. Thus, the architecture in FIG. 16 should also result into the same system performance as the ones illustrated in FIG. 3 or FIG. 13.


SUMMARY

In accordance with embodiments of the present invention, PSP is applied to develop PSP-BCJR (or other PSP-SDA) for performing timing recovery and equalization jointly. With PSP-BCJR, a PSP-based iterative timing recovery scheme was provided, denoted as PSP-ITR, for coded PR channels. The proposed scheme iteratively exchanges soft information between PSP-BCJR and an error-correction decoder.


Simulation results have shown that PSP-ITR outperforms NonPSP-ITR, especially when σw/T is large. This is primarily because PSP-ITR can automatically correct a cycle slip much more efficiently than NonPSP-ITR. In other words, PSP-ITR requires much less number of turbo iterations to correct a cycle slip than NonPSP-ITR. In addition, it has been observed that PSP-ITR performs similar to the system with a trained PLL at the 50-th iteration for σw/T up to 1%.


In accordance with embodiments of the present invention, a method of reducing the implementation complexity of the new PSP-based iterative timing recovery scheme is provided. The idea of interpolated timing recovery is employed to get rid of the hybrid A/D blocks within every branch of the PSP-BJCR architecture. Instead, those blocks are replaced with interpolation filters, which are simpler to implement compared to A/D blocks.


The PSP method can also be applied to Soft Output Viterbi Algorithm (SOVA) type soft output detectors, and those soft outputs can be used within the channel iteration.


Appendix A

Purpose


This Appendix further investigates the performance gain of the Per-Survivor Processing Iterative Timing Recovery (PSP-ITR) architecture provided above against the most recently proposed iterative timing recovery. In this Appendix, the most recently proposed iterative timing recovery method is again referred to as Non-PSP-ITR. Current and future magnetic recording products are taken as the base systems at low Signal-to-Noise-Ratio (SNR) regions to quantify the improvement in performance. It is worth noting that Non-PSP-ITR is not the algorithm that is implemented in current products. When needed, to quantify the performance of the timing recovery architecture implemented in current read-channel architectures, plots are labeled as “conventional receiver” to compare with PSP-ITR. The organization of this Appendix is as follows: After a brief introduction, recent investigations on quantifying timing errors in today's recording architectures are presented. Then, the spindle speed variation is taken as a case study, and a comparison is made between the different timing recovery architectures.


Introduction


Referring back to FIG. 11 described earlier, shown was the Bit-Error-Rate (BER) performance comparison of different timing recovery schemes as a function of percent phase jitter σw/T at SNR equal to 5 dB (low SNR value) for perfectly equalized PR-IV channel. FIG. 11 shows the system with only electronic noise. In order to see the effects of different noise mixes, we also compared the timing recovery architectures with 5% media jitter noise (medium media jitter noise) on top of electronic noise of 5 dB (FIG. 17), and with electronic noise of 10 dB (medium electronic noise) and the media jitter noise to be 10% (high media jitter noise) (FIG. 18). The inventors have also tried different target responses, and different magnetic recording channels (perpendicular and longitudinal recording channels with different normalized densities), and observed similar behavior.


Looking at FIGS. 11, 17 and 18, similar trends can be seen, i.e.,

    • At low phase jitter percentage values (σw/T<0.3) small performance difference between the Non-PSP-ITR and the PSP-ITR is observed.
    • When the jitter percentage value becomes high, the performance difference also increases.


Thus, the question which this Appendix addresses, i.e., “What is the performance gain of the Per-Survivor Processing Iterative Timing Recovery (PSP-ITR) architecture proposed above?” highly depends on the amount of timing errors in the system. In other words, where do we operate on σw/T axis of those plots? Is there any frequency offset in the system? If there is, what is the realistic amount of frequency offset, and how does it affect the system performance? In order to find answers to these questions, a number of resources were utilized.


Timing Errors in Magnetic Recording Architectures


The information presented here can be itemized as:

    • The clock jitter coming from crystal clock is very small (0.01%) and can be neglected.
    • The frequency offset in the system should also be considered.
    • The dominant effect, which causes the timing errors in the system, is spindle speed variations. It is specified as 0.1% of the sampling clock for write-process, and 0.1% for read-process. Thus, the overall worst case effect is 0.2%.
    • There are also other effects, which will cause timing errors in the system. For example:
      • When the head moves off-track this causes the phase of the waveform to shift because of the interaction of data at neighboring tracks.
      • Fly-height modulation will also cause phase changes.
      • The air-bearing resonance will cause sudden phase changes because of bumps on the medium.


Among the items above, some will translate into phase jitter in the system, some will be the source of frequency offset, and some will result into sudden phase offsets. Here, we will take the spindle speed variation as a case study because its effect is well quantified.


A Case Study—Effect of Spindle Speed Variation


First assume that all the spindle speed variation will be transferred into phase jitter. In other words, the value of a σw/T in FIGS. 11, 17 and 18 above will be 0.2. From those FIGS., it can be seen that the proposed PSP-ITR architecture is performing similar to Non-PSP-ITR architecture in literature.


However, for future magnetic recording architectures with higher areal densities the T value will reduce, which can result in higher σw/T values. For example, currently 80 Gbyte per platter products are available. The platter diameter is 3.5 inch with a hole of diameter 1.8 inch in the middle. This means that the area to write data is π(1.752−0.92), or 7.08 square inches. Each side of the platter is written to, thus the area becomes 14.18 square inches. Eighty 80 Gbytes or 80*8 Gbits of information is stored on that area, which translates into around 45 Gbits per square inch of areal density. For future products of say 500 Gbits per square inch eleven times more areal density will be required. Similarly, for 1Tbits per square inch twenty-two times more areal density will be required. Assuming the Bit-Aspect-Ratio (BAR) and the rotation speed of the future product to be same as today's, then one ends up with 3.32 and 4.69 times reductions in bit period (T) for 500 Gbits and 1 Tbits per square inch designs, respectively. Thus, the result is 3.32 times and 4.69 times more σw/T in the system. Then the spindle speed effect will be 0.66 and 0.938 percent of the bit period. Looking now at the plots in FIGS. 11, 17 and 18, and comparing the performances of the architectures at those points, system performance gains can be seen ranging from 1 decade to 2 decades.


The spindle speed variation is a slow process compared to sampling time of the channel. Any variations of that will be almost constant within a sector of data. Thus, rather than phase offset, most of it will be translated into frequency offset in the system. Among the other frequency-offset sources, the spindle speed takes a dominant effect. Thus, next consider all the spindle speed variation as a frequency offset in the system. For analysis sake, a frequency offset of 0.3% was assumed to be the nominal value (0.2% coming from spindle speed and 0.1% from other sources). The plots mentioned up until now don't include the frequency offset effect. New simulations with this specific offset value were run, and the results are shown in FIGS. 19 and 20. The following is observed from these FIGS:

    • Even at 0% phase jitter value σw/T, almost a decade improvement is seen with the proposed PSP-ITR architecture over the Non-PSP-ITR architecture;
    • The proposed PSP-ITR architecture is close to Trained PLL architecture performance. The Trained PLL refers to the limit which can be achieved with a PLL loop. Thus, the PSP-ITR method achieves close to the limit.
    • The PSP-ITR architecture results into a better performance with 5 iterations (FIG. 4) than the Non-PSP-ITR architecture with 10 iterations (FIG. 5). Thus, better performance is achieved with less iterations, which means reduced complexity coming from code iterations.


SUMMARY AND CONCLUSION

It has been discovered that spindle speed variation is a key parameter. Assuming the spindle speed variation contributes only to phase jitter, the benefit of the proposed algorithm is demonstrated for future high areal density products. On the other hand, assuming that the spindle speed variation contributes solely to frequency offset, the benefit of PSP-ITR can be seen even for current recording architectures.


In addition to spindle speed variations, there are also other disturbances in the system, which affect timing errors. Some of those disturbances are heads sliding to off-track, fly-height modulation, and air-bearing resonance. As PSP-ITR is more robust than the conventional algorithms implemented on the chip and the ones proposed in literature, it is submitted that PSP-ITR will also behave better in presence of those other disturbances. In conclusion, the PSP-ITR architecture can be used to increase the performance and improve the robustness of both today's and future recording products.


It is to be understood that even though numerous characteristics and advantages of various embodiments of the invention have been set forth in the foregoing description, together with details of the structure and function of various embodiments of the invention, this disclosure is illustrative only, and changes may be made in detail, especially in matters of structure and arrangement of parts within the principles of the present invention to the full extent indicated by the broad general meaning of the terms in which the appended claims are expressed. For example, the particular elements may vary depending on the particular application for the recording system while maintaining substantially the same functionality without departing from the scope and spirit of the present invention.

Claims
  • 1. A method comprising: receiving a signal indicative of data bits; and performing per survivor processing-iterative timing recovery (PSP-ITR) on the received signal to generate probabilities of the data bits.
  • 2. The method of claim 1, wherein performing PSP-ITR on the received signal further comprises processing the received signal using a per survivor processing-soft decision algorithm (PSP-SDA) which jointly performs timing recovery and equalization.
  • 3. The method of claim 2, wherein processing the received signal using a PSP-SDA algorithm comprises processing the received signal using a Bahl, Cocke, Jelinek, and Raviv (BCJR) algorithm configured to implement per survivor processing (PSP) to jointly perform timing recovery and equalization.
  • 4. The method of claim 2, wherein processing the received signal using a PSP-SDA algorithm comprises processing the received signal using a Soft Output Viterbi Algorithm (SOVA) configured to implement per survivor processing (PSP) to jointly perform timing recovery and equalization.
  • 5. The method of claim 2, wherein processing the received signal using a PSP-SDA algorithm further comprises: calculating a plurality of branch metrics, with each branch metric corresponding to a transition branch between states in a trellis; and identifying a survivor path between the states as a function of the calculated branch metrics.
  • 6. The method of claim 5, wherein each state has an associated sampling phase offset used to sample the received signal, at least some of the sampling phase offsets being different from one another, and wherein the step of calculating the plurality of branch metrics further comprises calculating each branch metric as a function of the sampling phase offset at a starting state of the corresponding branch.
  • 7. The method of claim 6, wherein calculating the plurality of branch metrics further comprises calculating a plurality of transition metrics during forward recursions.
  • 8. The method of claim 7, wherein calculating the plurality of branch metrics further comprises calculating a plurality of transition metrics during backward recursions.
  • 9. The method of claim 8, wherein performing per survivor processing-iterative timing recovery (PSP-ITR) on the received signal to generate the probabilities of the data bits further comprises generating the probabilities of the data bits as a function of the transition metrics calculated during the forward and backward recursions.
  • 10. The method of claim 5, wherein performing PSP-ITR on the received signal to generate probabilities of the data bits further comprises performing interpolated timing recovery on the received signal.
  • 11. A communications channel circuit configured to implement the method of claim 1.
  • 12. A read channel circuit configured to implement the method of claim 1.
  • 13. The method of claim 12, wherein performing interpolated timing recovery on the received signal further comprises: sampling the received signal with an analog-to-digital converter with a fixed sampling clock period to generate samples of a digital signal; processing the samples of the digital signal using the PSP-SDA, wherein the PSP-SDA processes the generated samples of the digital signal using interpolation filters in each of its branches.
  • 14. An apparatus for processing a signal indicative of data bits, the apparatus comprising: a low pass filter which receives the signal indicative of the data bits and provides as an output a filtered analog signal; and a per survivor processing-iterative timing recovery (PSP-ITR) equalizer which performs PSP-ITR on the filtered analog signal to generate probabilities of the data bits.
  • 15. The apparatus of claim 14, and further comprising a turbo equalizer including the PSP-ITR equalizer and a decoder which iteratively exchange information to generate the probabilities of the data bits.
  • 16. The apparatus of claim 15, wherein the PSP-ITR equalizer is a per survivor processing-soft decision algorithm (PSP-SDA) equalizer which jointly performs timing recovery and equalization.
  • 17. The apparatus of claim 16, wherein the PSP-SDA equalizer is a per survivor processing (PSP) Bahl, Cocke, Jelinek, and Raviv (BCJR) equalizer.
  • 18. The apparatus of claim 16, wherein the PSP-SDA equalizer is a per survivor processing (PSP) Soft Output Viterbi Algorithm (SOVA) equalizer.
  • 19. The apparatus of claim 16 and further comprising an analog-to-digital converter coupled between the low pass filter and the PSP-ITR equalizer, the analog-to digital converter sampling the filtered analog signal and generating samples of a digital signal, wherein the PSP-ITR equalizer processes the generated samples of the digital signal using interpolation filters in each of its branches.