1. FIELD OF THE INVENTION
Exemplary embodiments of the present invention relate to a clock and data recovery (CDR) apparatus with adaptive optimum CDR bandwidth estimation by using a Kalman gain extractor.
2. Discussion of the Background
The input jitter of a clock and data recovery (CDR) can be modeled as the sum of the accumulation and periodic jitter. The periodic jitter does not accumulate over time and has bounded variance in general.
Data-dependent deterministic jitter is a subset of the periodic jitter. The accumulation jitter, on the contrary, is unbounded in nature and increases indefinitely with time, thus a CDR has to track it for bit-error-free operation.
The analogy between the Kalman filter and a bang-bang (BB) CDR is utilized for the analytical minimum bounds of the mean squared phase error of a BB CDR circuit under the condition of random phase tracking.
An exemplary embodiment of the present invention discloses a clock and data recovery (CDR) apparatus with adaptive optimum CDR bandwidth estimation by using a Kalman gain extractor comprising a clock generator configured to provide frequency locked clocks to a digitally controlled phase rotator, a bang-bang phase detector (BBPD) and a Kalman gain extractor configured to estimate an optimum Kalman gain for a loop filter connected to the BBPD and the phase rotator.
The Kalman gain extractor includes an on-chip digital loop filter and an off-chip digital processor to receive phase update information from the CDR apparatus and output the optimum Kalman gain.
The on-chip digital loop filter includes a cyclic accumulator, a gain multiplier and a phase interpolator controller.
The off-chip digital processor outputs the optimum Kalman gain obtained by extracting a standard deviation of step sizes of an accumulation jitter from power spectral density (PSD) of the phase update information.
The off-chip digital processor includes, a storage register configured to store the phase update information and a fast Fourier transform (FFT) processor configured to extract PSD of an absolute input jitter from the phase update information.
The off-chip digital processor further includes, an optimum Kalman gain estimator configured to calculate the optimum Kalman gain from the PSD of an accumulation jitter.
The off-chip digital processor further includes, a gain calibrator configured to compensate for variations in transition density.
The Kalman gain extractor includes, a Kalman filter configured to find the optimum Kalman gain by minimizing a posterior MSE recursively.
The BBPD includes a demultiplexer modeled by parallel BBPDs with a subsequent summation block.
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention, and together with the description serve to explain the principles of the invention.
according to an exemplary embodiment of the present invention.
and σN=0.158Ul rms according to an exemplary embodiment of the present invention.
The invention is described more fully hereinafter with reference to the accompanying drawings, in which exemplary embodiments of the invention are shown. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these exemplary embodiments are provided so that this disclosure is thorough, and will fully convey the scope of the invention to those skilled in the art. In the drawings, the size and relative sizes of layers and regions may be exaggerated for clarity. Like reference numerals in the drawings denote like elements.
Hereafter, an exemplary embodiment of the present invention will be described in detail with reference to the accompanying drawings. It is noted that the same reference numerals are used to denote the same elements throughout the drawings. In the following description of the present invention, the detailed description of known functions and configurations incorporated herein is omitted when it may make the subject matter of the present invention unclear.
The input jitter of a CDR can be modeled as the sum of the accumulation and non-accumulative period jitter. The non-accumulative period jitter may not accumulate over time and have bounded variance in general. Data-dependent deterministic jitter may be a subset of the non-accumulative jitter. The accumulation jitter, on the contrary, may be unbounded in nature and increases indefinitely with time, thus a CDR may have to track it for bit-error-free operation.
where E[W2] may may be the variance of random period jitter W, and fData may be the data rate. By taking the bilinear transformation of the Equation 2 for simplicity, the following Equation 3 can be derived.
S(f) may decrease by −20 dB/decade as frequency increases.
A jitter tolerance mask may provide the information on the accumulation and random non-accumulative period jitter of a serial link.
The magnatidue of S(f) can be estimated with the jitter tolerance mask since it may represent the maximum permissible jitter present in a communication link. Even if the practical jitter in a link is hardly composed of sinusoids, the jitter tolerance specification may be defined with sinusoids for testing purposes. In practice, the jitter in serial links carrying real traffic may be more like random noise.
Appropriate values for σW and σN can be estimated by matching the variances of the modeled jitter in
and 0.053Ulrms respectively, and σN>>σW.
where σj may be the standard deviation of the relative input Gaussian jitter φj=φin−φout. φbbpd which can be modeled by a white randon process uncorrelated with φj if σj>>βθpr. The standard deviation of φbbpd may be approximately 0.750σj. In case σj≦0.5βθpr, the dynamics of a BB CDR may be merely nonlinear.
n-th prediction error en may be expressed as the following Equation 5.
e
n=φd,n−φout,n [Equation 5]
where φd,n and φout,n may be the n-th desired and the output clock phases, respectively. If computational latency D is neglected, for simplicity, the n+1-th prediction error en+1 may be recursively given by the following Equation 6.
e
n+1=φd,n+1−φout,n+1=(1−Kbbpdβθpr)en+Wn−Kbbpdβθpr(Nn+φbbpd,n) [Equation 6]
The MSE (Mean Squared Error) of the n+1-th prediction error may be expressed as the following Equation 7.
where E[φbbpd,n2]≈9σj2/16 under phase lock. Provided that the CDR bandwidth may be sufficiently large to track the accumulation jitter, σj2 may be approximately E[W2]+E[N2]. By setting E[en+12]=E[en2]=E[e∞2], the steady state MSE may be given by the following Equation 8.
The gain of the loop filter may be set β=1. The behavioral simulation results may validate the theoretical analysis in the meaningful σN range.
High-speed digital domain CDRs typically may make parallel demultiplexed subrate phase updates due to timing constraints of digital logic blocks.
en,m may be the n-th prediction error of the m-th channel in the set of parallel BBPDs as given by en,m=φd,n,m−φout,n,m. The time and channel indices may satisfy −∞<n<∞and 0<m≦M, respectively, where k indices may satisfy M may be the level of parallelization. The linearized gain of the m-th BBPD, Kbbpd,m may be 2/√{square root over (2π/(mσw2+σN2))}, since the random jitter W may be accumulated for m cycles. In the case of σw<<σN, this linearized gain may become insensitive to the channel index m and can be approximated as Kbbpd,m≈2/(√{square root over (2π)}σN)=Kbbpd. A recursive equation for the n+1-th prediction error of the first channel, en+1,1 is given by the following Equation 9.
e
n+1,1
=e
n,M
+W
n,1−(Men,1+Σk=2M(M+1−k)Wn−1,k+Σk=1M(Nn,k+φbbpd,n,k))Kbbpdβθpr [Equation 9]
en,m may be related to en,1 by the following Equation 10 since the phase updates occur every M-th input signal.
e
n,m
=e
n,1+Σk=2mWn−1.k [Equation 10]
The following Equation 11 may be derived by substituting Equation 10 into Equation 9,
e
n+1,1
=e
n,1+Σk=2MWn−1+Wn,1−(Men,1+Σk=2M(M+1−k)Wn−1,k+Σk=1M(Nn,k+φbbpd,n,k))Kbbpdβθpr. [Equation 11]
The MSE of the first channel is given by the following Equation 12.
phase lock. By defining the MSE at n+1-th clock cycle as the average MSE among M parallel channels, the following Equation 13 can be obtained.
By substituting Equation 12 into Equation 13, the following Equation 14 can be derived.
The steady state MSE may given by the following Equation 15.
and σN=0.158 Ulrms according to an exemplary embodiment of the present invention. The MSE may increase in proportion to M since the phase update latency degrades the tracking performance.
The Kalman filter may be a discrete time minimum MSE estimator that finds the optimum Kalman gain by minimizing the posterior MSE recursively. The tracking error in a BB CDR can be minimized by incorporating the Kalman filter algorithm in selecting the optimum forward gain β. The optimum Kalman gain may achieve the optimum balance between tracking the accumulation jitter and filtering the non-accumulative period jitter.
Bn may be β at time index n. By taking the derivative of E[en+12] in Equation 14 with respect to Bn, the following Equation 16 may be derived.
Optimum Kalman gain Bn satisfying dE[en+12]/dBn=0 may be expressed as the following Equation 17.
By substituting Equation 17 into Equation 14 for simplicity, the following Equation 18 may be derived.
E[e
n+1
2]=(1−MBnKbbpdθpr)E[en2]+ME[Wn2] ]Equation 18]
Equation 17 and Equation 18 may yield the recursive procedure that constitutes the Kalman filtering algorithm. The steady state MSE may be expressed as the following Equation 19.
Equation 19 may indicate the minimum MSE bound of a BB CDR.
and D=0. The theoretical and simulated results may show close agreement, and the MSEs may be minimized when the Kalman gains are applied.
As described above, implementation non-idealities such as latency in the loop filter and quantization noise from the phase rotator may be neglected for simplicity in the analysis. Control latency, however, may degrade the tracking performance of a CDR by decreasing the closed loop phase margin. Digitally controlled phase rotators may have limited resolution for the output phase. Reduced resolution may relax the complexity of a rotator while degrading the jitter performance of a CDR.
In case delay in the loop filter D may be nonzero, Equation 11 may be modified as the following Equation 20.
e
n+1,1
=e
n,1+Σk=2MWn−1,k+Wn,1−(Men−D,1+Σk=2M(M+1−k)Wn−D−1,k+Σk=1M(Nn−D,k+φbbpd,n−D,k))KbbpdBnθpr. [Equation 20]
According to Equation 20, Kbbpd can be approximated as Kbbpd=2q0/(√{square root over (2π)}σj)ifσj<Bn/Kbbpd, where q0 is ½, ⅓, abd ⅕ for D=0, 1 and 2, respectively. However, in the case of σj>Bn/Kbbpd, Kbbpd=2/(√{square root over (2π)}σj) and may be independent of the loop delay.
In order to calculate the MSE under nonzero loop delay, the correlation between en,1 and en−D,1 may be considered.
In this case,
E[e
n,1
e
n−D,i]≈E[en,12]−MDE[Wn2] [Equation 21]
By using Equation 21, E[(en,1−MBDnKbbpdθpren−D,1)2] may become the following Equation 22.
E[(en,1−MBDnKbbpdθpren−D,1)2]=(1−MBDnKbbpdθpr)2E[en,12]+2M2DBDnKbbpdθprE[Wn2]. [Equation 22]
From Equation 20 and Equation 22, the recursive MSE equation with nonzero D may be expressed as the following Equation 23.
where BDn may denote the Kalman gain with loop delay. By taking a similar approach to Equation 16, Kalman gain BDn may be expressed as the following Equation 24.
The Kalman gain under control latency may be smaller than Equation 17, because only low frequency prediction error may be valid. By the way, in most cases, the tracking error may satisfy E[en2]>>E[W2] in the locked condition, than BDn≈Dn. By substituting Equation 24 into Equation 23, the following Equation 25 may be derived.
E[e
n+1
2]=(1−MBDnKbbpdθpr)E[en2]+(M+M2DBDnKbbpdθpr)E[Wn2] [Equation 25]
and the steady state MSE may be the following Equation 26.
Equation 26 may represent the generalized minimum MSE bound of a BB CDR. This bound may be equal to Equation 19, when D=0.
and σN=0.158 Ulrms. The minimum MSE bound may increase in proportion to D and M.
By substituting Equation 26 into Equation 24, the optimum forward gain BDnθpr may be given by the following Equation 27.
In the case of σW<<σN. √{square root over (M2(4D+1)σW4+4ησW2)}≈(5/2)σWσN, and hence, Equation 27 can be simplified to Equation 28.
By using a Taylor series, Equation 28 can be further simplified as given by the following Equation 29 and Equation 30.
Because a PLL is designed to track the accumulation jitter, the forward gain, which represents the bandwidth of a PLL, may have to be mainly related to the accumulation jitter; the optimum bandwidth may be approximately the standard deviation of the step size of the accumulation jitter.
In case the recovered clock phase may be locked to the input data, the operation of the clock path of a CDR may be similar with the delta modulation. The staircase approximation of the input jitter, φq,n may be given by the following Equation 31.
φq,n=φq,n−1+Σm=1Mβθpren,m [Equation 31]
Assuming that the accumulation process starts at zero time, Equation 31 can be approximated as the following Equation 32.
φ
q,n=Σi=1nΣm=1Mβθprei,m [Equation 32]
Therefore, the accumulator output Σi=1nΣm=1Mβei,m can be considered as φq,n/θpr.
The inputs of the off chip digital processor may be phase update information from the cyclic accumulator. The input signal may be stored in a storage register and then a scale factor θpr may be multiplied for the phase domain conversion. The UI-domain PSD of the accumulated phase noise, S(f) can be achieved by using a fast Fourier transform (FFT) algorithm. The standard deviation of σW can be estimated from S(f) since σW=√{square root over (4S(f)sin2(fπ/fData))}{square root over (4S(f)sin2(fπ/fData))}. Finally, a calibrator may multiply an architecture-dependent correction factor to σW considering the resolution of the phase rotator, data transition density and the gain reduction caused by under sampling.
Proper selection of f may be crucial because σW can be misinterpreted due to computational error at low frequencies and period jitter at high frequencies. The upper 3 dB corner frequency may be determined by the ratio between the variance of the period jitter and σW as given by
The PSD retrieved from the FFT may be valid for frequencies greater than 1/NFFTTS, where NFFT and TS may be the number of points in the FFT and the sampling period, respectively. In order to eliminate the low frequency computational error caused by the limited data storage, the frequency in excess of 10/NFFTTS may have to be chosen. Therefore, NFFT>>10/(fcTs), the valid frequency range may be
The exemplary embodiments according to the present invention may be recorded in computer-readable media including program instructions to implement various operations embodied by a computer. The media may also include, alone or in combination with the program instructions, data files, data structures, and the like. The media and program instructions may be those specially designed and constructed for the purposes of the present invention, or they may be of the kind well-known and available to those having skill in the computer software arts.
It will be apparent to those skilled in the art that various modifications and variation can be made in the present invention without departing from the spirit or scope of the invention. Thus, it is intended that the present invention cover the modifications and variations of this invention provided they come within the scope of the appended claims and their equivalents.
Number | Date | Country | |
---|---|---|---|
61617205 | Mar 2012 | US |