This invention relates to mixed-signal Integrated Circuits (ICs) and more particularly to non-linearity identification and compensation of building blocks for a mixed-signal IC.
The ever-increasing data rates of wireline and wireless communication systems poses stringent performance requirements on their mixed-signal IC sub-systems. In particular, there is a great demand for high-resolution data converters at higher sampling rates, and high-performance frequency synthesizers with stringent phase-noise and spurious performance requirements. Delta-Sigma (ΔΣ) Digital-to-Analog Converters (DACs) leverage noise shaping and high oversampling ratio to realize a high-resolution (on the order of 20-bits), which cannot be readily achieved using a conventional DAC.
Similarly, ΔΣ Fractional-N Phase Locked Loops (FN-PLLs) leverage noise shaping and high oversampling ratio to realize a very fine frequency synthesis capability (on the order of 20-bits). In both of these examples, the ΔΣ noise-shaping modulator reduces the word length (m) of its digital input signal, x[k], to a few (1-6) bits, and the large amount of quantization noise generated is shaped to a high-frequency to make the in-band noise negligible. The out-of-band quantization noise is suppressed by a dedicated low-pass filter in case of a ΔΣ DAC or by loop dynamics in case of a ΔΣ FN-PLL.
In practice, non-linearity of the circuit building blocks poses a bottleneck to realize high-performance ΔΣ DACs and ΔΣ FN-PLLs. The non-linearity causes shaped out-of-band noise to fold intermodulation products into the baseband and limits the Spurious Free Dynamic Range (SFDR) and Signal-to-Noise plus Distortion Ratio (SNDR) of ΔΣ DACs and phase noise and spurious performance of ΔΣ FN-PLLs.
Non-Linearity Impact on ΔΣ Quantization Noise Cancellation
ΔΣ DACs using a 1-bit DAC unit offer inherently high-linearity but at the expense of a large quantization error. Consequently, achieving high in-band Signal-to-Noise Ratio (SNR) requires a very large oversampling ratio. Therefore, its usage is typically limited to low-bandwidth applications like audio and sensor interfaces. Using a multi-bit DAC effectively reduces the amount of quantization noise and considerably relaxes the oversampling ratio required to achieve high-bandwidth. But this comes at the expense of high sensitivity to the mismatch between DAC unit cells.
To improve matching between DAC unit cells, thermometer coding implementation is recommended. To get a better tradeoff between linearity and area, as the number of bits in the DAC increases, DAC implementation is usually segmented into two smaller DACs: an m1-bit coarse DAC and an m2-bit fine DAC. While the original modulator output x0[k] contains shaped noise, the split coarse x1[k], and fine x2[k], signals contain non-shaped noise as well as distortion. Adding DAC1 and DAC2 outputs with the proper gain ratio (gDAC1=2m
A segmented ΔΣ DAC architecture may use a Quantization Noise Cancellation (QNC) technique to overcome this limitation. By re-quantizing the original modulator output x0[k] using a second ΔΣ modulator to drive DAC1, coarse DAC1 control x1[k] now contains noise-shaped components besides the signal. Since signal x2[k] is the difference between the input and output of the ΔΣ modulator, it represents only the shaped quantization noise and does not contain any signal components. As a result, any spectral leakage due to improper gain ratio results only into noise-shaped signal that contributes little in-band energy.
Similarly, QNC techniques may be used to improve the performance and extend the bandwidth of ΔΣ fractional-N phase-locked loops (FN-PLLs).
The fractional divider quantization noise impacts both analog and digital FN-PLLs alike. QNC techniques for digital FN-PLLs may use Time-to-Digital Converters (TDCs) and Digital-to-Time Converters (DTCs). Even though the cancellation gain path can be accurately calibrated using a Least-Mean Square (LMS) technique, each of the PFD/CP signal and DAC noise cancellation paths has different non-linear characteristics, namely pCP(x) and pDAC(X), respectively. The non-linearity of these blocks can significantly limit the overall phase noise and spurious performance of FN-PLLs. Similarly, coarse and fine DACs of a segmented ΔΣ-DAC architecture can have different non-linear characteristics, namely pDAC1(x) and pDAC2(x), and severely degrade SNDR and SFDR of the overall DAC.
The non-linearity of the signal path or noise cancellation path poses a bottleneck to realize high-performance ΔΣ DACs and ΔΣ FN-PLLs. The non-linearity causes shaped noise and tones in the high-frequency region of the multi-bit spectrum to fold intermodulation products into the baseband. The in-band distortion products cannot be readily removed by simple linear filters. Reducing system bandwidth may or may not significantly improve the SNDR and SFDR of ΔΣ DACs and phase noise and spurious performance of ΔΣ FN-PLLs. In this case, it is the non-linearity, rather than the quantization noise, that limits the performance of the system.
What is desired is adaptive non-linearity identification and compensation techniques that can be leveraged in the implementation of data converters, PLLs, and frequency synthesizers to achieve improved performance. An adaptive non-linearity identification and compensation circuit that is useful for various analog/mixed-signal/RF integrated circuits (ICs) building blocks is desired, including for clock generators, clock and data recovery (CDR), phase interpolators, voltage/current amplifiers, transimpedance amplifiers (TIAs), and power amplifiers (PAs).
The present invention relates to an improvement in mixed-signal circuits. The following description is presented to enable one of ordinary skill in the art to make and use the invention as provided in the context of a particular application and its requirements. Various modifications to the preferred embodiment will be apparent to those with skill in the art, and the general principles defined herein may be applied to other embodiments. Therefore, the present invention is not intended to be limited to the particular embodiments shown and described but is to be accorded the widest scope consistent with the principles and novel features herein disclosed.
The inventor proposes methods and apparatus to identify and compensate the non-linearity of precision analog and mixed-signal building blocks such as ΔΣ DACs and ΔΣ FN-PLLs. In general, any arbitrary memoryless non-linearity of a given circuit block can be described as a function p(x) of its input signal x, where x is defined over a closed region x∈[−xp, +xp]. In the context of DACs, the input is a discrete-time digital signal, x[k], and the non-linearity function p(x) represents DAC Integral Non-Linearity (INL). This INL function principally defines the deviation of the DAC from its ideal linear behavior. For a perfectly linear DAC, p(x)=0 for any input x.
INL identification and compensation 140 also includes adaptive INL estimator 138 that uses adder 136 output eR[k] along with the input signal x[k] to estimate the error to control q(x) function generator 134 accordingly. INL function p(x) generator 132 represents the deviation from a linear output by a mixed-signal component, such as a DAC.
In practice, non-linearity function p(x) can vary significantly with supply and temperature variations. Therefore, techniques to describe the compensation function using factory trimming or on-power-up calibration may not be sufficient. For example, the temperature may change as the system heats up after the initial calibration. The goal here is to adaptively synthesize the non-linearity compensation function q(x) to cancel the effect of INL function p(x) accurately and robustly across Process, Supply, and Temperature (PVT) variations. To this end, the inventor's non-linearity identification and compensation scheme detects the residual error after compensation to adaptively construct the compensation function q(x). The complexity to build such adaptive non-linearity identification and compensation system can be reduced by how the inventor constructs the compensation function and how to adapt its coefficients. This construction method can be aligned with the inventor's model to represent the non-linearity function, p(x).
Non-Linearity Representation
A continuous static non-linearity is typically represented in terms of an Nth order power series given by:
where ci represents the real valued polynomial coefficients. The ci coefficients can be estimated using cross-correlation techniques, which is prohibitively expensive in terms of hardware implementation, particularly as the polynomial order increases. In practice, the implementation of this kind of polynomial modeling and estimation technique is limited to a few (2 or 3) terms only.
Alternatively, the inventor represents any arbitrary unknown static non-linear function p(x) defined over a closed region x∈[−xp, +xp] by a linear combination of a complete set of orthogonal basis functions or kernels ϕi(x):
where ci denotes the ith coefficient of the corresponding kernel. The orthogonal series method offers some interesting advantages over the power series method and allows a much simpler, but accurate, implementation of the compensation scheme as will be shown later. This is based on the underlying kernels' orthogonality properties. Kernels ϕi(x) are said to be orthogonal in the interval −xp≤x≤+xp if they satisfy:
where A is a constant. If range of x is a discrete set (i.e. M-levels quantized or digitally represented from −M/2 to +M/2−1), the complete set of orthogonal kernels consists of a finite number of M kernels:
For the rest of the disclosure, the inventor treats x as a discrete-time signal with discrete-amplitude (M values). Without loss of generality, all the upcoming equations and analysis are valid and easily applied when x has continuous amplitude (M→∞).
The compensation function q(x) is similarly represented by the orthogonal kernels. In practice, only a limited number of (N≤M)-kernels can be used in implementation of q(x):
where ĉi denotes the ith compensation coefficient of the corresponding kernel.
Optimum Compensation Coefficients
The difference eR(x)=p(x)−q(x), represents the residual compensation error due to the incomplete representation, which is minimum in the least square sense
where MSE is the residual mean square error after compensation. It is defined as:
The ĉi coefficients are chosen adaptively in the background to track changes in the non-linearity function, p(x), across PVT variations. The following analysis is presented to show what the optimum coefficient values that minimize MSE are assuming the input signal, x[k], has a uniform distribution with amplitude values from −M/2 to +M/2−1. This assumption is in-line with the type of signals the inventor encounters in ΔΣ QNC schemes. The real-time error signal can be expressed as:
eR[k]=y[k]−ŷ[k] (8)
Let {k1, k2, . . . , kL} represents L different discrete-time instances. Then, the inventor can use vector notation to describe input, output, compensation, and error signals as 1×L vectors as follows:
X=[x[k1],x[k2], . . . ,x[kL]]T (9)
Y=[y[k1],y[k2], . . . ,y[kL]]T (10)
Ŷ=[ŷ[k1],ŷ[k2], . . . ,ŷ[kL]]T (11)
ER=[eR[k1],eR[k2], . . . ,eR[kL]]T (12)
where T is the transpose operator. Similarly, the inventor can describe coefficients, ĉi, as 1×N vector, and the orthogonal kernels as N×L matrix:
Then, compensation and error vectors can be expressed as:
Ŷ=ΦĈ (15)
ER=Y−Ŷ=Y−ΦĈ (16)
The inventor defines the cost function, ε, to be minimized as:
ε=ERTER (17)
Then, by setting the derivative of E with respect to Ĉ, ∇Ĉξ, to zero, ε can be minimized at Ĉ optimal as follows:
∇Ĉξ=(−ΦT)·2(Y−ΦĈ)=0 (18)
Ĉ=(ΦTΦ)−1ΦTY (19)
The inventor can clearly see the error minimization process depends mainly on the autocorrelation matrix product ΦTΦ. For a uniformly distributed input vector, X, and because of the kernel's orthogonality (see eq. (5)), the autocorrelation matrix product ΦTΦ can be reduced to an identity matrix, I, times a constant, A:
Then, Ĉ optimal can be expressed as:
In standard format, the optimum compensation coefficients, ĉi, to minimize MSE:
In other words, optimum compensation coefficients, ĉi, represent the projection of the non-linearity function, p(x), onto the orthogonal kernels, ϕi(x):
Adaptive Non-Linearity Compensation Using LMS Technique
Orthogonality helps to avoid the prohibitively expensive matrix inversion operation (ΦTΦ)−1 of eq. (19). But still in practice finding the optimum compensation coefficients in a background manner by computing ΦTY is not obvious, as the inventor may not have signal y[k] readily available in a digital format. Alternatively, the inventor can use the residual error signal after compensation, eR[k], to adaptively construct the compensation function q(x). It may be better to search for the optimal solution iteratively using steepest-descent methods. Starting from an initial guess, Ĉ(0), and using the slope of the cost function, the inventor can improve upon it in a recursive manner until it ultimately converges to the optimal Ĉ using eq. (18) as follows:
where μ is the step size parameter. The error signal of eq. (5) can be expressed using kernels as:
The second error term is due to incomplete kernel representation (i.e. N<M). Because of kernels orthogonality (see eq. (5)), the nonorthogonal terms will average out and the expectation ΦTER, can approximately be reduced to:
This clearly shows that the iterative process will continue until the compensation coefficients, ĉi, converge to the non-linearity coefficients, ci, representing p(x) of eq. (4). For simplicity, the instantaneous approximation of variables xi[k]=ϕi(x[k]) and eR[k] are used to form a Least Mean Square (LMS) algorithm. The LMS adaptive technique was chosen because of its low computational complexity. The coefficients update equation of (24) can be re-written as:
ĉi[k+1]=ĉi[k]+μxi[k]eR[k] (27)
LMS is effective when both eR[k] and xi[k] are zero-mean signals (i.e. has no DC components).
INL identification and compensation 140 has N paths. The first path for N=1 has kernel block 141 that generates x1[k], which is multiplied by the residual error eR[k] using multiplier 171 and accumulated in accumulator 181. The accumulated output, ĉ1[k], is multiplied using multiplier 151 with kernel block 141's output, x1[k], and then summed with products from other kernel blocks 142, . . . 144 by summers 161, 162, . . . 164 to generate cancellation signal ŷ[k].
The input signal x[k] is passed though N kernel blocks 141, 142, . . . 144 to generate xi[k]=ϕi(x[k]) signals, which are then scaled using multipliers 151, 152, . . . 154 with the corresponding coefficients and aggregated to synthesize the cancellation signal ŷ[k]. Accumulators 181, 182, . . . 184 received scaled inputs from multipliers 171, 172, . . . 174 to generate the corresponding coefficients ĉ1[k], ĉ2[k], . . . ĉN[k].
Choice of Orthogonal Kernels
Conceptually, any complete orthogonal kernels set can be used to represent p(x) and construct q(x). In practice, a finite number of N kernels is used. The choice of the kernel set can have a determinate impact on the compensation MSE. Increasing N equivalently causes finer granularity in the input signal, x, to the kernels. The proper choice of the kernel set is very important to practically realize the compensation function in an efficient manner, while ensuring a small MSE that meets the target specifications.
For example, the most famous orthogonal representation, Fourier series, uses sine and cosine kernels. In practice, the implementation of sine and cosine kernels can lead to prohibitively complex implementation and can result in a very large MSE when N<<M. Other non-sinusoidal representations can be categorized as polynomial kernels and piece-wise kernels. Classical orthogonal polynomial kernels such as Laguerre polynomials, Hermite polynomials, and Jacobi polynomials (including Legendre, Chebyshev, Gegenbauer polynomials) are difficult to implement in practice, because of the numerous multiply and accumulate operations involved.
On the other hand, piece-wise kernels can offer a much simpler implementation. The simplest piece-wise kernels set is based on block pulse functions. A Block Pulse (BP) N-kernels set can be defined using:
where i=1, 2, . . . , N.
In the discrete-sense, BP kernels can be represented by the identity matrix I.
The reduced complexity of BP kernels permits increasing the number of kernels up to N=16 or N=32 in practice. Using N=16, the RMS error can be reduced to 1.29 LSB. But BP kernels suffer from a major drawback. The LMS adaptation of BP kernels can be sensitive to any offset or flicker noise added to the eR[k] signal. This is because all BP kernels, and accordingly generated xi[k] signals, have a DC component, and LMS is effective when both eR[k] and xi[k] are zero-mean signals (i.e. has no DC components) (see eq. (27)). To overcome this limitation, constant piece-wise kernels using Haar functions can be used, where only one kernel accounts for the DC component. Haar functions have three possible states 0 and ±a, where a is set to normalize each kernel's amplitude to satisfy eq. (5).
Walsh functions can also be leveraged to build the constant piece-wise kernel set, where only one kernel accounts for the DC component. Walsh functions uses only two possible states ±1 to form a complete orthonormal set. It has striking similarity with the sine-cosine functions, primarily with respect to their zero-crossing patterns. In the discrete-sense, Walsh kernels are represented by N×N matrix of Walsh Hadamard (WH) codes. In general, the WH matrix WH(n0) can be generated using the following recurrence relation:
where WH(O)=1 and n=log2 N.
Curve 216 shows that error e(x) varies from about +4 to −7, except for an initial excursion to +10 near −512. The Root Mean Square (RMS) of this error e(x), in steady-state, is about 2.50 LSB.
Similar to BP kernels, the reduced complexity of WH kernels permits increasing the number of kernels to N=16 or N=32 in practice. Using N=16, the RMS error can be reduced to 1.29 LSB. However, still the residual error is large and may not meet the target compensation accuracy.
Triangular Piece-Wise Kernels
To improve compensation, the inventor modifies WH kernels to be a linear piece-wise kernels set that uses triangular waveforms in contrast to the rectangular waveforms of the WH kernels. The inventor refers to the new set as Triangular Walsh Hadamard (TWH).
Compared to prior-art piece-wise linear approaches, which used a Simplicial Canonical Piece-Wise Linear (SCPWL) function, the proposed TWH representation offers more accurate and simpler representation. In contrast to constant-piece-wise of the aforementioned kernels, the proposed THW linear piece-wise representation considerably improves the compensation accuracy, because now each segment can be represented by a linear function, and as a result the RMS error can be reduced considerably. TWH kernels offer very simple implementation of sine/cosine like kernels.
Curve 246 shows that error e(x) varies from about +2 to −2, and there is no longer an initial excursion to a high value near −512, as was seen in
Constant Piece-Wise Kernels with Interpolation
The inventor's linear piece-wise kernels, Triangular Walsh Hadamard (TWH), considerably reduce the compensation MSE compared to piece-wise kernels such as BP and WH. However, TWH kernels have a scalability issue, because as the number of kernels increases, the implementation complexity of TWH kernels grows rapidly with numerous accumulate and multiply operations. So practically, it may not be efficient to increase the number of THW kernels beyond N=8. To break this trade-off between compensation accuracy and implementation complexity, the inventor leverages the remarkably simple constant piece-wise kernels as BP, Harr, or WH, in addition to linear interpolation to achieve the same compensation accuracy as TWH but with a much simpler and scalable implementation.
ŷ[k]=qx[k]+(qx+1[k]−qx[k])×xF[k]×2−(m-n) (30)
The interpolation process smoothes the final compensation function, q(x). Compensation function constructor 312 can use inverse matrix and dot product operations to directly construct qLUT(x) and update LUT 314 routinely to track PVT variations. Because PVT tracking is a slow process, inverse matrix computation and LUT update can be done at much slower rates than the input sample rate by adding decimation filters 301, 302, . . . 304 on the outputs of accumulators 291, 292, . . . 294 in LMS correlator 320.
For BP kernels, the kernel matrix is an identity matrix, which means its inverse is also an identity matrix, therefore coefficients can directly represent qLUT (x). For WH kernels, fast inverse WH transform (iWHT) can be used to construct qLUT(x) from coefficients, ĉi, in an efficient manner compared to direct inverse matrix and dot product operations. Compensation function constructor 312 using iWHT receives the outputs of decimation filters 301, 302, . . . 304 and generates updates to LUT 314. The fast iWHT algorithm is similar to the well-known inverse fast Fourier transform (iFFT). The fast iWHT is considerably more efficient and faster to compute than iFFT, because it only uses ±1 states in its kernel representation, so each operation can involve only addition and subtraction compared to complex and costly multiplication operations used in the iFFT computation.
Assume that function p(x) generator 132 represents the non-linear error generated by an m-bit DAC, and the range of x is a discrete set with M=2m levels. The cancellation signal ŷ[k] generated by INL identification and compensation 322 is subtracted from the error output by function p(x) generator 132 using adder 136 to generate a residual error eR[k].
INL identification and compensation 322 has N paths. The first path for N=1 has kernel block 271 that generates x1[k], which is multiplied by the residual error eR[k] using multiplier 281. The multiplier output, eLMS1[k], is accumulated by accumulator 291 and decimated by decimation filter 301 to generate the first input to compensation function constructor 312, ĉ1[k2].
The input signal x[k] is passed though N kernel blocks 271, 272, . . . 274 in kernels 318 to generate xi[k]=ϕi(x[k]) kernel signals, which are then multiplied with the residual error eR[k] using multipliers 281, 282, . . . 284 and accumulated by accumulators 291, 292, . . . 294 and decimated by decimation filters 301, 302, . . . 304 and input to compensation function constructor 312. LUT 314 is updated by compensation function constructor 312, making PVT adjustments for LUT interpolation to synthesize the cancellation signal ŷ[k].
Using interpolation and updating of LUT 314, as shown in
Curve 336 shows that error e(x) varies from about +2 to −3. The RMS of error e(x), curve 336, in steady-state is about 0.66 LSB. This is about 4 times the compensation accuracy improvement compared to WH without interpolation and about the same compensation accuracy of TWH (N=8).
Applications
In this section, the inventor presents some examples of applications for the proposed adaptive INL identification and compensation techniques.
TDC 362 outputs a digital word eTDC[k]. Digital loop filter 364 digitally filters the output of TDC 362 after subtraction of error terms by adders 368, 370. Digital loop filter 364 generates a multi-bit digital control word, DC[k], that drives Digitally Controlled Oscillator (DCO) 366 towards phase and frequency lock in the loop.
Fractional-N operation is generated by dithering MMD 108 with m1-bit xDIV[k], which is generated by truncating an m-bit (m>m1) frequency control word x[k] by using digital ΔΣ modulator 110. Adder 112 subtracts the frequency control word x[k] from the output of ΔΣ modulator 110 to generate an m2-bit quantization error that is accumulated by accumulator 114.
Output eTDC[k] of accumulator 114 is applied to gain calibrator 376, which generates the gain adjustment yG[k]. subtracted from the PLL loop by adder 368. Gain calibrator 376 uses LMS correlator 372 to correlate the accumulated quantization error, eTDC[k], with the input to digital loop filter 364, residual error eR[k], with the output of LMS correlator 372 multiplied using multiplier 374 with the accumulated quantization error, eTDC[k], to generate the gain adjustment yG[k].
INL identification and compensation 322 also receives the accumulated quantization error, eTDC[k], from accumulator 114. Kernels 318, LMS correlator 320, and compensation function constructor 312 operate on the input from accumulator 114 to generate updates to entries in LUT 314, as shown and described earlier for
LMS correlator 320 in INL identification and compensation 322 and LMS correlator 372 in gain calibrator 376 both receive residual error eR[k] that is input to digital loop filter 364. DCO 366 output OUT is fed back to Multi-Modulus Divider (MMD) 108 to generate the divided clock DIV. MMD 108 can be a ΔΣ fractional divider in the feedback path. DCO 366 generates an output signal, OUT, of frequency FOUT=(NDIV+αDIV)FREF, where NDIV is a positive integer, αDIV is a fractional value between 0 and 1, and FREF is the frequency of the reference clock signal REF. Fractional-N operation is achieved by dithering the feedback multi-modulus divider MMD 108 using ΔΣ modulator 110, where the average of the dithered signal xDIV[k] corresponds to the desired fractional factor αDIV.
The m2-bit quantization error is filtered by the low pass filtering characteristic of the PLL. However, it is common knowledge that a PLL suffers from a conflicting bandwidth tradeoff. For example, DCO phase noise suppression requires large bandwidth while a low bandwidth is needed to effectively suppress ΔΣ quantization noise error. In view of these tradeoffs, the impact of ΔΣ quantization noise error on output phase noise is reduced by cancelling it at the output of the TDC. The effectiveness of this approach greatly depends on the gain and non-linearity characteristic of the TDC.
While gain calibration using LMS correlation technique is reported in the prior-art, it is insufficient to adequately suppress the ΔΣ quantization noise error to levels that are mandated by many applications. The dynamic range of TDC 362 must at least be as large as one DCO period when the fractional divider is dithered by a first order ΔΣ modulator and several DCO periods for higher order ΔΣ modulators. Increasing TDC dynamic range severely degrades its linearity performance. The non-linearity identification and compensation circuit of
DTC 382 and DTCC 384 can be any digitally-controlled delay element, such as a digitally-controlled delay line, a voltage-controlled delay line with a DAC, a phase rotator, a phase interpolator, a multi-phase generator with phase selector, a multi-phase delay locked loop (DLL) with a programmable delay or a phase selector.
However, the effectiveness of this approach is also limited by the DTC non-linearity. This embodiment of the adaptive DTC INL identification and compensation scheme compensates DTC INL in the “time-domain” using a narrow range linear compensation (DTCC). As a result, it greatly improves the ΔΣ quantization noise cancellation accuracy, thereby enabling ultra-low noise and spurs at the synthesizer output.
Gain calibrator 376 is connected as described in
Several other embodiments are contemplated by the inventor. For example, many kinds and arrangements of analog detectors, filters, oscillators, adders, DAC's, and digital processors, function units, logic gates, and logic structures are possible. Various encodings, transforms, and alterations to data and signals may be performed at different stages, and for a variety of reasons. Functional units, blocks, and other components may be shared and used for several purposes. Various initialization and start-up procedures or circuits could be added, or power-down circuits and routines.
Some embodiments disclosed herein are applicable to any ΔΣ FN-PLL, such as analog, digital, or hybrid. FN-PLLs may be used for frequency synthesis, phase modulation and demodulation, clock generation, clock and data recovery, jitter attenuators, and phase synchronization. The output clock OUT may be encoded with data for transmission. Some embodiments presented in this disclosure are particularly suited for applications with stringent phase noise and spurious performance requirements.
Error corrections or adjustments may be added to the loop in the time domain or in the digital domain, or in a combination such as shown in
DTC 382, 384 can be any digitally-controlled-delay element, such as a digitally-controlled delay line, a phase rotator, or a phase interpolator. TDC 362 can compare the phases of REF and DIV and generate a multi-bit digital value directly or may use a charge pump to generate an analog voltage that is then converted by and ADC to a digital value.
A Digital-Signal Processor (DSP) may be used for some functions. The DSP may be part of a larger system that is controlled by a Central Processing Unit (CPU) that may have a microprocessor that controls a DSP, signal processing blocks, circuits, or other enhancements such as a pipeline to process signals. The CPU can execute instructions stored in memory to perform the operations. Inputs, outputs, and intermediate results may be stored in one or more memories. Data signals that are converted to digital values may be stored in a memory for processing by a CPU or a DSP, which may use lookup tables or a specialized processor or data pipeline to accumulate values, modulate, perform kernel operations, LMS correlation, compensation function constructor, fast iWHT, interpolation, multiplication and addition/subtraction. A general-purpose programmable DSP may be used for prototyping and development, then a faster DSP with dedicated hardware may be used for production. Purpose-built or custom DSP blocks and other hardware may be used for some or all components, while a more generic DSP may be used for other components, especially non-speed-critical blocks. Field-Programmable Gate-Array (FPGA) or other semi-custom blocks may also be used, either initially for prototypes or later for production devices. The invention may also be implemented with an Application-Specific Integrated Circuit (ASIC) design, or other implementations. A mixed-signal IC may be used that includes the PLL blocks and DSP blocks for gain calibration and INL identification and compensation. The device may be digitally re-programmable, such as to support various modes of operation, initialization, testing, different bit widths of digital signals, different operating speeds, clock speeds, division factors for the feedback divider, etc.
Many embodiments of the non-linearity identification and compensation techniques described herein are applicable in general to many analog/mixed-signal/RF Integrated Circuit (IC) building blocks including but not limited to clock generators, Clock and Data Recovery (CDR), phase interpolators, voltage/current amplifiers, Transimpedance Amplifiers (TIAs), and Power Amplifiers (PAs).
Additional components may be added at various nodes, such as resistors, capacitors, inductors, transistors, extra buffering, etc., and parasitic components may also be present. Enabling and disabling the circuit could be accomplished with additional transistors or in other ways. Pass-gate transistors or transmission gates could be added for isolation. Filters may be added. While a fast iWHT process has been described, other inverse transforms could be substituted or altered. For example, fast Fourier Transforms and inverse Fourier transforms may be used, especially during the design and prototyping phase, and later reduced in complexity for iWHT.
The kernel coefficients ĉi, can be set adaptively or non-adaptively. The coefficients can be set using pre-set values either under manual or computer or program control. At other times the coefficients may be adaptively generated using the LMS correlator. The LMS correlator can be used only on startup and later turned off after the residual error is below a certain threshold or after a pre-set period. The LMS correlator can be turned on again after a pre-determined period of time has elapsed to track Temperature and Supply Voltage variations. The LMS correlator may be turned on again after the residual error is above certain threshold. Thus, adaptive coefficient generation can be used only on startup, or when the residual error is large, with the coefficients stored in the lookup table remaining unchanged when the LMS correlator is turned off. The orthogonal kernel generator and compensation function constructor and other related blocks may also be turned off when the LMS correlator is turned off. Some PLL blocks could be implemented in the digital domain or in the analog domain or in the time domain. For example, in
In another alternative, TDC 362 contains a charge pump and an Analog-to-Digital Converter (ADC). In this alternative, TDC 362 measures the time or phase difference between rising edges of clocks REF, DIV, and activates the charge pump to charge or discharge a capacitor in TDC 362. Then the capacitor voltage is converted by the ADC to a digital value that is input to loop filter 364, or one of adders 368, 370.
In still another embodiment, TDC 362 could be PFD/CP 102 without a capacitor or ADC. Then loop filter 364 could be an analog capacitor (loop filter 104) with an ADC that converts the capacitor voltage Vc to digital control value DC[k] to control the frequency of oscillation of DCO 366. Other combinations and variations are possible.
While Least-Mean Square (LMS) has been described, other steepest-gradient methods may be substituted, such as Recursive Least squares (RLS), and modifications of LMS, RLS, or other methods.
While separate subtractors 368, 370 in series have been shown, subtractors could be in series or both adjustments could be summed first and then only the sum of the adjustment is subtracted in a single subtractor in the PLL loop.
Kernels ϕi(x) are said to be orthogonal in the interval −xp≤x≤+xp if they satisfy:
where A is a constant. Orthogonal kernels may be defined using equation (3).
The + and − inputs to adder 112 may be reversed.
The orthogonal kernel generator may generate a complete set or an incomplete set of orthogonal kernels. The orthogonal kernel generator may generate polynomials kernels, linear piece-wise kernels, or constant piece-wise kernels. Polynomials kernels may include Laguerre polynomials, Hermite polynomials, and Jacobi polynomials such as Legendre, Chebyshev, and Gegenbauer polynomials. Constant piece-wise kernels may include block-pulse (BP), Haar, and Walsh Hadamard (WH) kernels. The WH kernel set includes one kernel that has a non-zero, constant DC bias, while other WH kernels have a net zero DC bias over a range of the input signal and each can have only two output values. Linear piece-wise kernels include Triangular Walsh Hadamard (TWH) kernels. The TWH kernel set includes one kernel that has a constant DC bias, while other TWH kernels have a net zero DC bias over a range of the input signal.
The background of the invention section may contain background information about the problem or environment of the invention rather than describe prior art by others. Thus, inclusion of material in the background section is not an admission of prior art by the Applicant.
Any methods or processes described herein are machine-implemented or computer-implemented and are intended to be performed by machine, computer, or other device and are not intended to be performed solely by humans without such machine assistance. Tangible results generated may include reports or other machine-generated displays on display devices such as computer monitors, projection devices, audio-generating devices, and related media devices, and may include hardcopy printouts that are also machine-generated. Computer control of other machines is another tangible result.
Any advantages and benefits described may not apply to all embodiments of the invention. When the word “means” is recited in a claim element, Applicant intends for the claim element to fall under 35 USC Sect. 112, paragraph 6. Often a label of one or more words precedes the word “means”. The word or words preceding the word “means” is a label intended to ease referencing of claim elements and is not intended to convey a structural limitation. Such means-plus-function claims are intended to cover not only the structures described herein for performing the function and their structural equivalents, but also equivalent structures. For example, although a nail and a screw have different structures, they are equivalent structures since they both perform the function of fastening. Claims that do not use the word “means” are not intended to fall under 35 USC Sect. 112, paragraph 6. Signals are typically electronic signals but may be optical signals such as can be carried over a fiber optic line.
The foregoing description of the embodiments of the invention has been presented for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed. Many modifications and variations are possible in light of the above teaching. It is intended that the scope of the invention be limited not by this detailed description, but rather by the claims appended hereto.
Number | Name | Date | Kind |
---|---|---|---|
7999623 | Lin | Aug 2011 | B2 |
8497716 | Zhang | Jul 2013 | B2 |
8791733 | Tertinek | Jul 2014 | B2 |
9246500 | Perrott | Jan 2016 | B2 |
9264211 | Jenkins | Feb 2016 | B1 |
9344271 | Dusatko | May 2016 | B1 |
9490818 | Perrott | Nov 2016 | B2 |
9548750 | Tertinek | Jan 2017 | B2 |
9780945 | Avivi | Oct 2017 | B1 |
10419007 | Gao | Sep 2019 | B2 |
20110169578 | Lucas | Jul 2011 | A1 |
20140097875 | Tertinek et al. | Apr 2014 | A1 |
Entry |
---|
R. Adams and K. Q. Nguyen, “A 113-dB SNR oversampling DAC with segmented noise-shaped scrambling,” IEEE Journal of Solid-State Circuits, vol. 33, No. 12, pp. 1871-1878, Dec. 1998. |
M. Gupta and B. Song, “A 1.8-GHz Spur-Cancelled Fractional-N Frequency Synthesizer With LMS-Based DAC Gain Calibration,” IEEE Journal of Solid-State Circuits, vol. 41, No. 12, pp. 2842-2851, Dec. 2006. |
A. Swaminathan, K. J. Wang, and I. Galton, “A wide-bandwidth 2.4 GHz ISM band fractional-N PLL with adaptive phase noise cancellation,” IEEE J. Solid-State Circuits, vol. 42, pp. 2639-2650, Dec. 2007. |
C.-M. Hsu, M. Z. Straayer, and M. H. Perrott, “A low-noise wide-BW 3.6-GHz digital ΔΣ fractional-N frequency synthesizer with a noise-shaping time-to-digital converter and quantization noise cancellation,” IEEE J. Solid-State Circuits, vol. 43, No. 12, pp. 2776-2786, Dec. 2008. |
D. Tasca, M. Zanuso, G. Marzin, S. Levantino, C. Samori, and A. L. Lacaita, “A 2.9-4.0-GHz fractional-N digital PLL with bang-bang phase detector and 560-fs rms integrated jitter at 4.5-mW power,” IEEE J. Solid-State Circuits, vol. 46, No. 12, pp. 2745-2758, Dec. 2011. |
A. Elkholy, T. Anand, W. S. Choi, A. Elshazly, and P. K. Hanumolu, “A 3.7 mW low-noise wide-bandwidth 4.5 GHz digital fractional-N PLL using time amplifier-based TDC,” IEEE J. Solid-State Circuits, vol. 50, No. 4, pp. 867-881, Apr. 2015. |
R. Raich, Hua Qian and G. T. Zhou, “Orthogonal polynomials for power amplifier modeling and predistorter design”, IEEE Transactions on Vehicular Technology, vol. 53, No. 5, pp. 1468-1479, Sep. 2004. |
J. E. Cousseau, J. L. Figueroa, S. Werner and T. I. Laakso, “Efficient Nonlinear Wiener Model Identification Using a Complex-Valued Simplicial Canonical Piecewise Linear Filter”, IEEE Transactions on Signal Processing, vol. 55, No. 5, pp. 1780-1792, May 2007. |