1. Field of the Invention
The present invention relates to a method of estimating crosstalk noise in high-speed VLSI interconnects, and more particularly to a method of using moment computations of coupled RLC-tree models with uniform distributed lines and project-based model-order reduction techniques.
2. Description of Related Art
Modern technological trends have caused interconnect modeling to have attracted considerable attention in high-speed VLSI designs. Owing to these designs with performance considerations, increasing clock frequency, shorter rising times, higher density of wires, and using low-resistivity materials, on-chip inductance effects can no longer be ignored in interconnect models. Furthermore, the importance of coupling inductance effects has grown continuously since nanometer technology has emerged over the last few years. It has been observed that crosstalk noise estimations made by considering inductance effects may yield more pessimistic results than those made without considering coupling inductance effects, as discussed in C. K. Cheng, J. Lillis, S. Lin, and N. H. Chang, Interconnect Analysis and Synthesis, John Wiley and Sons Inc., 2000. Such estimation errors follow from two main reasons: (1) more and longer wires in parallel increase the capacitive coupling, leading to large current changes on the victim nets. (2) increasing self inductance worsens overshooting spikes on aggressor nets. It may then pull the victims with them. For the above practical considerations, interconnect models shall be extended to be coupled RLC trees while considering the inductance effects.
A common means of estimating crosstalk noise is implemented by simulating circuit-level VLSI interconnects. Although the results are very accurate, the computational complexity is excessive, especially for large-scale interconnect simulations. An alternative approach, called model-order reduction methods, has recently emerged to solve the problem, as disclosed in L. T. Pillage and R. A. Rohrer, “Asymptotic waveform evaluation for timing analysis,” IEEE Trans. Computer-Aided Design, vol. 9, no. 4, pp. 352-366, 1990; P. Feldmann and R. W. Freund, “Efficient linear circuit analysis by Pade approximation via the Lanczos process,” IEEE Trans. Computer-Aided Design, vol. 14, no. 5, 1995, and A. Odabasioglu, M. Celik, and L. T. Pileggi, “PRIMA: passive reduced-order interconnect macromodeling algorithm,” IEEE Trans. Computer-Aided Design, vol. 17, no. 8, pp. 645-653, 1998. Then, rather than directly estimating the crosstalk waveform of the original interconnects, the crosstalk noise of the reduced-order system is estimated. However, the computational cost is still too high for a noise optimization problem even though model-order reduction methods have reduced the cost, as disclosed in A. Devgan, “Efficient coupled noise estimatin for on-chip interconnects,” in Porc. ICCAD, 1997, pp. 147-151; M. Kuhlmann and S. S. Sapatnekar, “Exact and efficient crosstalk estimation,” IEEE Trans. Computer-Aided Design, vol. 20, no. 7, pp. 858-866, 2001.
A consensus has emerged that of many model-order reduction techniques, the moment matching approach seems to be the most viable for estimating interconnect crosstalk noise. For computational efficiency, traditional models for estimating noise in coupled RC trees have been developed, including the one-pole model (1P) (as disclosed in A. Vittal, L. H. Chen, M. Marek-Sadowska, K. P. Wang, and S. Yang, “Crosstalk in VLSI interconnects,” IEEE Trans. Computer-Aided Design, vol. 18, pp. 1817-1824, 1999; and A. Vittal and M. Marek-Sadowska, “Crosstalk reduction for VLSI,” IEEE Trans. Computer-Aided Design, vol. 16, pp. 290-298, 1997), the modified one-pole model (M1P) (as disclosed in Q. Yu and E. S. Kuh, “Moment computation of lumped and distributed coupled RC trees with application to delay and crosstalk estimation,” Proceedings of the IEEE, vol. 89, no. 5, pp. 772-788, 2001), the two-pole model (2P) (as discussed in M. Kuhlmann and S. S. Sapatnekar, “Exact and efficient crosstalk estimation,” IEEE Trans. Computer-Aided Design, vol. 20, no. 7, pp. 858-866, 2001.; and Q. Yu and E. S. Kuh, “Moment computation of lumped and distributed coupled RC trees with application to delay and crosstalk estimation,” Proceedings of the IEEE, vol. 89, no. 5, pp. 772-788, 2001), (as disclosed in E. Acar, A. Odabasioglu, M. Celik, and L. T. Pileggi, “S2P: A stable 2-pole RC delay and coupling noise metric,” in Proc. 9th Great Lakes Symp. VLSI, Mar. 1999, pp. 60-63), and the guaranteed stable three-pole model (S3P) (as discussed in Q. Yu and E. S. Kuh, “Moment computation of lumped and distributed coupled RC trees with application to delay and crosstalk estimation,” Proceedings of the IEEE, vol. 89, no. 5, pp. 772-788, 2001). Unlike the general model-order reduction methods, the techniques simply estimate the peak value of crosstalk noise and the time at which it peaks rather than evaluating the waveform of crosstalk noise. Also, other patents U.S. Pat. No. 5,481,695, U.S. Pat. No. 5,535,133, U.S. Pat. No. 5,555,506, U.S. Pat. No. 5,568,395, U.S. Pat. No. 5,596,506, U.S. Pat. No. 6,018,623, U.S. Pat. No. 6,029,117, and U.S. Pat. No. 6,405,348 have been disclosed the techniques about the crosstalk noise estimations. However, since the interconnect crosstalk noise may have a non-monotonic response waveform, these models seem to be unsuitable for capturing the essential nature of such crosstalk noise.
Recently, the delay and noise formulae by considering self inductances and mutual inductances have been disclosed in Y Cao, X. Huang, D. Sylvester, N. Chang, and C. Hu, “A new analytical delay and noise model for on-chip RLC interconnect,” in Proc. IEDM 2000, 2000, pp. 823-826. However, their model is restricted to two parallel lines. The analytical delay and overshooting formulae for coupled RLC lines have been disclosed in M. H. Chowdhury, Y. I. Ismail, C. V. Kashyap, and B. L. Krauter, “Performance analysis of deep sub micron VLSI circuits in the presence of self and mutual inductance,” in Proc. ISCAS 2002, 2002, pp. 197-200. However, issues concerning inductive crosstalk noise analysis have still not yet been studied. Furthermore, by exploring the special nature of RLC-tree structures, recursive algorithms for computing system moments with linear order have been developed, for example, by C. L. Ratzlaff and L. T. Pillage, “RICE: rapid interconnect circuit evaluation using AWE,” IEEE Trans. Computer-Aided Design, vol. 13, no. 6, pp. 763-776, 1994 and Q. Yu and E. S. Kuh, “Exact moment matching model of transmission lines and application to interconnect delay estimation,” IEEE Trans. VLSI syst., vol. 3, no. 2, pp. 311-322, 1995, independently. Moment models of general transmission lines were presented in Q. Yu, E. S. Kuh, and T. Xue, “Moment models of general transmission lines with application to interconnect analysis and optimization,” IEEE Trans. VLSI syst., vol. 4, no. 4, pp. 477-494, 1996. However, these studies did not mention moment computations for coupled RLC trees.
The technique, “Crosstalk estimated in high-speed VLSI interconnect using coupled RLC-tree models”, which is proposed in Proc. 2002 IEEE Asia Pacific Conference on Circuits and Systems, is our initial research. Although the moment computation formulae for coupled RLC trees have been developed, the technique about efficiently constructing the crosstalk estimation model was not provided. Also, the stability of the model was still not analyzed. In addition, the line model is represented as sections of lumped RLC circuits. In order to increase accuracy of the estimation results, the number of the sections must be increase. However, this will increase the memory space for storing data and simulation time.
The present invention discloses a method for efficiently estimating crosstalk noise of high-speed VLSI interconnects. In the invention, high-speed VLSI interconnects are modeled as RLC coupled trees with uniform distributed lines. The inductive crosstalk noise waveform can be accurately estimated in an efficient manner using the linear time recursive moment computation technique in conjunction with the projection-based order reduction method. Every moment of a line is represented as a polynomial of the coordinate of the line. Although a distributed line is a infinite order system in nature, the order of each moment polynomial can be shown finite. Notably, all of the coefficients of the polynomials of moments can be calculated recursively. Recursive moment computation formulas and a moment model for distributed coupling RLC-tree models are also developed. The efficiency and the accuracy of moment computation of distributed lines can be shown that outperform those of lumped ones. Crosstalk-metric models for distributed coupled RLC trees are also established. A stable reduced-order model will be constructed implicitly using the recursive moment computation technique in conjunction with the projection-based model-order reduction method.
Crosstalk noise estimations will be made by investigating the crosstalk noise of this reduced-order network.
Further benefits and advantages of the present invention will become apparent after a careful reading of the detailed description with appropriate reference to the accompanying drawings.
by using the moments obtained in step 22. Then {circumflex over (V)}(s) is reformulated by the pole-residue form
and the resultant time-domain crosstalk noise will be {circumflex over (v)}(t)=k1ep
The dynamics of RLC coupled trees can also be represented by the following MNA formula:
where (s{circumflex over (M)}d+{circumflex over (N)}d){circumflex over (X)}d(s)=AdVn(s) are the state equations of the reduced-order model of distributed lines; Vn(s) and IL(s) represent the node voltage vector and R-L branch current vector; the matrices R, L, G, and C include lumped resistors, inductors, conductors, and capacitors; and Ad, Ai, and As are incidence matrices.
Let vector Xk=[{circumflex over (X)}d,k Vn,k IL,k]T be the kth-order moment of X(s) about s=0. If Q=└X0 X1 . . . Xq−1┘ is used as the congruence transform matrix, then the MNA matrices of the reduced-order model are obtained: {circumflex over (M)}=QTMQ and {circumflex over (N)}=QTNQ. Thus, the kth-row and the lth-column entry of {circumflex over (N)} and {circumflex over (M)} become {circumflex over (n)}k,l=Xk−1TNXl−1 and {circumflex over (m)}k,l=Xk−1TMXl−1, respectively. Since the recursive moment formula implies
NX0=B
NXi+1=−MXi, for i=0, 1, . . . q
{circumflex over (n)}k,l will be
where Vn,k−1TCVn,l−2 and IL,k−1TLIL,l−2, related to lumped circuits, can be calculated by the technique: “Crosstalk estimated in high-speed VLSI interconnect using coupled RLC-tree models”, which is proposed in Proc. 2002 IEEE Asia Pacific Conference on Circuits and Systems by H. J. Lee, C. C. Chu, and W. S. Feng. For {circumflex over (X)}d,k−1T{circumflex over (M)}d{circumflex over (X)}d,l−2, the details will be shown as follows.
Moment Computations for Coupled RLC-Tree Interconnect Models
A set of coupled RLC trees contains several individual RLC trees with capacitive and inductive couplings to each other. Each RLC tree comprises floating resistors and self inductors from the ground and capacitors connecting between nodes on the tree and the ground. A tree with a voltage source connected to its root is called an aggressor tree; by contrast, trees whose roots are grounding are called victim trees. By ignoring self inductances and mutual inductances, the conventional coupled RC-tree models are obtained. In this invention, coupled interconnects are modeled as coupled RLC trees for analyzing the crosstalk noises.
To clearly describe the complex coupled RLC-tree structures, the invention first introduces the notation. Consider a typical section of tree Ti in coupled RLC trees shown in
Moment Model of coupled RLC Lines
Let Vji(x,s), Iji(x,s), and Ic
where CjTi=Cji+Σcc
where Eqs. (5) and (6) are derived from Kirchhoff's current law and Eq. (7) is derived from Kirchhoff's voltage law.
In this invention, IC
If k=0 , zeroth-order polynomial IC
Integrating IC
Substituting Eq. (10) into Eq. (7) concludes that
The relations between the order of polynomials m and p and the order of moments k can be found from Eqs. (9) and (11). First, m=0 when k=1. Second, from the third part of Eq. (11), we have p=m+2. Finally, from Eq. (9), the order of polynomial IC
After the coefficients αs and βs are computed, the moment model of coupled RLC lines can be established. From Eq. (6), setting x=0, we obtain
Ij,ki(0)=Ij,ki(1)+Jj,ki (12)
where
representing the summation of the capacitive currents of Lineji. Moreover, Eq. (7) can be rewritten
Ej,ki and ELM
Inserting Coupled Lines into Coupled RLC Trees
This invention-uses a function, Line(nji), to indicate a line connected between nodes nji and F(nji), Line(nji)=1, or not, Line(nji)=0. Thus the kth-order current moment Ij,ki(1) at x=1 can be obtained as below:
Next, the voltage moment Vj,ki is also considered. First, the added current source induces further voltage drops through resistances and inductances at all nodes A(nji), the set of ancestor nodes of nji. Second, the voltage source Ej,ki cause voltage drops only between Vj,ki and VF(j),ki. Thus Vj,ki can be obtained as follows:
Finally, the recursive moment computation formulae can be obtained by incorporating the moment model of coupled distributed lines. From Eq. (17), the recursive relationship of the kth-order current moments is given as
Each current moment can be calculated from the leaves to the root of tree Ti. Considering the relationships between voltage moments Vj,ki and VF(j),ki,k Eq. (18) can be rewritten
The computational complexity of the proposed recursive formulas of distributed models is O(nk2 ), where n is the number of nodes in the tree. On the other hand, that of lumped models equals O(mk), where m is the number of nodes in the tree. In general, $m$ must be larger than nk so that the lumped model can be enough accurate.
Using the Moment Information to Construct Crosstalk Noise Estimation Model
For {circumflex over (X)}d,k−1T{circumflex over (M)}d{circumflex over (X)}d,l−2 in Eq. (2), Proposition 1 shows the results.
Proposition 1: The term {circumflex over (X)}d,k−1T{circumflex over (M)}d{circumflex over (X)}d,l−2 can be calculated by evaluating the contribution of each distributed line. The contributions of a line Lineji are
where bj,ni is the coefficient of the term zn in the polynomial Ij,k−1i(z)ELM
Moreover, subtle relationships between entries of {circumflex over (M)} and {circumflex over (N)}, and simplification of {circumflex over (N)} are summarized in Proposition 2.
Proposition 2: Let matrices {circumflex over (M)} and {circumflex over (N)} be the MNA matrices for the reduced-order model that are generated by the congruence transformation Q, where Q=└X0 X1 . . . Xq−1┘. Thus entries of {circumflex over (M)} and {circumflex over (N)} have the following subtle relationships:
Let V(s) and {circumflex over (V)}(s)be the step responses of the original model and the desired reduced-order model, respectively. Technique disclosed in M. Kuhlmann and S. S. Sapatnekar, “Exact and efficient crosstalk estimation,” IEEE Trans. Computer-Aided Design, vol. 20, no. 7, pp. 858-866, 2001, suggested an appropriate formula of the q -pole reduced-order model {circumflex over (V)}(s) as follows (as in step 24):
which causes the approximate crosstalk voltage {circumflex over (v)}(t) to be zero for t converging to 0 and ∞. Conventional moment matching techniques are often used to solve the unknown coefficients ai(0≦i≦q−2) and bj(1≦j≦q) by using the front 2q−1 moments {V1V2, . . . V2q−1} of the original model:
V(s)=V1+V2s+V3s2+ . . . +V2q−1s2q−2+ . . .
In step 26, Eq. (24) can also be rewritten as the pole-residue form
where pi for i=1,2, . . . ,q are poles of {circumflex over (V)}(s) and each ki is the residue corresponding to the pole pi. By applying the inverse Laplace transformation, we have
{circumflex over (v)}(t)=k1ep
In step 28, the peak value of the crosstalk waveform will occur at time t=tm where {circumflex over (v)}′(tm)=0 and {circumflex over (v)}″(tm)<0.
In the previous moment computations, the input waveform is assumed to be a step function. However, the input signal in step 10 may be with an arbitrary waveform. Let the updated v(s) be
V(s)=m1′s+m2′s2+m3′s3+m4′s4+m5′s5 (26)
For example, suppose that the input signal is a ramp function as follows:
where u(t) is a step function and 1/τ is the slope of the ramp function. Applying the Laplace transform, we have
Comparing the coefficients of Eqs. (26) and (27) concludes
To verify the accuracy of the proposed method, a coupling circuit but not limiting examples shown in
The conventional one-pole model (1P) and two-pole model (2P) and our new method with three-pole model (S3P), . . . , six-pole model (S6P) are investigated for comparison studies. Absolute and relative errors of crosstalk peak values in comparison with HSPICE simulation results are summarized in Tables 1 Among the 560 cases, model 1P has unstable poles in 48 cases and model 2P is unstable in 98 cases. Table 2 summarizes the computational times of moments and the relative errors of model S6P generated by lumped couple RLC-tree models with different number of sections per 1 mm long, to compare the efficiency and the accuracy between distributed lines and lumped ones.
From simulation results, we have the following observations:
1. The models generated by the proposed method outperform the conventional 1P and 2P models. Thus these conventional models are no longer appropriate for coupled RLC trees. Increasing the order of the reduced-order models allows the proposed models perform more accurately.
2. From the viewpoints of the absolute errors in Table1, model S3P, whose average. errors are smaller than 10%, seems acceptable for estimating crosstalk noise. However, the relative errors imply that model S3P seems not accurate as expected. Model S6P is recommended to balance computational efficiency and estimation performance.
3. The computational time of moments and the relative errors of Model S6P yielded by the proposed method are 8.27 seconds and 5.15%, respectively. Table 2 displays that it must cost more than 271.96 seconds to obtain the same relative error by using lumped coupled RLC lines. Obviously, the efficiency and the accuracy of distributed models outperform those of lumped models.