1. Field of the Invention
The present invention relates to a method of estimating crosstalk noise in high-speed VLSI interconnects, and more particularly, to a method of using moment computations of lumped coupled RLC-tree models and project-based model-order reduction techniques.
2. Description of Related Art
Modern technological trends have caused interconnect modeling to have attracted considerable attention in high-speed VLSI designs. Owing to these designs with performance considerations, increasing clock frequency, shorter rising times, higher density of wires, and using low-resistivity materials, on-chip inductance effects can no longer be ignored in interconnect models. Furthermore, the importance of coupling inductance effects has grown continuously since nanometer technology has emerged over the last few years. It has been observed that crosstalk noise estimations made by considering inductance effects may yield more pessimistic results than those made without considering coupling inductance effects, as discussed in C. K. Cheng, J. Lillis, S. Lin, and N. H. Chang, Interconnect Analysis and Synthesis, John Wiley and Sons Inc., 2000. Such estimation errors follow from two main reasons: (1) more and longer wires in parallel increase the capacitive coupling, leading to large current changes on the victim nets. (2) increasing self inductance worsens overshooting spikes on aggressor nets. It may then pull the victims with them. For the above practical considerations, interconnect models shall be extended to be coupled RLC trees while considering the inductance effects.
A common means of estimating crosstalk noise is implemented by simulating circuit-level VLSI interconnects. Although the results are very accurate, the computational complexity is excessive, especially for large-scale interconnect simulations. An alternative approach, called model-order reduction methods, has recently emerged to solve the problem, as disclosed in L. T. Pillage and R. A. Rohrer, “Asymptotic waveform evaluation for timing analysis,” IEEE Trans. Computer-Aided Design, vol. 9, no. 4, pp. 352-366, 1990; P. Feldmann and R. W. Freund, “Efficient linear circuit analysis by Pade approximation via the Lanczos process,” IEEE Trans. Computer-Aided Design, vol. 14, no. 5, 1995, and A. Odabasioglu, M. Celik, and L. T. Pileggi, “PRIMA: passive reduced-order interconnect macromodeling algorithm,” IEEE Trans. Computer-Aided Design, vol. 17, no. 8, pp. 645-653, 1998. Then, rather than directly estimating the crosstalk waveform of the original interconnects, the crosstalk noise of the reduced-order system is estimated. However, the computational cost is still too high for a noise optimization problem even though model-order reduction methods have reduced the cost, as disclosed in A. Devgan, “Efficient coupled noise estimatin for on-chip interconnects,” in Porc. ICCAD, 1997, pp. 147-151; M. Kuhlmann and S. S. Sapatnekar, “Exact and efficient crosstalk estimation,” IEEE Trans. Computer-Aided Design, vol. 20, no. 7, pp. 858-866, 2001.
A consensus has emerged that of many model-order reduction techniques, the moment matching approach seems to be the most viable for estimating interconnect crosstalk noise. For computational efficiency, traditional models for estimating noise in coupled RC trees have been developed, including the one-pole model (1P) (as disclosed in A. Vittal, L. H. Chen, M. Marek-Sadowska, K. P. Wang, and S. Yang, “Crosstalk in VLSI interconnects,” IEEE Trans. Computer-Aided Design, vol. 18, pp. 1817-1824, 1999; and A. Vittal and M. Marek-Sadowska, “Crosstalk reduction for VLSI,” IEEE Trans. Computer-Aided Design, vol. 16, pp. 290-298, 1997), the modified one-pole model (M1P) (as disclosed in Q. Yu and E. S. Kuh, “Moment computation of lumped and distributed coupled RC trees with application to delay and crosstalk estimation,” Proceedings of the IEEE, vol. 89, no. 5, pp. 772-788, 2001), the two-pole model (2P) (as discussed in M. Kuhlmann and S. S. Sapatnekar, “Exact and efficient crosstalk estimation,” IEEE Trans. Computer-Aided Design, vol. 20, no. 7, pp. 858-866, 2001; and Q. Yu and E. S. Kuh, “Moment computation of lumped and distributed coupled RC trees with application to delay and crosstalk estimation,” Proceedings of the IEEE, vol. 89, no. 5, pp. 772-788, 2001), (as disclosed in E. Acar, A. Odabasioglu, M. Celik, and L. T. Pileggi, “S2P: A stable 2-pole RC delay and coupling noise metric,” in Proc. 9th Great Lakes Symp. VLSI, March 1999, pp. 60-63), and the guaranteed stable three-pole model (S3P) (as discussed in Q. Yu and E. S. Kuh, “Moment computation of lumped and distributed coupled RC trees with application to delay and crosstalk estimation,” Proceedings of the IEEE, vol. 89, no. 5, pp. 772-788, 2001). Unlike the general model-order reduction methods, the techniques simply estimate the peak value of crosstalk noise and the time at which it peaks rather than evaluating the waveform of crosstalk noise. Also, other patents U.S. Pat. No. 5,481,695, U.S. Pat. No. 5,535,133, U.S. Pat. No. 5,555,506, U.S. Pat. No. 5,568,395, U.S. Pat. No. 5,596,506, U.S. Pat. No. 6,018,623, U.S. Pat. No. 6,029,117, and U.S. Pat. No. 6,405,348 have been disclosed the techniques about the crosstalk noise estimations. However, since the interconnect crosstalk noise may have a non-monotonic response waveform, these models seem to be unsuitable for capturing the essential nature of such crosstalk noise.
Recently, the delay and noise formulae by considering self inductances and mutual inductances have been disclosed in Y. Cao, X. Huang, D. Sylvester, N. Chang, and C. Hu, “A new analytical delay and noise model for on-chip RLC interconnect,” in Proc. IEDM 2000, 2000, pp. 823-826. However, their model is restricted to two parallel lines. The analytical delay and overshooting formulae for coupled RLC lines have been disclosed in M. H. Chowdhury, Y. I. Ismail, C. V. Kashyap, and B. L. Krauter, “Performance analysis of deep sub micron VLSI circuits in the presence of self and mutual inductance,” in Proc. ISCAS 2002, 2002, pp. 197-200. However, issues concerning inductive crosstalk noise analysis have still not yet been studied. Furthermore, by exploring the special nature of RLC-tree structures, recursive algorithms for computing system moments with linear order have been developed, for example, by C. L. Ratzlaff and L. T. Pillage, “RICE: rapid interconnect circuit evaluation using AWE,” IEEE Trans. Computer-Aided Design, vol. 13, no. 6, pp. 763-776, 1994 and Q. Yu and E. S. Kuh, “Exact moment matching model of transmission lines and application to interconnect delay estimation,” IEEE Trans. VLSI syst., vol. 3, no. 2, pp. 311-322, 1995, independently. Moment models of general transmission lines were presented in Q. Yu, E. S. Kuh, and T. Xue, “Moment models of general transmission lines with application to interconnect analysis and optimization,” IEEE Trans. VLSI syst., vol. 4, no. 4, pp. 477-494, 1996. However, these studies did not mention moment computations for coupled RLC trees.
The technique, “Crosstalk estimated in high-speed VLSI interconnect using coupled RLC-tree models”, which is proposed in Proc. 2002 IEEE Asia Pacific Conference on Circuits and Systems, is our initial research. Although the moment computation formulae for coupled RLC trees have been developed, the technique about efficiently constructing the crosstalk estimation model was not provided. Also, the stability of the model was still not analyzed.
The present invention discloses a method for efficiently estimating crosstalk noise of high-speed VLSI interconnects. In the invention, high-speed VLSI interconnects are modeled as RLC coupled trees. The inductive crosstalk noise waveform can be accurately estimated in an efficient manner using the linear time recursive moment computation technique in conjunction with the projection-based order reduction method. Recursive formulas of moment computations for coupled RC trees are derived with considering both self inductances and mutual inductances.
The present invention discloses a method for efficiently estimating crosstalk noise of high-speed VLSI interconnects. In the invention, high-speed VLSI interconnects are modeled as RLC coupled trees. The inductive crosstalk noise waveform can be accurately estimated in an efficient manner using the linear time recursive moment computation technique in conjunction with the projection-based order reduction method. Recursive formulas of moment computations for coupled RC trees are derived with considering both self inductances and mutual inductances.
Further benefits and advantages of the present invention will become apparent after a careful reading of the detailed description with appropriate reference to the accompanying drawings.
by using the moments obtained in step 22. Then {circumflex over (V)}(s) is reformulated by the pole-residue form
and the resultant time-domain crosstalk noise will be {circumflex over (v)}(t)=k1ep
The dynamics of RLC coupled trees can also be represented by the following MNA formula:
where M,NεRn×n, BεRn×m, and PεRn×p, The matrix M
contains the capacitance matrix C and the inductance matrix L; the matrix N comprises the conductance matrix G and the resistance matrix R and the incident matrix E. The matrix X(s) represents the transfer functions of the state variables. The matrix B=[b1 b2 . . . bm] indicates that the circuit has m aggressor trees and each bi(1≦i≦m) represents the contribution of the ith independent voltage source. V(s)εR(n/2)×m denotes voltages across the grounding capacitances and I(s)εR(n/2)×m contains currents flowing through R-L branches of the aggressor and victim trees. Since the number of nodes and that of R-L branches are equal, n/2 is a positive integer number. Y(s) stands for the transfer functions at the far end nodes of the victim trees chosen from X(s) by P. For simplicity, it is assumed that the circuit has only one aggressor tree and once a victim tree is concerned, m=p=1.
If we expand X(s) in power series, X(s)=ΣiXisi, the ith-order moment of X(s) about s=0, Xi, can be obtained. With the aid of Eq. (1), the recursive formula for moment Xi can be established as follows:
NX0=B
NXi+1=−MXi, for i=0, 1, . . . , q (2)
In particular, for special lumped RLC-tree structures, efficient recursive moment computation formulae are disclosed in the invention to neglect the above expensive matrix computations. The details will be shown as follows.
Moment Computations for Lumped Coupled RLC-Tree Interconnect Models
A set of coupled RLC trees contains several individual RLC trees with capacitive and inductive couplings to each other. Each RLC tree comprises floating resistors and self inductors from the ground and capacitors connecting between nodes on the tree and the ground. A lumped RLC-tree model excludes transmission lines, couplings, and resistor loops. Each transmission line should be approximated by lumped RLC circuits with a sufficiently large number of RLC segments. A tree with a voltage source connected to its root is called an aggressor tree; by contrast, trees whose roots are grounding are called victim trees. By ignoring self inductances and mutual inductances, the conventional coupled RC-tree models are obtained. In this invention, coupled interconnects are modeled as coupled RLC trees for analyzing the crosstalk noises.
To clearly describe the complex coupled RLC-tree structures, the invention first introduce the notations. Consider a typical section of tree Ti in coupled RLC trees shown in
Each root node has only one son node. Rji and Lji are the resistance and the inductance connected between nji and F(nji); Cj,0i is the capacitance connected between nji and the ground.
denotes the coupling capacitance between nji and nj
Let Vji(s) be the transfer function of the voltage at node nji and Iji(s) be the one of the current flowing through Rji. In particular, V0i(s) represents the voltage at root n0i, where V0i(s)=1 means that a voltage source connected between n0i and the ground and otherwise V0i(s)=0. By expanding Vji(s) and Iji(s) in power series, we have
Vj,ki is called the kth-order voltage moment of Vji(s) and Ij,ki is called the kth-order current moment of Iji(s). The conventional Elmore delay of nji is defined as the first-order voltage moment −Vj,1i. The aim of this section is to compute moments Vj,ki and Ij,ki for each node nji with a given order k.
For a capacitor C, owing to a capacitive current IC(s)=sCV(s), its zeroth-order current moment IC,0 is equal to zero and the kth-order (k>0) current moment IC,k=CVk−1. This implies that the capacitor is equivalent to an open circuit if k=0 or otherwise a current source. For example, consider the circuit with two grounding capacitors and a coupling capacitor in
As a result, the coupling capacitor can be interpreted as two moment current sources. With extensions to multiple coupling capacitors, each decoupled current moment model can be derived as
The kth-order current moment Ij,ki can be obtained by summing up all of the downstream kth-order capacitive current sources:
In particular, k=0 implies the zero dc current case. The Zeroth-order voltage moment Vj,0i at each node nji is equal to moment V0,0i at the root node.
In contrast to the moment model of a capacitor, a self inductor L behaves as a short circuit or a voltage source. Suppose that VL(s)=sLI(s). Then VL,0=0 and VL,k=LIk−1 for k>0. Referring to the two coupled R-L branches illustrated in
By expanding each voltage and current about s=0 and collecting the coefficients of sk, the kth-order voltage moments can be obtained:
as shown in
By repeatedly substituting Eq. (7) upstream, the kth-order (k>0) moment Vj,ki of the non-root voltage Vji(s) (j≠0) can be obtained as follows:
If the common path Ppji between nodes npi and nji is concerned for each npiεNi, the above equation can be rewritten as
As a result, the analytical formula for voltage moments at each node can be established as follows:
Now the invention considers the rules of the signs of the voltage moments for coupled RLC trees. As indicated in Q. Yu and E. S. Kuh, “Moment computation of lumped and distributed coupled RC trees with application to delay and crosstalk estimation,” Proceedings of the IEEE, vol. 89, no. 5, pp. 772-788, 2001, the sign of Vj,ki for coupled RC trees can be determined by that of Vj,k−1i and the directions of current moments through coupling capacitances. However, the signs of voltage moments for the coupled RLC-tree models will be more complex than those for coupled RC-tree models. That is, Vj,k−1i in Eq. (10) may depend not only on Vj,k−1i but also on Vj,k−2i and self/mutual inductances. The inductive components, i.e., VLC
Proposition 1: For any node on coupled RLC trees, the signs of the voltage moments have the following relationships:
The proposition will be applied to analyze the stability of the reduced-order models of coupled RLC trees.
Stability of reduced-order models can not be guaranteed for general RLC circuits, even for the simplest models (i.e., one-pole model and two-pole model). For example, an one-pole model (see A. Vittal, L. H. Chen, M. Marek-Sadowska, K. P. Wang, and S. Yang, “Crosstalk in VLSI interconnects,” IEEE Trans. Computer-Aided Design, vol. 18, pp. 1817-1824, 1999) can be represented as follows:
After simple manipulations, we have a0=V1, b1=−V2/V1, and the pole is p1=V1/V2. From Proposition 1, we have known that V1 is positive but V2 may be nonnegative due to significant inductive effects. Thus the one-pole model may be unstable.
We will derive recursive formulas for computing the kth-order moment at node nji from the kth-order moment at the corresponding node F(nji). The recursive processes will be proceeded by calculating both current moments Ij,ki and voltage moments Vj,ki recursively. First, by accumulating capacitive current sources at each level of {circumflex over (D)}(nji), the set of descendant nodes of nji, Eq. (6) can be rewritten
Thus each kth-order current moment can be calculated upstream from leaves of the coupled RLC trees to their roots by using the (k−1)st-order voltage moments in Eq. (5).
Similarly, the kth-order moment Vj,ki can be derived from Eq. (7)
As shown
Crosstalk Noise Estimations for Coupled RLC Trees
The purpose of crosstalk estimations is to solve the peak voltage value at the far end node in the victim tree efficiently and accurately. As interconnects are typically of very large size and high-order, model-order reduction is a necessity for efficient crosstalk estimations. In this section, we will utilize moment matching techniques to establish a reduced-order system first. Then we will use the reduced-order system to estimate the crosstalk metric.
In order to overcome the stability issue of the qth-order reduced-order model, here the projection-based model-order reduction algorithms are recommended. A guaranteed stable reduced-order model will be generated for crosstalk noise estimations. By applying the congruence transformation, the original n-dimensional state vector can be projected to a reduced q-dimensional one, where q<<n.
Step 18 aims to establish a stable reduced-order model
({circumflex over (N)}+s{circumflex over (M)}){circumflex over (X)}(s)={circumflex over (B)}.
The congruence transformation Q is used to project the n-dimensional original state vector to a reduced-order q-dimensional one: {circumflex over (X)}=QTX. Thus, we have the following MNA matrices for the reduced-order model:
{circumflex over (N)}=QTNQ, {circumflex over (M)}=QTMQ, {circumflex over (B)}=QTB and {circumflex over (P)}=QTP
where {circumflex over (M)},{circumflex over (N)}εRq×q and {circumflex over (X)},{circumflex over (B)},{circumflex over (P)},εRq. {circumflex over (X)}i, the ith-order moment of the reduced-order network {circumflex over (X)}(s), can also be defined. The conventional moment matching technique implies that if each moment {circumflex over (X)}i lies in the column space of Q, the first q moments of {circumflex over (X)}(s) will indeed be equal to those of X(s), as disclosed in J. M. Wang, C. C. Chu, Q. Yu, and E. S. Kuh, “On projection-based algorithms for model-order reduction of interconnects,” IEEE Trans. Circuits Syst. I, vol. 49, no. 11, pp. 1563-1585, 2002. That is, {circumflex over (X)}i={circumflex over (X)}i for i=0, 1, . . . , q−1. In particular, if each column Qi in Q is equal to the system moment Xi−1 (i.e., Q=└X0 X1 . . . Xq−1┘), each entry of {circumflex over (M)} and {circumflex over (N)} can be described as follows:
{circumflex over (m)}ij=Xi−1TMXj−1 and {circumflex over (n)}ij=Xi−1TNXj−1. (13)
Entries of {circumflex over (M)} and {circumflex over (N)} have subtle relationships, which are summarized in the following proposition.
Proposition 2: Let matrices {circumflex over (M)} and {circumflex over (N)} be the MNA matrices for the reduced-order model that are generated by the congruence transformation Q, where Q=└X0 X1 . . . Xq−1┘. Thus entries of {circumflex over (M)} and {circumflex over (N)} have the following subtle relationships: 1. {circumflex over (m)}ij=−Xi−1TNXj={circumflex over (n)}i,j+1 and 2. mij=Xj−1TMXi−1=−Xj−1TNXi=−{circumflex over (n)}j,i+1.
Therefore, except for {circumflex over (m)}kk, all other entries of {circumflex over (M)} can be obtained directly from {circumflex over (N)}. The remaining task is to calculate each entry in {circumflex over (N)} and the entry {circumflex over (m)}kk. By exploring symmetric characteristics of matrix M, the entry {circumflex over (n)}ij can be calculated by using Eq. (2). Suppose that Xj−2=[Vj−2T Ij−2T]T and MXj−2=[Vj−2TC Ij−2TL]T, where vector Vj−2 and vector Ij−2 are the (j−2)nd-order moment of V(s) and I(s). CVj−2 and LIj−2 can be calculated easily. For general cases, it can be derived that
It is worthy of mentioning that these moments are intermediates in Algorithm 1 without any additional costs. Therefore, all entries in {circumflex over (N)} and {circumflex over (m)}kk can be calculated by multiplying the corresponding moment vectors rather than constructing M and N explicitly.
Further simplifications about entries in the matrix {circumflex over (N)} are still possible. The following proposition presents this result.
Proposition 3: With the same conditions as Proposition 2, entries in the first column and the first row of matrices {circumflex over (N)} have the relationships shown as below:
By exploring symmetric characteristic of matrix M, it is straightforward to see that entries in matrix {circumflex over (N)} can also be related as follows:
For illustrational purpose, let symbol ◯ represent the entries of matrix {circumflex over (N)} that need to be calculated additionally using Algorithm 1, and symbol X denote the entries of matrix {circumflex over (N)} that can be simplified by Proposition 3 and Corollary 1. Entries in {circumflex over (N)} can be displayed as follows:
Thus the number of entries that need to be calculated in a q×q matrix {circumflex over (N)} can be counted as follows:
Let V(s) and {circumflex over (V)}(s) be the step responses of the original model and the desired reduced-order model, respectively. Technique disclosed in M. Kuhlmann and S. S. Sapatnekar, “Exact and efficient crosstalk estimation,” IEEE Trans. Computer-Aided Design, vol. 20, no. 7, pp. 858-866, 2001, suggested an appropriate formula of the q-pole reduced-order model {circumflex over (V)}(s) as follows (as in step 24):
which causes the approximate crosstalk voltage {circumflex over (v)}(t) to be zero for t converging to 0 and ∞. Conventional moment matching techniques are often used to solve the unknown coefficients ai(0≦i≦q−2) and bj(1≦j≦q) by using the front 2q−1 moments {V1, V2, . . . , V2q−1} of the original model:
V(s)=V1+V2s+V3s2+ . . . +V2q−1s2q−2+ . . . ,
In step 26, Eq. (14) can also be rewritten as the pole-residue form
where pi for i=1, 2, . . . , q are poles of {circumflex over (V)}(s) and each ki is the residue corresponding to the pole pi. By applying the inverse Laplace transformation, we have
{circumflex over (v)}(t)=k1ep
In step 28, the peak value of the crosstalk waveform will occur at time t=tm where {circumflex over (v)}′(tm)=0 and {circumflex over (v)}n(tm)<0.
In the previous moment computations, the input waveform is assumed to be a step function. However, the input signal in step 10 may be with an arbitrary waveform. Let the updated V(s) be
V(s)=m1′s+m2′s2+m3′s3+m4′s4+m5′s5 . . . . (16)
For example, suppose that the input signal is a ramp function as follows:
v(t)=t/τu(t)−t/τu(t−τ)+u(t−τ),
where u(t) is a step function and 1/τ is the slope of the ramp function.
Applying the Laplace transform, we have
Comparing the coefficients of Eqs. (16) and (17) concludes
To verify the accuracy of the proposed method, three coupling circuits but not limiting examples shown in
The conventional one-pole model (1P) and two-pole model (2P) and our new method with three-pole model (S3P), four-pole model (S4P), . . . , eight-pole model (S8P) are investigated for comparison studies. Absolute and relative errors of crosstalk peak values in comparison with HSPICE simulation results are summarized in Tables 1 and 2. The corresponding absolute relative errors of crosstalk noise peak value occurrence time are summarized in Tables 3 and 4. Each entry from row 1 to row 24 represents the average error of the cases for different lengths of net 1 and net 2 under a condition of a specific rising time and driver impedance. The entries in the last three rows of each model stand for the maximum, average, and minimum error of the 24 sets in the table. Among 1640 cases, there are 42 cases in which model 1P has unstable poles and 18 cases in which model 2P is unstable. From simulation results, we have the following observations:
1. The models generated by our method perform better than the conventional 1P and 2P models. Thus these conventional models are no longer appropriated for coupled RLC trees. With increasing order of reduced-order moldels, the proposed models perform more accurately.
2. From the viewpoints of the absolute error values in Tables 1 and 3, model S3P, whose average errors are smaller than 10%, seems acceptable to crosstalk noise estimations. However, referring to the relative error values in Tables 2 and 4, model S3P seems not accurate as expected. For balancing computational efficiency and estimation performance, the S6P model will be recommended.
3. With increasing the effective driver impedance and rising time, errors of each model seem decreasing.
Although the invention has been explained in relation to its preferred embodiment, it is to be understood that many other possible modifications and variations can be made without departing from the spirit and scope of the invention as hereinafter claimed.