The present invention relates to integrated circuit design, and more particularly to a design timing verification tool that is capable of handling the non-Gaussian variation effects to the second order for both gate and interconnect on Statistical Static Timing Analysis (SSTA) for multi-phase sequential circuit.
It has been quite well-known that due to the complexity of integrated circuits, it is not possible to apply a finite set of test vectors to verify the circuit delays for all the critical paths in the chip. Therefore, the vector-less approach or Static Timing Analysis (STA) is used widely to verify the timing of chip for each corner case with fixed gate delays. However, in today's nanometer process technology the manufacturing process variation plays a significant role in circuit delays, making SSTA increasingly important. Furthermore, interconnect part can be as dominant as the gate part in today's technology. Therefore, the issue of including process variation effect in the study of interconnect becomes extremely important.
SSTA is formalized in a way very similar to that of STA in terms of path tracing algorithm, but with quite a few differences. In STA, the gate delay is a fixed number, while in SSTA the delay is expressed as a random variable with some probability distribution function (PDF). In STA the critical paths are presented to the users based on slack which is the delay exceeding the timing limit. However, SSTA uses slack which means the probability of the occurrence of the critical path exceeding the use-specified threshold in terms of probability. In the presence of multi-phase sequential elements the timing constraints become complicated in the sense that the timing constraint is not uniform for all the paths, thus complicating the path tracing algorithms based on the breadth first search approach. Some specific algorithm in STA has been described in detail by Chang before. For example in STA the latest arrival time with respect to clock phases are stored at each node, while in SSTA the node stored accumulated probability. The path tracing procedure in terms of probability used in this invention is different from algorithm used by Visweswariah.
The process of calculating the aforementioned accumulated probability in SSTA at each node can be much more involved than that of storing the latest arrival time in STA. In SSTA, first we need to store the latest arrival time by using the max operation for all arrival times from each input of the gate with all the arrival times being expressed by random variables. Even we start with Gaussian distribution after max operation the result becomes non-Gaussian. Therefore, we need to handle max operation for Non-Gaussian distribution. If a random variable, is the linear combination of several Gaussian variables, then this random variable is Gaussian. To include non-Gaussian behavior, this random variable needs to be expressed as a sum of each Gaussian random variables to the second order. Zhan has proposed an algorithm to handle max operation with non-Gaussian distributions. The concept of edge probability can be understood as follows. For gate C with two inputs A and B, we use the notation that A is the random variable of the latest arrival time at the output of gate C for signal propagating from A to the output of gate C, and random variable B is defined similarly. For gate C with two inputs A and B the edge probability from A to output of C is Prob(A>B) meaning the probability of the latest signal at gate output coming from A by taking the integral of the PDF of Prob(A>B). Prob(B>A) then follows from 1-Prob(A>B). The accumulated probability at the given node then is evaluated by taking the largest of the products of edge probability for each gate along the path reaching this said node. We have adopted the algorithm of carrying out breadth first search to achieve these aforementioned accumulated probability for each node and then using depth first traversal to trace backward recursively to generate the critical paths with probability of occurrence greater than a specified probability threshold.
In SSTA with non-Gaussian delay distribution, the procedure of pre-characterizing the timing library to quadratic order of the Gaussian random variables is quite different from that of STA. The timing library in STA stores the gate delay as a function of gate input slope and gate output loading. In SSTA, this timing library stores the coefficients which are random variables in terms of sum of several Gaussian random variables up to quadratic order, while in STA the said coefficients are merely numbers. In evaluating the non-Gaussian delay distribution function use slope and output load as random variables up to quadratic order for the Gaussian random variables, and the coefficients from the equation also to the second order.
The process of calculating the aforementioned accumulated probability in SSTA at each node can be further complicated in the presence of interconnect. In STA the effective capacitance is already quite well-known. In this approach the gate driving the interconnect part is decomposed into two stages, the first stage is the driving gate with the effective capacitance and the output of the immediate output of the driving gate serves as the driving point of the second stage, namely the interconnect. Therefore, driving point admittance matrix and voltage transfer from the driving point to interconnect outputs are needed to calculate the delay from the input of the gate to both the immediate output of the gate and the interconnect outputs. A mathematical algorithm is presented to calculate admittance matrix and voltage transfer to the second order in powers of those Gaussian random variables due to manufacturing process variation. Subsequently, all the poles and residues for both admittance matrix and voltage transfer are also expressed to the second order in powers of Gaussian random variables. Thus, voltage wave function also expressed as random variable is obtained and delay distribution can be obtained.
In the case of cross-talk, effective capacitance approach by using the admittance matrix with poles and residues with variations up to the second order is also adopted. The effective capacitance of each gate by taking into account of the shift time of aggressor's input with respect to the victim's input are calculated and using admittance matrix with poles and residues with variations up to the second order. As to the victim delays from the immediate output of the input driver to the victim outputs, they are calculated by making use voltage transfer up to the 2nd order variation effect. Again, in SSTA, all of these delay quantities are treated as non-Gaussian random variables.
Finally, with all delay information in place, path search is continued until all critical paths in terms of probability are identified.
This invention provides a tool and a method for performing SSTA of a circuit with multi-phase sequential elements with interconnect, consisting of using path analysis including both forward bread first search and backward depth first traversal considering constraint with multi-phase sequential elements to generate critical paths in term of probability of path occurrence, considering gate and interconnect delays with non-Gaussian variation up to quadratic order in the path analysis, handling cross-talk issue with non-Gaussian variation up to quadratic order in the path analysis, and characterizing cells with non-Gaussian variation up to quadratic order.
According to another aspect, a SSTA tool according to the invention provides a method of forward breadth first search considering constraint with multi-phase sequential elements to first store clock phases of the driven gates to determine whether two signals coming from different phases of inputs of the gate should merge at the output node, and calculate the accumulated probability at each node as a vector with respect to clock phases, and edge probability matrix associated with each clock phase of input of the gate and the clock phase of output of the gate.
According to another aspect, a SSTA tool according to the invention provides a method of backward depth first traversal considering timing constraint with multi-phase sequential elements to search paths backward from the flip-flop data input with negative slack in terms of probability and make use of accumulated probabilities at node and input of the fanin gate of the said node and the edge probability from the said input to said output with certain clock phases to determine the probability of the path passing through the said input associated with clock phase and keep searching recursively.
According to another aspect, a SSTA tool according to the invention provides a method to pre-characterize the timing library of the gate delay and its output slope as random variables in terms of input slope and output loading also as random variables with the coefficients in terms of process variations to the 2nd order.
According to another aspect, a SSTA tool according to the invention provides a method of considering gate and interconnect delays with non-Gaussian variation up to quadratic orders by evaluating the input admittance matrix and voltage transfer from interconnect input to its out outputs all of which are random variables to the 2nd order of variation effects and computing gate effective capacitance and interconnect delay also in terms of 2nd order variation effect.
According to another aspect, a SSTA tool according to the invention provides a method of computing gate effective capacitance up to quadratic order of variation effect using the input admittance matrix of the interconnect driven by the gate up to quadratic order of variation effect by adopting iterative procedure including evaluating waveform parameters with 2nd order variation effect and re-evaluating capacitance utilizing driving input admittance in terms of poles and residues with 2nd order variation effects.
According to another aspect, a SSTA tool according to the invention further provides a method of evaluating the driving input admittance matrix of the interconnect by formulating these up to quadratic order of variation effect and using Arnoldi's method including variations to achieve orthonormal bases and fitting each matrix element to simple pole format with pole and residue being expressed in terms of variations up to the 2nd order.
According to another aspect, a SSTA tool according to the invention further provides a method of adopting iterative procedure in computing effective capacitance by approximating waveform with combination of linear and exponential waveforms and achieving admittance matrix and effective capacitance exactly.
According to another aspect, a SSTA tool according to the invention further provides a method of handling cross-talk issue with non-Gaussian variation up to quadratic order by iteratively calculating effective capacitance for each port and obtaining waveform at each port in terms of parameters up to 2nd order of variation.
According to another aspect, a SSTA tool according to the invention further provides a method of calculating voltage transfer using Arnoldi's method up to 2nd order of variation and fitting matrix element into simple pole format with pole and residue as random variables including up to 2nd order of variation.
According to another aspect, a SSTA tool according to the invention further provides a method of calculating voltage wave at interconnect outputs by approximating the waveform at interconnect inputs with combination of linear and exponential waveforms and achieving voltage transfer pole and residues to the 2nd order of variation.
According to another aspect, a SSTA tool according to the invention further provides a method of finding the time at which the waveform expressed as a random variable up to 2nd order of variation reaches certain value.
These and other aspects and features of the present invention will become apparent to those ordinarily skilled in the art upon review of the following description of specific embodiments of the invention in conjunction with the accompanying figures, wherein:
The present invention will now be described in detail with reference to the drawings, which are provided as illustrative examples of the invention so as to enable those skilled in the art to practice the invention. Notably, the figures and examples below are not meant to limit the scope of the present invention to a single embodiment, but other embodiments are possible by way of interchange of some or all of the described or illustrated elements. Moreover, where certain elements of the present invention can be partially or fully implemented using known components, only those portions of such known components that are necessary for an understanding of the present invention will be described, and detailed descriptions of other portions of such known components will be omitted so as not to obscure the invention. In the present specification, an embodiment showing a singular component should not be considered limiting; rather, the invention is intended to encompass other embodiments including a plurality of the same component, and vice-versa, unless explicitly stated otherwise herein. Moreover, applicants do not intend for any term in the specification or claims to be ascribed an uncommon or special meaning unless explicitly set forth as such. Further, the present invention encompasses present and future known equivalents to the known components referred to herein by way of illustration.
As set forth above, the issue of SSTA incorporating the interconnect effect has been addressed before. However, to increase accuracy of path analysis results, the physical quantities such as delay and slope of waveform in SSTA must be expressed as non-Gaussian PDF. Furthermore, to handle gate and interconnect delays correctly, these two parts must be treated together by using the well-known effective capacitance approach. Therefore, the cells and interconnect must be pre-characterized to quadratic order of the process variation effects. Besides, this problem becomes even more complicated if the circuit has multi-phase sequential elements. It is necessary to have a timing analysis engine using forward signal propagation based on breadth first search followed by backward propagation using depth first traversal to generate critical paths in terms of probability. In stark contrast, as shown in
In
As shown in
According to one aspect of the invention, new timing analysis algorithm 107108 in this tool of SSTA with uniform or non-uniform timing constraint 102 is discussed here in detail. First, some basic notation is reviewed. In SSTA, the delay or slope at the node is not a number, but expressed as a random variable with certain PDF due to process variation. If the random variable A has Gaussian distribution, then
A=a+bx
in which x is a random variable with normalized Gaussian distribution N(0,1). The PDF fx for random variable x is
fx(x)=1/√2πexp(−x2)
while the PDF fA(x) for random variable A is
fx(x)=1/√2πexp(−((x−a)/b)2)
with a and b being the nominal value and variance for A, respectively.
In reality any random variable representing physical quantities like delay, slope etc. cannot be approximated by Gaussian distribution exactly, and must be modeled by non-Gaussian distribution including the second-order effect such as
A=a+bx+cx2
Note that x can be a vector with each component representing independent process parameter. For example, transistor width and length are two independent process parameters.
In SSTA the operation MAX(X,Y) is used to store PDF of max of the two random variables X and Y. Suppose a gate has two inputs i1 and i2 and one gate output out with their latest arrival times at two inputs i1 and i2 stored as two random variables A and B, respectively. Two delays delayi1→out out and delayi2→out out are represented by C and D also as random variables. The latest arrival times in terms of random variables at out from the two inputs i1 and i2 are E and F, respectively. We have
A=a0+a1×+a2x2
B=b0+b1×+b2x2
C=c0+c1x+c2x2
D=d0+d1x+d2x2
E=(a0+c0)+(a1+c1)x+(a2+c2)x2
F=(b0+d0)+(b1+d1)x+(b2+d2)x2
MAX(E,F)=e0+e1x+e2x2
Then, MAX(E,F) is computed to get e0, e1 and e2 and it is the latest arrival time in terms of a random variable stored at the output of the gate. The computation of e0, e1 and e2 by Zhan et al. is already quite well-known, and there is no need to repeat them here. In SSTA the probability of the path is needed besides the latest arrival time which is random variable. Still using the above example of a gate with two inputs, we need to know the probability for each of the two paths, namely from i1 to out and i2 to out. This information is obtained from the calculation of MAX(E>F) and MAX(F>E) with MAX(E,F)=MAX(E>F)+MAX(F>E). The probability for E>F is obtained by integrating the PDF (Probability Density Function) of MAX(E>F) and same for F>E.
According to one aspect of the invention,
The engine of forward propagation using BFS 107 in the presence of uniform timing constraint is further illustrated by referring to
P=D>T1/√2πexp(−(y2))dy
Here y can be a vector with several process parameters.
P=D>T1/√2πexp(−(y2))dy
Continue referring to
edge_prob(e,f,P1,P1)=Prob(d1>d2)
in which e and f refer to input and output node, and the 3rd argument is the clock phase the delay at input is associated with, and the 4th argument is the clock phase the delay at output is associated with. Then, edge_prob(e,f,P2,P1)=Prob(d2>d1) follows similarly.
As accum_prob at node f, since d3 is measured from phase P1, and there are two paths from e to f with two different clock phases. The correct accum_prob 604 is obtained as follows,
accum_prob(f,P1)=max(accum_prob(e,P1)*Prob(d1>d2), accum_prob(e,P2)*Prob(d2>d1))
Similar to the case of uniform timing constraint, the forward propagation of the signals stop when the signals have reached inputs of destination flip-flops and the probability for failure at the said nodes are calculated 207.
Using the previous notation for edge_prob, we have P11=edge_prob(1,e,P1,P1).
Note that the 2nd row is (0,0) because input a does not have any delay associated with P2. Since node a does not store the longest delay with respect to P2, the longest delay with respect to P2 in node d does not merge with any delay from a, meaning the probability of the delay from d with respect to phase P2 goes to node e also with respect to P2 is 1. This explains the matrix edge probability P between d and e is
With the information of accum_prob at nodes a and d, and the edge matrix probabilities between edges a to e and d to e, the accum_prob at node e is obtained easily.
accum_prob(e,P1)=max(accum_prob(a,P1)*edge_prob(a,e,P1,P1),
accum_prob(d,P1)*edge_prob(d,e,P1,P1))=max(1*0.7,1*0.3)=0.7
accum_prob(e,P2)=1
This is how we get 0.7,1 for node e in
and edge_prob(e,f,P2,P1)=0.1. In terms of edge matrix it is
accum_prob(f,P1)=max(accum_prob(e,P1)*edge_prob(e,f,P1,P1),
accum_prob(e,P2)*edge_prob(e,f,P2,P1))=max(0.7*0.9,1*0.1)=0.63
accum_prob(f,P1)=0.63
accum_prob(f,P2)=0
In
This invention presents an algorithm 108 for backward propagation to generate all critical paths as shown in
x=edge_prob(e,f,Q,P)
y=accum_prob(e,Q)
z=accum_prob(f,P)
Since the most critical paths passing through node f have the probability cur_prob, then the probability for the path passing through e with clock phase Q to node f with clock phase P is 805
new_prob=cur_prob*y*x/z
By comparing new_prob with user specified threshold probability, it can be determined whether the path search can go beyond this node e. If so, then the process is done recursively 807. If not then the next clock phase Q of node e will be accessed and the same process continues 808. Then, when the clock phases of node e are exhausted, the next input of gate F is accessed 809 and so on until all critical paths with probability greater than user specified threshold probability.
Referring back to
Referring back to
X=a+bx+cy+dx2+eye+fxy
in which x and y refer to random variables for transistor width and transistor length, respectively assumed to have normalized Gaussian distribution N(0,1). If X is for cell output slew time, then the coefficients a, b, c etc. depend on the cell input slew time and output loading which are also random variables. With this understanding, the procedure of pre-characterizing cell in the presence of variation effect is described as follows. The purpose is to characterize, say output slew S as a function of input slew, output loading and transistor width, length into
Sout=a(Sin,Lout)+b(Sin,Lout)W+c(Sin,Lout)W+d(Sin,Lout)W2+e(Sin,Lout)W2+f(Sin,Lout)WL
The delay of the cell from cell input to cell output denoted by Din→out is discussed similarly, and there is no need to repeat it here. Since we use N(1,0) for x any y representing random variables for width W and length L, in the modeling W in fact is (W−μ)/σ assuming N(μ,σ) is the Gaussian PDF for W, and L follows the same. In the sampling for Sin and Lout 902, choosing one sample point and then do sampling for W and L 903, to get coefficient a, b, c, d, e, f at one chosen point for Sin and Lout 904. By doing this way, the data for all the sampling points for the coefficients a, b, c, d, e and f, we therefore can fit each of them into relation 905
a(Sin,Lout)=ƒ(i)Y(i)
Y(1)=Sin Y(2)=Lout*Sin Y(3)=(Lout)3
Y(4)=Lout Y(5)=1
All f g and h are empirical fitting coefficient.
The explicit formula is given as follows
Sout=A+Bx+xtCx
where x=(x1,x2, . . . xn)t is 1×n vector representing n independent process parameters with normalized Gaussian PDF N(1,0), B is 1×n vector, and C is a n×n symmetric matrix and A is a scalar, which is the constant term. We have
A=Σi=15a(i)Y(i)
Bm=Σi=15b(m,i)Y(i)
Cmn=Σi=15c(m,n,i)Y(i)
in which Bm is the mth component of vector B, while Cmn is element of matrix C at row m and column n.
The SSTA engine is further complicated by the variation effects due to the RLC interconnect by referring to 104 in
(G+sC)x=b
In MNA there are two types of elements, type 1 elements are nodal voltages and type 2 elements are branch currents respectively. The matrix G consists of time independent elements, while matrix C handles time dependent elements such as capacitors. The vector b contains nonzero external current at the ports and zero for the remaining components. By separating the ports from the remaining nodes, the above equation is further partitioned into
Assuming there are m ports and n internal nodes in RLC circuit, here xp are the m port node voltages and xi include n internal node voltages and l inductor currents. GP is m×m matrix for m ports, so is CP. GI and CI are (l+n)×(l+n) matrices representing n internal nodes and l inductors. Both GC and CC are (l+n)×m connection conductance and susceptance matrices, respectively. The right hand side is a (m+l+n)×m matrix with bP as being the external current at the m port nodes. The admittance Y(s) is defined as
Y(s)xP(s)=bP(s)
Through some calculation by eliminating xi, by following Kerns formulation without considering variations Y(s) is obtained as follows,
Y(s)=GP+sCP−(GC+sCC)T(GI+sCI)−1(GC+sCC)
Kerns and Yang have pointed out that by using Cholesky factorization GI can be transformed into unit matrix. Since this part will not affect the later discussions, it is omitted for the sake of brevity.
Our purpose is to calculate Y(s) as a function of variation parameters to the second order and express each matrix component of Y(s) in terms of pole and its residue to the second order effect due to variations. The conductance and susceptibility matrices can be formulated in powers of variation parameters. For example, resistance being assumed as a Gaussian random variable expressed as
R=R0+R1a
Similarly capacitance C has Gaussian distribution like
C=C0+C1b
Here R0 and C0 are nominal values and a and b are two independent variation parameters for resistance and capacitance, respectively 1003. By using these, it is straightforward to evaluate G and C matrices in powers of a and b. We have
GI=GI,0+V
V=Σi=1NGIiVi
Here Vi stands for ith random variable up to the second order. For example if there are two random parameters a and b, then Vi can be one of random variables a, b, ab, a2, and b2 in which a and b are independent process parameter with normalized Gaussian distribution N(0,1). The nominal value of GI is GI0 and GIi is the coefficient matrix of random variable yi. The matrices CP, GP, CP, GC and CC are all defined similarly. To simplify the calculation the congruence transform is carried out by using
After the congruence transform XTGX and XTCX the connection susceptibility matrix CC becomes zero and Y(s) still remains the same. It is noted that the above mentioned X has V dependency. In the calculation we need to preserve up to V2 term since V contains 1st order of random variable and we need computation to the 2nd order. For example,
GI−1=(GI,0+V)−1=(I+GI,0−1V)−1GI,0−1=GI,0−1−GI,0−1VGI,0−1+GI,0−1VGI,0−1V GI0−1
Note that the square terms for the random variables in V doesn't contribute in V2 since we only need up to the 2nd order term of random variable. Therefore, X can be calculated to be
X=X0+V
V=Σi=1NXivi
Where X0 is the nominal part of X, V is the variation part consisting N variational terms up to the second order of each independent variation parameter and Xi is the coefficient of each variation term vi.
For the sake of brevity of notation we stick to the same formula for Y(s) after congruence transform by X with the understanding that Cc is zero and the remaining matrices still up to the 2nd order of random variables but with different matrix coefficients. The admittance matrix Y(s) can then be written as
Y(s)=GP+sCP−(GC)T(I+sGI−1CI)−1(GI−1 GC)
The next step is to use Arnoldi's procedure in finding the orthonormal bases [W0,W1, . . . Wq−1] for the Krylov space Kr(GI−1 CI, GI−1 GC, q) for matching q moments of the multiport admittance. We start with WI, which is GI−1 GC and followed by QR decomposition if the circuit has more than one port. It is worth noting that in QR process, we need to get vector divided by its norm. Here we use a simple example to illustrate this concept. Assuming there is one random parameter x, a vector with two components up to the second order of x is something like
The norm is obtained by first taking the square
(4+2x+3x2)2+(3+1x+2x2)2=(16+16x+28x2)+(9+6x+13x2)=25+22x+41x2
Then square root is in the format of (a+bx+cx2), and we easily get 5+2.2x+7.232x2. As to W2, W3 etc. they are obtained through standard procedure of Gram-Schmidt orthogonalization. We end up with the Arnoldi vector Wi (i=0, 1 . . . q−1) with variation effect as follows
Wi=Wi0+M
M=Σm=1NWi,mvm
Here Wi0 is the nominal part of ith vector Wi, and Wi,m is the coefficient of the mth random parameter vm which may contain second order of independent random variable. Set W=[W0 W1 . . . Wq−1] which is an orthonormal matrix spanning the Krylov space Kr(GI−1 CI, GI−1 GC, q). To preserve passivity of the circuit, the congruence transform of G and C by W is 1004
{tilde over (G)}I=WTGIW
{tilde over (C)}I=WTCIW
{tilde over (G)}C=WTGC
Y(s)=GP+sCP−({tilde over (G)}C)T({tilde over (G)}I+s{tilde over (C)}I)−1({tilde over (G)}C)
Now we are in the position to calculate poles and residues with variation terms. Y(s) is rewritten as follows,
Y(s)=GP+sCP−({tilde over (G)}C)T(I+s{tilde over (G)}I−1{tilde over (C)}I)−1({tilde over (G)}I−1{tilde over (G)}C)
A={tilde over (G)}I−1{tilde over (C)}I=A0+V
V=Σi=1NViyi
B={tilde over (G)}I−1{tilde over (G)}C=B0+Y
Y=Σi=1NYiyi
C=({tilde over (G)}C)T=C0+Z
Z=Σi−1NZiyi
Note that A is the reduced matrix by Arnoldi method by considering variation effects up to the 2nd order of random parameters. Here we have 1005
calculated to the 2nd order of random parameters. The variation parts V, Y, and Z contain second order terms, so the quadratic terms like sV sV only contain 2nd order terms. Since A0 is the reduced matrix without variation, this matrix can be diagonalized by using congruence transform U consisting of eigenvectors of A0. For the sake of brevity of notation, A0 is diagonalized matrix containing the eigenvalues of the original A0. The matrix V after the congruence transform U cannot be diagonalized. The right hand matrices B0=UT B0, Y=UT Y and the left hand matrices C0=UC0, Z=UZ. By simple observation for each matrix element of Y(s) it can have the form of simple, double, and cubic pole etc. However, we need to express simple pole and residue with its variation terms. Here we adopt the method by expanding pole and residue form in powers of variation parameters and compare with the exact solution of Y(s) to obtain the coefficients of variation terms for pole and residue. The procedure can be illustrated by the following example. Assuming there is only one random parameter x with the pole and residue form can be shown as follows 1006
The purpose is to find coefficients for random parameter x for the pole −α and residue β. By comparing with the exact result from Y(s) in powers of x and 1/(s+α) we can obtain a and c from x term and b and d from x2 term. Note that in this example 1/(s+α)3 term for x2 term in fact is not needed, since the unknown a in the coefficient a2β has already been obtained in the other terms.
Referring back to
As shown in
in which Vi is he initial node value of the interconnect, and i(s) is the Laplace transform of the current i(t) at the said driving node of RLC interconnect. We require that
Here −1 is the inverse Laplace transform and V(s) is the Laplace transform of V(t) as already defined 1104. The value of Ceff is achieved by using the above formula iteratively. The initial choice of Ceff can be chosen to be the total routing capacitance 1103. The above formula is evaluated as follows. To make notation simple, δ and γ are used in replacement of δk and γk without loss of generality. We have
Making use of
We obtain 1105
Note that in the above formulae, the pole γ and residue δ refer to one of the poles and residues, and the summation with respect to all poles and residues are implied. The constant α then is independent of the summation of each pair of pole and residue. All the constants have variation terms to the second order. The coefficients b and c, and delays tD and tx are also random variables calculated by using pre-characterized timing library with variations from input slew time and output loading based on Ceff during the process of iteration until the final Ceff is obtained. Therefore, with all of terms such as poles, residues, tx and tD being calculated to the 2nd order of random parameters, the term Q(t) up to the 2nd order of random parameters can also be achieved. For example, in calculating Q(t=tD), we have one term like eγ tD with γ=γnom+X and tD=tD,nom+Y in which X and Y include variation terms up to 2nd order. Then, eγt(t=tD)=eγnomtD,nom (1+XtD,nom+Yγnom+XY tD,nomγnom+X2 tD,nom2+Y2γnom2+2 XYtD,nomγnom). By combining the coefficients of the same random parameter with the same order, eγt(t=tD) becomes eγnomtD,nom Z where Z contain variation terms up to 2nd order. The calculation is straightforward but very tedious, so the final formula up to the 2nd order is not given here. Eventually we obtain Q(t=tD)=Q0+W with W being the variation term up to 2nd order of variation effects. Using 1106
Ceff is reevaluated 1107, and using new Ceff, which is a random variable up to 2nd order of random parameter, to obtain new constants such as tD, tx, a, b, c and achieve new Q(t) accordingly. This process is iterated until convergence 1108.
Referring back to
A11(s)V1(s)+A12(s)V2(s)=i1(s)
A21(s)V1(s)+A22(s)V2(s)=i2(s)
in which A11 is the matrix element of 2×2 admittance matrix, V1(s)(i1(s)) and V2(s) (i2(s)) are voltages(currents) in s domain at victim and aggressor node. The purpose is to iteratively 1303 find the effective capacitances Ceff,1 and Ceff,2 at victim and aggressor nodes 1 and 2, respectively. Initial values for Ceff,1 and Ceff,2 are given first. Then the waveforms in terms of tx,i and tD,i at node i (i=1 and 2) are evaluated with tx and tD being defined similar to the those in the case of one port. Here all of the constants Ceff,i, tx,i, tD,i (i=0,1) obviously are random variables up to 2nd order of random parameters. We have 1304
∫0t−1(ii(s))dt=Q1
=Q11+Q12=∫0t−1(A11(s)V1(s))dt+∫0t
−1(A12(s)V2(s))dt
Note that waveforms V1(t) and V2(t) in time domain at nodes 1 and 2 have different starting points. The integration range from 0 to tD1 are with respect to the starting point of waveform at node 1. We therefore need to transform 0 and tD1 to the coordinate system used by V2(t). For example the waveform V1(t) starts from, say T1, from origin, while waveform at node 2 starts from T2 as shown in
Ceff,2 is obtained similarly. Through tedious calculation all of the above constants are random variable with nominal part and variation part up to the 2nd order of random parameter, thus Ceff,1 and Ceff,2 are also calculated to the 2nd power in random parameters. The process is iterated until convergence 1307 is reached. In cases of more than one aggressors, we need to find the switching tomes for each aggressor with respect to the victim in order to obtain maximum delay from victim input to victim output. This procedure is similar to that in the case of STA and not repeated here. The discussion on victim glitch is exactly the same as that in the case of calculating maximum delay of the victim except setting voltage a constant at the input of the victim driver. Therefore, there is no further discussion on glitch in cross-talk.
Referring back to
Vout(s)=T(s)Vin(s)
in which Vout (s) is a m×1 matrix, T(s) is m×n matrix and Vin (s) is n×1 matrix with each matrix element being a random variable up to the 2nd order of variation effects. Both admittance matrix and voltage transfer are treated under the same formulation. By using the same Arnoldi's method 1502 in obtaining W=[W0 W1 . . . Wq−1] which is an orthonormal matrix spanning the Krylov space Kr(GI−1CI, GI−1 GC, q) and making the congruence transform as in the case of calculating admittance matrix, we achieve 1503
Ñ=WTN
T(s)=−({tilde over (N)})T(I+s{tilde over (G)}I−1{tilde over (C)}I)−1({tilde over (G)}I−1{tilde over (G)}C)
here N being defined as n×m matrix if the circuit has n outputs and m internal nodes and N initially prior to congruence transform by W, assuming CC is zero without loss of generality, is
Nij=1 if ith input=jth internal node
By comparing T(s) with Y(s)
Y(s)=GP+sCP−({tilde over (G)}C)T(I+s{tilde over (G)}I−1{tilde over (C)}I)−1({tilde over (G)}I−1{tilde over (G)}C)
as obtained before, we see that GP+s CP is not contained in T(s) and {tilde over (G)}C in Y(s) is replaced by Ñ in T(s). Thus, the matrix element of T(s) does not include α+βs. Similar to the handling of admittance matrix the voltage transfer can be expanded to the 2nd order of variation terms, and then fitted into simple pole and residue including variation terms to obtain poles and residues to the second order of variations. Without loss of generality we assume Vout (s) is a 3×1 matrix, T(s) is 3×2 matrix and Vin (s) is 2×1 matrix, Vout,1(t) and Vout,2(t) refer to waveform at output nodes of victim and aggressor, respectively. For output 1, we have
Vout,1(t)=−1(T11(s)Vin,1(s))+
−1(T12(s)Vin,2(s))
Vin,1 (t) and Vin,2 (t) are defined similarly as in the discussion of effective capacitance for the case of one port with the combination of three regions including quadratic, linear and exponential forms. In the actual implementation the quadratic part can be neglected without suffering too much inaccuracy. In this approximation Vin,1 (t) stays at initial constant value Vi, which is either Vdd or 0 depending on fall or rise transition at RLC input, for the period of time denoted by t0 before the waveform starts changing. Vin,1 (t) and Vin,2 (t) have their own t0, meaning they have different switching times. Using the aforementioned formula as follows,
Vin,1(t)=a+b(t−tx)
The starting point t0 when Vin,1 (t) changes from initial value Vi is
The waveform at the RLC input is as follows 1504,
It is emphasized that all of t, t0 and tD are measured from the origin which is the starting point when the input of the driver starts changing. We have
The notation for summing all pairs of residue and pole is omitted. In actual calculation it is neat to change t coordinate from to and tD becomes tD−t0, and then replace t by t−t0. We have 1505
For t<tD
For t>tD
After the waveform at the output node of the victim is obtained, the next step is to find the time tout,1 at which the waveform of the victim Vout,1(t) is at 50% of Vdd, then the delay from the input of RLC input, say node 1 which is the input of the victim, to the output node can be obtained as 1506,
Delay=tout,1−tD,1
Note that Delay is a random variable up to the 2nd order variation effect. Knowing that all the constants such as t0, tD, f, d, b in fact are random variables expressed by random parameters up to the second order, the waveform at output node is also a random variable up to the 2nd order effect. We use an example as shown in
f(t,α)=f0(t)+f1(t)α+f2(t)α2
The purpose is to find the random variable t being expressed as 1602
t=tn+Δt=tn+xα+yα2
f(t,α)=0.5 Vdd
The function f(t,α) is further expanded in power of Δt 1603
We therefore obtain tn, x and y as by solving the following equations sequentially 1604
In conclusion, the issue of SSTA in multi-phase sequential circuit with cross-talk in consideration of process variations up to the 2nd order is not addressed well. There are several difficulties. First, in multi-phase sequential circuit timing constraint can be non-uniform and complex. This invention provides a novel method to utilize breadth first traversal and special algorithm to store longest arrival time as random variable and accumulated probability at each node with respect to clock phases and generates edge probability matrix from input to output of the gate associated with stored input and output clock phases, then followed by backward search to get critical paths in terms of probability. Secondly, the accuracy of delay based on Gaussian distribution is questionable. This invention solves the problem by adopting Non-Gaussian behavior and provides a method to pre-characterize the timing library for the gate output delay and slope as random variables as a function input slope and output loading up to the 2nd order of variation. Thirdly, the issue of interconnect including cross-talk needs to be handled by considering non-Gaussian behavior. This invention provides a novel way to calculate admittance matrix and voltage transfer up to 2nd order of process variation and fitted the results into simple pole format to obtain poles and residues for each matrix element up to 2nd order of variation. Effective capacitance and interconnect delay can be calculated up to 2nd order of variation by using poles and residues including up to second order variation effects of both admittance matrix and voltage transfer, accordingly. The invention further provides a method for cross-talk with multiple ports to evaluate effective capacitances at the ports individually and waveforms at the ports are then used to calculate delay expressed as random variable containing 2nd order variation terms at victim outputs. With all of these in place, SSTA is used to identify critical paths in terms of probability in an accurate manner.
It should be noted that a timing verification tool according to the invention may have one or more of the above-described capabilities in any combination, and any of these novel capabilities can be combined with conventional or other novel timing verification tools.
Accordingly, although the present invention has been particularly described with reference to the preferred embodiments thereof, it should be readily apparent to those of ordinary skill in the art that changes and modifications in the form and details may be made without departing from the spirit and scope of the invention. It is intended that the appended claims encompass such changes and modifications.
Number | Name | Date | Kind |
---|---|---|---|
5680332 | Raimi | Oct 1997 | A |
7086023 | Visweswariah | Aug 2006 | B2 |
7890915 | Celik | Feb 2011 | B2 |
7900175 | Chang | Mar 2011 | B2 |
8244491 | Zhang | Aug 2012 | B1 |
20040044510 | Zolotov | Mar 2004 | A1 |
20070277134 | Zhang | Nov 2007 | A1 |
Entry |
---|
Agarwal et al., Circuit optimization using statistical static timing analysis, Conference Paper, DOI: 10.1109/DAC.2005.193825 Source: IEEE Xplore, Jul. 2005. |
Chang et al., Statistical timing analysis under spatial correlations, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (vol. 24, Issue: 9, Sep. 2005). |
Lizheng Zhang, Jun Shao, Charlie Chungping Chen Non-Gaussian Statistical Parameter Modeling for SSTA with Confidence Interval Analysis ISPD 2006, Apr. 9-12, 2006, pp. 33-38. |
Jessica Qian, Satyamurthy Pullela, and Lawrence Pillage, Modeling the “Effective Capacitance” for the RC Interconnect of CMOS Gates, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 13, No. 12,pp. 1526-1535 Dec. 1994. |
Kevin J. Kerns and Andrew T. Yang, Stable and Efficient Reduction of Large, Multiport RC Networks by Pole Analysis via Congruence Transformations, Proc. DAC 1996, pp. 280-285. |
Kevin J. Kerns and Andrew T. Yang, Preservation of Passivity During RLC Network Reduction via Split Congruence Transformations, Stable and Efficient Reduction of Large, Multiport RC Networks by Pole Analysis via Congruence Transformations, Proc. DAC 1997, pp. 34-39. |
Altan Odabasiouglu, Mustafa Celik, PRIMA: Pssive Reduced-Order Interconnect Macromodeling Algorithm, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 17, No. 8,pp. 645-654 Aug. 1998. |
Ying Liu, Lawrence T. pillegi and Andrzej J. Strojwas, Model Order-Reduction of RC(L) Interconnect including Variational Analysis, Proc. DAC 1999, pp. 201-206. |
Sani R. Nassif, Modeling and Analysis of Manufacturing Variations, Proc. CICC 2001, pp. 223-228. |
C. Visweswariah, K. Ravindran, k. kalafala, S.G. Walker, S. Narayan First-Order Incremental Block-Based Statistical Timing Analysis, Proc. DAC 2004, pp. 331-336. |
Yaping Zhan, Andrzej J. Strojwas, Xin Li, Lawrence T. Pileggi, Correlation-Aware Statistical Timing Analysis with Non-Gaussian Delay Distribution, Proc. DAC 2005, pp. 77-82. |
Soroush Abbaspour, Hanif Fatemi, Massoud Pedram, Parameterized Block-Based Non-Gaussian Statistical Gate Timing Analysis, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 26, No. 8,pp. 1495-1508 Aug. 2007. |
Anirudh Devgan, Chandramouli Kashyap, Block-based Static Timing Analysis With Uncertainty, Proc. ICCAD 2003, pp. 607-614. |
Number | Date | Country | |
---|---|---|---|
20150199462 A1 | Jul 2015 | US |
Number | Date | Country | |
---|---|---|---|
61927740 | Jan 2014 | US |