This application claims priorities of Taiwanese Patent Application No. 101116185, filed on May 7, 2012, and No. 101128031, filed on Aug. 3, 2012.
1. Field of the Invention
The present invention relates to a method of implementing timing engineering change order (ECO), more particularly to a method of implementing timing ECO that takes into consideration smoothness of circuit paths.
2. Description of the Related Art
Since complexity in modern integrated circuit (IC) design has grown rapidly, some design failures that are hard to be detected may not be found until later stages of IC design, such as, after circuit layout, or even after chip fabrication. In order to correct the late-found design failures, it has become a trend for a current IC design house to not trace the failure back to early stages, and to adopt the metal-only engineering change order instead. The reason is that, in relevant researches, photomasks for a transistor layer is much more expensive than photomasks for a metal layer. Therefore, adopting metal-only ECO and maintaining the photomasks for the transistor layer unchanged is a cost saving way for failure correction.
To facilitate metal-only ECO, a placement tool is adopted to insert spare cells in a circuit. Once a design failure is detected, by selecting and rewiring appropriate spare cells, the design failure may be corrected. Typically, a circuit design is usually required to undergo many ECO runs. Therefore, how to save spare cells during the ECO runs is an important issue.
ECO may be classified into functional ECO and timing ECO. The functional ECO is used to correct functional errors and/or revise specification. The timing ECO is used to remedy signal imperfection and fix timing violations by gate sizing and/or buffer insertion.
In general, slack or delay of a gate is used to measure timing criticality of the gate. However, neither the slack nor the delay can reflect the timing criticality well, such that the timing violation may not be fixed effectively. For this reason, how to develop a method of implementing timing ECO, which is capable of determining timing criticality of gates precisely and fixing timing violation with less spare cells, is an object of the present invention.
Therefore, an objective of the present invention is to provide a method of implementing timing engineering change order (ECO) effectively.
Accordingly, the method of implementing timing ECO in a circuit that includes a plurality of gates and that is provided with a plurality of spare cells, according to the present invention, comprises the steps of:
(A) performing a static timing analysis on the circuit so as to determine at least one timing violating path of the circuit, each timing violating path including a sequence of the gates such that from each of its gates there is a wire connecting to the next gate in the sequence, and having negative edge slacks, which are slacks of edges that represent wires between the gates;
(B) decomposing each of the at least one timing violating path into at least one violating path segment having the same edge slack;
(C) determining a smooth curve from each of the at least one timing violating path, and determining a plurality of reference points that are evenly distributed along the smooth curve and that correspond respectively to the gates on the timing violating path;
(D) computing a fixability parameter of each of the gates on the violating path segment, wherein the fixability parameter is associated with a smoothness parameter of the respective gate, and the smoothness parameter is associated with a distance between the respective gate and the corresponding reference point;
(E) extracting at least one gate from the gates on the violating path segment, according to the fixability parameters of the gates on the violating path segment, to serve as at least one extracted gate;
(F) selecting one of the spare cells that is adapted for improving slack of a corresponding one of the at least one extracted gate as a selected spare cell, and disposing the selected spare cell on the violating path segment; and
(G) rewiring the circuit so as to apply the selected spare cell.
An effect of the present invention resides in that, the fixability parameter of each of the gates is computed using the smooth curve so as to take into consideration the influence of geometrical smoothness of a path on timing criticality, such that the timing criticality of the gates may be determined precisely.
Other features and advantages of the present invention will become apparent in the following detailed description of the four preferred embodiments with reference to the accompanying drawings, of which:
Referring to
First, step S01 is to perform a static timing analysis on the circuit so as to determine at least one timing violating path of the circuit. The timing violating path includes a sequence of the gates such that from each of its gates there is a wire connecting to the next gate in the sequence, and has negative edge slacks, which are slacks of edges that represent wires between the gates. Definitions associated with the timing violating path are explained hereinafter. In this embodiment, all of the timing violating paths of the circuit are determined in the static timing analysis, such that all timing violations in the circuit may be thus fixed.
An arrival time A(i) of an output signal of the gate giεG is computed as
A required time R(i) of the gate giεG is computed as
wherein R(i,k) is an edge required time of the edge e(i,k).
Edge slack S(i,j) is slack of the edge e(i,j) that is contributed from the gate gj back to the gate gi, and that is computed as
S(i,j)=R(i,j)−A(i).
Gate slack S(i) is slack of the gate g(i), and is computed as
Alternatively, the gate slack S(i) can be computed as
S(i)=R(i)−A(i).
Based on the aforementioned definitions, a timing violating path P can be defined as a path with
S(i,j)<0,∀e(i,j)εP.
An example is given in
After all timing violating paths of the circuit are determined in step S01, step S02 is to decompose each timing violating path into at least one violating path segment having the same edge slack. The violating path segment is a segment of the timing violating path that includes the most consecutive edges having the same negative edge slack. A circuit on the left side of
Subsequently, step S03 is to compute a fixability parameter Tx(i) of each of the gates on the violating path segment. The fixability parameter Tx(i) is associated with a smoothness parameter Ts(i), a flexibility parameter Tf(i), and a path sharing parameter Th(i) of the respective gate g(i). The fixability parameter Tx(i) is expressed as
Tx(i)=(Tf(i)+Ts(i))Th(i),∀giεG
The fixability parameter Tx(i) is used to indicate timing criticality of the respective gate g(i). In this embodiment, a higher value of the fixability parameter Tx(i) represents higher timing criticality of the respective gate g(i). Computations related to the smoothness parameter Ts(i), the flexibility parameter Tf(i), and the path sharing parameter Th(i) are explained hereinafter.
The flexibility parameter Tf(i) reflects the room for timing improvement, and is associated with a wire loading impact for the respective gate Tl(i) and a slack difference of the respective gate Td(i). The wire loading impact Tl(i) is associated with a difference between gate delays of the respective gate g(i) with and without wire loading (D(i) and D0(i)), that is
Tl(i)=D(i)−D0(i),∀giεG.
The wire loading impact Tl(i) corresponds to the maximum improvement that can be obtained by wire loading reduction. The slack difference Td(i) is associated with a difference between the worst one and the second worst one of the edge slacks of the respective gate g(i) to fanout gates of the respective gate g(i), that is
The slack difference Td(i) gives an upper bound of the impact on slack when the delay of the respective gate g(i) is improved. Preferably, the flexibility parameter Tf(i) is associated with the smaller one of the wire loading impact Tl(i) and the slack difference Td(i), i.e.
Tf(i)=min(Tl(i),Td(i)).
A path with good timing is desired to be geometrically smooth. The smoothness parameter Ts(i) is related to geometrical smoothness of the violating path segment. To calculate the smoothness parameter Ts(i), first of all, a smooth curve is determined from the timing violating path. The smooth curve is utilized as a baseline to measure the smoothness of the timing violating path. In this embodiment, the smooth curve is a Bézier curve, and the gates on the timing violating path are used as control points of the Bézier curve. Given as set of n+1 gates g0, g1, . . . , gn on the timing violating path, the corresponding Bézier curve C(t) is given by a weighted sum of the control points
wherein Bin(t) is a Bernstein polynomial and tε[0,1]. Referring to
A subsequent step for calculating the smoothness parameter Ts(i) is to determine a plurality of reference points that are evenly distributed along the smooth curve and that correspond respectively to the gates on the timing violating path. The timing violating path and the curve C in
A wirelength between each of the gates on the violating path and the corresponding reference point is considered as a wirelength penalty of the respective gate. Therefore, the smoothness parameter of the respective gate is defined as a difference between a gate delay related to the distance between the respective gate and the corresponding reference point DB(i) and a gate delay related to zero wire loading D0(i). The smoothness parameter Ts(i) is represented as
Ts(i)=DB(i)−D0(i),∀giεG.
It is noted that a Bézier curve has the following properties. First, a Bézier curve must be completely contained inside a convex hull of control points of the Bézier curve (i.e., the Bézier polygon). Referring to
Moreover, referring to
Further, a Bézier curve has a property of high predictability. If a position of a control point of the Bézier curve is adjusted, a shape of the Bézier curve changes accordingly in a predictable manner. Referring to
The path sharing parameter Th(i) is defined as a number of timing violating paths passing through the same respective gate. Since the more timing violating paths pass through a gate, the more timing of the timing violating paths may be improved at the same time when timing of the gate is improved, the path sharing parameter Th(i) may thus effectively measure timing criticality of a gate.
After computing the fixability parameter Tx(i) of each of the gates on the violating path segment in step S03, step S04 is to extract one gate from the gates on the violating path segment, according to the fixability parameters of the gates on the violating path segment, to serve as an extracted gate. In this embodiment, the gate having the highest fixability is extracted to serve as the extracted gate.
Step S05 is to select one of the spare cells that is adapted for improving slack of the extracted gate as a selected spare cell. In this embodiment, the selected spare cell is to be selected from at least one spare cell candidate. The at least one spare cell candidate is determined from the spare cell that is adjacent to the extracted gate. Preferably, the at least one spare cell candidate is the spare cell located within a convex hull of fanout gates of the extracted gate and the gates on the timing violating path. For example, referring to
In this embodiment, step S05 is regarded as a problem of minimum weight perfect matching. Referring to a bipartite graph B=(GC,GS,EB) illustrated in
S′(ci|Sj)=R(gci)−A(gsj).
The aforementioned equation of slack may be estimated by means of the shielding effect and the loading dominance. A detailed description related to the shielding effect and the loading dominance is disclosed in “ECO timing optimization using spare cells and technology remapping” published in Proc. IEEE/ACM Int'l Conf. on Computer-Aided Design (ICCAD). The article is referred to as prior art DCP hereinafter.
In this embodiment, the weight w(gci,gsj) is represented as
When selecting an adequate spare cell, the spare cell that is able to make the edge slack of the violating path segment equal to zero has the highest priority for selection (i.e, w(gci,gsj)=0), the spare cell that is able to make the edge slack of the violating path segment equal to a positive value has the second highest priority for selection (i.e., 0<w(gci,gsj)≦1), and the spare cell that is able to make the edge slack of the violating path segment equal to a negative value has the least priority for selection (i.e., 1<w(gci,gsj)≦2). In this way, cost efficiency of the timing ECO may be promoted. It is noted that, in calculating the weight w(gci,gsj), the spare cell candidates that cause more timing violations after matching are ignored. In other words, only spare cell candidates that are able to improve slacks are considered when calculating the weight w(gci,gsj).
The selected spare cell is disposed on the violating path segment corresponding to the extracted gate. More specifically, the selected spare cell is used to replace the extracted gate (i.e., gate sizing). Alternatively, the selected spare cell is inserted in the violating path segment corresponding to the extracted gate (i.e., buffer insertion).
Subsequently, step S06 is to compose the violating path segments. When composing the violating path segments, a propagation property is utilized. The propagation property means that if a gate delay of a gate on a violating path segment P(i,j) is improved (decreased) by t, t≦min{Td(w)|gwεP(i,j)}, each edge slack of the violating path segment will also be improved (increased) by t. It is noted that if the gate is passed by multiple violating path segments, all the edge slacks of the violating path segments passing through the gate should be improved (increased) by t. Four cases for composing the violating path segments are illustrated in
Referring to
After composition of the timing violating path segments, step S06 is further to determine whether each of the extracted gates of a respective one of the violating path segments is a redundant gate, and when it is determined that the extracted gate is the redundant gate, disposition of the selected spare cell, which corresponds to the redundant gate, on the violating path segment corresponding to the redundant gate is cancelled. The redundant gate is the extracted gate whose slack is fixed to be not smaller than zero when another one of the violating path segments is disposed with the corresponding selected spare cell. For example, referring to
Subsequently, step S07 is to perform the static timing analysis once again on the circuit, and determine whether there is any timing violating path of the circuit. The method proceeds to step S02 if it is determined in step S07 that there is the timing violating path, and proceeds to step S08 if otherwise.
Finally, S08 is to rewire the circuit so as to apply the selected spare cell (i.e., rewire design), and the procedures for the first preferred embodiment of the method end.
A second preferred embodiment of the method of implementing timing ECO according to the present invention is substantially similar to the first preferred embodiment, and differs in the configurations that, in step S03, the fixability parameter Tx(i) is further associated with a spare-cell availability parameter Ta(i) of the respective gate. In this embodiment, the fixability parameter Tx(i) is expressed as
Tx(i)=(Tf(i)+Ts(i))Th(i)Ta(i),∀giεG
The spare-cell availability parameter Ta(i) is associated with a number of the spare cells in the vicinity of the respective gate. The timing ECO cannot be implemented when there is no spare cell adjacent to the extracted gate. Therefore, the spare-cell availability parameter Ta(i) is a scale factor corresponding to an impact of available spare cells. In this embodiment, the spare-cell availability parameter Ta(i) is defined as
wherein α is a user-specified parameter ε(0,1], and ns(i) is a number of the spare cells in a bounding box formed by fanout and fanin gates of the respective gate. When ns(i)=0, α is used to reduce the spare-cell availability parameter Ta(i) so as to reduce the fixability parameter Tx(i) accordingly. A circuit in
A third preferred embodiment of the method according to the present invention is similar to the second preferred embodiment, and differs in the configurations that, in step S04, a number of the gates to be extracted from the gates on the violating path segment is further considered. That is, the extraction process is extended from one gate per segment to multiple gates per segment.
Specifically, step S04 includes the following sub-steps.
A first sub-step is to sort the gates on the violating path segment in a decreasing order of the fixability parameters. The sorted gates are denoted as c1, c2, . . . .
A second sub-step is to calculate a slack improvement bound parameter Tm(j) of each of the gates on the violating path segment, wherein the slack improvement bound parameter Tm(j) is a maximum improvement on slack available for the respective gate by disposing any one of the spare cell candidates sk on the violating path segment. Assuming that the edge slack of the violating path segment is Si, the slack improvement bound parameter Tm(j) is represented as
A third sub-step is to calculate a number n of the gates on the violating path segment to be extracted for serving as the at least one extracted gate, wherein the number n thus calculated is a minimum required number of the slack improvement bound parameters Tm(j), such that a sum of the slack improvement bound parameters Tm(j) in the order the gates are sorted is not smaller than the edge slack Si of the violating path segment. The number n may be obtained by the following relationship:
A fourth sub-step is to extract at least one gate from the gates on the violating path segment, according to the fixability parameter Tx(i) and the number n calculated in the third sub-step, to serve as the at least one extracted gate. A violating path segment P(1,4) in
A fourth preferred embodiment of the method according to the present invention is similar to the first preferred embodiment, and differs in the configurations that the fixability parameter Tx(i) is only associated with the smoothness parameter Ts(i), that is
Tx(i)=Ts(i),∀giεG.
Relevant experimental data resulting from experiments using the first to fourth preferred embodiments of the method of implementing timing ECO are illustrated hereinafter, and are compared with two conventional techniques. One of the conventional techniques is the aforementioned prior art DCP, the other one is a technique disclosed in an article “Redundant-wires-aware ECO timing and mask-cost optimization” published in Proc. IEEE/ACM Int'l Conf. on Computer-Aided Design (ICCAD), and is referred to as prior art ILP hereinafter. The experiments were conducted with five industrial benchmark circuits which were also adopted in the prior arts DCP and ILP. The statistics of these circuits are summarized in Table I, and include the benchmark name (Circuit names), the number of gates in each design (Gate count), the number of available spare cells (#Spare cell), the clock period (Cycle), the number of timing violating paths (#Violating paths), the maximum number of gates on one timing violating path (#Max path), the total number of gates passed by the timing violating paths (#Gate passed), the worst negative slack (WNS), and the total negative slack (TNS).
Table II provides the comparison among the prior arts DCP and ILP and the third preferred embodiment of the present invention on total negative slack (TNS) and run time. Table II also lists the number of resulting spare cells (#Resulting spares), the number of used buffers (#Used buf.), the number of released buffers/inverters (#Released buf/inv), the number of iterations (#Ite.), the maximum number of timing violating paths shared (Max #path shared), and the number of violating path segments (#Path segments). It is evident from Table II that the third preferred embodiment is capable of making the total negative slack of each of the five circuits equal to zero, and has efficiency 122.7 and 229.14 times higher than the prior arts DCP and ILP, respectively. It is noted that, compared to the number of gates passed by all timing violating paths (#Gate passed) in Table I, the number of violating path segments (#Path segments) in Table II is much smaller. Therefore, the third preferred embodiment of the present invention is capable of significantly reducing complexity in the timing ECO, so as to promote the efficiency for implementing the timing ECO. Moreover, the number of iterations (#Ite.) in Table II is small, implying that the present invention may fix timing violations of a circuit effectively.
Table III summarizes the comparison of run time, the number of resulting spare cells (#Resulting spares), the number of used buffers (#Used buf.), the number of released buffers/inverters (#Released buf/inv), and the number of iterations (#Ite.) among the first, second and third preferred embodiments. It is evident from Table III that the second preferred embodiment consumes fewer buffers than the first preferred embodiment, and the third preferred embodiment consumes even zero buffers, such that the third preferred embodiment has the best cost efficiency. Moreover, the number of iterations (#Ite.) of the third preferred embodiment is fewer than those of the first and second preferred embodiments, and run time of the third preferred embodiment is 3.32 and 4.09 times faster than those of the first and second preferred embodiments, implying that the third preferred embodiment has the best efficiency.
Table IV illustrates the experimental data associated with the fourth preferred embodiment. Compared with Table II, while the fourth preferred embodiment is also capable of making the total negative slack of each of the five circuits equal to zero, the number of iterations of the fourth preferred embodiment is slightly more than that of the third preferred embodiment.
To sum up, the method of implementing timing ECO according to the present invention, by virtue of the fixability parameter which is associated with the smoothness parameter, is capable of precisely determining timing criticality of a gate. Moreover, by virtue of the spare-cell availability parameter and the number of the gates on the violating path segment to be extracted for serving as the extracted gates, the present invention may implement the timing ECO with fewer spare cells. Last but not the least, by decomposing the timing violating path into at least one violating path segment, the present invention may promote efficiency for the timing ECO.
While the present invention has been described in connection with what are considered the most practical embodiments, it is understood that this invention is not limited to the disclosed embodiments but is intended to cover various arrangements included within the spirit and scope of the broadest interpretation so as to encompass all such modifications and equivalent arrangements.
Number | Date | Country | Kind |
---|---|---|---|
101116185 A | May 2012 | TW | national |
101128031 A | Aug 2012 | TW | national |
Number | Name | Date | Kind |
---|---|---|---|
7072815 | Chaudhary et al. | Jul 2006 | B1 |
7137093 | Harn | Nov 2006 | B2 |
7707530 | Alpert et al. | Apr 2010 | B2 |
20030005398 | Cho et al. | Jan 2003 | A1 |
20080052652 | Walian et al. | Feb 2008 | A1 |
Entry |
---|
Kuan-Hsien Ho, Jie-Hong R. Jiang, and Yao-Wen Chang, TRECO: Dynamic Technology Remapping for Timing Engineering Change Orders, Proceedings of the 2010 Asia and South Pacific Design Automation Conference (2010), pp. 331-336 (6 pgs.). |
Yen-Pin Chen, Jia-Wei Fang, and Yao-Wen Chang, ECO Timing Optimization Using Spare Cells, IEEE/ACM International Conference on Computer-Aided Design, 2007, ICCAD 2007. (6 pgs.). |
Hua-Yu Chang, Iris-Hui-Ru Jiang, and Yao-Wen Chang, Timing ECO Optimization via Bézier Curve Smoothing and Fixability Identification, Proceedings of the International Conference on Computer-Aided Design, 2011, pp. 742-746 (5 pgs.). |
Shao-Yun Fang, Tzuo-Fan Chien and Yao-Wen Chang, Redundant-Wires-Aware ECO Timing and Mask Cost Optimization, Proceedings of the International Conference on Computer-Aided Design, 2010, pp. 381-336 (6 pgs.). |
Number | Date | Country | |
---|---|---|---|
20130298097 A1 | Nov 2013 | US |