Timing optimization in presence of interconnect delays

Information

  • Patent Grant
  • 6553338
  • Patent Number
    6,553,338
  • Date Filed
    Tuesday, April 27, 1999
    25 years ago
  • Date Issued
    Tuesday, April 22, 2003
    21 years ago
Abstract
A strategy for optimal buffering in the case of an infinitely long wire buffered with an arbitrary number of equally spaced single-size buffers is presented. A simple but efficient technique is proposed using this to choose a buffer size and determine a good inter-buffering distance up front, thus enabling fast, efficient buffer insertion. The analysis also allows representing delays of long wires as a simple function of the length and buffer and wire widths. Based on this, a novel constant wire delay approach is proposed where the proposed wire delay model is used for fairly accurate prediction of wire delays early in the design process and these predictions are later met via buffer insertion and wire sizing.
Description




BACKGROUND OF THE INVENTION




1. Field of the Invention




The present invention is directed to digital logic design systems. More particularly, the invention is directed. to automated digital logic synthesis and placement systems.




2. Background of the Related Art




Prior art computer aided design (CAD) systems for the design of integrated circuits and the like assist in the design thereof by providing a user with a set of software tools running on a digital computer. In the prior art, the process of designing an integrated circuit on a typical CAD system was done in several discrete steps using different software tools.




First, a schematic diagram of the integrated circuit is entered interactively to produce a digital representation of the integrated circuit elements and their interconnections. This representation may initially be in a hardware description language such as Verilog and then translated into a register transfer level (RTL) description in terms of pre-designed functional blocks, such as memories and registers. This may take the form of a data structure called a net list.




Next, a logic compiler receives the net list and, using a component database, puts all of the information necessary for layout, verification and simulation into object files whose formats are optimized specifically for those functions.




Afterwards, a logic verifier checks the schematic for design errors, such as multiple outputs connected together, overloaded signal paths, etc., and generates error indications if any such design problems exist. In many cases, the IC designer improperly connected or improperly placed a physical item within one or more cells. In this case, these errors are flagged to the IC designer so that the layout cells may be fixed so that the layout cells perform their proper logical operation. Also, the verification process checks the hand-laid-out cells to determine if a plurality of design rules have been observed. Design rules are provided to integrated circuit designers to ensure that a part can be manufactured with greater yield. Most design rules include hundreds of parameters and, for example, include pitch between metal lines, spacing between diffusion regions in the substrate, sizes of conductive regions to ensure proper contacting without electrical short circuiting, minimum widths of conductive regions, pad sizes, and the like. If a design rules violation is identified, this violation is flagged to the IC designer so that the IC designer can properly correct the cells so that the cells are in accordance with the design rules.




Then, using a simulator the user of the CAD system prepares a list of vectors representing real input values to be applied to the simulation model of the integrated circuit. This representation is translated into a form which is best suited to simulation. This representation of the integrated circuit is then operated upon by the simulator which produces numerical outputs analogous to the response of a real circuit with the same inputs applied. By viewing the simulation results, the user may then determine if the represented circuit will perform correctly when it is constructed. If not, he or she may re-edit the schematic of the integrated circuit, re-compile and re-simulate. This process is performed iteratively until the user is satisfied that the design of the integrated circuit is correct.




Then, the human IC designer presents as input to a logic synthesis tool a cell library and a behavioral model. The behavioral circuit model is typically a file in memory which looks very similar to a computer program. The behavioral circuit model contains instructions which define logically the operation of the integrated circuit. The logic synthesis tool receives as input the instructions from the behavioral circuit model and the library cells from the library. The synthesis tool maps the instructions from the behavioral circuit model to one or more logic cells from the library to transform the behavioral circuit model to a gate schematic net list of interconnected cells. A gate schematic net list is a data base having interconnected logic cells which perform a logical function in accordance with the behavioral circuit model instructions. Once the gate schematic net list is formed, it is provided to a place and route tool.




The place and route tool is used to access the gate schematic net list and the library cells to position the cells of the gate schematic net list in a two-dimensional format within a surface area of an integrated circuit die perimeter. The output of the place and route step is a two-dimensional physical design file which indicates the layout interconnection and two-dimensional IC physical arrangements of all gates/cells within the gate schematic net list.




Interconnect delay is a major concern in deep sub-micron technologies for two reasons: as the technologies scale, the feature sizes are shrinking. The gate resistance and capacitance scales better with this shrinking than the interconnect resistance and capacitance. Moreover, although the local interconnect lengths are shrinking, lengths of global interconnect and long wires are increasing as chip sizes increase. As the interconnect delay is proportional to the square of the wire length, this has a significant impact on the over-all timing of the chip. Buffer insertion and wire sizing are the two main tools to manage interconnect delay. Among these, buffer insertion offers the possibility of reducing cross talk hazards due to signal regeneration (apart from any cross talk reduction obtained by offsetting inverters).




In the deep sub-micron technologies, the contribution of interconnect delay can constitute as much as 80% of the total delay. Interconnect delay is proportional to the square of the wire length. As a result, long wires are a major concern in designing high performance chips since the long wire lengths increase quadratically with technology scaling. Buffer insertion and wire sizing are the two main tools to manage interconnect delay.




Apart from the wire length, the interconnect delay is a function of various technology parameters, net topology, source and sink parameters etc. The exact formulation of the interconnect delay and an optimal buffering scheme to minimize it can be very complex. Since many of these parameters are dependent on specific net instances, a complex analysis/buffering strategy would have limited practical use when buffering a design with tens of thousands of long wires.




SUMMARY OF THE INVENTION




The present invention has been made with the above problems of the prior art in mind, and a first object of the present invention is to provide a wire delay model which allows accurate prediction of wire delays in the presence of long wires. The delay estimates account for the effect of buffering on the delay. The model is exact under some assumptions and is a good heuristic approximation for more general cases.




It is a further object of the present invention to provide a method which derives the optimal buffering distance and the optimal buffer size in long wires under some assumptions. These formulae are also good heuristic approximations for more general cases.




It is another object of the present invention to provide a method using delay models which allows placement algorithms to account for the effect of buffering of long wires on delay and capacitance without actually inserting buffers in the design.




It is a still further object of the present invention to provide a method in which design independence of the optimal buffering distance and the optimal buffer size can be used to efficiently buffer long wires. This strategy can also be applied in a placement-optimization flow to implement the delays and capacitances estimated by the proposed delay model during placement.




It is a still further object of the present invention to provide a delay model in which the wire delay is a function of the wire length and wire width and spacing. This formulation can be used to trade-off length versus wire width and/or wire spacing to achieve a target delay.




It is yet another object of the present invention to provide a delay model that can be used to estimate wire delays at any stages in a placement-optimization flow. This delay can then be held constant throughout the rest of the flow by varying wire width and spacing as the wire length changes.




It is a still further object of the present invention to provide a delay model that can be used to implement a desired wire delay by wire width, spacing or length changes to meet a target stage delay (delay of driving gate+wire delay). Given a fixed placement, this can be used to balance fast/slow gate delays in the circuit. This can increase the flexibility of gate sizing algorithms.




It is yet another object of the present invention to provide an algorithm which selects target wire delays in a design. The algorithm can be applied in conjunction with a gate sizing/gate delay selection algorithm to trade-off gate and wire delays and the associated area/routing congestion. These target delays are then met by varying the wire widths and spacing. If no combination of wire delays can satisfy the timing constraints, the circuit cannot meet timing for the given selection of gate sizes. This can be used as a timing checkpoint in a placement flow.




It is another object of the present invention to provide a buffering algorithm which combines optimal buffering distance results with previous works on buffer insertion using dynamic programming.




It is another object of the present invention to provide a method in which constant wire delay placement-optimization flow uses the above results and algorithms to hold wire delay constant.




It is another object of the present invention to provide a technique in which the delay of an optimally buffered wire can be modeled using just library RC parameters for any given wire size and spacing.




It is a still further object of the present invention to provide such a technique in which very large circuit designs can be buffered and sized/spaced efficiently and which allows modeling the effect of buffering without actually inserting buffers in the design.




It is a further object of the present invention to provide a technique in which an increase in wire delay can be made linear in length, as opposed to quadratic for an unoptimized wire, via buffering.




It is yet another object of the present invention to provide a technique in which an increase in wire delay can be made independent of its length, as opposed to the quadratic dependency of an unoptimized wire, via buffering and wire sizing/spacing.




The above objects are achieved according to an aspect of the present invention by providing a strategy for optimal buffering in the case of a long wire buffered with an arbitrary number of equally spaced single-size buffers. A simple but efficient technique is proposed using this to choose a buffer size and determine a good inter-buffering distance up front, thus enabling fast, efficient buffer insertion. The analysis also allows representing delays of long wires as a simple function of the length and buffer and wire widths. Based on this, a novel constant wire delay approach is proposed where the proposed wire delay model is used for fairly accurate prediction of wire delays early in the design process and these predictions are later met via buffer insertion and wire sizing.




The present invention provides a buffering strategy that is optimal in the case of an infinitely long wire buffered with an arbitrary number of equally spaced single-size buffers. In practice, this technique can be used to choose a buffer size and determine a good inter-buffering distance up front, thus enabling fast, efficient buffer insertion.




The delay due to long wires can be represented as a linear function of the length. Based on this, a novel constant wire delay approach can be developed where the proposed wire delay model is used for fairly accurate prediction of wire delays early in the design process and these predictions are later met via buffer insertion and wire sizing.




The present invention initially treats the above problem in the case of an infinitely long wire buffered with an arbitrary number of equally spaced single-size buffers. A simple but efficient technique is proposed using this to choose a buffer size and determine a good inter-buffering distance up front, thus enabling fast, efficient buffer insertion. The analysis presented here also allows representing delays of long wires as a simple function of the length and buffer and wire widths. Based on this, a novel constant wire delay approach is proposed where the proposed wire delay model is used for fairly accurate prediction of wire delays early in the design process and these predictions are later met via buffer insertion and wire sizing.











BRIEF DESCRIPTION OF THE DRAWINGS




These and other objects, features, and advantages of the present invention are better understood by reading the following detailed description of the preferred embodiment, taken in conjunction with the accompanying drawings, in which:





FIG. 1

is a schematic diagram of a gate and its load;





FIG. 2

is a schematic diagram of a gate driven by an input and driving a load;





FIG. 3

is a schematic diagram of a buffered 2-pin net; and





FIG. 4

is a schematic diagram of a buffered long wire.











DETAILED DESCRIPTION OF THE PRESENTLY PREFERRED EXEMPLARY EMBODIMENT




As is known in the art, the theory of logical effort represents the delay D


gate


through a simple logic gate


10


as shown in

FIG. 1

, driving a capacitive load


20


as:








D




gate


=τ(


g




gate




h




gate




+p




gate


)  (1)






where g


gate


is the logical effort of the gate, h


gate


is the electrical effort and p


gate


is the intrinsic delay of the gate. τ is a technology constant defined as the delay of an ideal inverter with no intrinsic delay, driving another ideal inverter.




The logical effort g


gate


represents the computational complexity of the gate and measures how much weaker it is in current drive compared to an ideal inverter with the same input capacitance. It compares the characteristic time constant (product of the output resistance and input capacitance) of a gate with that of an inverter. Logical effort is a function of the topology of the transistor interconnections but not of transistor sizes, and is given by:










g
gate

=



R
gate_min



C
gate_min




R
inv_min



C
inv_min







(
2
)













where gate_min refers to the minimum sized gate and inv_min to the minimum sized inverter.




The electrical effort h


gate


is the ratio of the load capacitance to input capacitance


30


and does depend on the transistor sizes. h


gate


is given by:










h
gate

=


C
load


C
in






(
3
)













where C


load


is the load capacitance


20


of the gate in the given network, and C


in


the input capacitance


30


.




The parasitic delay p


gate


occurs primarily due to source/drain diffusion capacitance C


diff













min


at the output of the minimum sized gate. It depends on the layout geometry, but is independent of the sizes:










p
gate

=



R
gate_min



C
diff_min




R
inv_min



C
inv_min







(
4
)













Now, consider a stage in the circuit with the source gate


40


and sink gate


50


connected by a 2-pin net


60


as shown in FIG.


2


. An estimate of the stage delay from the input pin of the source gate


40


to the input pin of the sink gate


50


can be obtained by combining the source gate delay (as given by the theory of logical effort as described above) with wire delay (as computed by the Elmore delay model as is known in the art). Then, if the net


60


is not buffered, the delay D


0


is given by:










D
0

=


τ






(



g
so



C
si


+



C
w


L


C
so


+

p
so


)


+


R
w



L


(




C
w


L

2

+

C
si


)








(
5
)













where L is the length of the net


60


, R


w


and C


w


are the per unit length interconnect resistance and capacitance, R


so


and C


so


are the source driver


40


resistance and input capacitance, C


si


is the sink


50


input capacitance, g


so


is the logical effort of the source


40


, p


so


is the parasitic delay of the source


40


and τ is the technology constant. Note that the delay due to the source driver


40


resistance charging the sink input


50


capacitance is included in the logical effort delay and is hence omitted from the Elmore delay terms.




In the case when the net has one buffer


70


inserted halfway between the source


40


and the sink


50


as shown in

FIG. 7

, the delay D


1


is given by adding the two stage delays from the source to the buffer and from the buffer to the sink:










D
1

=


τ
(



g
so



C
b


+




C
w


L

2


C
so


+

p
so


)

+




R
w


L

2



(




C
w


L

4

+

C
b


)


+

τ
(




C
si

+



C
w


L

2



C
b


+

p
b


)

+




R
w


L

2



(




C
w


L

4

+

C
si


)







(
6
)













where R


b


and C


b


are the buffer driver resistance and input capacitance and p


b


the intrinsic delay of the buffer


70


.




Similarly, in the case of a net


60


with k equally spaced same-size buffers


70


as shown in

FIG. 4

, the total stage delay from the source


40


to the sink


50


is given by the sum of k+1 stage delays:










D
k

=


τ
(



g
so


C

+




C
w


L

2


C
so


+

p
so


)

+


R
w




L
k



(




C
w



L
k


2

+

C
b


)



+




i
=
1


k
-
1








τ






(

1
+



C
w



L
k



C
b


+

p
b


)



+


R
w




L
k



(




C
w



L
k


2

+

C
b


)



+


R
w




L
k



(




C
w



L
k


2

+

C
si


)



+

τ






(

1
+



C
w



L
k



C
b


+

p
b


)



p
b







(
7
)







D
k

=


τ






(



g
so



C
b


+



C
w



L
k



C
so


+

p
so


)


+

k





τ






(

1
+



C
w



L
k



C
b


+

p
b


)


+



R
w



C
w



L
k


2

+


kR
w



C
b



L
k


+


R
w



C
si



L
k







(
8
)













where L


k


=L/(k+1) is the distance between buffers


70


.




Since the wire delay varies with the square of the wire length, buffer insertion can reduce the wire delay by dividing the delay. of one long wire segment in to two or more smaller segments. However, inserting a buffer


70


adds the buffer delay to the over-all stage delay. Thus, there is a trade-off between the number of buffers


70


inserted and the over-all stage delay. Let w


opt


be the ratio of the widths of an optimally sized buffer for a given stage and the minimum sized buffer and k


opt


be the optimal number of buffers


70


for a given stage. Clearly, k


opt


depends on the size of the buffer


70


being inserted since the gate delay of the buffer


70


and wire delays of the segments both depend on R


b


and C


b


.




Now, consider the above 2-pin net


60


with an arbitrary number of equally spaced same-sized buffers


70


, where the sink


50


and source


40


are sized such that the sink


50


input capacitance and the source


40


driving resistance are the same as the buffer


70


input capacitance and driving resistance. Let L, R


0


, C


0


, τ, p


b


, R


w


and C


w


be the parameters defined above. Also, let the number of buffers


70


inserted on the net be k, k=0, 1 . . . , k≧0. Let R


so


, C


so


, C


si


, g


so


and p


so


be defined as before. Further, it is given that R


b


=R


so


, C


b


=C


si


, g


so


=1.




From Equation (1),










D
k

=


τ






(



g
so



C
b


+



C
w



L
k



C
so


+

p
so


)


+

k





τ






(

1
+



C
w



L
k



C
b


+

p
b


)


+



R
w



C
w



L
k


2

+


kR
w



C
b



L
k


+


R
w



C
si



L
k







(
9
)













Rearranging,
















D
k

=



(

k
+
1

)






τ






(

1
+



C
w



L
k



C
b


+

p
b


)


+



R
w



C
w



L
k


2

+


(

k
+
1

)



R
w



C
b



L
k












D
k


=



(

k
+
1

)



τ


(

1
+



C
w



L
k



C
b


+

p
b


)



+



R
w



C
w



L
2



2


(

k
+
1

)



+


(



R
b



C
w


+


R
w



C
b



)


L

















D
k




k



=



τ


(

1
+

p
b


)


-




R
w



C
w



L
2



2



(

k
+
1

)

2








and









2



D
k





k
2





=




R
w



C
w



L
2




(

k
+
1

)

3


>
0









(
10
)













k


opt


, the number of buffers


70


for which the delay is minimized is then given by:













D
k




k


=


0


k
opt


=

L






R
w



C
w



2





τ






(

1
+

p
b


)



-
1








(
11
)













The optimal inter-buffer spacing is given by:










L
kopt

=



2





τ






(

1
+

p
b


)




R
w



C
w








(
12
)













The optimal delay is given by:










D
k

=

L


(



2


R
w



C
w


τ






(

1
+

p
b


)



+


τ






C
w



C
b



)






(
13
)













Since R


0


and C


0


are the driver resistance and input capacitance of a minimum size buffer, if w is the ratio of the widths of the buffer


70


used here and the minimum size buffer, R


b


=R


0


/w and C


b


=wC


0


. The delay then is:










D
k

=



L


(



2


R
w



C
w


τ






(

1
+

p
b


)



+


τ






C
w



wC
0


+


wR
w



C
0



)











D
k




w



=



R
w

-



τ






C
w




w
2



C
0








and















2



D
k





w
2





=



τ






C
w



w
3


>
0







(
14
)













The optimal size of the buffer


70


is then given by:














D
k




w


=
0

,


w
opt





τ






C
w




R
w



C
0
2









(
15
)













Since k


opt


is independent of w and w


opt


is independent of k, the above two-variable analysis is consistent. Thus, the delay from the input of the source


40


to the input of the sink


50


under the logical effort and Elmore delay model is minimized when the number of buffers


70


inserted is







k
opt

=

L





R
w



C
w



2


τ


(

1
+

p
b


)
















and each buffer


70


is







w
opt

=



τ






C
w




R
w



C
0
2














times larger than the minimum sized buffer.




Since the above formula for optimal delay applies to k


opt


≧0, i.e., L≧L


kopt


, the wire delay of a 2-pin net


60


with above buffering constraints can be modeled as (from (1), and by substituting Equation (15) in Equation (13):










D
k

=



τ


(

1
+



C
w


L



C
b

+

p
b




)


+


R
b



C
w


L

+


R
w



C
b


L

+




R
w



C
w



L
2


2






L


<

L
kopt






(
16
)







D
k

=


L




R
w



C
w





(



2


τ


(

1
+

p
b


)




+

2


C
0



τ



)






L



L
kopt






(
17
)













From the above derivation, one can see that the delay model in Equations (16) and (17). is exact for 2-pin nets


60


with an arbitrary number of equally spaced same-sized buffers


70


and with the sink


50


and source


40


sized to be the same as the buffer


70


input capacitance and driving resistance. Further, the delay model in Equations (16) and (17) is continuous and differentiable everywhere in L for 2-pin nets


60


with an arbitrary number of equally spaced same-sized buffers


70


and with the sink


50


and source


40


sized to be the same as the buffer


70


input capacitance and driving resistance. This can be seen as follows.




First, D


k


is clearly continuous and differentiable everywhere in L except at L


kopt


. For L=L


kopt


,












lim

L


L

kopt
-










D
k


=


τ


(



g
so









C
b

+


C
w


L



C
so



+

p
so


)


+



R
w



C
w



L
2


2

+


R
w



C
si


L















For






R
b


=

R
so


,


C
b

=

C
si


,


g
so

=
1






(
18
)












lim

L


L

kopt
-










D
k


=






τ


(

1
+



C
w


L


C
b


+

p
b


)


+



R
w



C
w



L
2


2

+


R
w



C
b


L








=





L


(



τ


(

1
+

p
b


)


L

+



R
w



C
w


L

2

+


τ






C
w



C
b


+


R
w



C
b



)



















lim

L


L

kopt
-










D
k



=





L


(



2


R
w



C
w



τ


(

1
+

p
b


)




+


τ






C
w



C
b


+


R
w



C
b



)








=







D
k



(

L
kopt

)


=


lim

L


L

kopt
+






D
k











(
19
)













D


k


is thus continuous at L=L


kopt


.




Also,











lim

L


L

kopt
-









D
k




L



=




τ






C
w



C
b


+


R
W



C
w


L

+


R
w



C
b



=





2


R
w



C
w



τ


(

1
+

p
b


)




+


τ






C
w



C
b


+


R
w



C
b










lim

L


L

kopt
-









D
k




L




=






D
k




L




(

L
kopt

)


=


lim

L


L

kopt
+









D
k




L










(
20
)













Therefore, D


k


is both continuous and differentiable everywhere in L. Even when R


b


≠R


so


, C


b


≠C


si


, or g


so


≠1, similar results can be derived under the assumption that the 2-pin net


60


is infinitely long.




Further, consider a 2-pin net


60


with the source


40


and sink


50


connected by an infinitely long wire


60


with an arbitrary number of equally spaced same-sized buffers


70


. Let L, R


0


, C


0


, τ, p


b


, R


w


and C


w


be the parameters defined as before. Then the delay from the input of the source


40


to the input of the sink


50


under the logical effort and Elmore delay model is minimized when the number of buffers


70


inserted is








k
opt

=

L





R
w



C
w



2


τ


(

1
+

p
b


)







,










and each buffer


70


is







w
opt

=



τ






C
w




R
w



C
0
2














times larger than the minimum sized buffer.




Let the number of buffers


70


inserted on the net be k, k=0, 1 . . . , k≧0. From (1),










D
k

=


τ


(



g
so









C
b

+


C
w



L
k




C
so



+

p
so


)


+

k






τ


(

1
+



C
k



L
k



C
b


+

p
b


)



+



R
w



C
w



LL
k


2

+


R
w



C
b



L
k


+


R
w



C
si



L
k







(
21
)













Rearranging,










D
k

=


k






τ


(

1
+

p
b


)



+


(



τ






C
w



C
b


+


R
w



C
b



)


L






k

k
+
1



+




R
w



C
w



L
2


2







1

k
+
1



+

τ


(



g
so








C
b


C
so



+

p
so


)


+


(



τ






C
w



C
so


+


R
w



C
si



)


L






k

k
+
1








(
22
)











D
k




k


=






τ


(

1
+

p
b


)


+


(



τ






C
w



C
b


+


R
w



C
b



)


L






k


(

k
+
1

)

2



-




R
w



C
w



L
2


2







1


(

k
+
1

)

2



-


(



τ






C
w



C
so


+


R
w



C
si



)


L






1


(

k
+
1

)

2













For









D
k




k



=




0

,





(
23
)






k
=






R
w



C
w



L
2


+

2


(



τ






C
w



C
so


+


R
w



C
si



)


L

-

2


(



τ






C
w



C
b


+


R
w



C
b



)


L



2


τ


(

1
+

p
b


)





-
1





(
24
)








lim

L









k

=


L





R
w



C
w



2


τ


(

1
+

p
b


)






-
1





(
25
)













Since k


→∞


as L


→∞


,












lim

L









D
k




k



=







τ


(

1
+

p
b


)


-




R
w



C
w



L
2



2



(

k
+
1

)

2








and







lim

L













2



D
k





k
2






=




R
w



C
w



L
2




(

k
+
1

)

3


>
0






(
26
)













k


opt


, the number of buffers


70


for which the delay is minimized is then given by:











lim

L










k
opt


=


L





R
w



C
w



2


τ


(

1
+

p
b


)






-
1





(
27
)













The optimal inter-buffer spacing is given by:











lim

L










L
kopt


=



2


τ


(

1
+

p
b


)





R
w



C
w








(
28
)













And the optimal delay is given by:










lim






D
k


=

L


(



2


R
w



C
w



τ


(

1
+

p
b


)




+


τ






C
w



C
b


+


R
w



C
b



)






(
29
)













Similar to the above, since R


0


and C


0


are the driver resistance and input capacitance of a minimum size buffer, if w is the ratio of the widths of the buffer


70


used here and the minimum size buffer, R


b


=R


0


/w and C


b


=wC


0


. The delay then is:






&AutoLeftMatch;









lim

L










D
k


=





L


(



2


R
w



C
w



τ


(

1
+

p
b


)




+


τ






C
w



wC
0


+


wR
w



C
0



)
















lim

L













D
k




w




=


-


τ






C
w




w
2



C
0




+


R
w



C
0






and




















lim


L













2



D
k





w
2




=


-






τ






C
w



2


w
3



C
0




>
0









(
30
)














The optimal size of the buffer


70


is then given by:











lim

L






w
opt


=



τ






C
w



C
0
2







(
31
)













The delay model proposed earlier can hence also be applied to nets with arbitrary sink and source under the restriction of infinite length.




From the above, one can see that Equations 16 and 17 are exact for 2-pin nets


60


with the source


40


and sink


50


connected by an infinitely long wire


60


with an arbitrary number of equally spaced same-sized buffers


60


. For finite length wires


60


, the estimated delay approaches the exact delay as the length approaches infinity. Also, the delay model in Equations 16 and 17 is continuous and differentiable everywhere in L for 2-pin nets with the source


40


and sink


50


connected by an infinitely long wire


60


with an arbitrary number of equally spaced same-sized buffers


70


. Finally, the above can also be demonstrated using the traditional linear delay model (which models the delay through a gate as a linear function of its load) in conjunction with the Elmore delay model.




The delay model proposed in Equations (16) and (17) allows accurate prediction of wire delays in presence of long wires


60


. The delay estimates account for the effect of buffering on the delay. The model is exact under some assumptions and is a good heuristic approximation for the more general cases. The delay model framework also provides results on the optimal buffering scheme for such wires


60


. This enables a placement-optimization flow where the initial placement of cells can be performed without actually buffering long wires


60


. Using this delay model, a timing-driven placement program can account for the wire delays without actually buffering long wires


60


(via the predictive delay formula) and an area/capacitance-driven placement program can account for capacitance shielding due to long wires


60


(via the optimal buffering distance formula). The actual delay and capacitance estimates are met later by buffering long wires


60


using the optimal buffering distance formula. This is a very powerful application of this model, since the placement changes cell locations and hence the wire lengths. Thus, adding buffers


70


explicitly at every iteration of a placement program can bias the placement algorithm, while not inserting them would yield overly pessimistic delay/capacitance estimates if traditional wire delay models are used.




From the delay model in Equations (16) and (17), the delay of long wires


60


is a function of their length and the per unit length wire resistance. Let the resistance and capacitance per unit length of a minimum width/spacing wire be r


w


and c


w


respectively. Further, let the area and lateral capacitance per unit length be c


a


and c


l


(for typical deep sub-micron technologies, c


a


<<c


l


). Then, the resistance R


w


and capacitance C


w


per unit length of a wire with width w


w


and spacing s


w


are R


w


=r


w


/w


w


and C


w


=c


a


w


w


+c


l


/(w


w


s


w


) Thus,










RC
w

=




r
w



c
a


+



r
w



c
l




w
w



s
w








r
w



c
l




w
w



s
w








(
32
)













Substituting these Equations (16) and (17), the overall delay for long wires


60


is given by









D
=

L





r
w



c
a


+



r
w



c
l




w
w



s
w







(



2


τ


(

1
+

p
b


)




+

2


C
0



τ



)






(
33
)







D
k

=


L



w
w



s
w






(



2


τ


(

1
+

p
b


)




r
w



c
l



+

2


C
0




τ






r
w



c
l





)






(
34
)













From Equations (33) and (34), the wire delay can be held constant in presence of a 100% increase in wire length by doubling the width and the spacing of the wire. This observation can be used to predict and control the wire delay over a wide range of lengths during a placement-optimization flow.




From Equations (33) and (34) wire delay can be held constant in presence of length changes by changing the width and spacing of the wire. This can be used to assign wire sizes and widths and to compute delay budgets as follows: given a placement, the lengths of all wires are fixed. At this point, the wire delays are computed first using the wire length from the placement, minimum width and spacing and the library RC data. Given the cell delays and the computed wire delays, critical paths are identified. Wires on critical paths are than sized up/spaced out to reduce their delay and to bring the overall design slack to zero—this could be done by a simulated annealing algorithm as described in, e.g., Kirkpatrick et. al, “Optimization by Simulated Annealing,” Science, vol. 220, no. 4598, pp. 671-680, 1983, or some greedy assignment algorithm which sizes up wires with significant delays.




The maximum allowable wire widths/sizes may be limited by technology or manufacturing constraints. Given these size and spacing assignments, wire delays are computed and stored as budgets. These delays are held constant through the flow. Subsequent wire sizings and spacings can be done on each wire individually to continue to meet these budgets in spite of any length changes (due to placement changes) and as more information like neighborhood information (during routing) is made available.




Wire delay and capacitance estimates based on placement information (e.g. Manhattan distance between pins) can be significantly different from statistical wireload models. As a result, the gate and wire delays can change significantly once placement is performed. This may change the arrival and requires times of signals in the circuit and a netlist optimized with wireload models may no longer be the optimal netlist when placement based wire models are applied. This provides a potential of re-optimizing the circuit by selectively resynthesizing parts of the network. While there are many resynthesis techniques known in the literature and resynthesis can be applied at any time, the main problem with a generic application of resynthesis is that the placement is invalidated. We propose a restructuring approach which applies the resynthesis technique similar to the ones outlined in, e.g., Singh et al., ICCAD-88, pp. 282-285, with the difference that it is applied on a placed netlist. The timing analysis is performed using the wire delay models outlined herein. Furthermore, cell location history is preserved as Boolean transformations are performed and is used to place the resulting set of cells in the same location as the cells from which they were derived. Some of these Boolean transformations involve collapsing multiple cells and reconnecting them or re-implementing them with different cells. To ensure that no new long wires are created in this process and that placement information is not changed significantly, only cells in a local placement area are resynthesized at a time.




Given a design to be placed and routed, an overall process flow may go as follows:




calculate wire delay/buffering model parameters from library data;




placement;




update timing;




restructure circuit based on wire delays;




assign wire delays;




buffering and wire sizing/spacing;




incremental placement;




resizing/respacing wires to maintain delays; and




routing wires with desired sizing and spacing.




Preferably, the placement technique is performed as disclosed in the United States Patent Application to Savoj et al. filed on Apr. 27, 1999, U.S. patent application Ser. No. 09/300,666 incorporated herein by reference.




If there are other optimization steps, e.g., to optimize cell delays, the buffering, placement, wire sizing and spacing steps can be iterated. Preferably, this iteration includes iteration of assigned delays to account for timing changes due to the other optimization steps. Further, while the constant wire delay methodology does not perform iterations, in general one or more steps between updating timing and routing wires can be iterated. While more iterations may not always yield a better design and in fact may lead to convergence problems, one can come up with heuristic criteria for convergence to decide when to stop iterating.




Also, a crosstalk-sensitive routing program can vary size and spacing as neighboring wiring information becomes available to continue to meet the delay budgets set in the assign wire delays.




The above description of the preferred embodiment of the present invention has been given for purposes of illustration only, and the invention is not so limited. Modification and variations thereof will become readily apparent to those skilled in the art, and these too are within the scope of the invention. Thus, the present invention is limited only by the scope of the appended claims.



Claims
  • 1. A method of buffering a long wire comprising:using a computer to receive an electronic representation of a circuit including a long wire, the long wire having resistance and capacitance per unit length of Rw and Cw, respectively, and a length of L; using the computer to determine a number kopt of buffers, each buffer having an intrinsic delay of pb, to be used to buffer the long wire, to be kopt=L⁢Rw⁢Cw2⁢ ⁢τ⁡(1+pb)where τ is the intrinsic delay of an ideal inverter; and using the computer to modify the electronic representation based on kopt.
  • 2. A method of buffering a long wire comprising:using a computer to receive an electronic representation of a circuit including a long wire, the long wire having resistance and capacitance per unit length of Rw and Cw, respectively, and a length of L; using a computer to determine a size wopt of buffers to be used to buffer the long wire, to be wopt=τ⁢ ⁢CwRw⁢C02where τ is the intrinsic delay of an ideal inverter and C0 is the input capacitance of a minimum size buffer; and using the computer to modify the electronic representation based on wopt.
  • 3. A method of buffering a long wire comprising:using a computer to receive an electronic representation of a circuit including a long wire, the long wire having resistance and capacitance per unit length of Rw and Cw, respectively, and a length of L; using the computer to determine a spacing Lkopt between buffers to be used to buffer the long wire, each of the buffers having an intrinsic delay of pb, to be Lkopt=2⁢τ⁢(1+pb)Rw⁢Cwwhere τ is the intrinsic delay of an ideal inverter; and using the computer to modify the electronic representation based on Lkopt.
  • 4. A method of calculating delay in a long wire comprising:using a computer to receive an electronic representation of a circuit including a long wire, the long wire having resistance and capacitance per unit length of Rw and Cw, respectively, and a length of L; and using the computer to determine a delay Dk of the long wire when a length of the long wire is less than an optimal spacing of buffers in the long wire, to be Dk=τ⁢ ⁢(1+Cw⁢LCb+pb)+Rb⁢Cw⁢L+Rw⁢Cb⁢L+Rw⁢Cw⁢L22where Rb and Cb are resistance and capacitance, respectively, of buffers later used to buffer the long wire; pb an intrinsic delay of the buffers; and τ the intrinsic delay of an ideal inverter.
  • 5. The method of claim 4, wherein the-optimal spacing Lkopt is Lkopt=2⁢τ⁢(1+pb)Rw⁢Cw.
  • 6. A method of calculating delay in a long wire comprising:using a computer to receive al electronic representation of a circuit including a long wire, the long wire having resistance and capacitance per unit length of Rw and Cw, respectively, and a length of L; using the computer to determine a delay Dk of the long wire when a length of the long wire is greater than an optimal spacing of buffers in the long wire, to be Dk=L{square root over (RwCw)}({square root over (2τ(1+pb))}+2C0{square root over (τ)}) where pb is an intrinsic delay of the buffers and τ the intrinsic delay of an ideal inverter.
  • 7. The method of claim 6, wherein the optimal spacing Lkopt is Lkopt=2⁢τ⁢(1+pb)Rw⁢Cw.
US Referenced Citations (4)
Number Name Date Kind
5838580 Srivatsa Nov 1998 A
6002268 Sasaki et al. Dec 1999 A
6009253 Srivatsa et al. Dec 1999 A
6117182 Alpert et al. Sep 2000 A
Non-Patent Literature Citations (5)
Entry
Pullela et al. “Post processing of clock trees via wiresizing and buffering for robust design”, IEEE transactions on Computer aided design of Integrated circuits and systems Jun. 1996.*
Alpert et al., “Wire segmenting for improved buffer insertion”, Design automation Confeerence, Jun. 1997.*
Venkat, K., “Generalized Delay Optimization of Resistive Interconnections through an Extension of Logic Effort,” (Proc.) 1993 IEEE Intl Symposium on Circuits and Systems, Chicago, IL, 306 May 1993; pp. 2106-2109.
Dhar, S.; Franklin, M.A.; “Optimum Buffer Circuits for Driving Long Uniform Lines,” IEEE Journal of Solid-State Circuits, 1/91, vol. 26, No. 1, pp. 32-40.
Nekili, M.; Savaria, Y., “Optimal Methods of Driving Interconnections in VLSI Circuits,” 1992 IEEE Intl Symposium on Circuits and Systems, San Diego, CA, May 10-13 1992, pp. 21-24.