1. Field of the Invention
The present invention generally relates to the fabrication and design of semiconductor chips and integrated circuits, and more specifically to routing tools which predict wire congestion.
2. Description of the Related Art
Integrated circuits are used for a wide variety of electronic applications, from simple devices such as wristwatches, to the most complex computer systems. A microelectronic integrated circuit (IC) chip can generally be thought of as a collection of logic cells with electrical interconnections between the cells, formed on a semiconductor substrate (e.g., silicon). An IC may include a very large number of cells and require complicated connections between the cells. A cell is a group of one or more circuit elements such as transistors, capacitors, resistors, inductors, and other basic circuit elements grouped to perform a logic function. Cell types include, for example, core cells, scan cells and input/output (I/O) cells. Each of the cells of an IC may have one or more pins, each of which in turn may be connected to one or more other pins of the IC by wires. The wires connecting the pins of the IC are also formed on the surface of the chip. For more complex designs, there are typically at least four distinct layers of conducting media available for routing, such as a polysilicon layer and three metal layers (metal-1, metal-2, and metal-3). The polysilicon layer, metal-1, metal-2, and metal-3 are all used for vertical and/or horizontal routing.
An IC chip is fabricated by first conceiving the logical circuit description, and then converting that logical description into a physical description, or geometric layout. This process is usually carried out using a “netlist,” which is a record of all of the nets, or interconnections, between the cell pins. A layout typically consists of a set of planar geometric shapes in several layers. The layout is then checked to ensure that it meets all of the design requirements, particularly timing requirements. The result is a set of design files known as an intermediate form that describes the layout. The design files are then converted into pattern generator files that are used to produce patterns called masks by an optical or electron beam pattern generator. During fabrication, these masks are used to pattern a silicon wafer using a sequence of photolithographic steps. The component formation requires very exacting details about geometric patterns and separation between them. The process of converting the specifications of an electrical circuit into a layout is called the physical design.
Cell placement in semiconductor fabrication involves a determination of where particular cells should optimally (or near-optimally) be located on the surface of a integrated circuit device. Due to the large number of components and the details required by the fabrication process for very large scale integrated (VLSI) devices, physical design is not practical without the aid of computers. As a result, most phases of physical design extensively use computer-aided design (CAD) tools, and many phases have already been partially or fully automated. Automation of the physical design process has increased the level of integration, reduced turn around time and enhanced chip performance. Several different programming languages have been created for electronic design automation (EDA), including Verilog, VHDL and TDML.
Physical synthesis is prominent in the automated design of integrated circuits such as high performance processors and application specific integrated circuits (ASICs). Physical synthesis is the process of concurrently optimizing placement, timing, power consumption, crosstalk effects and the like in an integrated circuit design. This comprehensive approach helps to eliminate iterations between circuit analysis and place-and-route. Physical synthesis has the ability to repower gates, insert buffers, clone gates, etc., so the area of logic in the design remains fluid. However, physical synthesis can take days to complete.
Routability is a key factor when performing floorplanning or trying to close on timing via physical synthesis. A designer can expend considerable effort trying to get the design into a good state in terms of timing and signal integrity, only to subsequently find that it is unroutable. Ideally, the designer should be able to invoke a snapshot routability analysis that allows him or her to understand the routability issues involved from making floorplanning or optimization decisions.
During physical synthesis, wire congestion may be examined as part of the routing process. Circuit designers have devised various routing tools to provide reliable congestion information when designing the circuit, including empirical models, global routers, and probabilistic analysis. Among these, only probabilistic routing congestion analysis is particularly efficiently, since it avoids actually performing global routing. Instead, for a given placement, it examines the set of nets in the design and uses probability theory to compute the expected congestion for each routing tile.
In one probabilistic analysis algorithm, all possible pin-to-pin routes within the bounding box of the pins are considered, and each route is assigned an equal usage probability. This approach invariably produces biased congestion towards the middle of the bounding box instead of the periphery. Since routers usually try to minimize the insertion of vias, the periphery of the bounding box actually has more congestion than the interior, so this approach can lead to unsatisfactory results.
In another approach, the probabilistic analysis depends on the different types (shapes) of the routes. Every net is classified into one of four different categories: short nets, flat nets, L-shaped nets, and Z-shaped nets. These types of nets are illustrated in
While this approach leads to better probabilistic routing usage along the boundary of a net's bounding box, it still does not adequately address the problem of wiring blockages. Before global routing occurs, several requirements may stake claim to wiring resources which then become fixed for global routing. These requirements include local wiring on the bottom layers for the internal pin connections of a gate, power grids on multiple layers, pre-routed clock wires, planned buses, or datapaths, and hierarchical logic, memory, or propriety (IP) blocks. Those features may already have been completely routed; even if not, their routes may be hidden from the top-level routing congestion map. The corresponding bins are unlikely to block 100% of the routing resources since generally there will be some routing resources allocated on the top layers.
Wiring blockage for a given bin can be complete, or partial. Complete blockages can be handled by simply omitting the bin from the possible paths. In practice, however, blockages with absolutely no available tracks are rarely seen, and previous approaches fail to realistically model that essentially every tile of a routing congestion map is neither completely empty or completely full. There is almost always some amount of wiring blockage that a global router will take into account, yet probabilistic routers do not take this reality into account. In conventional probabilistic analysis, if there are partial blockages in some bins, the usage of these bins is not changed at all. Rather, a simple model is applied in which the number of blocked tracks are subtracted from the capacity of the bin. A global router is thus more likely to route a net in a lower congestion region than in a higher one.
In light of the foregoing, it would be desirable to devise an improved probabilistic method of predicting wire congestion which provides a practical approach to handling partial wiring blockages. It would be further advantageous if the method could improve the complexity the probabilistic analysis while still maintaining quality routing solutions.
It is therefore one object of the present invention to provide an improved method of predicting wiring congestion when modeling routes of a net of an integrated circuit design.
It is another object of the present invention to provide such a method which takes into consideration partial wiring blockages in the net.
It is yet another object of the present invention to provide fast and accurate routing congestion estimation using a probabilistic metric that is more efficient than a global router but still achieves comparable results.
The foregoing objects are achieved in a method of estimating routing congestion between pins in a net of an integrated circuit design, by establishing one or more potential routes between the pins which pass through buckets in the net (each bucket having a set of wiring tracks), assigning a probabilistic usage to each bucket based on any partial blockage of the wiring tracks in each bucket, and computing routing congestion for each bucket using its probabilistic usage. When the net is a two-pin net that is a part of a larger multi-pin net, and a tree is constructed to bridge the two-pin net to another pin of the multi-pin net. The routing congestion for each bucket is computed as a ratio of the bucket usage to bucket capacity. For L-shaped routes (having at least one bend in a bucket), the probabilistic usage is proportional to a scale factor a which is a ratio of a minimum number of available wiring tracks for a given route to a sum of minimum numbers of available wiring tracks for all possible routes. For Z-shaped routes (having at least two bends in two respective buckets), the probabilistic usage is equal to a ratio of a minimum capacity of a given route to a sum of minimum capacities of all routes having an associated orientation with the given route. In particular, the minimum capacity F(n) of the given route n is
F(n)=min{F(u1)·F(i dn)/FR(d1), . . . , F(un)·F(dn)/FR(dn), F(dn), F(en)·F(dn)/FL(dn), . . . , F(eQ)·F(dQ)/FL(dQ)},
where Q is the number of potential routes, d is one of a plurality of central span portions of the potential routes, u is one of a first plurality of edge portions of the potential routes that lie on a first side of the central span portions, e is one of a second plurality of edge portions of the potential routes that lie on a second side of the central span portions, F(un) is the capacity associated with edge un, F(dn) is the capacity associated with edge dn, F(en) is the capacity associated with edge en, FL(dn) is the total capacities of all central spans d to the left of span dn having an associated orientation with the given route, and FR(dn) is the total capacities of all central spans d to the right of span dn having the associated orientation with the given route. Assignment of the usage values may entail the creation of a temporary usage map of the net buckets with an initial value of zero usage in every temporary usage map bucket, thereafter storing usage values in corresponding buckets of the temporary usage map, and deriving a final usage map from the temporary usage map.
The above as well as additional objectives, features, and advantages of the present invention will become apparent in the following detailed written description.
The present invention may be better understood, and its numerous objects, features, and advantages made apparent to those skilled in the art by referencing the accompanying drawings.
The use of the same reference symbols in different drawings indicates similar or identical items.
With reference now to the figures, and in particular with reference to
CPU 12, ROM 14 and DRAM 16 are also coupled to a peripheral component interconnect (PCI) local bus 20 using a PCI host bridge 22. PCI host bridge 22 provides a low latency path through which processor 12 may access PCI devices mapped anywhere within bus memory or I/O address spaces. PCI host bridge 22 also provides a high bandwidth path to allow the PCI devices to access DRAM 16. Attached to PCI local bus 20 are a local area network (LAN) adapter 24, a small computer system interface (SCSI) adapter 26, an expansion bus bridge 28, an audio adapter 30, and a graphics adapter 32. LAN adapter 24 may be used to connect computer system 10 to an external computer network 34, such as the Internet. A small computer system interface (SCSI) adapter 26 is used to control high-speed SCSI disk drive 36. Disk drive 36 stores the program instructions and data in a more permanent state, including the program which embodies the present invention as explained further below. Expansion bus bridge 28 is used to couple an industry standard architecture (ISA) expansion bus 38 to PCI local bus 20. As shown, several user input devices are connected to ISA bus 38, including a keyboard 40, a microphone 42, and a graphical pointing device (mouse) 44. Other devices may also be attached to ISA bus 38, such as a CD-ROM drive 46. Audio adapter 30 controls audio output to a speaker 48, and graphics adapter 32 controls visual output to a display monitor 50, to allow the user to carry out the integrated circuit design as taught herein.
While the illustrative implementation provides the program instructions embodying the present invention on disk drive 36, those skilled in the art will appreciate that the invention can be embodied in a program product utilizing other computer-readable media, including transmission media. The program instructions may be written in the C++ programming language for an AIX environment.
Computer system 10 carries out program instructions for an interconnect optimization process which predicts wire congestion using a novel method that includes consideration of partial wiring blockages. Accordingly, a program embodying the invention may include conventional aspects of various placement and timing tools, and these details will become apparent to those skilled in the art upon reference to this disclosure.
In one embodiment, computer system 10 divides the layout of the integrated circuit (IC) chip into rows and columns of buckets (tiles or bins). As illustrated in
Multi-pin nets are broken up into sets of two-pin nets by constructing either a minimum spanning tree (MST) or rectilinear Steiner tree (RST) to bridge the pins. The results can reflect pin-to-pin, pin-to-Steiner, or Steiner-to-Steiner congestion.
Uv(i,j)=ta/H,
Uv(+k,j)=bb/H, and
Uv(i+m,j)=1, for 0<m<k.
The wire traveling through the bucket will occupy a full vertical track.
The short horizontal route between a and b in the vertical flat net 66 could occur in any of the buckets. In the illustrative implementation, it is deemed more likely to occur in a bucket with less partial routing blockage. In particular, the horizontal usage Uh of each bucket may be made proportional to the available routing capacity
Uh(i+m,j)=[|xa−xb|·Ah(i+m,j)]/(W·S), for 0≦m≦k,
where S=Σkm=0Ah(i+m,j). If one assumes equal bucket capacities, this formula reduces to
Uh(i+m,j)=|xa−xb|/Whf, for 0≦m≦k.
L-shaped nets with wf>1 and hf>1 have at least one bend. An L-shaped route generally has two possible configurations 68a and 68b as shown in
ShA=min0≦n≦lAh(i,j+n), and
SvA=min0≦m≦kAv(i+m,j+1).
The number of available tracks ShB and SvB at the horizontal and vertical directions for L-shaped route B are derived similarly. The scale factor α is then defined as
α=min(ShA, SvA)/[min(ShA, SvA)+min(ShB, SvB)].
The horizontal and vertical usage of each bucket on route A in
Uh(i,j)=ra/W
Uh(i,j+l)=lb/W
Uv(i,j+l)=ta/H
Uv(i+k,j+l)=bb/H
Uh(i,j+n)=1, for 0<n<1
Uv(i+m,j+l)=1, for 0<m<k.
L-shaped nets with wf=hf=2 are different from other L-shaped nets since no Z-shape is possible. However, if both wf and hf are greater than 2, there can be two different Z-shaped orientations, denoted horizontal and vertical, based on the orientation of the center span of the Z-shapes. A generalized vertical Z-shaped net 70 is illustrated in
F(un)=min(Ah(i,j+n−1), Ah(i,j+n)),
F(dn)=min0≦m≦lAv(i+m,j+n), and
F(en)=min(Ah(i+k,j+n), Ah(i+k,j+n+1)).
Given Q possible routes denoted as R(n)={ul, . . . , un, dn, en, . . . , eQ} for n=1, . . . , Q, a vertical Z-shaped net routed by a global router must be one of shape R(n). The present invention utilizes the probability of a vertical Z-shaped net routed with the shape R(n), denoted as P(n). The value for P(n) must satisfy two properties: 0≦P(n)≦1, for n=1, . . . , Q; and ΣQn=1 P(n)=1. If these probabilities are already known, then the usage of each bucket can be derived as follows.
For the leftmost bucket in the bottom row, the horizontal and the vertical cost are
Uh(i,j)=ra/W,
Uv(i,j)=0.
For the other buckets in the bottom row, the horizontal usage consists of two terms. The first term is for the case in which the vertical segment d(n) on route R(n) will start in this bucket. The horizontal usage in that case is 0.5 since the bend would on average be in the middle of the bucket. The other term is for the case in which d(n) is at the right of the bucket, and the usage is 1. A bucket has vertical usage only if the bend occurs in that bucket. The total vertical usage (ta/H) is spread over the candidate buckets. Therefore, the usages can be defined as
Uh(i,j+n)=P(n)/2+PR(n), and
Uv(i,j+n)=ta·P(n)/H, for 1≦n≦1,
where PR(n)=ΣQm=n+1 P(m).
For the top row, the derivation is similar. Horizontal bucket usage consists of two terms, one for the case wherein the bend occurs in that bucket, and one for the case wherein the bend occurs to its left. Vertical usage is spread over the buckets. The horizontal and vertical cost for these buckets are
The horizontal and vertical usage for bucket (i+m,j+n) in the center of the netbox are
Uv(i+m,j+n)=P(n), and
Uh(i+m,j+n)=0,
where 0<m<k, and 0≦n≦l.
The total “Z-usage” can be found using another scale factor β, which is a function of the total capacities of vertically and horizontally oriented Z-shapes, Sv and Sh. In the illustrative implementation, the horizontal and vertical Z-usages are scaled with β=Sh/(Sh+Sv) and 1−β=Sv/(Sh+Sv), respectively, as was done with scale factor α for the L-shapes, and then summed.
As mentioned in the Background section, the conventional approach is to assign each route an equal usage probability, i.e., P(n)=1/Q for every route, n=1, . . . , Q, and β=(wf−2)/(wf+hf−4). This assumption is valid only when there is no partial blockage. For example, in the routing graph 74 of
P(n)=F(n)Sv,
where F(n) is the minimum capacity of route R(n) and Sv=ΣQn=1 F(n) is the total minimum capacities of all the vertical Z-shaped routes. The minimum capacity F(n) is related to the capacities of every edge on route R(n). However, for every edge um (for m=1, . . . , Q) its capacity is shared by Q−m−1 routes, and for every edge em (for m=1, . . . , Q) its capacity is shared by m routes. Therefore, for one specific route R(n), it is hard to know the exact capacities of edge um and em that are contributed to R(n). If the capacities of all u and e edges are infinite, F(n) is equal to the capacity of a unique edge dn of each route. In other cases, for each route R(n), n=1, . . . , Q, the capacity of every edge um (for m=1, . . . , n) on route R(n) is redistributed according to the ratio between the capacity of unique edge dn of R(n) and the total capacities of all unique edges of routes sharing edge um. The capacity of every edge em (for m=n, . . . , Q) is redistributed in a similar way. After deriving the new capacity of every edge on route R(n), the minimum capacity F(n) can be easily computed.
The total vertical capacities all edges d to the left and right of edge dn, including edge dn itself, are denoted as FL(dn) and FR(dn) for n=1, . . . , Q, and are defined as
The minimum capacity F(n) of each route R(n) (for n 1, . . . , Q) can be computed as
F(n)=min{F(u1)·F(dn)/FR(d1), . . . , F(un)·F(dn)/FR(dn), F(dn), F(en)·F(dn)/FL(dn), . . . , F(eQ)·F(dQ)/FL(dQ)},
which can be rewritten as
F(n)=F(dn)·min{Ku(n), 1, Ke(n)},
where
Ku(n)=min{F(u1)/FR(d1), . . . , F(un)/FR(dn)}, and
Ke(n)=min{F(en)/FL(dn), . . . , F(eQ)/FL(dQ)}.
Similar analysis can be done for horizontal Z-shaped routes, and Sh and β can be derived accordingly. When there are no partial blockages and the capacities of each bucket are same, this model degenerates into the aforementioned conventional approach for P(n).
Returning to
F(1)=20·min(20/45, 1, 1, 20/25, 20/45)=400/45,
F(2)=5·min(20/45, 20/25, 1, 20/25, 20/45)=100/45, and
F(3)=400/45.
These capacities results in usage probabilities of P(1)=4/9, P(2)=1/9 and P(3)=4/9. Compared to the prior art approach of assigning equal probabilities to each route, the probability of path R(2) is reduced by 2/9 which is evenly distributed to R(1) and R(3). If 20 two-pin nets are routed in this example where all lower left pins are in the same bucket, and all upper right pins are in the same bucket, then the vertical probabilistic congestion of d1, d2 and d3 would be 4/9·20/20=4/9, 1/9 20/5=4/9, and 4/9·20/20=4/9, respectively. Comparable congestion of d1, d2 and d3 derived from the conventional method would be 1/3, 4/3 and 1/3, respectively. The present invention thus predicts that the congestion for all three routes is equal. This outcome is more likely to occur since this solution leaves even space for each route and predicts that the router will take less probability to route this Z-shaped net with the second route.
Another example is illustrated in
It is useful to decide what the relative probabilities of L-shapes versus Z-shapes should be, since the analysis must assume that a certain number are L-shaped and a certain number are Z-shaped. The probability of taking an L-route over a Z-route can be expressed as a parameter γ=#netsL/(#netsL+#netsZ). The value for γ can be chosen by previous design experience, i.e., how many routes are optimally routed or fixed by the designer. Actual routing results can be examined to determine a reasonable percentage. The combination probabilistic usages are ULZ=γUL+(1−γ)UZ.
The following algorithm can be used to predict the congestion map of a given set N of nets in accordance with the foregoing:
It is easy to see that for short nets, flat nets and L-shaped nets the algorithm takes linear time with respect to the maximum between horizontal buckets and vertical buckets the net spans, max(wf, hf). For Z-shaped nets, it is also easy to prove that with dynamic programming it takes a time-complexity of O(max(wf, hf)) in the bounding box to compute all of the FL(dn), FR(dn), Ku(n), Ke(n), F(n), P(n), PL(n) and PR(n) values for both vertical and horizontal Z-shaped nets, i.e., FL(dn)=FL(dn−1)+F(d1) (the time-complexity value O indicates how the algorithm scales with time). It also takes O(max(wf, hf)) to update the usage value of the top and bottom rows for the vertical Z-shaped net, and the left and right columns for the horizontal Z-shaped net. However, it still takes O(wj·hf) time to update the usage value for all other buckets, which results in the complexity O(#nets·#buckets) as proposed in the prior art. This complexity can, however, be improved. Using a vertical Z-shaped net as an example, we know that the horizontal usages for all center buckets are zero. The vertical usage for all buckets on an edge dn are as given above, which is P(n). Instead of updating these buckets explicitly, for all buckets in one column, a temporary usage map may be created with an initial value of zero in every bucket. A positive value is then stored before bucket (i,j+n), e.g., P(n), and a negative value is stored after bucket (i+k−1, j+n), e.g., −P(n). After obtaining this temporary map, the usage of each bucket in the final map can be derived by scanning from the first bucket in the temporary map and summing up all usage values in the temporary map that occur before this bucket. Experimental results show that predicted congestion according to the present invention matches quite well with the real congestion seen by a global router.
The invention may be further understood with reference to the flow chaff of
Although the invention has been described with reference to specific embodiments, this description is not meant to be construed in a limiting sense. Various modifications of the disclosed embodiments, as well as alternative embodiments of the invention, will become apparent to persons skilled in the art upon reference to the description of the invention. For example, while the invention has been described in the context of particular shaped nets, it could be implemented for other shapes as well. It is therefore contemplated that such modifications can be made without departing from the spirit or scope of the present invention as defined in the appended claims.
Number | Name | Date | Kind |
---|---|---|---|
6952815 | Teig et al. | Oct 2005 | B2 |
20030233627 | Konno et al. | Dec 2003 | A1 |
Number | Date | Country | |
---|---|---|---|
20060156266 A1 | Jul 2006 | US |