Particular embodiments generally relate to electronic design automation (EDA) tools and more specifically to clock tree synthesis. A clock tree distributes a clock signal from a source node to a set of sink nodes within an integrated circuit design. The clock tree may include a number of levels of clock tree repeaters that fan the clock signal out to different sink pins. The primary objective in clock tree design is to ensure that the clock signal arrives at all of the sink pins at the same time. The skew in a clock tree is the maximum difference in the arrival time of the clock signal at the sink pins. A clock tree synthesis (CTS) tool is used to generate a clock tree with good clock skew.
Particular embodiments generally relate to clock tree synthesis considering multiple timing variation parameters (corners and modes). In one embodiment, a method for building a clock tree for an integrated circuit design is provided. The clock tree may include a clock tree root node and a plurality of clock tree nodes that couple to sink pins for circuit elements of the integrated circuit design. The clock tree nodes may be arranged to distribute the clock signal to the sink pins. In synthesizing the clock tree, the sink pins may be clustered into one or more clusters. Clock tree nodes may be placed for the clock tree to distribute the clock signal to the one or more clusters. Timing information is determined to measure the clock signal delay from the root to the sink pins in the one or more clusters based on the placement of clock tree nodes. Different sets of timing information may be determined based on different sets of clock tree timing variation parameters.
A plurality of CTS metric values are measured for the one or more clusters. For example, the clock skew values are measured for different sets of timing information for the different sets of clock tree timing variation parameters. The clock tree is then optimized based on the clock skew values measured for the different sets of timing information. For example, the placement of the clock tree nodes or the sink pins included in the one or more clusters may be adjusted and new clock skew values are determined for different clock tree timing variation parameters. Particular embodiments balance whether clock skew is improved across the clock tree timing variation parameters. For example, the process makes sure that if clock skew is improved for one timing scenario clock skew is not significantly worsened for another timing scenario (a timing scenario includes a mode and corner). The clock tree may be adjusted to optimize the clock skew. This process continues as clock tree nodes are placed in the design to generate the clock tree.
A further understanding of the nature and the advantages of particular embodiments disclosed herein may be realized by reference of the remaining portions of the specification and the attached drawings.
CTS tool 102 may be found on a computing device 104, such as a personal computer, laptop computer, workstation, or other computing device. In one embodiment, CTS tool 102 may include software stored on a computer-readable storage media that may be read and executed by one or more processors of the computing device to perform clock tree synthesis.
CTS tool 102 receives a design, such as an integrated circuit (IC) design, and can perform clock tree synthesis for the design. Clock tree synthesis includes building a clock tree to distribute a clock signal to sink pins of devices in the IC design. In building the clock tree, CTS 102 may use timing information for different sets of clock tree timing variation parameters. The variation parameters may be different parameters for multiple process corners and/or multiple modes of operation. Using these parameters, different sets of timing information may be determined and used to build an optimal clock tree.
A corner may be conditions for voltage, temperature, or other manufacturing parameters. The corner may model process variations that may occur during manufacturing of the integrated circuit design. The corner may also model variations in operating environment for the circuit that manifests itself as different voltage and temperature conditions. In one example, a number of process corners may be provided, such as 9 different process corners. Depending on the corner, timing delays may differ.
A mode of operation may be different modes that the integrated circuit design may operate in. For example, each mode may operate differently and cause different timing information to be determined. For example, the modes may include a test mode, functional mode, stand-by mode, powered on mode, etc. These are different modes in which a client may cause the integrated circuit design to operate. For example, a computer that is using a chip including the IC design may be in a stand-by mode and the circuit operates in the stand-by mode. Depending on the mode, timing delays may differ.
CTS tool 102 may take into account different sets of clock tree timing variation parameters in determining the placement of clock tree nodes in a clock tree. In one embodiment, clock tree nodes may be buffers or inverters. Clock tree nodes may also be other logic elements that can be used to fan out a clock signal.
CTS tool 102 may place clock tree nodes for sink pins of devices to be clocked. For example, CTS tool 102 synthesizes a clock tree for delivering a clock signal to a number of clocked devices, such as registers, latches, flip-flops, etc., that are clocked by the same clock signal. Each of the clocked devices may include sink pins in which clock tree nodes are connected. A hierarchy of clock tree nodes may be provided to fan the clock signal out from a root node to the sink pins.
CTS tool 102 determines the placement and fan-out of the clock tree nodes during clock tree synthesis. In determining the placement and fan-out a CTS metric is optimized based on different sets of clock tree timing variation parameters. A CTS metric may be a metric that can be altered or varied when a clock tree is being synthesized based on timing information. For example, clock skew is discussed as being optimized. Also, other CTS metrics are also optimized, such as area, power, insertion delays, etc.
The different sets of variation parameters yield different timing information for the clock tree. In one example, when optimizing clock skew using one corner, how the clock skew is affected for other corners is also analyzed. Thus, if the clock tree is adjusted to improve skew for one corner, CTS tool 102 balances whether clock skew for another corner is significantly worsened. This is an iterative process in which balancing clock skew for multiple corners may be performed in synthesizing the clock tree. The timing information for all corners and modes is considered simultaneously or concurrently. For example, multiple iterative runs may not be run where one corner or mode is considered, and then another mode or corner is considered. Rather, timing information for all corners and modes are considered simultaneously. Accordingly, multi-corner process information and/or multi-mode process information allow synthesis of a clock tree that balances the clock tree synthesis over multi-corners or multi-modes.
A clock tree synthesis conventionally generated the clock tree using one corner or one mode. For example, the clock skew may be optimized based on conditions for one corner. Also, one mode of operation for the circuit may also be taken into account when optimizing the clock tree. Due to different variations in processing the integrated circuit, optimizing based on one corner may not be optimal if different conditions result during processing. Also, circuits are configured to operate in different modes and only taking into account one mode may not result in an optimal clock tree.
In step 304, CTS tool 102 places one or more clock tree nodes 204 to propagate a clock signal from root node 202 to pins 206. In one example, a bottom-up approach may be used where positions of clock tree nodes for lower levels of the hierarchy are determined first and then positions for clock tree nodes at higher levels are then determined until the root node is reached. Although a bottom-up approach is described, a top-down approach may also be used. In the top-down approach, CTS tool 102 may place higher levels of clock tree nodes first and then position lower levels thereafter.
In step 306, CTS tool 102 determines different sets of timing information for the different sets of clock tree timing variation parameters. For example, timing information from root node 202 to pins 206 is determined for multiple modes and/or multiple corners based on the placement of clock tree nodes 204.
In step 308, a CTS metric for pins 206 is determined for the different sets of timing information. For example, clock skew may be different depending on the corner that is used. Also, depending on the mode, different clock skew may result. Accordingly, CTS tool 102 determines clock skew based on multiple factors that may result in different timing information.
In step 310, CTS tool 102 optimizes the CTS metric based on the different sets of timing information. The optimization may take into account the different sets of timing information simultaneously. For example, it is determined if the placement of nodes is considered optimal considered the sets of timing information. One set of timing information for a corner is not considered and then another set in series. Rather, the sets are considered together.
The placement of the nodes may be adjusted many times. This may be an iterative process where placement of the clock tree nodes may be adjusted and/or pins 206 in clusters may be adjusted. Other adjustments may also be appreciated. This process will be described in more detail below. Generally, synthesis of clock tree 200 may be iteratively adjusted to determine if the CTS metric is improved. For example, the placement of clock tree node 206 may be changed and clock skew may be measured using the different sets of clock tree timing variation parameters. If clock skew is improved for one corner but worsens clock skew in another corner, then the adjustment may not be beneficial. However, if it is determined that clock skew improves for one corner and does not worsen it for other corners, then the adjustment may be positive. A balancing is performed to improve clock skew over multiple corners and/or modes.
The process for synthesizing the clock tree will be described in more detail now. The process described uses a bottom-up approach. Although this approach is described, it will be understood that other approaches may be used, such as a top-down approach.
In step 404, CTS tool 102 places clock tree nodes for each of the clusters. In one example, clock tree nodes 204 may be placed in substantially the middle of the clusters of pins 206. Also, other positions may be appreciated.
A first level of clock tree nodes 204-1-204-4 is placed in clusters 502-2-502-4. These nodes may be placed such that the length from nodes 204 to pins 206 in a cluster is substantially uniform.
Referring back to
In step 408, CTS tool 102 measures the delay from root node 202 to pins 206 through the placed clock tree nodes 204 for each set of timing information determined. The delay may be measured using the timing information that is determined for multiple corners or modes of operation. The delays in all the corner and modes are computed together and used together to make decisions involving placement or fan-out of the nodes.
In step 410, the skew for the different sets of timing information is determined. For example, the maximum and minimum clock skew may be determined for clusters 502. This may be the largest clock skew and the smallest clock skew. Accordingly, CTS tool 102 determines the maximum clock skew for multiple corners and/or multiple modes. For example, the maximum clock skew may be determined for each corner or mode or the maximum clock skew is determined taking all of the corners and/or modes into account. The clock skew information may vary depending on the corner or mode used. For example, different variations in the processing that each corner includes may cause different timing information to be determined. Thus, clock skew may differ for different corners.
In step 412, when clock skew for all corners has been determined, CTS tool 102 optimizes the clock skew based on the information for different sets of timing information. The optimization, which is described in more detail below, may alter the synthesis of the clock tree to optimize the clock skew. For example, pins in clusters 502 may be moved to other clusters 502 or a new cluster may be created. Also, placement of clock tree nodes 204 may be moved. When these adjustments are made to the clock tree, the clock skew is again measured across different sets of clock tree timing variation parameters. Thus, for example, it can be determined if the adjustment improves clock skew in one corner, but may worsen clock skew in another corner. This is an iterative process that can be performed to balance an improvement in clock skew across different sets of clock tree timing variation parameters.
In step 414, when skew has been optimized, CTS tool 102 may move to place another level of clock tree nodes 204 for clock tree 200. For example, a layer up in the clock tree hierarchy may now be placed. In moving to a new level of clock tree 200, clusters 502 that already have been formed may be used to form bigger clusters 506. For example, referring to
Multiple clock tree nodes may be placed on the same level as clock tree node 204-4. The clock skew may then be measured from root node 202 to pins 206 through clock tree node 204-4. The above process of optimizing the skew across multiple different sets of clock tree timing variation parameters may also be performed. This process may continue until the entire clock tree is synthesized.
The optimization of clock tree 200 will now be described in more detail.
In step 604, CTS tool 102 adjusts clock tree 200. For example, cost metrics may be used to determine how to adjust clock tree 200. In one example, for the maximum delay cluster, the delay may depend on the clock tree node being used and the total load that the clock tree node is driving, such as the number of pins 206 being driven. If some pins are removed from cluster 302, the delay may be reduced. These pins may be pushed into another cluster or used to form a new cluster.
For the minimum delay, CTS tool 102 may increase the load on the clock-tree node (buffer) by either detaching some pins/nodes from some nearby node, and attaching it to this minimum delay node, or by changing the placement of this node to add more “interconnect”/wiring load seen by this node.
Also, the type of clock tree node may be changed to adjust the clock skew. For example, different types of clock tree nodes may provide different delays. Further, the position of the placement of the clock tree node may be changed. For example, by changing the clock tree node, the distance between pins 206 and clock tree node 204 adjusted and skew may be changed. Other changes may also be made to clock tree 200.
In step 606, CTS tool 102 measures the changes in the clock skew across different sets of clock tree timing variation parameters. For example, when clock tree 200 is adjusted, the skew may be affect timing in multiple process corners. For example, the clock skew may be improved in one corner, such as a maximum clock skew may be reduced. However, for conditions associated with a second corner, the clock skew may be increased, which may be an undesirable result. Accordingly, the change in clock skew is measured for multiple corners.
In step 608, CTS tool 102 balances the measured changes for the different sets of clock tree timing variation parameters. For example, if the adjustment provides a net positive in change for skew across multiple corners, then the change adjustment may be considered better. Step 610 determines if clock tree 200 should be adjusted again. For example, the process may be iterative and it may be determined that more improvements may be made. The prior adjustment may be discarded if the clock skew is considered worsened across multiple corners and/or modes or a further adjustment may be determined to the refine the prior adjustment. Accordingly, the process may reiterate to step 604.
If it is determined that clock tree 200 should not be adjusted again, in step 612, the process may move to another level of clock tree 200. The process may reiterate to step 602 where the process is repeated when clock tree nodes for the next level are placed.
Accordingly, CTS tool 102 uses different sets of clock tree timing parameters. Different sets of timing information can be determined and allows a balancing of the clock tree over the different sets of clock tree timing parameters. Thus, when one set of parameters is used, it can be determined how the change to a clock tree affects another set of parameters. This allows more efficient synthesis of the clock tree.
Having a tool that considered multi-corner or multi-mode information is useful because variation problems increase with different designs. For example, having mixed variation threshold (VT) threshold designs—Low VT and High VT cells may cause variations among corners. Also, temperature inversion problems due to the small geometries of the wires cause the worst case R values may now occur at low temperatures vs. high temperatures. Designs are becoming very complex physically with cores and macros now taking up over 40-50% of the physical area of a chip, which leads to physical differences in paths so the paths vary differently: a path through a standard cell area may vary differently vs. a path with very long wires going to Macros or in macro channels. These factors all cause variations that CTS tool 102 takes into account when synthesizing a clock tree.
Although the description has been described with respect to particular embodiments thereof, these particular embodiments are merely illustrative, and not restrictive. Although clock skew is discussed, other CTS metrics may be optimized.
Any suitable programming language can be used to implement the routines of particular embodiments including C, C++, Java, assembly language, etc. Different programming techniques can be employed such as procedural or object oriented. The routines can execute on a single processing device or multiple processors. Although the steps, operations, or computations may be presented in a specific order, this order may be changed in different particular embodiments. In some particular embodiments, multiple steps shown as sequential in this specification can be performed at the same time.
A “computer-readable medium” for purposes of particular embodiments may be any medium that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, system, or device. The computer readable medium can be, by way of example only but not by limitation, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, system, device, propagation medium, or computer memory. Particular embodiments can be implemented in the form of control logic in software or hardware or a combination of both. The control logic, when executed by one or more processors, may be operable to perform that which is described in particular embodiments.
Particular embodiments may be implemented by using a programmed general purpose digital computer, by using application specific integrated circuits, programmable logic devices, field programmable gate arrays, optical, chemical, biological, quantum or nanoengineered systems, components and mechanisms may be used. In general, the functions of particular embodiments can be achieved by any means as is known in the art. Distributed, networked systems, components, and/or circuits can be used. Communication, or transfer, of data may be wired, wireless, or by any other means.
It will also be appreciated that one or more of the elements depicted in the drawings/figures can also be implemented in a more separated or integrated manner, or even removed or rendered as inoperable in certain cases, as is useful in accordance with a particular application. It is also within the spirit and scope to implement a program or code that can be stored in a machine-readable medium to permit a computer to perform any of the methods described above.
As used in the description herein and throughout the claims that follow, “a”, “an”, and “the” includes plural references unless the context clearly dictates otherwise. Also, as used in the description herein and throughout the claims that follow, the meaning of “in” includes “in” and “on” unless the context clearly dictates otherwise.
Thus, while particular embodiments have been described herein, a latitude of modification, various changes and substitutions are intended in the foregoing disclosures, and it will be appreciated that in some instances some features of particular embodiments will be employed without a corresponding use of other features without departing from the scope and spirit as set forth. Therefore, many modifications may be made to adapt a particular situation or material to the essential scope and spirit.
This application is a continuation of U.S. patent application Ser. No. 13/274,276, filed Oct. 14, 2011 and entitled “MULTI-MODE MULTI-CORNER CLOCKTREE SYNTHESIS”, which is a continuation-in-part of U.S. patent application Ser. No. 12/036,191, filed Feb. 22, 2008 and entitled “MULTI-MODE MULTI-CORNER CLOCKTREE SYNTHESIS,” and which is also a continuation-in-part of U.S. patent application Ser. No. 12/026,755 filed on Feb. 6, 2008, and entitled “CLOCK TREE SYNTHESIS GRAPHICAL USER INTERFACE”, all of which are hereby incorporated by reference herein in their entirety.
Number | Name | Date | Kind |
---|---|---|---|
5617426 | Koenemann et al. | Apr 1997 | A |
6204713 | Adams et al. | Mar 2001 | B1 |
6298468 | Zhen | Oct 2001 | B1 |
6407756 | Sontag et al. | Jun 2002 | B1 |
6460166 | Reddy et al. | Oct 2002 | B1 |
6530065 | McDonald et al. | Mar 2003 | B1 |
6782519 | Chang et al. | Aug 2004 | B2 |
6836877 | Dupenloup | Dec 2004 | B1 |
6904572 | Igarashi | Jun 2005 | B2 |
6909311 | Foley et al. | Jun 2005 | B2 |
6944840 | Sasaki et al. | Sep 2005 | B2 |
7020854 | Killian et al. | Mar 2006 | B2 |
7039891 | Tetelbaum | May 2006 | B2 |
7042269 | Kao | May 2006 | B2 |
7051310 | Tsao | May 2006 | B2 |
7191417 | Luo et al. | Mar 2007 | B1 |
7216322 | Lai et al. | May 2007 | B2 |
7225421 | Migatz et al. | May 2007 | B2 |
7275235 | Molinari et al. | Sep 2007 | B2 |
7299433 | Clement et al. | Nov 2007 | B2 |
7334209 | Roberts et al. | Feb 2008 | B1 |
7406672 | Fukuda | Jul 2008 | B2 |
7418689 | Habitz et al. | Aug 2008 | B2 |
7479825 | Komoda | Jan 2009 | B2 |
7486130 | Overs | Feb 2009 | B2 |
7512923 | Tsuchiya | Mar 2009 | B2 |
7519927 | Hryckowian et al. | Apr 2009 | B1 |
7523430 | Patel | Apr 2009 | B1 |
7526666 | Soni | Apr 2009 | B1 |
7555740 | Buck et al. | Jun 2009 | B2 |
7584443 | Govig et al. | Sep 2009 | B1 |
7624364 | Albrecht et al. | Nov 2009 | B2 |
7689957 | Shenoy | Mar 2010 | B2 |
7810061 | Minonne et al. | Oct 2010 | B2 |
7930652 | Wood | Apr 2011 | B2 |
7996804 | Nikitin et al. | Aug 2011 | B2 |
20020045995 | Shimazaki et al. | Apr 2002 | A1 |
20030048122 | Kazi | Mar 2003 | A1 |
20030058280 | Molinari et al. | Mar 2003 | A1 |
20030135836 | Chang et al. | Jul 2003 | A1 |
20030167451 | Igarashi | Sep 2003 | A1 |
20040225984 | Tsao | Nov 2004 | A1 |
20050050497 | Tetelbaum | Mar 2005 | A1 |
20060053395 | Lai et al. | Mar 2006 | A1 |
20060064658 | Minonne et al. | Mar 2006 | A1 |
20060132186 | Bourgin | Jun 2006 | A1 |
20060190899 | Migatz et al. | Aug 2006 | A1 |
20060248488 | Habitz | Nov 2006 | A1 |
20070094627 | Komoda | Apr 2007 | A1 |
20070106970 | Fukuda | May 2007 | A1 |
20070136708 | Overs et al. | Jun 2007 | A1 |
20070171733 | Wood | Jul 2007 | A1 |
20070220465 | Tsuchiya | Sep 2007 | A1 |
20070288875 | Eakins et al. | Dec 2007 | A1 |
20080168412 | Cheon | Jul 2008 | A1 |
20090064067 | Liu et al. | Mar 2009 | A1 |
20090070714 | Shenoy | Mar 2009 | A1 |
20090179680 | Reuland et al. | Jul 2009 | A1 |
20090187873 | Nikitin et al. | Jul 2009 | A1 |
20090199143 | Sclotman et al. | Aug 2009 | A1 |
20090217225 | Sunder et al. | Aug 2009 | A1 |
20090254874 | Bose | Oct 2009 | A1 |
20100049481 | Walker et al. | Feb 2010 | A1 |
20120110538 | Shih | May 2012 | A1 |
20120240091 | Sunder et al. | Sep 2012 | A1 |
Entry |
---|
International Search Report and Written Opinion dated Apr. 13, 2009 from International Patent Application No. PCT/US2009/034583, 8 pp. |
International Search Report and Written Opinion dated Mar. 13, 2009 from International Patent Application No. PCT/US2009/033203, 6 pp. |
Number | Date | Country | |
---|---|---|---|
20160203251 A1 | Jul 2016 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13274276 | Oct 2011 | US |
Child | 15076991 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 12036191 | Feb 2008 | US |
Child | 13274276 | US | |
Parent | 12026755 | Feb 2008 | US |
Child | 12036191 | US |