METHOD OF REDUCING PEAK POWER CONSUMPTION IN AN INTEGRATED CIRCUIT SYSTEM

Information

  • Patent Application
  • 20080270965
  • Publication Number
    20080270965
  • Date Filed
    April 24, 2007
    17 years ago
  • Date Published
    October 30, 2008
    16 years ago
Abstract
A method that utilizes connectivity and/or timing information among a plurality of design partitions of an circuit system to create a clock system that reduces peak power consumption across the system. The method includes sorting the design partitions according to a connectivity model, a timing model, or both, and assigning interleaved clock signals as a function of the design partition ordering. The clock system is created as a function of the interleaved clock signals.
Description
FIELD OF THE INVENTION

The present invention generally relates to the field of integrated circuit design. In particular, the present invention is directed to a method of reducing peak power consumption in an integrated circuit system.


BACKGROUND

Peak power consumption, frequently expressed in terms of IR drop, is a growing concern in modern integrated circuit designs, such as, for example, designs using deep sub-micron technology. Peak power consumption can affect the performance of the integrated circuit, the robustness of the power grid, and the design time needed to close timing. In some cases, designs must be re-worked late in the design cycle to improve the robustness of weak points in the power grid. Presently, low power design methods are used to combat the power consumption problem. These methods reduce the number of switching events in the design, the overall capacitance of the paths, and the drive strength of standard cells to the minimum needed to close timing. Other methods, on the other hand, create dynamic power grids that change density based on the specific demand of the integrated circuit design.


SUMMARY OF THE DISCLOSURE

In one embodiment, a method of designing a clock system for a plurality of functional blocks, is provided. The method includes the steps of receiving a plurality of design partitions; generating a connectivity model or a timing criticality model or both of said connectivity model and said timing criticality model; sorting said plurality of design partitions based on said connectivity model or a timing criticality model or both of said connectivity model and said timing criticality model so as to generate sorted design partitions having a sorted order; and assigning a plurality of interleaved clock signals to said sorted design partitions according to said sorted order.


In another embodiment, a machine-readable medium containing machine-readable instructions for performing a method of designing a clock system for a plurality of functional components, is provided. The machine-readable instructions include a first set of machine-readable instructions for receiving a plurality of design partitions; a second set of machine-readable instructions for generating a connectivity model or a timing criticality model or both of said connectivity model and said timing criticality model; a third set of machine-readable instructions for sorting said plurality of design partitions based on said connectivity model or a timing criticality model or both of said connectivity model and said timing criticality model so as to generate sorted design partitions having a sorted order; and a fourth set of machine-readable instructions for assigning a plurality of interleaved clock signals to said sorted design partitions according to said sorted order.


In still another embodiment, an integrated circuit system is provided. The integrated circuit includes a plurality of functional blocks distributed among a plurality of design partitions and interconnected by a plurality of connection arcs; and a clock system. The clock system includes clock interleaving circuitry for interleaving a plurality of clock signals corresponding respectively to said plurality of design partitions, said clock interleaving circuitry configured as a function of a connectivity model of said plurality of connecting arcs or a timing model of said plurality of connecting arcs or both of said connectivity model and said timing model; and a plurality of timing paths connected to ones of said plurality of functional blocks in each of said plurality of design partitions so as to provide said plurality of clock signals to said plurality of functional blocks.





BRIEF DESCRIPTION OF THE DRAWINGS

For the purpose of illustrating the invention, the drawings show aspects of one or more embodiments of the invention. However, it should be understood that the present invention is not limited to the precise arrangements and instrumentalities shown in the drawings, wherein:



FIG. 1 is a high-level schematic diagram of an integrated circuit (IC) system having a clock system designed in accordance with the present disclosure;



FIG. 2 is an exemplary timing diagram of clock signals within the clock system of FIG. 1;



FIG. 3 is a flow diagram illustrating a clock system design method that may be used to design a clock system, such as clock system of FIG. 1;



FIG. 4 is a connectivity/timing diagram for a plurality of design partition blocks of an exemplary IC system design;



FIG. 5 is a connectivity chart corresponding to the connectivity/timing diagram of FIG. 4; and



FIG. 6 is a timing chart corresponding to the connectivity/timing diagram of FIG. 4.





DETAILED DESCRIPTION

Referring to the drawings, FIG. 1 illustrates an example 100 of an integrated circuit (IC) system that includes a clock circuit 104 made in accordance with the present invention. Clock circuit 104 ultimately generates a plurality of interleaved clock signals 108A-C that are transmitted to a plurality of design partitions, in this example partitions 112A-C, via a plurality of timing paths, here timing paths 116A-C. As those skilled in the art will appreciate, depending on the application IC system 100 may be located within a single chip, e.g., in the case of a microprocessor, application-specific integrated circuit (ASIC), system-on-chip (SOC), etc. or may comprise a plurality of chips and/or other discrete components, e.g., circuit board level busses, etc. Design partitions 112A-C may be selected and/or otherwise result from the design process for the IC system at issue. Consequently, each design partition, such as each partition 112A-C, can contain one or more functional components 120A-F as is suitable/necessary for the IC system under consideration. Examples of a functional component include, but are not limited to, a functional logic block, a memory, a processor, a computer chip, a communications bus (integrated and discrete), an ASIC and component thereof, an SOC and component thereof, a discrete IC component (e.g., a logic gate), and any combination thereof. That said, those ordinarily skilled in the art will readily recognize that the particular function(s) that system 100 provides is/are not material to the understanding of the present invention. Therefore, these functions are not described herein.


As elaborated more below, IC system 100 is particularly designed to cause functional components 120A-F of differing design partition 112A-C to receive clock signals 108A-C at differing times. In this example, this is accomplished by providing clock circuit 104 with an interleave circuit 124 that receives an input clock signal 128 from a clock signal source, here a phase-locked loop 132, and outputs interleaved clock signals 108A-C. Interleaved clock signals (e.g., clock signals 108A-C) are signals used to define time references for coordinating movement of data within a system (such as IC system 100) and to account for delays in the propagation of data signals (not illustrated) throughout the system. Examples of a clock interleave circuit suitable for use as interleave circuit 124 or other interleave circuit, include, without limitation, a static interleave circuit in which delays are set during design, a programmable interleave circuit in which delays are programmable after manufacturing, and any combination thereof. The clock signal source may be an on-chip or an off-chip source, depending on the application at issue.


Referring to FIG. 2, and also to FIG. 1, FIG. 2 illustrates an exemplary timing diagram 200 for clock signals 108A-C and corresponding input clock signal 128 used by interleave circuit 124 to generate interleaved clock signals 108A-C. As illustrated in FIG. 2, interleaved clock signals 108A-C are clock signals derived from input clock signal 128 so as to have corresponding respective delays (including a delay of zero, which occurs in the case of clock signal 108A) and, optionally, modified waveforms relative to the waveform of the input clock signal. The staggered timings of interleaved clock signals 108A-C are designed to reduce peak power consumption by organizing design partitions 112A-C as a function of connectivity and/or timing considerations for communications interconnections (represented in FIG. 1 by interconnections 136A-E) as between the functional components 120A-F. Examples of various methods of determining the staggered timings of interleaved clock signals 108A-C and like clock signals, provided to partitions 112A-C and like partitions are described below. Depending on timing considerations, none, some or all of interconnections 136A-E may include one or more delay elements, such as delay element 140A-C for controllably causing the propagating of data through such interconnections to be delayed so as to effect the proper operation of the IC system, here IC system 100. Each delay element 140A-C may be any suitable delay element.


Referring next to FIG. 3, and also to FIG. 1, FIG. 3 illustrates an example 300 of a clock system design method that may be used to design a clock system of an IC system, such as clock system 104 of IC system 100 (FIG. 1). At step 304, an IC system design 308 (such as a design of IC system 100 of FIG. 1) is input into method 300. The IC system design may be considered to comprise a plurality of design partitions 312 (design partitions 112A-C in the example of FIG. 1). Design partitions 312 may be identified using any one or more of a variety of methods, such as identifying one or more of design partitions 312 based on a design hierarchy, identifying such partition(s) based on clock gating domain, and identifying such partition(s) based on a specified number of connections (e.g., connections 136A-E). In one specific example of method 300, design partitions 312 are identified so as to reduce the number of interconnections (e.g., interconnections 136A-E of FIG. 1) between the design partitions (e.g., partitions 112A-C of FIG. 1) while maximizing the number of interconnections within each partition, for example, between/among the functional components within that partition.


Once design 308 has been input, one or more clock system design models are applied to the design. Generally, a clock system design model is a tool for evaluating a specified aspect, or group of specified aspects, of a design. Examples of a clock system design model, include, without limitation, a connectivity model and a timing criticality model. A connectivity model may be used to determine connectivity priorities as a function of the number of data/communications connections between design partitions, here design partitions 312. A timing criticality model may be used to determine timing priorities as a function of the timing characteristics of the data/communications connections between design partitions. Further details of connectivity and timing criticality models are described below with reference to FIGS. 4-5.


Referring first to FIG. 4, this figure illustrates a connectivity/timing diagram 400 of an IC system design that includes a plurality of design partitions 404A-C interconnected with one another via connectivity arcs 408A-F. Each connectivity arc 408A-F simply represents a possible (though perhaps not actual) providing of information from a corresponding first design partition 404A-C (a/k/a a “source”) to a corresponding second one of design partitions (a/k/a a “sink”). Here, the directionality of the providing of information is represented by corresponding respective arrowheads. Since for the model it is initially assumed that each design partition 404A-C can provide information to each other design partition, this bi-directionality is represented by two connectivity arcs (such as connectivity arcs 408B, 408D in the case of design partitions 404A-B) that extend between each possible pair of design partitions. While connectivity arcs 408A-F represent potential communications connections, actual connections are represented by numerals 412A-F that correspond respectively to the number of actual connections that a given first design partition 404A-C makes with a given second one of design partitions. The number of connections is essentially driven by the functionality of the IC system represented in the design, the selection of design partitions, the function(s) of the functional blocks contained in the partitions, and the need for such blocks to communicate with one another.


Referring next to FIG. 5, and also to FIG. 4, FIG. 5 illustrates a connectivity chart 500 corresponding to connectivity/timing diagram 400 of FIG. 4. Connectivity chart 500 simply presents the connectivity information for a given design (or portion thereof) in tabular form. As readily seen from FIGS. 5 and 4, this connectivity information includes numerals 412A-F that represent the number of actual connections between the corresponding respective source and sink design partitions 404A-C.


Referring back to FIG. 4, each connection arc 408A-F also includes a timing margin information 416A-F. A timing margin is a timing characteristic for the corresponding respective connection arc 408A-F between design partitions 404A-C. Examples of a timing margin information 416A-F include, without limitation, late mode (LM) margins and early mode (EM) margins, and any combination thereof. A late mode margin (a/k/a “best case mode”) represents the optimal operation of an integrated circuit system, wherein no variations in temperature, manufacturing and voltage are taken into consideration. An early mode margin (a/k/a “worst case mode”) represents the slowest operation of an integrated circuit system, wherein the variations in temperature, manufacturing and voltage are considered. Late mode and early mode margins can be determined for connections 408A-F, for example, using a conventional static timing tool, such as the Einstimer static timing tool used by International Business Machines, Armonk, N.Y.


Referring now to FIG. 6, and also to FIG. 4, FIG. 6 illustrates a timing chart 600 corresponding to connectivity/timing diagram 400 of FIG. 4. Timing chart 600 presents timing priorities for the various connection arcs. In this example, each timing priority is the minimum value of the early mode and late mode margins for each connection arc 408A-F. In other embodiments, the timing priorities may be determined in another manner. For example, the timing priorities may be determined by the equation −LM+EM.


Referring back to FIG. 3, as mentioned above after design 308 has been entered into method 300, the method includes applying one, the other, or both of connectivity and timing models to the design. Correspondingly, method 300 may include steps 316, 320 for determining which one(s) of the connectivity and timing models will form the basis of the clock system design. More specifically, step 316 may determine whether or not the clock system design will be connectivity based, and step 320 may determine whether or not the clock system design will be timing based. As will be readily appreciated, each of steps 316, 320 may provide an answer in the affirmative, meaning that the clock system design may be both connectivity and timing based. This option is described in more detail below.


If at step 316 it is determined that clock system design is to be at least partially connectivity based, method 300 proceeds to step 324 at which the connectivity priorities are determined. This step prioritizes design partitions 312 according to their connectivity. In one example, design partitions 312 are prioritized so as to indicate the order in which each design partition receives a clock signal according to the number of connections in the corresponding respective connection arcs. For example, referring to connectivity chart 500 of FIG. 5, design partitions 404A-C may be organized into a structured priority list according to the number of connections. Such a structure priority list appears in Table I, below. The basis for this approach is the proposition that the greater the number of connections, the higher the priority of the corresponding connection arc 408A-F (FIG. 4) should be.









TABLE I







Ordered Structure List









Connection Arc
Source
Sink





408C
C
B


408F
A
C


408D
A
B


408B
B
A


408E
B
C









In this example, the first listed connection arc in Table I, i.e., connection arc 408C (FIG. 4) between source 404C and sink 404B, has the highest priority because the connection has the highest number of connections, here 22. The second listed connection arc in Table I, i.e., connection arc 408F (FIG. 4) between source 404A and sink 404C, has the next highest priority because the connection has the next highest number of connections, i.e., 7. The remaining three connection arcs listed in Table I, i.e., connection arcs 408B, 408D-E (connection arc 408A has no connections) follow in the same manner.


Whether or not it was determined at step 316 that the clock timing design should be at least partially connectivity based, method 300 proceeds to step 320 where it is determined whether or not the clock timing design should be at least partially timing based. If so, method 300 proceeds to step 328 at which timing priorities are determined according to a timing criticality model. In one example, using timing chart 600 of FIG. 6 as an example, the timing priorities in timing chart 600 may be used to create an ordered structure list that organizes connectivity arcs 404A-F (FIG. 4) from highest to lowest priority based on the timing characteristics of the connections between design partitions. Such an ordered structure list is illustrated below in Table II.









TABLE II







Ordered Structure List









Connection Arc
Source
Sink





412C
C
B


408A
C
A


408B
B
A


408E
B
C


408D
A
B


408F
A
C









In this example, the first listed connection arc in Table II, i.e., connection arc 412C (FIG. 4) between source 404C and sink 404B, has the highest priority because this connection arc has the largest minimum timing margin value, here 2. The second listed connection arc in Table II, i.e., connection arc 408A between source C and sink A, has the next highest priority because this connection arc has one of the next largest minimum timing margin value, i.e., −1. It is recognized that any one of connection arcs 408A, 408B, 408E could be listed second, third, and fourth in Table II because they all have the same minimum timing margin value of −1. The ordering of connection arcs 408A, 408B, 408E may be arbitrary or may be based on one or more other criteria, such as physical distance between the corresponding respective sources and sinks and the ordering from the corresponding ordered connectivity list, among others.


Once the connectivity priority order and/or timing priority orders have been determined, respectively, at steps 324, 328, method 300 may proceed to step 332 at which design partitions 312 are sorted. This step 332 may include applying one or more weights to the sorted partitions 312. For example, if only a connectivity ordering was performed or if only a timing ordering was performed, then a weight of 1 may be applied to the sorted partitions 312. However, if both orderings were performed, each ordering may be weighted with a selected weight that produces a desired clock system design. Such weights may be applied according to the equation CWeight*CP+TWeight*TP, wherein CWeight is a selected weight for the connectivity-based ordering, CP is the connectivity priority, TWeight is a selected weight for the timing-based ordering, and TP is the timing priority. CWeight and TWeight may be values pre-selected and unchangeable for a particular instantiation of method 300 or, alternatively, may be changeable, for example, by a user of software implementing the method.


Once the connectivity arcs have been ordered and/or weighted, at step 336, a clock circuit design, such as the design of clock circuit 104 of FIG. 1, is generated so as to contain as many interleaved clock signals (e.g., clock signals 108A-C) as there are design partitions (e.g., design partitions 112A-C). In one example, the interleaved clock signals are determined by dividing the period of the clock by the number of design partitions (e.g., design partitions 112A-C). In another example, the interleaved clock signals are determined by selecting a constant value and offsetting the clock signal for each design partition (e.g. design partitions 112A-C) by that constant value. It is permissible, however, that the first design partition may have an offset value equal to zero.


Various functions of a method of designing a clock circuitry, such as method 300 of FIG. 3, can be implemented in software that can be stored on any suitable machine-readable memory(ies), including electronic storage media (such as RAMs, ROMs, flash memories), optical storage media (such as CDs, DVDs, BDs (i.e., Blu-Ray disks), holographic disks, etc.), magnetic storage media (hard disks, floppy disks, tapes, etc.), among others. As those skilled in the art will readily appreciate, a method of the present disclosure may be implemented in any of a wide variety of ways, depending on the type of environment in which the software is designed to be run. Since the various environments and ways of implementing software in these environments are well known, it is not necessary to provide any examples herein for skilled artisans to appreciate the broad scope of the present disclosure.


Exemplary embodiments have been disclosed above and illustrated in the accompanying drawings. It will be understood by those skilled in the art that various changes, omissions and additions may be made to that which is specifically disclosed herein without departing from the spirit and scope of the present invention.

Claims
  • 1. A method of designing a clock system for a plurality of functional blocks, comprising: receiving a plurality of design partitions;generating a connectivity model or a timing criticality model or both of said connectivity model and said timing criticality model;sorting said plurality of design partitions based on said connectivity model or a timing criticality model or both of said connectivity model and said timing criticality model so as to generate sorted design partitions having a sorted order; andassigning a plurality of interleaved clock signals to said sorted design partitions according to said sorted order.
  • 2. The method of claim 1, further comprising generating a clock system design as a function of said plurality of interleaved clock signals.
  • 3. The method of claim 1, further comprising partitioning the plurality of functional blocks into said plurality of design partitions.
  • 4. The method of claim 3, wherein said partitioning includes maximizing a first number of interconnections within each of said plurality of design partitions and minimizing a second number of interconnections between pairs of said plurality of design partitions.
  • 5. The method of claim 1, wherein said receiving of said plurality of design partitions consists of receiving a plurality of design partitions of a single-chip system.
  • 6. The method of claim 1, wherein said receiving of said plurality of design partitions consists of receiving a plurality of design partitions from across a multi-chip system.
  • 7. The method of claim 1, wherein said generating of said connectivity model, said timing model, or both of said connectivity model and said timing model comprises generating both of said connectivity model and said timing model, the method further comprising applying weights to said sorted design partitions corresponding to desired influences of each of said connectivity model and said timing model.
  • 8. The method of claim 1, wherein said generating of said connectivity model includes generating an ordered connectivity list of connectivity arcs extending between corresponding respective pairs of said plurality of design partitions.
  • 9. The method of claim 1, wherein said generating of said connectivity model includes generating a connectivity chart of connectivity arcs extending between corresponding respective pairs of said plurality of design partitions.
  • 10. The method of claim 1, wherein said generating of said timing model includes generating an ordered timing list of connectivity arcs extending between corresponding respective pairs of said plurality of design partitions.
  • 11. The method of claim 1, wherein said generating of said timing model includes generating a timing chart of connectivity arcs extending between corresponding respective pairs of said plurality of design partitions.
  • 12. A machine-readable medium containing machine-readable instructions for performing a method of designing a clock system for a plurality of functional components, said machine-readable instructions comprising: a first set of machine-readable instructions for receiving a plurality of design partitions;a second set of machine-readable instructions for generating a connectivity model or a timing criticality model or both of said connectivity model and said timing criticality model;a third set of machine-readable instructions for sorting said plurality of design partitions based on said connectivity model or a timing criticality model or both of said connectivity model and said timing criticality model so as to generate sorted design partitions having a sorted order; anda fourth set of machine-readable instructions for assigning a plurality of interleaved clock signals to said sorted design partitions according to said sorted order.
  • 13. The machine-readable medium of claim 10, further comprising a fifth set of machine-readable instructions for generating a clock system design as a function of said plurality of interleaved clock signals.
  • 14. The machine-readable medium of claim 10, further comprising a sixth set of machine-readable instructions for partitioning the plurality of functional components into said plurality of design partitions.
  • 15. The machine-readable medium of claim 12, wherein said sixth set of machine-executable instructions includes machine-executable instructions for maximizing a first number of interconnections within each of said plurality of design partitions and minimizing a second number of interconnections between pairs of said plurality of design partitions.
  • 16. The machine-readable medium of claim 10, wherein said generating of said connectivity model, said timing model, or both of said connectivity model and said timing model comprises generating both of said connectivity model and said timing model, the machine-executable instructions further comprising a seventh set of machine-executable instructions for applying weights to said sorted design partitions corresponding to desired influences of each of said connectivity model and said timing model.
  • 17. An integrated circuit system, comprising: a plurality of functional blocks distributed among a plurality of design partitions and interconnected by a plurality of connection arcs; anda clock system, comprising: clock interleaving circuitry for interleaving a plurality of clock signals corresponding respectively to said plurality of design partitions, said clock interleaving circuitry configured as a function of a connectivity model of said plurality of connecting arcs or a timing model of said plurality of connecting arcs or both of said connectivity model and said timing model; anda plurality of timing paths connected to ones of said plurality of functional blocks in each of said plurality of design partitions so as to provide said plurality of clock signals to said plurality of functional blocks.
  • 18. The integrated circuit system of claim 17, wherein all of said plurality of functional blocks are contained on a single integrated circuit chip.
  • 19. The integrated circuit system of claim 17, wherein said plurality of functional blocks are distributed across a plurality of integrated circuit chips.
  • 20. The integrated circuit system of claim 17, wherein said clock interleaving circuitry is configured as a function of said connectivity model and said timing model and weights applied to each of said connectivity model and said timing model.