The present invention relates generally to timing performance analysis, and more particularly to timing performance analysis for an integrated circuit comprising an embedded device.
The process for producing an integrated circuit comprises many steps. Conventionally, a logic design is followed by a circuit design, which is followed by a layout design. With respect to the circuit design and layout portion, once circuits for an integrated circuit have been designed, such designs are converted to a physical representation known as a “circuit layout” or “layout.” Layout is exceptionally important to developing a working design as it affects many aspects, including, but not limited to, signal noise, signal time delay, resistance, cell area, and parasitic effect.
Once a circuit is designed and laid out, it is often simulated to ensure performance criteria are met, including, but not limited to, signal timing. This type of analysis is difficult at the outset, and is made more difficult by an embedded design. An embedded design or embedded circuit is conventionally designed separately from an integrated circuit in which it is embedded. Sometimes this embedded circuit is referred to an intellectual property (IP) core or embedded core. This is because the information to build and test such an embedded circuit is provided from one company to another.
An IP core may have a certain maximum timing performance for input and output. For example, a microprocessor will have certain maximum timing performance for input and output of data and other information to a memory, or more particularly, a memory controller. In personal computer manufacture, operation of memory, or more particular memory modules, is specified for a bus “speed,” such as 33 MHz, 66 MHz, and so on. Presently, the Rambus Signaling Level road map is for a memory to processor bus frequency of 1.2 GHz. However, processors presently operate at speeds in excess of 1.2 GHz, and thus processors must be slowed down for communicating with memory. Moreover, memory is speed graded, and conventionally slower memory costs less than faster memory.
However, there is not de facto standard bus interface for an embedded microprocessor. Accordingly, glue or gasket logic and/or interconnects are used to couple an embedded microprocessor to a host device, such as a programmable logic device. Programmable logic devices exist as a well-known type of integrated circuits that may be programmed by a user to perform specified logic functions. There are different types of programmable logic devices, such as programmable logic arrays (PLAs) and complex programmable logic devices (CPLDs). One type of programmable logic devices, called a field programmable gate array (FPGA), is very popular because of a superior combination of capacity, flexibility and cost.
Accordingly, it would be desirable and useful to provide method and apparatus for timing performance analysis for an embedded device.
An aspect of the present invention is a method for performing a timing analysis for a core device to be embedded in a host integrated circuit. Clock-to-output timing information is obtained for the core device. Setup and hold timing information and delay timing information is determined for a portion of the host integrated circuit. The clock-to-output timing information, the setup and hold timing information and the delay timing information is associated with respective signals, and a path time delay for each of the respective signals is calculated.
An aspect of the present invention is a method for performing a timing analysis for a core device in a host integrated circuit. Setup and hold timing information is obtained for the core device. Clock-to-output timing information and delay timing information is determined for a portion of the host integrated circuit. The clock-to-output timing information, the setup and hold timing information and the delay timing information is associated with respective signals, and a path time delay for each of the respective signals is calculated.
An aspect of the present invention is a method for determining timing performance. Clock-to-output times for a processor core are obtained. Static timing analysis is used to determine timing data for a memory controller. Setup and hold times are obtained from the timing data for the memory controller. A programmatic representation of logic and interconnects for coupling the memory controller and the processor core is provided. The programmatic representation of logic and interconnects are simulated to obtain delay times. The delay times, the setup and hold times and the clock-to-output times are used as inputs to a spreadsheet, and path times are determined from the spreadsheet.
An aspect of the present invention is a method for determining timing performance. Setup and hold times for a processor core are obtained. Static timing analysis is used to determine timing data for a memory controller. Clock-to-output times are obtained from the timing data for the memory controller. A programmatic representation of logic and interconnects for coupling the memory controller and the processor core is provided. The programmatic representation of logic and interconnects is simulated to obtain delay times. The delay times, the setup and hold times and the clock-to-output times are provided as input to a spreadsheet, and path times are determined from the spreadsheet.
So that the manner in which the above recited features, advantages and objects of the present invention are attained and can be understood in detail, a more particular description of the invention, briefly summarized above, may be had by reference to the embodiments thereof which are illustrated in the appended drawings.
It is to be noted, however, that the appended drawings illustrate only typical embodiments of this invention and are therefore not to be considered limiting of its scope, for the present invention may admit to other equally effective embodiments.
In the following description, numerous specific details are set forth to provide a more thorough understanding of the present invention. However, it will be apparent to one of skill in the art that the present invention may be practiced without one or more of these specific details. In other instances, well-known features have not been described in order to avoid obscuring the present invention.
Referring to
There are two signal paths to and from embedded core 110, namely input path 113 and output path 114. Clock signal 109 is provided to embedded core 110 and OCM 120. Each signal path 113, 114 represents provisioning of data, control and address information to and from embedded core 110. Accordingly, each signal path 113, 114 represents more than one signal path. Notably, a maximum time allowed without down grading for speed is:
Tpath=1/fmin (1)
where fmin represents a minimum acceptable operating frequency for a system. Notably, fmin may be set equivalent to a maximum operating frequency for transferring information to and from embedded core 110. Thus, embedded core 110 is used to determine at least an initial value of fmin. Because there is more than one signal, each signal will have an associated Tpath. However, though each Tpath delay may be the same for two or more signals it may also be different, depending on routing circuitry and the like. Thus, Tpath is evaluated for each signal to determine Tpath for a system. However, as the analysis for each signal is the same, only one input and one output signal is described in the exemplary timing diagram of
Referring to
Additionally, embedded core 110 comprises one or more setup and hold times. So, a setup and hold time for an incoming signal to embedded core 110 must be met before a next triggering edge 109-2 of clock signal 109. Embedded core input signal 133 is equivalent to GL&I output signal 132, as indicated by each having transition 132-1. However, embedded core input signal 133 is used in
It should be understood that for this embodiment Tpath is to be less than one period of clock signal 109 to ensure integrated circuit 100 may operate at fmin of embedded core 110. In other words, it may be a goal to have clock signal 109 with a frequency of fmin. This is important because embedded core 110 may be provided as a “hard macro,” namely, a fixed layout formed with a set minimum lithographic feature size. In other words, if embedded core 110 may not be changed, then operating at an optimum frequency of embedded core 110 is a set target operating speed. Notably, though the embodiment described herein is a single data rate, the present invention may be used with double data rate timing.
Output path 114 has delays similar to those of input path 113. Accordingly, embedded core 110 provides core output signal 121 delayed by a clock-to-output delay 124 as measured from a triggering edge 109-1 of clock signal 109 to transition 121-1 of output signal 121. Measured from transition 121-1 to transition 122-1 is GL&I delay 125 of GL&I output signal 122 due to G-logic and/or interconnect 115-A. OCM input signal 123 is equivalent to GL&I output signal 122, as indicated by each having transition 122-1. However, OCM input signal 123 is used in
Conventionally, embedded core 110 is provided with performance data including setup and hold times and clock-to-output times. These times may be provided in a known format, such as Standard Delay Format (SDF). Based on the assumption that setup and hold times and clock-to-output times are provided or determined, such as from simulation or testing prior to embedding, for embedded core 110, flow diagrams of
Referring to
At step 301, clock-to-output times are obtained for embedded core 110. At step 302, a static timing analysis is done on OCM 120. This static timing analysis is done by simulation at a transistor-level, and may be done with a product called PathMill from Synopsis of Mountain View, Calif. At step 303, data from step 302 is used to determine respective setup and hold times for signals to be inputted to OCM 120.
At step 304, a programmatic representation of gasket logic and interconnects 115-A is provided. Such a representation may be done in Verilog or VHDL, for example. At step 305, this programmatic representation is taken down from a logic level to something closer to a physical or transistor level, as such with HSpice or like program simulation, and simulated to get delays associated with signals passing through gasket logic and interconnects 115-A.
At step 306, outputs from steps 303 and 305, namely, setup and hold times for OCM 120 and signal delays for gasket logic and interconnects 115-A, respectively, are associated with clock-to-output times for embedded core 110 from step 301. This association may be done using a spreadsheet, a database and the like. For example, assuming data_out—1 from embedded core 110 is under consideration, then a spreadsheet association may look something like that shown in Table I,
where all values are expressed in units of time, such as picoseconds for example.
At step 307, critical paths are identified by totaling C-O delay, GL&I delay and Setup/Hold time to provide a total time for each signal traveling along output path 114. Accordingly, a total time, Ti, is determined for each signal on output path 114 going from embedded core 110 to OCM 120.
At step 308, Ti is compared to Tpath. For example, it may be determined whether Tpath is greater than or equal to Ti for each signal. Notably, it should be appreciated that if Ti was equal to Tpath, then there would be “critical” timing. Accordingly, at step 308, such a check may be for Tpath greater than Ti to avoid critical timing. Moreover, to ensure a margin of error, T′path, which is approximately 1 to 10 percent, for example, less than Tpath, may be used at step 308 for comparison with Ti. For purposes of clarity, the remainder of
Alternatively, timing performance analysis process 300 may end at step 307. This is because a largest value of times Ti may be determined, and frequency of operation of output path 114 of embedded core 110 may be set from there.
However, assuming either or both OCM 120 and gasket logic and interconnects 115-A may be modified, if any Ti is greater than Tpath, then at step 310 circuitry from either or both OCM 120 and gasket logic and interconnects 115-A is modified to reduce time associated with identified critical paths, namely, signal paths producing Ti's greater than Tpath. In response to modification of circuitry at step 310, layout for such modified circuitry is made at step 311 and circuitry values associated therewith including, but not limited to, resistance, capacitance, among others both actual and parasitic, are extracted. Modified circuitry and associated circuitry values are fed back at steps 302 and 304, as applicable. For example, if no change results in OCM 120 to modification to gasket logic and interconnect 115-A, then there is nothing to feedback, and vise versa with respect to change to OCM 120 resulting in no change to gasket logic and interconnect 115-A. Of course, modification may be made to both OCM 120 and gasket logic and interconnect 115-A resulting in feedback for both.
Timing performance analysis process 300 may continue, until at step 308 each Ti is less than or equal to Tpath, in which event timing performance analysis process 300 ends at step 309. Notably, timing performance analysis process 300 works with embedded core 110 formed with a lithography of a first minimum dimension and OCM 120/gasket logic and interconnects 115-A form with a lithography of a second minimum dimension different than the first minimum dimension. So, for example, embedded core 110 may be formed using 0.13 micron lithography and OCM 120/gasket logic and interconnects 115-A may be formed using 0.18 micron lithography.
Referring to
At step 401, setup and hold times are obtained for embedded core 110. At step 402, a static timing analysis is done on OCM 120. This static timing analysis is done by simulation at a transistor-level, and may be done with a product called PathMill from Synopsis of Mountain View, Calif. At step 403, data from step 402 is used to determine respective clock-to-output times for signals to be outputted from OCM 120.
At step 404, a programmatic representation of gasket logic and interconnects 115-B is provided. Such a representation may be done in Verilog or VHDL, for example. At step 405, this programmatic representation is taken down from a logic level to something closer to a physical or transistor level, as such with HSpice or like program simulation, and simulated to get delays associated with signals passing through gasket logic and interconnects 115-B.
At step 406, outputs from steps 403 and 405, namely, clock-to-output times for OCM 120 and signal delays for gasket logic and interconnects 115-B, respectively, are associated with setup and hold times for embedded core 110 from step 401. This association may be done using a spreadsheet, a database and the like. For example, assuming data_in—1 to embedded core 110 is under consideration, then a spreadsheet association may look something like that shown in Table II,
where all values are expressed in units of time, such as picoseconds for example.
At step 407, critical paths are identified by totaling C-O delay, GL&I delay and Setup/Hold time to provide a total time for each signal traveling along input path 113. Accordingly, a total time, Ti, is determined for each signal on input path 113 going from OCM 120 to embedded core 110.
At step 408, Ti is compared to Tpath. For example, it may be determined whether Tpath is greater than or equal to Ti for each signal. Notably, it should be appreciated that if Ti was equal to Tpath then there would be “critical” timing. Accordingly, at step 408, such a check may be for Tpath greater than Ti to avoid critical timing. Moreover, to ensure a margin of error, Tpath, which is approximately 1 to 10 percent, for example, less than Tpath, may be used at step 408 for comparison with Ti. For purposes of clarity, the remainder of
Alternatively, timing performance analysis process 400 may end at step 407. This is because a largest value of times Ti may be determined, and frequency of operation of input path 113 of embedded core 110 may be set from there.
However, assuming either or both OCM 120 and gasket logic and interconnects 115-B may be modified, if any Ti is greater than Tpath, then at step 410 circuitry from either or both OCM 120 and gasket logic and interconnects 115-B is modified to reduce time associated with identified critical paths, namely, signal paths producing Ti's greater than Tpath. In response to modification of circuitry at step 410, layout for such modified circuitry is made at step 411 and circuitry values associated therewith including, but not limited to, resistance, capacitance, and inductance, among others both actual and parasitic, are extracted. Modified circuitry and associated circuitry values are fed back at steps 402 and 404, as applicable. For example, if no change results in OCM 120 to modification to gasket logic and interconnect 115-B, then there is nothing to feedback, and vise versa with respect to change to OCM 120 resulting in no change to gasket logic and interconnect 115-B. Of course, modification may be made to both OCM 120 and gasket logic and interconnect 115-B resulting in feedback for both.
Timing performance analysis process 400 may continue, until at step 408 each Ti is less than or equal to Tpath, in which event timing performance analysis process 400 ends at step 409. Notably, timing performance analysis process 400 works with embedded core 110 formed with a lithography of a first minimum dimension and OCM 120/gasket logic and interconnects 115-B form with a lithography of a second minimum dimension different than the first minimum dimension. So, for example, embedded core 110 may be formed using 0.13 micron lithography and OCM 120/gasket logic and interconnects 115-B may be formed using 0.18 micron lithography.
While foregoing is directed to the preferred embodiment of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof, and the scope thereof is determined by the claims that follow. For example, though the present invention is described in terms of an FPGA and embedded processor core, it should be understood that constructs other than an FPGA and an embedded processor core may be used, including, but not limited to, combinations formed of a programmable logic device and at least one of a memory, an Application Specific Integrated Circuit, an Application Specific Standard Product, a Digital Signal Processor, a microprocessor, a microcontroller, and the like.
All trademarks are the respective property of their owners.
Number | Name | Date | Kind |
---|---|---|---|
4758985 | Carter | Jul 1988 | A |
4855669 | Mahoney | Aug 1989 | A |
5072418 | Boutaud et al. | Dec 1991 | A |
5142625 | Nakai | Aug 1992 | A |
RE34363 | Freeman | Aug 1993 | E |
5274570 | Izumi et al. | Dec 1993 | A |
5311114 | Sambamurthy et al. | May 1994 | A |
5339262 | Rostoker et al. | Aug 1994 | A |
5347181 | Ashby et al. | Sep 1994 | A |
5361373 | Gilson | Nov 1994 | A |
5381524 | Lewis et al. | Jan 1995 | A |
5457410 | Ting | Oct 1995 | A |
5473267 | Stansfield | Dec 1995 | A |
5500943 | Ho et al. | Mar 1996 | A |
5504738 | Sambamurthy et al. | Apr 1996 | A |
5521837 | Frankle et al. | May 1996 | A |
5535223 | Horstmann et al. | Jul 1996 | A |
5537601 | Kimura et al. | Jul 1996 | A |
5543640 | Sutherland et al. | Aug 1996 | A |
5550782 | Cliff et al. | Aug 1996 | A |
5552722 | Kean | Sep 1996 | A |
5572717 | Pedersen | Nov 1996 | A |
5574930 | Halverson, Jr. et al. | Nov 1996 | A |
5574942 | Colwell et al. | Nov 1996 | A |
5581745 | Muraoka | Dec 1996 | A |
5600845 | Gilson | Feb 1997 | A |
5610833 | Chang et al. | Mar 1997 | A |
5652904 | Trimberger | Jul 1997 | A |
5671355 | Collins | Sep 1997 | A |
5705938 | Kean | Jan 1998 | A |
5724557 | McBean, Sr. | Mar 1998 | A |
5732250 | Bates et al. | Mar 1998 | A |
5737631 | Trimberger | Apr 1998 | A |
5740404 | Baji | Apr 1998 | A |
5742179 | Sasaki | Apr 1998 | A |
5742180 | DeHon et al. | Apr 1998 | A |
5748979 | Trimberger | May 1998 | A |
5752035 | Trimberger | May 1998 | A |
5760607 | Leeds et al. | Jun 1998 | A |
5790435 | Lewis et al. | Aug 1998 | A |
5809517 | Shimura | Sep 1998 | A |
5835405 | Tsui et al. | Nov 1998 | A |
5874834 | New | Feb 1999 | A |
5889788 | Pressly et al. | Mar 1999 | A |
5892961 | Trimberger | Apr 1999 | A |
5903472 | Barrientos | May 1999 | A |
5910899 | Barrientos | Jun 1999 | A |
5914616 | Young et al. | Jun 1999 | A |
5914902 | Lawrence et al. | Jun 1999 | A |
5933023 | Young | Aug 1999 | A |
5970254 | Cooke et al. | Oct 1999 | A |
6011407 | New | Jan 2000 | A |
6020755 | Andrews et al. | Feb 2000 | A |
6026481 | New et al. | Feb 2000 | A |
6096091 | Hartmann | Aug 2000 | A |
6154051 | Nguyen et al. | Nov 2000 | A |
6163166 | Bielby et al. | Dec 2000 | A |
6167364 | Stellenberg et al. | Dec 2000 | A |
6172990 | Deb et al. | Jan 2001 | B1 |
6178541 | Joly et al. | Jan 2001 | B1 |
6181163 | Agrawal et al. | Jan 2001 | B1 |
6211697 | Lien et al. | Apr 2001 | B1 |
6242945 | New | Jun 2001 | B1 |
6272451 | Mason et al. | Aug 2001 | B1 |
6279045 | Muthujumaraswathy et al. | Aug 2001 | B1 |
6282627 | Wong et al. | Aug 2001 | B1 |
6301696 | Lien et al. | Oct 2001 | B1 |
6343207 | Hessel et al. | Jan 2002 | B1 |
6353331 | Shimanek | Mar 2002 | B1 |
6356987 | Aulas | Mar 2002 | B1 |
6389558 | Herrmann et al. | May 2002 | B1 |
6434735 | Watkins | Aug 2002 | B1 |
6453450 | Walter | Sep 2002 | B1 |
6460172 | Farre et al. | Oct 2002 | B1 |
6467009 | Winegarden et al. | Oct 2002 | B1 |
6483342 | Britton et al. | Nov 2002 | B2 |
6507942 | Calderone et al. | Jan 2003 | B1 |
6510548 | Squires | Jan 2003 | B1 |
6518787 | Allegrucci et al. | Feb 2003 | B1 |
6519753 | Ang | Feb 2003 | B1 |
6522167 | Ansari et al. | Feb 2003 | B1 |
6532572 | Tetelbaum | Mar 2003 | B1 |
6539508 | Patrie et al. | Mar 2003 | B1 |
6541991 | Hornchek et al. | Apr 2003 | B1 |
6578174 | Zizzo | Jun 2003 | B2 |
6587995 | Duboc et al. | Jul 2003 | B1 |
6588006 | Watkins | Jul 2003 | B1 |
6601227 | Trimberger | Jul 2003 | B1 |
6604228 | Patel et al. | Aug 2003 | B1 |
6611905 | Grundon et al. | Aug 2003 | B1 |
6611951 | Tetelbaum et al. | Aug 2003 | B1 |
6662285 | Douglass et al. | Dec 2003 | B1 |
6675272 | Ware et al. | Jan 2004 | B2 |
6686769 | Nguyen et al. | Feb 2004 | B1 |
6781407 | Schultz | Aug 2004 | B2 |
6798239 | Douglass et al. | Sep 2004 | B2 |
6871331 | Bloom et al. | Mar 2005 | B1 |
6886092 | Douglass et al. | Apr 2005 | B1 |
7093240 | Rodi et al. | Aug 2006 | B1 |
20010049812 | Chan et al. | Dec 2001 | A1 |
20030062922 | Douglas et al. | Apr 2003 | A1 |
20030097541 | Abrosimov et al. | May 2003 | A1 |
Number | Date | Country |
---|---|---|
0315275 | Oct 1989 | EP |
1 235 351 | Aug 2002 | EP |
WO 93 25968 | Dec 1993 | WO |
0 905 906 | Mar 1999 | WO |
Number | Date | Country | |
---|---|---|---|
Parent | 10084515 | Feb 2002 | US |
Child | 11144523 | US |