The present disclosure relates to electronic circuit systems, and more particularly, to temperature control systems and methods for integrated circuits.
In an electronic integrated circuit, the temperature of the integrated circuit is related to the frequency of data and/or clock signals in the integrated circuit. Some users of integrated circuits (ICs), such as field programmable gate arrays (FPGAs), have stringent use requirements, including the amount of time the ICs will be in use and the maximum temperatures that the ICs will generate while in use. Manufacturers of ICs often do not know in advance which of their ICs will experience the maximum environmental stresses (e.g., the highest temperatures for the longest periods of time), while being used in specific customer applications.
In many types of integrated circuits (ICs), such as field programmable gate arrays (FPGAs), it may be important to accurately detect hotspots within the ICs. Detecting hotspots within an FPGA is challenging, because the core logic region of an FPGA typically does not contain thermal sensors. Without accurate temperature measurements in an FPGA, it is difficult to determine if the temperature of a hotspot exceeds a desired temperature threshold. Therefore, it would be desirable to determine the temperatures of hotspots in various regions of an IC, such as an FPGA, so that corrective action can be taken when the optimal temperatures for these regions are exceeded.
Because the core logic region of an FPGA is modular, an irregular structure, such as a thermal sensor, is not well-suited to be placed into the core logic region of an FPGA. On the other hand, placing thermal sensors in the core logic region of an FPGA in a regular pattern is unnecessary, because the thermal sensors would consume a significant amount of die area and may not be needed by all potential FPGA users. In addition, placing temperature sensing diodes into the core logic region would suffer from additional limitations, including requiring an analog-to-digital converter (ADC) for each set of diodes. Each ADC may have a limited distance support that could cause overhead if a significant number of temperature sensing diodes are required by a user design.
One or more specific examples are described below. In an effort to provide a concise description of these examples, not all features of an actual implementation are described in the specification. It should be appreciated that in the development of any such actual implementation, as in any engineering or design project, numerous implementation-specific decisions must be made to achieve the developers' specific goals, such as compliance with system-related and business-related constraints, which may vary from one implementation to another. Moreover, it should be appreciated that such a development effort might be complex and time consuming, but would nevertheless be a routine undertaking of design, fabrication, and manufacture for those of ordinary skill having the benefit of this disclosure.
This disclosure discusses circuit systems that can be implemented in integrated circuit devices, including programmable logic devices such as field programmable gate arrays (FPGAs). As discussed herein, circuit systems may use hard logic and soft logic of an FPGA. As used herein, “hard logic” generally refers to circuits in an integrated circuit device (e.g., a programmable logic integrated circuit) that are not programmable by an end user. The circuits in the integrated circuit device that are programmable by the end user are considered “soft logic.”
According to some examples disclosed herein, logic circuits in an integrated circuit (IC) are configured to function as temperature sensors that sense the temperatures in different regions of the IC. The IC may be a programmable logic IC or another type of IC. The temperature sensing logic circuits may be, for example, soft logic circuits, hard logic circuits, or a combination thereof. Each of the logic circuits generates an output that indicates the temperature of one of the regions in the IC. Each of the logic circuits includes transistors having timing characteristics that change with the temperature of the logic circuit. For example, the delay of the logic circuit may become faster when the temperature of the logic circuit increases.
A temperature management controller circuit detects temperature changes in the regions of the IC based on the outputs of the temperature sensing logic circuits. The temperature management controller circuit is able to detect the temperature changes in the regions of the IC based on the performance changes indicated by the temperature sensing logic circuits. The temperature sensing logic circuits may indicate hotspots in the IC. The temperature management controller circuit may compare the temperatures in regions of the IC to one or more temperature thresholds based on the outputs of the temperature sensing logic circuits. In response to one or more of the temperature sensing logic circuits indicating that the temperature in one or more regions of the IC has reached or exceeded one or more of the temperature thresholds, the temperature management controller circuit may initiate corrective actions, for example, powering down the IC or throttling down switching activity in the IC, such as disabling clock or data signal inputs to the IC. The temperature management controller circuit may be implemented, for example, using soft logic in a core logic region of the IC. The corrective actions may occur on the IC die or at the circuit board or package level. For example, the corrective actions may include enabling clock gating in the IC or power cycling the IC.
Each of the frequency counter circuit 101 and the comparator circuit 102 of Figure (
The frequency counter circuit 101 may, for example, have a minimum frequency (e.g., of 100 MHz). The COUNT value is provided in one or more signals from the frequency counter circuit 101 to the comparator circuit 102. The comparator circuit 102 compares the COUNT value to predetermined values that correspond to different temperatures to generate an output OUT. The predetermined values are determined (e.g., by a designer) by calibrating the frequency counter circuit 101 across different temperatures. The predetermined values may vary between integrated circuits (ICs) due to process variations between the ICs. The comparator circuit 102 causes the output OUT to have a value that indicates the temperature of the temperature sensing logic circuit 100 based on the comparison between the COUNT value and the predetermined values that correspond to different temperatures. The value of the output OUT of comparator circuit 102 can be used to determine if the temperature sensing logic circuit 100 is in a hotspot of the IC.
Table 1 below shows examples of the COUNT values generated by counter circuit 101 and the corresponding temperatures and values for the output OUT of comparator circuit 102 that are generated in response to the respective COUNT values in the respective rows using a first set of the predetermined values for counter circuit 101. The examples shown in Table 1 are derived using a reference frequency of 1 megahertz (MHz) for signal FREF.
The temperature sensing logic circuit 100 can achieve greater accuracy in the temperature determination using a lower reference frequency for signal FREF. For example, signal FREF may have a frequency of 100 kilohertz (kHz) to increase the granularity of the output OUT of the comparator circuit 102. Table 2 below shows additional examples of the COUNT values generated by counter circuit 101 and corresponding temperatures and values for the output OUT of comparator circuit 102 that are generated in response to the respective COUNT values in the respective rows using a second set of the predetermined values for counter circuit 101. The examples in Table 2 below are derived using a reference frequency of 100 kHz for signal FREF.
In operation 201 of
In operations 206-208, the CAD tools perform various functions to mitigate the hotspots in the user design for the IC. For example, in operation 206, the CAD tools may perform input/output assignment to reduce the temperature of one or more of the hotspots. Operation 206 may, for example, be performed at the external circuit board level or internally in the IC to assign or re-assign input/output signals to different external terminals (e.g., pads or bumps) of the IC to reduce the temperatures of one or more of the hotspots in the IC. In operation 207, the CAD tools may perform clock signal gating. The clock signal gating performed in operation 207 may, for example, involve gating off (i.e., blocking) one or more clock signals at selected locations in the IC to reduce the temperatures of one or more of the hotspots in the IC. In operation 208, the CAD tools may perform clock signal switch over. The clock signal switch over of operation 208 may, for example, be performed by reducing the frequency of one or more clock signals at selected locations in the user design to reduce the temperatures of one or more of the hotspots in the IC. The user design for the IC is then complete at operation 209.
In operation 402, hotspot definitions are provided to the CAD tools. The hotspot definitions may include, for example, a prediction of hotspots in the user design, one or more temperature thresholds that indicate definitions of hotspots in the user design, or user input indicating potential locations of hotspots in the user design. In decision operation 403, the CAD tools determine if hotspots in the user design for the IC have been identified based on the hotspot definitions received in operation 402 and based on the temperature dependent analysis data generated in operation 401. If no hotspots are identified by the CAD tools in operation 403, then the CAD tools determine no change is to be made to the user design for the IC in operation 404.
If at least one hotspot is identified by the CAD tools in operation 403, the CAD tools generate a new compilation of the user design for the IC in operation 405 that reduces the temperatures of the one or more hotspots. In operation 405, the CAD tools generate a new compilation of the user design that includes a new synthesis, a new placement, and a new routing of the user design on the IC. The new compilation of the user design generated in operation 405 may, for example, eliminate the one or more hotspots identified in operation 403 by reducing the energy consumption in these regions of the user design to reduce the temperatures of the one or more hotspots below a temperature threshold.
If the IC is a programmable logic IC, such as an FPGA, a new configuration bitstream is generated for the IC based on the new compilation of the user design generated in operation 405. The new configuration bitstream is then provided to the IC. In operation 406, the programmable logic IC is then reconfigured with the new configuration bitstream to mitigate the hotspots in the user design. In the reconfigured IC generated in operation 406, the hotspots that were identified in the IC in operation 403 are either eliminated, or the temperatures of these hotspots is significantly reduced (e.g., below one or more temperature thresholds).
In decision operation 602, the temperature management controller 502 can, for example, determine when the temperature indicated by the output of the temperature sensor circuit 501 has reached or exceeded a temperature threshold that is indicative of a hotspot. In response to the temperature management controller 502 determining that the temperature indicated by the output of the temperature sensor circuit 501 has reached or exceeded a temperature threshold in operation 602, the temperature management controller 502 can activate various temperature mitigation functions in the IC to reduce the temperature of the hotspot. For example, in response to the temperature management controller 502 determining that the temperature indicated by the output of the temperature sensor circuit 501 has reached or exceeded a temperature threshold, the temperature management controller 502 may activate the functionality of one or more of the clock switch-over circuit 503, the clock gating circuit 504, or the power management controller circuit 505 to reduce the temperature of the hotspot. If the temperature management controller 502 does not detect a high temperature in the output of the temperature sensor circuit 501 in operation 602, the user design continues to run on the IC without changes in operation 601.
Temperature management controller 502 may activate the functionality of clock switch-over circuit 503, clock gating circuit 504, or power management controller circuit 505 by sending activation signals to these circuits through the paths shown by arrows in
In operation 604, temperature management controller 502 sends activation signals to the clock switch-over circuit 503. In response to receiving the activation signals from temperature management controller 502, clock switch-over circuit 503 changes the frequency (e.g., decreases the frequency) of one or more clock signals to reduce the temperature of the hotspot in operation 604. For example, clock switch-over circuit 503 may decrease the frequency of one or more clock signals that clock 10 circuitry 506. IO circuitry 506 may transmit output signals OUT to other devices outside of the IC of
In decision operation 605, the temperature management controller 502 determines if the temperature indicated by the output of the temperature sensor circuit 501 continues to be equal to or greater than the temperature threshold that is indicative of a hotspot. If operations 603-604 have not decreased the temperature of the hotspot below the temperature threshold, the temperature management controller 502 may then change the assignment of input/output (IO) signals to different external terminals (e.g., pins, pads, or bumps) of the IC in operation 606. For example, the temperature management controller 502 may transfer input and/or output signals from external terminals in a hotspot of the IC to external terminals in a cooler region of the IC in operation 606. If the temperature management controller 502 does not detect a high temperature in the output of the temperature sensor circuit 501 in operation 605, the user design continues to run on the IC without changes in operation 601.
In operation 607, temperature management controller 502 sends activation signals to the power management controller circuit 505. In response to receiving the activation signals from temperature management controller 502, power management controller circuit 505 controls load balancing within an integrated circuit (IC) package or circuit board that includes the IC of
The temperature control and mitigation system 500 may activate the power control signals PWRCTL in response to any one or more of the temperature sensors 711-715 indicating a hotspot in IC 700. The power control signals PWRCTL are provided from IC 700 to board management control system 710. In response to board management control system 710 sensing that the power control signals PWRCTL have been activated, board management control system 710 may activate device power control signals DPC. In response to the device power control signals DPC being activated by the board management control system 710, IC 700 may power off and then power on again (i.e., perform a power cycle). Board management control system 710 may power cycle other ICs in the IC package or on the circuit board using the power control signals DPC.
The temperature sensors, such as temperature sensors 711-715, can also be used in performance tuning for critical timing paths in the IC 700. At locations in the critical timing paths where the temperatures indicated by the temperature sensors are not at a temperature threshold indicative of a hotspot, IR drop (i.e., voltage drop) can be reduced, which causes the critical timing paths to run at a higher voltage and to have a higher performance as a result. Critical timing path areas in the IC can also be designed to have lower logic circuit utilization to improve performance.
Programmable logic IC 10 contains memory elements 20 that can be loaded with configuration data using pads 14 and input-output circuitry 12. Once loaded, the memory elements 20 may each provide a corresponding static control output signal that controls the state of an associated logic component in programmable logic circuitry 18. Typically, the memory element output signals are used to control the gates of field-effect transistors. In the context of programmable logic integrated circuits, the memory elements 20 store configuration data and are sometimes referred to as configuration random-access memory (CRAM) cells.
In general, software and data for performing any of the functions disclosed herein may be stored in non-transitory computer readable storage media. Non-transitory computer readable storage media is tangible computer readable storage media that stores data for a significant period of time, as opposed to media that only transmits propagating electrical signals (e.g., wires). The software code may sometimes be referred to as software, data, program instructions, instructions, or code. The non-transitory computer readable storage media may include computer memory chips, non-volatile memory such as non-volatile random-access memory (NVRAM), one or more hard drives (e.g., magnetic drives or solid state drives), one or more removable flash drives or other removable media, compact discs (CDs), digital versatile discs (DVDs), Blu-ray discs (BDs), other optical media, and floppy diskettes, tapes, or any other suitable memory or storage device(s).
The following are additional examples. Example 1 is an integrated circuit system comprising: a temperature sensor circuit that generates an output indicative of a temperature in an integrated circuit in the integrated circuit system; a temperature management controller circuit that compares the temperature indicated by the output of the temperature sensor circuit to a temperature threshold; and temperature reduction circuitry that controls the temperature in the integrated circuit, wherein the temperature management controller circuit causes the temperature reduction circuitry to reduce the temperature in response to the temperature indicated by the output of the temperature sensor circuit exceeding the temperature threshold.
In Example 2, the integrated circuit system of Example 1 may optionally include, wherein the temperature sensor circuit comprises a frequency counter circuit that generates a count value in response to a periodic signal and a comparator circuit that compares the count value with predetermined values corresponding to temperatures to generate the output indicative of the temperature.
In Example 3, the integrated circuit system of any one of Examples 1-2 may optionally include, wherein the temperature sensor circuit comprises programmable logic circuits, and wherein the temperature management controller circuit causes the temperature reduction circuitry to reduce voltage drop at a location in a critical timing path in response to the temperature indicated by the output of the temperature sensor circuit being less than the temperature threshold.
In Example 4, the integrated circuit system of any one of Examples 1-3 may optionally include, wherein the temperature sensor circuit comprises non-programmable logic circuits.
In Example 5, the integrated circuit system of any one of Examples 1˜4 may optionally include, wherein the temperature sensor circuit detects performance changes in the integrated circuit system to determine a change in the temperature.
In Example 6, the integrated circuit system of any one of Examples 1-5 may optionally include, wherein the temperature sensor circuit causes the output to be indicative of a hotspot in the integrated circuit in response to detecting a frequency that exceeds a frequency threshold.
In Example 7, the integrated circuit system of any one of Examples 1-6 may optionally include, wherein the temperature reduction circuitry comprises a clock switch-over circuit, and wherein the temperature management controller circuit causes the clock switch-over circuit to reduce a frequency of a clock signal in the integrated circuit in response to the temperature indicated by the output of the temperature sensor circuit exceeding the temperature threshold.
In Example 8, the integrated circuit system of any one of Examples 1-7 may optionally include, wherein the temperature reduction circuitry comprises a clock gating circuit, and wherein the temperature management controller circuit causes the clock gating circuit to block a clock signal from being provided to at least one circuit in the integrated circuit in response to the temperature indicated by the output of the temperature sensor circuit exceeding the temperature threshold.
In Example 9, the integrated circuit system of any one of Examples 1-8 may optionally include, wherein the temperature reduction circuitry comprises a power management controller circuit, and wherein the temperature management controller circuit causes the power management controller circuit to reduce power consumption in multiple integrated circuits in the integrated circuit system in response to the temperature indicated by the output of the temperature sensor circuit exceeding the temperature threshold.
Example 10 is a method for controlling a temperature in an integrated circuit, the method comprising: sensing the temperature using a temperature sensor circuit; comparing the temperature indicated by the temperature sensor circuit to a temperature threshold using a temperature management controller circuit; and causing temperature reduction circuitry to reduce the temperature in at least a portion of the integrated circuit in response to the temperature management controller circuit detecting that the temperature has reached the temperature threshold.
In Example 11, the method of Example 10 may optionally include, wherein sensing the temperature using the temperature sensor circuit comprises generating a count value using a frequency counter circuit in response to a reference periodic signal, and comparing the count value generated by the frequency counter circuit to pre-simulated data indicative of multiple temperatures to determine the temperature of a hotspot using a comparator circuit.
In Example 12, the method of any one of Examples 10-11 may optionally include, wherein sensing the temperature using the temperature sensor circuit comprises causing an output of the temperature sensor circuit to be indicative of a hotspot in the integrated circuit in response to detecting a frequency of a signal in the integrated circuit that exceeds a frequency threshold.
In Example 13, the method of any one of Examples 10-12 may optionally include, wherein causing the temperature reduction circuitry to reduce the temperature comprises causing the temperature reduction circuitry to reduce voltage drop at a location in a critical timing path in response to the temperature indicated by the temperature sensor circuit being less than the temperature threshold.
In Example 14, the method of any one of Examples 10-13 may optionally include, wherein causing the temperature reduction circuitry to reduce the temperature comprises reducing a frequency of a clock signal in the integrated circuit using a clock switch-over circuit in response to the temperature indicated by the temperature sensor circuit reaching the temperature threshold.
In Example 15, the method of any one of Examples 10-14 may optionally include, wherein causing the temperature reduction circuitry to reduce the temperature comprises blocking a clock signal from being provided to at least one circuit in the integrated circuit using a clock gating circuit in response to the temperature indicated by the temperature sensor circuit reaching the temperature threshold.
In Example 16, the method of any one of Examples 10-15 may optionally include, wherein causing the temperature reduction circuitry to reduce the temperature comprises reducing utilization of logic circuitry in the integrated circuit in response to the temperature indicated by the temperature sensor circuit reaching the temperature threshold.
Example 17 is a non-transitory computer readable storage medium comprising instructions stored thereon for causing a computer to execute a method for generating a circuit design for an integrated circuit using a circuit design tool, the method comprising: performing compilation and simulation of the circuit design for the integrated circuit using the circuit design tool; calculating energy consumption in regions of the circuit design using the circuit design tool; identifying hotspots in the circuit design using the circuit design tool based on the energy consumption in the regions of the circuit design; and placing temperature sensor circuits in the regions of the circuit design identified as the hotspots.
In Example 18, the non-transitory computer readable storage medium of Example 17 may optionally include, wherein the method further comprises: assigning input signals or output signals to selected external terminals of the integrated circuit to reduce a temperature of at least one of the hotspots in response to an output of one of the temperature sensor circuits.
In Example 19, the non-transitory computer readable storage medium of any one of Examples 17-18 may optionally include, wherein the method further comprises: reducing a frequency of a clock signal in the integrated circuit using a clock switch-over circuit to reduce a temperature of at least one of the hotspots in response to an output of one of the temperature sensor circuits.
In Example 20, the non-transitory computer readable storage medium of any one of Examples 17-19 may optionally include, wherein identifying the hotspots in the circuit design comprises identifying the hotspots based on definitions of the hotspots, and wherein the method further comprises: generating a new compilation of the circuit design for the integrated circuit that reduces a temperature of at least one of the hotspots by relocating logic and resource allocation away from high temperature regions of the circuit design to lower temperature regions of the circuit design.
It will be recognized by one skilled in the art, that the examples disclosed herein may be practiced without some or all of these specific details. In other instances, well-known operations have not been described in detail in order not to obscure the present examples. It should be appreciated that the examples disclosed herein can be implemented in numerous ways, such as a process, an apparatus, a system, a device, or a method on a computer readable medium.
The foregoing description of the examples has been presented for the purpose of illustration. The foregoing description is not intended to be exhaustive or to be limiting to the examples disclosed herein. In some instances, features of the examples can be employed without a corresponding use of other features as set forth. Many modifications, substitutions, and variations are possible in light of the above teachings.