1. Field of the Invention
The present invention generally relates to cooling systems. More specifically, the present invention is directed to the detection of faulty central processing unit (CPU) heat sink coupling during system power-up.
2. Related Art
With the increase in heat dissipation and the reduction in overall form factors, thermal management of CPUs and other types of integrated circuits has become an increasingly important element of electronic product design.
Heat sinks are devices that enhance heat dissipation from a hot surface, usually the packaging of a heat generating component such as a CPU or other integrated circuit, to a cooling fluid, usually air. Heat sinks typically include a large number of fins to increase the surface area that is in direct contact with air. This allows more heat to be dissipated to control the component operating temperature. Some newer heat sinks also include vapor chambers that are configured to use an internal liquid-to-vapor phase change to spread heat efficiently to air-cooled fins.
The surface of a CPU or heat sink is never entirely flat; if a heat sink is placed directly on a CPU, there will be tiny air gaps between the two. Since air conducts heat poorly, these gaps have a negative effect on heat transfer to the heat sink. Therefore, a heat sink compound with a high thermal conductivity is generally used to fill these gaps, and thus improve heat conductivity between the CPU and heat sink.
In some cases, the heat sink compound must be precisely applied during the attachment of a heat sink to a CPU, and cannot be applied in the field. As such, if the heat sink is removed from the CPU and the heat sink compound is the least bit disturbed, a new heat sink must be supplied, increasing the duration of repair.
A faulty or improperly mounted heat sink will not sufficiently cool a CPU and will likely result in the overheating of the CPU at the most inopportune time. There is a need, therefore, for a technique for predicting this type of fault prior to the overheating of a CPU.
The present invention is directed to the detection of faulty CPU heat sink coupling during system power-up.
A first aspect of the present invention is directed to a method for detecting faulty central processing unit (CPU) heat sink coupling during system power-up, comprising: monitoring a slope of a CPU temperature rise from initial system power-up; determining if the slope of the CPU temperature rise exceeds an expected value; and in the case that the slope of the CPU temperature rise exceeds the expected value, indicating an existence of a possible fault (PFA) related to a heat sink coupled to the CPU.
A second aspect of the present invention is directed to a system for detecting faulty central processing unit (CPU) heat sink coupling during system power-up, comprising: a system for monitoring a slope of a CPU temperature rise from initial system power-up; a system for determining if the slope of the CPU temperature rise exceeds an expected value; and a system for indicating an existence of a possible fault (PFA) related to a heat sink coupled to the CPU, in the case that the slope of the CPU temperature rise exceeds the expected value.
The illustrative aspects of the present invention are designed to solve the problems herein described and other problems not discussed.
These and other features of this invention will be more readily understood from the following detailed description of the various aspects of the invention taken in conjunction with the accompanying drawings.
The drawings are merely schematic representations, not intended to portray specific parameters of the invention. The drawings are intended to depict only typical embodiments of the invention, and therefore should not be considered as limiting the scope of the invention. In the drawings, like numbering represents like elements.
Under known conditions (air flow, ambient temperature, initial CPU temperature at power-on, etc.), a CPU 12 with a properly mounted heat sink 14 will have a predictable temperature rise during initial system power-up. In accordance with the present invention, the slope of the CPU temperature rise after initial system power-up is monitored in step S1. The temperature of the CPU 12 can be obtained, for example, using one or more CPU temperature sensors 16 mounted directly on the CPU 12 or underneath the CPU 12 on a motherboard. Some CPUs have temperature sensors integrated into their silicon. Other techniques for obtaining CPU temperature readings are also possible.
If the slope of the CPU temperature rise exceeds the expected norm 18 as determined in step S2, it is likely that there is a problem with the heat sink 14 or its interface 20 with the CPU 12. The steeper the slope, the more likely there is a fault related to the heat sink 14. A marginally steep slope with a subsequent over-temperature fault could also indicate a heat sink coupling problem. If the slope of the CPU temperature rise exceeds the expected norm 18, the existence of a possible fault (PFA) is indicated in step S4. The expected norm 18 of the slope of the CPU temperature rise can be determined by monitoring the temperature of one or more of the same combination of CPU 12 and heat sink 14, which are operating normally, or in any other suitable manner. If the slope of the CPU temperature rise does not exceed the expected norm 18 (step S2), and system power-up has been completed (step S3), the process ends. Otherwise flow passes back to step S1.
Upon the indication of a possible fault (PFA) associated with the heat sink 14 in step S4, further tests can be run in step S5 to obtain a more precise reading of the slope of the CPU temperature rise. Testing can be provided, for example, via the system basic input/output system (BIOS), power-on self test (POST) BIOS, baseboard management controller (BMC), etc., or in any other suitable manner. Sufficient data can be collected during such tests to determine in step S6 whether to flag 22 the PFA of the heat sink 14 in step S7. The output of other types of sensors (e.g., airflow, ambient temperature, etc.) can be collected to help determine whether to flag 22 the PFA of the heat sink 14.
If an abnormal slope is detected, the BIOS can delay passing control to the operating system (OS) and run a routine to maximize the temperature of the CPU 12. This type of max load routine can be run until the slope of the CPU temperature rise stabilizes or the temperature of the CPU 12 rises outside of specification.
Flagging a PFA of the heat sink 14 early during power-up provides many advantages, including, for example:
An illustrative temperature plot of a first CPU (CPU0) with a heat sink and a second CPU (CPU1) with a heat sink, both operating as expected, is depicted in
The present invention or portions thereof can be implemented on any now known or later developed computer system that is capable of executing computer program code. The computer program code can be provided on a computer-readable medium or provided in any other suitable manner.
The foregoing description of the embodiments of this invention has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed, and many modifications and variations are possible.
Number | Name | Date | Kind |
---|---|---|---|
5491610 | Mok et al. | Feb 1996 | A |
6349269 | Wallace, Jr. | Feb 2002 | B1 |
6577146 | Gamache et al. | Jun 2003 | B2 |
6842718 | Byrd et al. | Jan 2005 | B2 |
7141953 | Cohen et al. | Nov 2006 | B2 |
7275248 | Nishida | Sep 2007 | B2 |
7574321 | Kernahan et al. | Aug 2009 | B2 |
7607828 | Beier et al. | Oct 2009 | B2 |
20020087904 | Cai | Jul 2002 | A1 |
20030063437 | Kurihara | Apr 2003 | A1 |
20040006721 | Getz et al. | Jan 2004 | A1 |
20050068053 | Conti et al. | Mar 2005 | A1 |
20060122740 | Law et al. | Jun 2006 | A1 |
20070097620 | Leech et al. | May 2007 | A1 |
20080155288 | Cai | Jun 2008 | A1 |
20080215901 | Beard | Sep 2008 | A1 |
Number | Date | Country |
---|---|---|
6038544 | Feb 1994 | JP |
8174895 | Jul 1996 | JP |
Number | Date | Country | |
---|---|---|---|
20080154536 A1 | Jun 2008 | US |