Computer cooling solutions can allow for the removal of waste heat produced by computer equipment, which can help keep such equipment within certain operating temperature limits. Certain computer equipment components, such as integrated circuits, CPUs, chipset, graphics cards, and hard disk drives, are especially susceptible to temporary malfunction or permanent failure if overheated. Such components are often designed to minimize heat generation. Likewise, some computer operating systems are designed to reduce power consumption and related heat generation. Moreover, certain computer systems rely on one or more dedicated cooling solutions to remove unwanted heat. For example, some computer systems rely on fans, heat sinks, and related cooling devices to reduce temperature by actively exhausting hot air.
The following discussion is directed to various examples of the disclosure. Although one or more of these examples may be preferred, the examples disclosed herein should not be interpreted, or otherwise used, as limiting the scope of the disclosure, including the claims. In addition, the following description has broad application, and the discussion of any example is meant only to be descriptive of that example, and not intended to intimate that the scope of the disclosure, including the claims, is limited to that example. Throughout the present disclosure, the terms “a” and “an” are intended to denote at least one of a particular element. In addition, as used herein, the term “includes” means includes but not limited to, the term “including” means including but not limited to. The term “based on” means based at least in part on.
There are ever-increasing demands for more IT data capacity and faster access for both consumer and enterprise markets. These demands have led to the development of new computing architectures that can more effectively and efficiently manage massive amounts of data. For example, new technologies have emerged that allow for the integration of multiple chips, semiconductor dies, and discrete components into a single package, such as a Multi-Chip Module (MCM). Thermal management is important to successfully implementing such architectures—and especially so in data center environments.
Challenges related to thermal management of such technologies can, for example, include: (1) different temperature limits between component types within a package (e.g., in situations where a memory chip case temperature specification is lower than a temperature specification for a main processor or controller die), (2) situations in which there is a strong thermal cross-talk between different components, and (3) in situations where vertical stacking of components (e.g., memory devices) leads to magnification of hot spot temperatures. Each of these challenges may lead to designs that seek to lower an effective case temperature (Tc) specification with an increase in thermal design power levels (Ptotal). The above challenges may be amplified in situations where data centers are operated at high inlet air temperatures (Ta) or use warm water for waste heat reuse.
Certain implementations of the present disclosure are directed to addressing the above challenges. In some implementations, an MCM is described that includes: (1) a heat sink; (2) a circuit board; (3) a first chip secured to a first location on the circuit board; (4) a first vapor chamber thermally coupled to the first chip to pass heat generated by the first chip to the heat sink; (5) a second chip secured to a second location on the circuit board; and (6) a second vapor chamber thermally coupled to the second chip to pass heat generated by the second chip to the heat sink. In some implementations, a portion of the second vapor chamber is positioned between a portion of the first vapor chamber and the heat sink and the first vapor chamber is substantially thermally insulated from the second vapor chamber.
This arrangement of vapor chambers and other implementations described herein can provide one or more of the following advantages: (1) reduced thermal cross-talk between components, which can reduce a total thermal resistance for the MCM; (2) increased temperature margins for components; (3) improved thermal resistance under the same available physical space; and (4) lower air flow used compared to current cooling solutions, which may result in a reduction on the total product power consumption and related energy savings. Other advantages of implementations presented herein will be apparent upon review of the description and figures.
As provided herein, certain implementations of the present disclosure are directed to designs that can reduce thermal-cross talk between components of MCM 100 (e.g., first chip 106 and second chip 108). The term “Multi-Chip Module” or “MCM” as used herein can, for example, refer generally to an electronic assembly where multiple integrated circuits, semiconductor dies, and/or other discrete components are integrated, usually onto a unifying substrate. Such an MCM can, for example, be treated as if it were a single component (e.g., as though it were a larger Integrated Circuit (IC)). Suitable electronic assemblies can, for example, refer to packages including conductor terminals or “pins”). In suitable contexts, the term “MCM” can also refer to related industry terms, such as “hybrid” or “hybrid integrated circuit.” MCMs can, for example, be used with certain processors, graphic processing units (GPUs), non-volatile memory DIMM devices, gaming consoles, portable storage devices (e.g., USB drives, memory cards, etc.), etc. In some implementations, an MCM may rely on a different layout architecture, which can depend on its application and physical limitations. Such layout architectures can, for example, be a 2D architecture (e.g., tiled horizontally or stacked vertically) or 2.5D/3D architectures (e.g., tiled horizontally and stacked vertically).
It is appreciated that one or more aspects described herein can be applied to other suitable electronic components or assemblies other than MCMs. For example, in some implementations, aspects can be applied to a heat generating component that is not in the form of a “chip.” In such an implementation, first vapor chamber 110 and second vapor chamber 112 can comprise a vapor chamber system for cooling heat generating components of circuit board 104. In some implementations, first vapor chamber 110 is to pass heat generated by a first heat generating component of circuit board 104 to heat sink 102. In such an implementation, second vapor chamber 112 is to pass heat generated by a second heat generating component of circuit board 104 to heat sink 102. In some implementation of such a system, a portion of second vapor chamber 112 is positioned between a portion of first vapor chamber 110 and heat sink 102 and the portion of second vapor chamber 112 is thermally insulated from the portion of first vapor chamber 110.
As provided above, MCM 100 includes a heat sink 102 that receives heat generated by first chip 106 via first vapor chamber 110 and receives heat generated by second chip 108 via second vapor chamber 112. The term “heat sink” as used herein can, for example, refer to a passive heat exchanger that transfers heat generated by an electronic device to a cooling medium (e.g., air or a liquid coolant), where it is dissipated away from the device. This can, for example, allow regulation of the device's temperature. One or more heat sinks of the present disclosure can be designed to maximize a surface area in contact with the cooling medium surrounding it. One or more heat sinks of the present disclosure can be made of copper, aluminum, and/or another suitable material.
In some implementations, heat sink 102 can, for example, include heat sink fins.
In some implementations, such as the implementation depicted in
As provided above, MCM 100 includes a circuit board 104 to which the first chip 106 and second chip 108 secured at respective locations on circuit board 104. As used herein, the term “circuit board” can, for example, refer to a printed circuit board (PCB) that mechanically supports and electrically connects electronic components using conductive tracks, pads and other features. Such a circuit board can, for example, rely on copper sheet etchings that are laminated onto a non-conductive substrate. Components (e.g. capacitors, resistors or active devices) can, for example, be soldered on such a PCB. It is appreciated that certain suitable PCBs can, for example, include components embedded in the substrate.
The term “chip” as used herein can, for example, refer to an integrated circuit or monolithic integrated circuit and can also be referred to as an “IC” or “microchip”. Such a chip can, for example, be in the form of a set of electronic circuits on a small flat piece of semiconductor material (e.g., silicon). First chip 106 can, for example, be in the form of a Central Processing Unit (CPU) and second chip 108 can, for example, be in the form of a memory chip. As used herein, the terms “Central Processing Unit’ or “CPU” can, for example, refer to electronic circuitry within a computer that carries out instructions of a computer program by performing arithmetic, logical, control and input/output (I/O) operations specified by the instructions. The terms can, for example, refer to a processing unit and control unit (CU) as distinguished from main memory and I/O circuitry. Such a CPU can, for example, be in the form of a microprocessor on a single integrated circuit (IC) chip. It is appreciated that an IC chip that contains a CPU may also contain memory, peripheral interfaces, and other components of a computer. Such an integrated device can, for example, be referred to as a microcontrollers or systems on a chip (SoC), or MCM. As used herein, the term “memory chip” can, for example, refer to an IC used to store data or process code. The IC can, for example, include capacitors and transistors and can, for example, hold memory temporarily (e.g., through random access memory (RAM) or permanently (e.g., through read only memory (ROM)).
As provided above, MCM 100 includes a first vapor chamber 110 that is thermally coupled to first chip 106 to pass heat generated by first chip 106 to heat sink 102. MCM 100 also includes a second vapor chamber 112 of MCM 100 that is thermally coupled to second chip 108 to pass heat generated by second chip 108 to heat sink 102. The term “vapor chamber” as used herein can, for example, refer to an arrangement that attempts to maximize the use of surface area available from a heat sink. In certain vapor chambers, a liquid (e.g., water) can evaporate on a “powered component side” of the chamber. The resulting vapor can spread uniformly on the other side of the chamber, which may be referred to as the “condenser side” of the chamber. The vapor can condense to water on the condenser side, which may be self-recirculated under surface tension force within the vapor chamber.
In some implementations, first vapor chamber 110 can include a suitable first type of liquid or gas selected to accommodate thermal characteristics of first chip 106. In some implementations, liquid within first vapor chamber 110 is water, methanol, or acetone. In some implementations, second vapor chamber 112 can include a suitable second type of liquid or gas selected to accommodate different thermal characteristics of second chip 108. It is appreciated that in some implementations, a liquid or gas of first vapor chamber 110 may be the same as a liquid or gas of second vapor chamber 112. In some implementations, a volume of first vapor chamber 110 is sized to accommodate thermal characteristics of first chip 106. Likewise, in some implementations, a volume of second vapor chamber 112 is sized to accommodate different thermal characteristics of second chip 108. It is appreciated that in some implementations, the volume of first vapor chamber 110 is the same as the volume of second vapor chamber 112.
As depicted in the example implementation of
In some implementations, MCM 100 includes an insulation layer between first vapor chamber 110 and second vapor chamber 112 to substantially thermally insulate first vapor chamber 110 from second vapor chamber 112.
Various example implementations for the present disclosure will now be described. It is appreciated that these examples may include or refer to certain aspects of other implementations described herein (and vice-versa), but are not intended to be limiting towards other implementations described herein. Moreover, it is appreciated that certain aspects of these implementations may be applied to other implementations described herein.
In some implementations, an evaporative cooling solution can address thermal challenges of MCMs (e.g., MCM 100) and similar electronic equipment. The solution can, for example, incorporate the design of multiple vapor cavities to redirect heat generated by electronic components into specific areas for controlled heat extraction. This can, for example, result in improved thermal management of devices inside multi-chip packages by reducing thermal cross-talk between components and may further result in a reduction of the effective thermal resistance obtain for this type of cooling solution, which has the potential for energy savings by lowering the power consumption required to cool these devices.
When certain electronic devices are positioned close together such that they share common heat transfer paths there is a thermal interaction between them where the heat generated from the higher power level devices affects the temperature conditions of lowered-power ones.
We can observe from the equation above that the final temperature of each device (TMi) can be expressed as the ambient temperature (Ta) plus the temperature contribution of each of the neighbor devices and itself. This contribution can be referred to as “thermal influence”. For this test case, we can arrange this thermal influence into three main components: 1) the thermal influence of the CPU (main heat source), 2) the thermal influence of all the neighbor memory chips, and 3) its own thermal influence.
We then apply this “cross-talk” methodology to investigate how the cooling solutions available in the market perform for MCMs. Multiple thermal simulations, using computational fluid dynamic (CFD) modeling, were performed with a heat sink placed on top of the test case MCM (see
When a typical vapor chamber (VC) device is inserted as part of heat sink base we can obtain a significant case temperature reduction on most of the chips (and in particular the CPU) as shown in table 1 below:
This can be achieved due to the benefits of the vapor chamber to improve the heat spreading across the heat sink and maximize the utilization of the surface area available. However, because of this reduction in the spreading resistance, some of the memory chips (M5 to M8) experience an increase in temperature which relates to an increase in the thermal influence of the CPU as shown in the table below. In other words, the thermal cross-talk between components on a MCM increases when current vapor chamber devices are used.
As depicted in
CFD simulations were performed based on this new cooling solution and the results were compared with results of other solutions including a solid base solution and a single vapor chamber solution. The results show a dramatic reduction in the thermal influence of the CPU on the memory devices (see Table 3 below) and a reduction in the final temperature of all the devices inside the MCM (see Table 4 below). As demonstrated by these experimental results, the solutions of the present disclosure offer an improved cooling solution for multi-chip modules due to its ability to reduce the thermal cross-talk between devices in the module.
Electronic device 132 includes multiple packages 134 having a first heat generating component 136, a second heat generating component 138, a first sealed vapor chamber 140, and a second sealed vapor chamber 142. The interior of packages 134 are not depicted in
First sealed vapor chamber 140 can, for example, be partially filled with a first liquid, the first liquid to vaporize in response to heat generated by first heat generating component 136. Likewise, second sealed vapor chamber 142 can, for example, be partially filled with a second liquid, the second liquid to vaporize in response to heat generated by second heat generating component 138. In this implementation, first vapor chamber 110 partially overlaps second vapor chamber 112 and is sized to accommodate an expected thermal load of first heat generating component 136 that is larger than an expected thermal load of second heat generating component 138. Such a vapor chamber system can, for example, include an insulating layer (e.g., layer 118) sandwiched between first vapor chamber 110 and second vapor chamber 112 to thermally insulate first vapor chamber 110 from second vapor chamber 112.
While certain implementations have been shown and described above, various changes in form and details may be made. For example, some features that have been described in relation to one implementation and/or process can be related to other implementations. In other words, processes, features, components, and/or properties described in relation to one implementation can be useful in other implementations. Furthermore, it should be appreciated that the systems and methods described herein can include various combinations and/or sub-combinations of the components and/or features of the different implementations described. Thus, features described with reference to one or more implementations can be combined with other implementations described herein. As used herein, “a” or “a number of” something can refer to one or more such things. For example, “a number of widgets” can refer to one or more widgets.