The present disclosure relates to methods, apparatus, and products for multiple chip module direct attach cooling.
According to embodiments of the present disclosure, various methods, apparatus and products for multiple chip module direct attach cooling are described herein. In some aspects, multiple chip module direct attach cooling includes an apparatus that includes a manifold body including a plurality of heat transfer structures therein. The plurality of heat transfer structures are configured to be thermally coupled to a plurality of heat generating components. The apparatus further includes a plurality of flow directing structures within the manifold body. The plurality of flow directing structures configured to direct a first portion of a fluid flow within the manifold body to a first heat transfer structure of the plurality of heat transfer structures and a second portion of the fluid flow within the manifold body to a second heat transfer structure of the plurality of heat transfer structures.
Computer systems typically include a combination of hardware and software components, application programs, operating systems, processors, buses, memory, input/output devices, and so on. As advances in semiconductor processing and computer architecture push the performance of the computer higher and higher, more sophisticated computer software has evolved to take advantage of the higher performance of the hardware, resulting in computer systems today that are much more powerful than just a few years ago. With the increase in performance of computer system hardware, the need to remove heat from heat generating components such as processors, for example central processing units (CPUs) and graphics processing units (GPUs), is becoming increasingly more important. Thermal cooling solutions are usually employed to assist in heat removal from processors. Often one or more cooling elements are thermally coupled to the heat generating component, such as processor cores, to dissipate heat generated by the component. Such cooling elements include, for example, cold plates, heat sinks, fluid cooling systems (e.g., water cooling systems), vapor chambers, heat pipes, fans, and the like to conduct heat to dissipate the heat out of the computing device. However, existing solutions may not adequately cool components of a computing system.
Compliant cold plate technology has been utilized as a thermal cooling solution for large die packages such as large Single Chip Modules (SCMs), Dual Chip modules (DCMs), and Multi-Chip Modules (MCMs). MCMs often include both high performance cores and low performance cores. Compliant cold plates include a compressible layer that drives the compliant cold plate to conform to dimensional differences between the cold plate and the heat generating component being cooled. Compliance is important for system reliability and consistent thermal performance across large die (e.g., SCM, DCM, and MCM) packages.
Current Dual Chip modules are typically cooled using compliant direct attach cooling in which compliant cold plates are directly attached to the chip modules to be cooled. The cold plates for these modules can be implemented in multiple ways including options which flow coolant across (e.g., perpendicular to) a row of high performance cores. One conventional option is to utilize standard sawn channels in the cold plate to flow coolant in this way. Alternately, the coolant may flow through mesh structures within the cold plate. The arrangement of high performance cores in rows rather than some other more thermally preferable arrangement such as a checkerboard pattern is a requirement of some base chip designs which cannot be changed without significantly affecting system performance. Some emerging MCM designs, such as Quad Chip Module (QCM) designs utilize core configurations which place chips with these rows of cores next to each other but with the cores of neighboring chips oriented orthogonally with respect to each other. With coolant flow in one direction, coolant will flow along the core row for some of the chips, resulting in large preheat and substantially elevated downstream core temperatures which may be unacceptable in product implementation. QCM designs typically have areas to be cooled that are large in both directions compared to DCM designs where one dimension is much larger than the other. Compliant direct attach utilizes flexibility in the cold plate to create a mechanically reliable thermal solution. Sawn channel type cold plates are much stiffer along the channel direction. As a result, increasing this dimension as would be required for a QCM may be problematic for reliability as well as thermal performance.
Various embodiments described herein provide for cold plates with high performance heat transfer structures providing cooling zones over the cores of a chip and utilize lower performing structures elsewhere to simultaneously cool low power areas and route coolant to high performance zones such that cooling flows perpendicular to core rows in these areas.
In one or more embodiments, a manifold body or other container is provided that includes a number of heat transfer structures within the manifold body that are configured to be thermally coupled to a number of heat generating components of a multiple chip module. A heat transfer structure is a structure configured to transfer heat from the heat generating components into a coolant. In one or more embodiments, this coolant is flowing within the manifold body. In particular embodiments, the manifold body is configured to be directly coupled, generally with a thermal interface material (TIM), to the heat generating components. In particular embodiments, the manifold body is configured to contain a flow of heat transfer fluid to facilitate cooling of electronic components. In a particular embodiment, the manifold body forms a cold plate. In particular embodiments, one or more of the heat generating components are chips such as processors, memory, or other electronic components. In particular embodiments, one or more of the heat generating components are arranged orthogonally with respect to another of the heat generating components. A number of flow directing structures (or flow blockers) are provided within the manifold body configured to direct a first portion of a fluid flow within the manifold body to a first heat transfer structure of the number of heat transfer structures and a second portion of the fluid flow within the manifold body to a second heat transfer structure of the number of heat transfer structures. In some implementations, the fluid flow within the manifold body is split and directed to higher heat load portions of the chips of the multiple chip module (e.g., processor cores) before being directed to lower heat load portions of the chips. For example, in particular embodiments, plenums and other non-high-performance zones are configured to direct flow where it is desired while providing adequate cooling for low power chip areas within lower performance mesh or an open plenum.
In some implementations, the manifold body is configured to be compliant under a distributed pressure load when thermally coupled to the heat generating components to allow the manifold body to conform to the top surfaces of the chips of the multiple chip module to increase thermal transfer. In particular embodiments, the flow directing structures are constructed in a mesh type technology either as somewhat rigid walls or as structures that have a high permeability (e.g., with small holes) but remain somewhat flexible. In some implementations, the heat transfer structures are configured to distribute a mechanical load from a first side of the manifold body to a second side of the manifold body. In particular implementations, one or more of the heat transfer structures may have a different heat transfer characteristics and/or a different pressure drop characteristic than others of the heat transfer structures.
With reference now to
In the particular embodiment shown in
The heat transfer structures 304A-304H are configured to remove a portion of heat from the high heat load areas (e.g., core areas) of the heat generating components 102A-102D and transfer the portion of heat to the heat transfer fluid. In a particular embodiment, one or more of the heat transfer structures 304A-304H has a different heat transfer characteristic than one or more of the other heat transfer structures 304A-304H. In another particular embodiment, one or more of the heat transfer structures 304A-304H has a different pressure drop characteristic than one or more of the other heat transfer structures 304A-304H.
The manifold body 302 includes a manifold inlet 306 for receiving an input of fluid into the manifold body 302, and a manifold outlet 308 for outputting the fluid. In the example illustrated in
After the first portion of the flow 312 receives heat from the first heat generating component 102A, a portion of the flow directing structures 310 further direct the first portion of the flow to the fifth heat transfer structure 304E and the sixth heat transfer structure 304F of the third heat generating component 102C. After the second portion of the flow 312 receives heat from the second heat generating component 102B, a portion of the flow directing structures 310 further direct the second portion of the flow to the seventh heat transfer structure 304G and the eighth heat transfer structure 304H of the fourth heat generating component 102D. In particular embodiments, one or more of the flow directing structures 310 further includes one or more openings in a wall of the flow direction structure 310 to allow a portion of heat transfer fluid to pass through to prevent or mitigate stagnation of the flow of the heat transfer fluid.
After removing a portion of heat from each of the first heat generating component 102A, the second heat generating component 102B, the third heat generating component 102C, and the fourth heat generating component 102D, the flow 312 is then directed to the manifold outlet 308. In one or more embodiments, the heat transfer fluid from the manifold outlet 308 is transferred to a cooling component, such as a radiator, to remove a portion of the heat transferred to the heat transfer fluid, and the heat transfer fluid is returned to the manifold inlet 306.
The heat transfer structures 504A-504H are configured to remove a portion of heat from the high heat load areas (e.g., core areas) of the heat generating components 102A-102D and transfer the portion of heat to the heat transfer fluid. In a particular embodiment, one or more of the heat transfer structures 504A-504H has a different heat transfer characteristic than one or more of the other heat transfer structures 504A-504H. In another particular embodiment, one or more of the heat transfer structures 504A-504H has a different pressure drop characteristic than one or more of the other heat transfer structures 504A-504H.
The manifold body 502 includes a manifold inlet 506 for receiving an input of fluid into the manifold body 502, and a manifold outlet 508 for outputting the fluid. In the example illustrated in
After the first portion of the flow 512 receives heat from the first heat generating component 102A, a portion of the flow directing structures 510 further direct the first portion of the flow to the fifth heat transfer structure 504E and the sixth heat transfer structure 504F of the third heat generating component 102C. After the second portion of the flow 512 receives heat from the second heat generating component 102B, a portion of the flow directing structures 510 further direct the second portion of the flow to the seventh heat transfer structure 504G and the eighth heat transfer structure 504H of the fourth heat generating component 102D. In particular embodiments, one or more of the flow directing structures 510 further includes one or more openings in a wall of the flow directing structure 510 to allow a portion of heat transfer fluid to pass through to prevent or mitigate stagnation of the flow of the heat transfer fluid.
After removing a portion of heat from each of the first heat generating component 102A, the second heat generating component 102B, the third heat generating component 102C, and the fourth heat generating component 102D, the flow 512 is then directed to the manifold outlet 508. In one or more embodiments, the heat transfer fluid from the manifold outlet 508 is pumped to a cooling component, such as a radiator, to remove a portion of the heat transferred to the heat transfer fluid, and the heat transfer fluid is returned to the manifold inlet 506.
The process 600 further includes providing 604 a plurality of flow directing structures within the manifold body. The plurality of flow directing structures are configured to direct a first portion of a fluid flow within the manifold body to a first heat transfer structure of the plurality of heat transfer structures and a second portion of the fluid flow within the manifold body to a second heat transfer structure of the plurality of heat transfer structures. In an embodiment, at least one of the plurality of heat transfer structures is configured to be coupled to a first portion of at least one of the plurality of heat generating components. In a particular embodiment, the first portion comprises a higher heat load area of the at least one of the plurality of heat generating components.
Various aspects of the present disclosure are described by narrative text, flowcharts, block diagrams of component cooling device embodiments. With respect to any flowcharts, depending upon the technology involved, the operations can be performed in a different order than what is shown in a given flowchart. For example, again depending upon the technology involved, two operations shown in successive flowchart blocks may be performed in reverse order, as a single integrated step, concurrently, or in a manner at least partially overlapping in time.
The descriptions of the various embodiments of the present disclosure have been presented for purposes of illustration, but are not intended to be exhaustive or limited to the embodiments disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The terminology used herein was chosen to best explain the principles of the embodiments, the practical application or technical improvement over technologies found in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.