High performance computing may require a plurality of high bandwidth memory (“HBM”) dies. HBM may provide greater bandwidth while using less power as compared to other types of memory. As the performance requirements of packages increase, additional HBM dies may be necessary to provide the bandwidth and capacity for the increased performance. However, the number of HBM dies integrated in a package may be limited due to space constraints of the package, power constraints of the package, and/or the thermal constraints of the package.
The technology generally relates to high bandwidth memory (HBM) and optical connectivity stacking. Disclosed systems and methods herein allow for 3D-stacking of HBM dies that are interconnected with an optical interface in a manner that allows for high performance computing. The HBM optics module package may include one or more HBM dies, HBM chiplets, cooling dies, and optical chiplets. The optical chiplets may be configured to connect the HBM optics module package to one or more optical fibers that form an optical link with one or more other components of the package.
In accordance with aspects of the disclosure, a package may include one or more high-bandwidth memory (HBM) dies, an optical chiplet connected to one or more optical fibers and a cooling die located between the one or more HBM dies and the optical chiplet, wherein the cooling die contains an inlet for a cooling material, an outlet for a cooling material, one or more channels configured to transport the cooling material, and one or more vias for providing interface connections between the one or more HBM dies and the optical chiplet.
In accordance with another aspect, the cooling die may have a first surface oriented toward the optical chiplet and a second surface oriented toward the one or more HBM dies, and wherein each of the one or more vias comprises an aperture extending between the first surface and the second surface.
In accordance with yet another aspect, the one or more vias may be a plurality of vias distributed within a region of the cooling die, and the one or more channels may be configured to transport the cooling material between the vias within the region.
In accordance with another aspect, the vias may be distributed in a plurality of rows within the region and wherein the one or more channels are configured to transport the cooling material between the plurality of rows.
In accordance with still another aspect, the one or more HBM dies may be a plurality of HBM dies arranged in a stack having a footprint relative to a substrate, and wherein the cooling die may be configured so that it does not extend beyond the footprint of the HBM dies.
In accordance with another aspect, the cooling die may be a first surface oriented toward the optical chiplet, wherein the optical chiplet is configured to have a footprint that is smaller than the first surface of the cooling die so that a first region of the first surface is exposed relative to the optical chiplet, and wherein the first region contains the inlet and the outlet.
In accordance with yet another aspect, the flow of the cooling material may be controlled based on an operating temperature of HBM dies.
In accordance with another aspect, the one or more channels are peripherally located around the cooling die.
In accordance with still another aspect, the one or more channels may further comprise a plurality of microchannels.
In accordance with yet another aspect, the optical chiplet may be configured to be in communication with one or more application-specific integrated circuits (ASICs) via the one or more optical fibers.
In accordance with aspects of the disclosure, a system may include a first package; and a second package optically coupled to the first package, the second package may have one or more high-bandwidth memory (HBM) dies, an optical chiplet connected to one or more optical fibers, and a cooling die located between the one or more HBM dies and the optical chiplet, wherein the cooling die contains an inlet for a cooling material, an outlet for a cooling material, one or more channels configured to transport the cooling material, and one or more vias for providing interface connections between the one or more HBM dies and the optical chiplet, wherein the optical chiplet is configured to connect the second package to the first package via the one or more optical fibers.
Disclosed systems and methods herein provide an optics module package that allows for 3D-stacking of dies that are interconnected with an optical interface in a manner that allows for high performance computing. The dies may be, for example, high bandwidth memory (HBM) or other dies. The optics module package may include one or more HBM dies, HBM chiplets, cooling dies, and optical chiplets. The optical chiplet may be configured to connect the HBM optics module package to another package via an optical link that is established over one or more optical fibers.
As the performance requirements of computing packages increase, more HBM dies may be required to provide the memory bandwidth and capacity for the increased performance. For example, the increase in application specific integrated circuit (ASIC) performance requirement may limit how many HBM dies can be placed on the ASIC package. The available package size or the ASIC footprint may limit how many HBM dies are placed on the package. The package size may allocate a predetermined amount of space for HBM dies based on the other components that also have to be connected to the package. Therefore, the amount of space for HBM dies may be limited without enlarging the package. The number of HBM dies placed on the package may, additionally or alternatively, be limited based on the power and thermal constraints of the package. For example, including additional HBM dies on the package may cause an increase in thermal output. The increase in thermal output may exceed the thermal constraints of the package and, therefore, may damage components within the package and/or the package itself.
The optical links between the HBM optics module package and the ASIC package may allow for the HBM optics modules package and, therefore, the HBM dies to be positioned significantly further from an ASIC die as compared to when HBM dies are integrated within an ASIC package. As the connection between the HBM optics module package and ASIC package is an optical connection, little to no performance degradation results, even with the increased distance between the HBM dies and the ASIC package. The HBM optics module package may be configured to allow for high bandwidth and power, while maintaining thermal control for the package components.
In order to reduce the footprint of the HBM optics module on a substrate, the HBM dies may be stacked onto another, and the optical components may be positioned on top of the stack of HBM dies.
A plurality of dies 104a-g, such as HBM dies, may be stacked onto base die 102, and each die may be electrically connected to adjacent dies so as to allow for HBM electrical signals to be passed through the stack. While the dies of
DRAM optical interface die 106 may be placed at the top of the HBM stack. The DRAM optical interface die 106 may be configured to pass electrical signals from the HBM stack to an optical chiplet 120. DRAM optical interface die 106 may use one or more standard protocols, such as UCIe/CXL or one or more proprietary protocols that are designed for particular memory/optical systems. A cooling die 110 may be positioned between the DRAM optical interface die 106 and the optical chiplet 120. Cooling die 110 may be configured to contain a plurality of electrical vias 112, through which the electrical signals between the DRAM optical interface die 106 and optical chiplet may pass. For example, each via 112 may be a through-silicon via (TSV) that extends up from DRAM optical interface die 106 to come into electrical contact with the optical chiplet 120.
The optical chiplet 120 may be configured to convert the signals received from the HBM stack into an optical signal that can be transmitted over one or more optical fibers 140. The same optical chiplet 120 may also be configured to receive optical signals that can be converted to electrical signals for transmission to the stack of HBM dies. The optical chiplet 120 may be part of a light-bundle optical interconnect. The optical chiplet 120 may include a plurality of components, such as an interface 122, a frame 124, one or more optical collectors 126, and micro-LEDs 128. The interface 122 may be configured to receive the electrical signals that are passed through vias 112 from DRAM optical interface die 106. Interface 112 can be configured to convert the received electrical signal to an optical signal that can be transmitted by the micro-LEDs 128 and one or more collectors 126 in accordance with a light beam induced current protocol. As provided in
The optical fiber 140 may connect from the optical chiplet 120 to a separate component. For example, the optical fiber channels 144 may connect the optical chiplet 120 to a package 150, such as an ASIC package. By coupling the optical fiber 140 to the optical chiplet 120, the HBM optics module package 100 may be disaggregated from package 150. In this regard, the optical fiber 140 may extend the channel reach between package 100 and the ASIC package as compared to an electrical connection between the HBM die(s) and the ASIC package.
With regard to cooling die 110, it may be connected to a coolant feed 114 that is used to circulate a cooling material, such as water, throughout the cooling die 110. Cooling die 110 may be configured to use any effective cooling material and flow rates to maintain a desirable temperature. For example, cooling die 110 may be configured to provide a cooling material and flow rate that is capable of maintaining a target temperature of approximately 65° C., with spikes of temperature up to approximately 105° C. Optical chiplet 120 may be configured so that its footprint is smaller than the top surface 111 of cooling die 112, and the coolant feed may be connected to the top surface 111 of cooling die 110. The vias 112 may be configured to travel along a plurality of apertures that extend between the top surface 111 and the bottom surface 113 of cooling die 110.
The inlet temperature and inlet flow of the coolant can be controlled so as to control the amount of heat that is captured by the coolant and to control the temperature throughout the cooling die 110. In the configuration shown in
The stacked HBM dies 104 can be HBM DRAM dies that can transfer data with the DRAM optical interface die 106 at 200-400 MHz or more. The DRAM optical interface die 106 and the optical chiplet 420 may transfer data between 8 Gbps to 16 Gbps or more. The base die may be powered by a power source of 60-100 W or more. By cooling package 100 in the manner disclosed herein, package 100 may achieve higher connectivity speeds and operate more efficiently under various power ranges.
Each component of the optics module package 400 may be connected to one or more substrates, such as circuit boards, within the optics module package 400. For instance, the components may be connected via one or more sockets to the main substrate 130 or sub-substrates connected to the main substrate. In some instances, each component may be removably connected to a substrate. In such a configuration, each component may be replaced with an upgraded component. In some examples, each component may be replaced if the component fails.
The HBM chiplet 504 may include a HBM physical interface (“PHY”) 506, a HBM controller 508, an adaptor 510, a die-to-die (“D2D”) interface 512, and a chip manager (“CM”), design for testing (“DFT”), and general purpose input/output (“GPIO”) 514.
The HBM PHY 506 may, for example, receive commands from the HBM controller 508 and transmit the commands to the HBM die(s) 502. The HBM controller 108 may, for example, optimize the memory traffic and improve the overall performance of the HBM chiplet 504, HBM die(s) 102, and/or HBM optics module package 400. The adaptor 510 may allow for independent testing of the components on the HBM optics module package 400. The independent testing may be performed via one or more external interfaces.
The CM 414 may configure and manage the HBM chiplet 504, HBM die(s) 102, and/or the HBM optics module package 400.
The HBM chiplet 504 may be connected to both the HBM dies(s) 102 and the optical chiplet 420. The optical chiplet 420 may be connected to the HBM chiplet 504 via a D2D interface 518, 512. For example, the D2D 518 of the HBM chiplet 512 may communicate with a D2D 518 of the optical chiplet 420. The D2D interface 512, 518 may be a high-bandwidth interface (“HBI”).
The optical chiplet 420 may include a D2D 518, one or more optical fibers 544, and a load balancing (“LB”) physical interface (“PHY”) 530. The optical fibers 544 may connect from the optical chiplet 420 to a separate component. For example, the optical fibers 544 may connect the optics chiplet 420 to an ASIC package. The LB PHY 530 may be configured to control the data being transmitted to and/or received from an ASIC package.
The cooling die can be configured so that the coolant flows through one or more channels in the layer (608). As discussed above, the cooling die may be configured so that the channels traverse regions adjacent to one or more vias within the cooling die. Data may be transferred through the vias of the cooling die while coolant is flowing through the channels of the cooling die. At 610, coolant may be removed from the cooling die at an outlet location. As shown in
Unless otherwise stated, the foregoing alternative examples are not mutually exclusive, but may be implemented in various combinations to achieve unique advantages. As these and other variations and combinations of the features discussed above can be utilized without departing from the subject matter defined by the claims, the foregoing description should be taken by way of illustration rather than by way of limitation of the subject matter defined by the claims. In addition, the provision of the examples described herein, as well as clauses phrased as “such as,” “including” and the like, should not be interpreted as limiting the subject matter of the claims to the specific examples. Further, the same reference numbers in different drawings can identify the same or similar elements.