Universal network-attached memory architecture

Information

  • Patent Grant
  • 12058874
  • Patent Number
    12,058,874
  • Date Filed
    Monday, December 4, 2023
    a year ago
  • Date Issued
    Tuesday, August 6, 2024
    4 months ago
Abstract
Semiconductor devices, packaging architectures and associated methods are disclosed. In one embodiment, a multi-chip module (MCM) is disclosed. The MCM includes a common substrate and a first integrated circuit (IC) chiplet disposed on the common substrate. The first IC chiplet includes at least one processing element. A communications fabric switchably couples to the at least one processing element. A peripheral gearbox chiplet (PGC) includes a first port having a second memory-agnostic interface coupled to the first memory-agnostic interface of the first IC chiplet. The PGC includes a second port having a memory interface of a first type and interface conversion circuitry disposed between the second memory-agnostic interface and the memory interface of the first type.
Description
TECHNICAL FIELD

The disclosure herein relates to semiconductor devices, packaging and associated methods.


BACKGROUND

As integrated circuit (IC) chips such as system on chips (SoCs) become larger, the yields realized in manufacturing the chips become smaller. Decreasing yields for larger chips increases overall costs for chip manufacturers. To address the yield problem, chiplet architectures have been proposed that favor a modular approach to SoCs. The solution employs smaller sub-processing chips, each containing a well-defined subset of functionality. Chiplets thus allow for dividing a complex design, such as a high-end processor or networking chip, into several small die instead of one large monolithic die.


When accessing memory, traditional chiplet architectures often employ relatively large and complex on-chip memory interfaces for transferring data between the chiplet and a specific memory type. In some situations, this may result in constraining the chiplet's usage for a variety of applications due to memory type inflexibility.





BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the disclosure are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings and in which like reference numerals refer to similar elements and in which:



FIG. 1 illustrates a high-level top plan view of a multi-chip module (MCM), including a first integrated circuit (IC) chiplet that is coupled to a memory chiplet via a peripheral gearbox chiplet (PGC).



FIG. 2 illustrates one embodiment of the network on chip (NoC) circuitry of FIG. 1.



FIG. 3 illustrates a top plan view of one detailed embodiment of the MCM of FIG. 1 that employs high bandwidth memory (HBM) chiplets.



FIGS. 4A-4C illustrate side views of the first IC chiplet and one of the HBM memory chiplets of the MCM of FIG. 3.



FIG. 5 illustrates a further embodiment of an MCM architecture that is similar to the MCM of FIG. 1, and employing double-data rate (DDR) memory chiplets.



FIG. 6 illustrates an expanded embodiment of an expanded MCM topology for DDR memory chiplets similar to that of FIG. 5.



FIG. 7 illustrates one embodiment of a hybrid MCM topology that is similar to FIG. 1, and employing multiple types of memory chiplets.



FIG. 8 illustrates a further embodiment of an MCM topology that is similar to FIG. 5, and employing multiple gearbox chiplets.





DETAILED DESCRIPTION

Semiconductor devices, packaging architectures and associated methods are disclosed. In one embodiment, a multi-chip module (MCM) is disclosed. The MCM includes a common substrate and a first integrated circuit (IC) chiplet disposed on the common substrate. The first IC chiplet includes at least one processing element. A communications fabric integrated into the first IC chiplet switchably couples to the at least one processing element. A peripheral gearbox chiplet (PGC) includes a first port having a second memory-agnostic interface coupled to the first memory-agnostic interface of the first IC chiplet. The PGC includes a second port having a memory interface of a first type and interface conversion circuitry disposed between the second memory-agnostic interface and the memory interface of the first type. By employing a memory-agnostic interface on the first IC chiplet, and enabling memory-related communications to be handled via the communications fabric, specific memory interfaces may be utilized in the off-chip PGC, expanding the available applications for the first IC chiplet. For one embodiment, the communications fabric takes the form of network-on-chip (NoC) circuitry. For some embodiments, the memory-agnostic interface includes a NoC memory access controller (MAC) circuit and a universal die-to-die interface. In some embodiments, the NoC circuit enables communications between the first IC chiplet and the PGC via a packet protocol.


Throughout the disclosure provided herein, the term multi-chip module (MCM) is used to represent a semiconductor device that incorporates multiple semiconductor die or sub-packages in a single unitary package. An MCM may also be referred to as a system in a chip (SiP). With reference to FIG. 1, a multi-chip module (MCM) is shown, generally designated 100. For one embodiment, the MCM includes a package substrate 102 that serves as a common substrate for a first integrated circuit (IC) chiplet 104, a peripheral gearbox chiplet (PGC) 106 and, for some embodiments, a memory chiplet 108. For some situations, off-chip memory 130 (memory located off of the MCM 100) may be coupled to the PGC 106 in addition to, or as an alternative to the memory chiplet 108. Depending on the application, the various chiplets may be interconnected in a manner that allows for use of a relatively inexpensive non-silicon or organic substrate as the package substrate 102. The use of a non-silicon common substrate 102 avoids size and signaling constraints typically associated with silicon-based substrates. This allows the substrate 102 to be larger, incorporate a more relaxed bump pitch for external interface contacts, and provide low-loss traces. Other embodiments may employ other substrate materials such as silicon-based substrates and/or ceramic or glass-based substrates.


With continued reference to FIG. 1, the first IC chiplet 104 is mounted to the package substrate 102 and may take the form of a computer processing unit (CPU), graphics processing unit (GPU), tensor processing unit (TPU), artificial intelligence (AI) processing circuitry, field-programmable gate array (FPGA) circuitry or other form of host chiplet with a need to access memory. For one embodiment, the first IC chiplet 104 includes at least one processing element 110, such as one or more processor cores. An on-chip communications fabric 112 enables the at least one processing element 110 to communicate on-chip with other various locations throughout the chip, while also providing a way to communicate with the memory chiplet 108 or off-chip memory 130 via a packet-based signaling protocol.


For one embodiment, the on-chip communications fabric 112 comprises network-on-chip (NoC) circuitry. The NoC circuitry 112 generally serves as a form of network router or switch that allows the at least one processing unit 110 to communicate with other on-chip elements or nodes, such as at 114, and further allows the first IC chiplet 104 to cooperate with other NoC circuits that may be disposed in various other IC chiplets or memory chiplets disposed on the MCM 100. Thus, the NoC circuitry 112 is generally capable of transferring and/or receiving data and/or control signals via a packet-switched protocol to any other nodes within the MCM 100 that also have NoC interface circuitry.



FIG. 2 illustrates one specific embodiment of the NoC circuitry 112 of FIG. 1. The NoC circuitry 112 includes input buffer circuitry 202 that receives data and/or control signals from a separate NoC circuit associated with another IC chiplet or node on the MCM 100. Depending on how many separate edge interfaces, or ports, are employed by the first IC chiplet 104, the input buffer circuitry 202 may include two (corresponding to, for example, “east” and “west” ports such as those shown in FIG. 2), three, or four queues (“N INPUT”, “S INPUT”, “E INPUT” OR “W INPUT”) to temporarily store signals received from the multiple ports. The input buffer circuitry 202 feeds a crossbar switch 204 that is controlled by a control unit 206 in cooperation with a scheduler or arbiter 208. Output buffer circuitry 210 couples to the crossbar switch 204 to receive data/control signals from the data/control signals from the input buffer circuitry 202 for transfer to a selected output port/interface (“N OUTPUT”, “S OUTPUT”, “E OUTPUT” OR “W OUTPUT”).


Referring back to FIG. 1, for one embodiment, the first IC chiplet 104 includes a memory-agnostic interface 115 that is coupled to the NoC circuitry 112. In one specific embodiment, the memory-agnostic interface 115 includes at least one NoC Memory access controller (MAC) port 116 that transfers information directly with the NoC circuitry 112 via a packet-based protocol. Incoming information received off-chip is converted from a die-to-die communications protocol into the NoC-based packet-protocol by the NoC MAC port 116. The NoC MAC port 116 also allows for converting NoC-based packets generated on-chip into the die-to-die packetized protocol for transfer to another chiplet, such as the PGC 106. In some situations, such as when half-duplex signaling is employed by the PGC 106, the NoC MAC port may detect and control link availability and usage.


Further referring to FIG. 1, the first IC chiplet 104 also includes a memory-agnostic die-to-die (D2D) interface port 118, such as a Universal Chiplet Interconnect express (UCIe) interface or Universal Memory Interface (UMI), that is disposed on an edge of the first IC chiplet 104 and proximate the NoC MAC 116. The D2D interface port 118 transfers the converted die-to-die information associated with the NoC packets via the die-to-die transfer protocol. The NoC MAC 116 and the D2D interface port 118 thus cooperate to transfer data, command and address information that would be transferred by a memory-specific interface, but in a memory-agnostic way. As a result, the first IC chiplet 104 is unencumbered by any specific type of memory interface, and can be applied in a variety of applications that may use a variety of memory types. Various embodiments illustrating the memory-agnostic features of the first IC chiplet 104 are more fully described below.


With continued reference to FIG. 1, while incorporating the NoC MAC port 116 and the D2D interface port 118 provides the first IC chiplet 104 with the memory-agnostic on-chip interface 115 for handling communications between the first IC chiplet 104 and the memory chiplet 108, the use of the NoC MAC port 116 and the D2D interface port 118 also enhances the potential memory-related bandwidth capabilities of the first IC chiplet 104. This is because a given typical memory-specific interface often exhibits an overall footprint that is much larger than the combined footprint achievable with the cooperative NoC MAC port 116 and the D2D interface port 118. Using a memory-agnostic interface such as that described above, each edge of the first IC chiplet 104 can employ a larger number of memory-agnostic ports/interfaces than could be achieved via a standard memory-specific interface, thereby resulting in an increase in memory bandwidth during operation.


Further referring to FIG. 1, for one embodiment, the PGC 106 is mounted to the package substrate 102 and includes a second D2D interface 120 that couples to the D2D interface port 118 of the first IC chiplet 104 via an appropriate set of traces 124 formed in the package substrate 102 or complementary to the package substrate 102, such as through use of a silicon bridge or other form of interconnect with an enhanced trace density. For some embodiments, a second NoC MAC port 122 is also included on the PGC 106. A memory interface of a specific memory type and associated memory controller 126 of the specific memory type (such as DDR, GDDR, LPDDR, HBM, etc.) is incorporated on the PGC 106 to allow for direct memory communications with the memory chiplet 108 of the certain memory type. The memory controller of the specific memory type is also referred to herein as a memory access controller or MAC, and generally coordinates memory transfer operations between the memory chiplet 108 (and/or memory 130) and the first IC chiplet 104. Conversion circuitry 128 disposed in the PGC 106 converts the aggregated non-packetized information provided by the memory-specific interface 126 into a packet format that is compliant with the second D2D interface 122. The conversion circuitry 128 also performs a reverse conversion that transforms the information received from the first IC chiplet 104 in the packetized die-to-die interface protocol into the non-packetized specific memory interface protocol for transfer by the memory-specific interface and controller 126. The PGC 106 thus allows for the memory-specific memory interface and controller 126 to be disposed off of the first IC chiplet 104.


With continued reference to FIG. 1, depending on the application, the memory chiplet 108 may take one of a variety of types. Exemplary memory types may include, for example, double data rate (DDR) dynamic random access memory (DRAM) and related low-power and graphics DDR variants (LPDDR and GDDR), high-bandwidth memory (HBM), compute express link (CXL) memory, and various non-volatile memory variants, to name but a few. Additionally, based on the memory type being employed, the associated memory chiplet 108 may be disposed on the MCM 100 (via a packaged 2.5D or 3.0D configuration with the PGC 106), or off the MCM 100. For some embodiments, at least one memory chiplet 108 may be disposed on the MCM 100, while one or more other memory chips 130 of the same or different memory type may be disposed off of the MCM 100.


While the MCM 100 of FIG. 1 is shown with a single IC chiplet 104, a single PGC 106, and a single memory chiplet/chip 108, the number of memory chiplets 108 that may be available for use with the first IC chiplet 104 may be scaled in a variety of ways, such as through efficient use of the available beachfront area along the edges of the first IC chiplet 104 to allow for multiple PGC 106 and memory chiplet 108 couplings to the first IC chiplet 104. Such embodiments are presented below in further detail.



FIG. 3 illustrates one specific embodiment of an MCM, generally designated 300, that employs the generic concepts of the MCM 100 of FIG. 1 in a high-bandwidth memory (HBM) context. A package substrate 302 mounts a host or processor chiplet 304 and multiple HBM memory devices 306. The host chiplet 304 includes NoC circuitry 308 that couples to multiple memory-agnostic interface ports 310, that include multiple NoC MAC ports and corresponding UCIe ports that are disposed along the edges of the host chiplet 304.


Further referring to FIG. 3, each HBM memory device 306 includes HBM memory 314 that is paired with peripheral gearbox circuitry 316 that corresponds to the circuitry employed by the peripheral gearbox chiplet 106 of FIG. 1. The peripheral gearbox circuitry 316 thus includes a UCIe (or any other universal D2D PHY, such as UMI) interface to interact with the UCIe portions of the memory-agnostic interface 310, and a NoC MAC to control the communication. For some embodiments, the peripheral gearbox circuitry 316 also includes an HBM memory access controller MAC and interface to communicate with the HBM memory 314. Various embodiments of the HBM memory device 306 are shown in FIGS. 4A-4C, and are described more fully below.


With continued reference to FIG. 3, for one embodiment, multiple photonic chiplets 318 communicate with a subset of the memory-agnostic interface ports 310 of the host chiplet 304. Each of the photonic chiplets 318 includes logic circuitry 320, similar to the peripheral gearbox circuitry 316, that employs a UCIe interface to interact with the UCIe portions of the memory-agnostic interface 310, and a NoC MAC to control the communication. A photonic IC chip 322 couples to the logic circuitry 320 and communicates with other devices or MCMs (not shown). In some embodiments, the photonic ICs connect to off-chip memory chips or system to communicate memory traffic.



FIG. 4A illustrates a cross-sectional view of a portion of the MCM 300, showing one embodiment of one of the HBM memory devices 306 coupled to the host chiplet 304 via a die-to-die link 402 formed in the package substrate 302. The HBM memory device 306 integrates a base die 404 and an HBM memory stack 406 with a peripheral gearbox chiplet 408 in a horizontal manner using a 2.5D packaging technique, such as one that uses a silicon interposer, wafer-level fanout (WLFO) process, or the like. The peripheral gearbox chiplet 408 includes a NoC MAC 410, and an HBM die-to-die (D2D) interface coupled with an HBM memory controller or MAC 412.



FIG. 4B illustrates a variation of the HBM memory device 306, where the HBM memory stack 406 and the base die 404 are vertically integrated with the peripheral gearbox chiplet 408 in a 3.0 packaging technique, and vertically interconnected using through-silicon via (TSV) technology.


As a further alternative, and referring now to FIG. 4C, if the HBM base die is fabricated by a more advanced logic process, the HBM memory access controller MAC, the D2D interface, and the NoC MAC may be formed in the HBM base die, such as at 414. As a result, the HBM base die in essence becomes a peripheral gearbox die with additional HBM-related functionality. For some embodiments, where the HBM base die and the peripheral gearbox die are combined into one die, then the HBM die-to-die interface may be substituted with a more universal die-to-die interface, such as UCIe.


While HBM memory works well for high-bandwidth applications, in certain circumstances, such as for high-capacity situations, double-data rate (DDR) dual inline memory modules (DIMMs) may be the preferred memory of choice for pairing with the MCM 100 (FIG. 1). FIG. 5 illustrates an MCM, generally designated 500, that employs a host or processor chiplet 502 with peripherally-disposed memory-agnostic ports 504 that each include a NOC MAC and a D2D interface such as UCIe. Like the previously-described embodiments, the peripheral ports 504 allow for off-chip memory-agnostic communications between one or more processing elements (not shown) of the host chiplet 502 and other chiplets via a NoC communications fabric (not shown).


Further referring to FIG. 5, a plurality of peripheral gearbox chiplets (PGCs) 506 are disposed on a package substrate, along with the host chiplet 502, proximate a corresponding port 504 of the host chiplet 502. For one embodiment, each of the PGCs 506 include a host interface 508 that includes a NoC MAC and a D2D interface such as UCIe to communicate with each of the memory-agnostic interfaces 504 of the host chiplet 502. Each PGC 506 also includes a memory-specific interface and memory-specific controller circuit 510 that is of a DDR or graphics DDR (GDDR) or low-power DDR (LPDDR) type. For one embodiment, each memory interface and controller circuit 510 is coupled to a DDR or GDDR or LPDDR memory 512 via an appropriate memory bus 514. Using multiple memory interface and controller circuits 510 on the MCM 500 allows for memory capacity expansion that is enhanced by the more efficient beachfront usage by the host chiplet 502 by employing the memory-agnostic interfaces 504 to handle memory access communications.



FIG. 6 illustrates an MCM, generally designated 600, that is similar to the MCM 500 of FIG. 5 in that it includes a host chiplet 602 that includes NoC circuitry 604. The NoC circuitry 604 couples to multiple PGCs 606 via corresponding memory-agnostic interfaces 608 for carrying out memory-related communications. Each PGC 606 includes universal D2D PHY like a UCIe or UMI interface 610 for coupling to a corresponding memory-agnostic interface 608. In an effort to increase memory capacity over the embodiment of FIG. 5, the PGCs 606 include SerDes interface circuitry as a die-to-die interface in addition to a DDR MAC 612.


Further referring to FIG. 6, the Serdes interface circuitry 612 of each PGC 606 couples to a high-speed serial link 614 that uses fewer signaling paths than a full-width memory interface. As a result, for a given MCM beachfront, such as at 616, a larger number of DDR DIMM modules, such as at 618 and 620, may be coupled to the MCM 600. For one embodiment, pairs of DIMMs, such as 618 and 620 are coupled to a common buffer 622 to form a buffer-on-board unit 624.



FIG. 7 illustrates a hybrid MCM 700 that employs different varieties of memory to accomplish correspondingly different goals for different applications. A first host chiplet 702 with a NoC communications fabric (not shown) incorporates multiple memory-agnostic interfaces 704 for coupling to corresponding PGCs. A first type of PGC 708, similar to the PGC 408 shown in FIG. 4A, and specifically configured for HBM memory applications, includes interface circuitry containing a NoC MAC in combination with a universal D2D PHY like a UCIe or UMI interface for coupling to the memory-agnostic interface 704, and an HBM memory access controller MAC and PHY for communicating with an HBM memory device 710. A second type of PGC 712, specifically configured for DDR DIMM memory applications, includes interface circuitry 714 including a NoC MAC with a UCIe interface for coupling to a second memory-agnostic interface 704, and a LPDDR/GDDR/DDR memory access controller MAC and PHY 718 for communicating with remote memory, such as multiple LPDDR/GDDR/DDR memory, or pooled memory associated with a compute express link (CXL) memory architecture.


Further referring to FIG. 7, for one embodiment, the MCM 700 provides high-bandwidth and low-latency connectivity to other computing systems through one or more photonic chiplets 720. Each photonic chiplet 720 includes a die-to-die interface 722 for coupling to a corresponding die-to-die interface 724 on the host chiplet 702, and a photonic IC chip 725. For one embodiment, each of the die-to-die interfaces 722 and 724 include an interface adaptor or NoC MAC. For the embodiment shown in FIG. 7, the host chiplet 702 couples to a second host chiplet 726 via a plurality of die-to-die interfaces 728. The second host chiplet 726 includes similar features as the host chiplet 702, and couples to multiple PGC's to scale the total memory bandwidth by a factor of two. This is but one example of a myriad of combinations that may take advantage of the use of memory-agnostic interfaces 704 in the host chiplets 702 and 726.



FIG. 8 illustrates a further embodiment of an MCM topology, generally designated 800, that incorporates many of the features shown in the embodiments described above, and illustrating in further detail how beachfront bandwidth efficiency for a first IC chiplet 802 may be improved through use of memory-agnostic interfaces, such as at 804, to access memory, such as at 820. The first IC chiplet 802 is disposed on a substrate 803 and includes a first edge that defines a beachfront 810 that is of a finite dimension for coupling to the multiple memory-agnostic interfaces 804. Each of the memory-agnostic interfaces 804 includes a NoC MAC and a die-to-die interface, such as a UCIe or UMI interface. The memory-agnostic interface 804 communicates with a first PGC 812 via a set of links 814. For one embodiment, a length L1 of the memory-agnostic interface 804 occupies a relatively small portion of the beachfront 810 of the first IC chiplet 802.


Further referring to FIG. 8, each PGC 812 includes a die-to-die interface 816 that generally matches the die-to-die interface of the memory-agnostic interface 804. Each PGC 812 also includes a memory-specific interface and memory-specific controller circuit 818 that is of a DDR or graphics DDR (GDDR) or low-power DDR (LPDDR) type. For one embodiment, each memory interface and controller circuit 818 is coupled to a DDR or GDDR or LPDDR memory 820 via an appropriate memory bus 822.


With continued reference to FIG. 8, generally, the memory-specific interface and memory-specific controller circuit 818 exhibits a length L2 that is significantly larger than the length L1 of the memory-agnostic interface 804. Consequently, disaggregating multiple memory-specific interface and memory-specific controller circuits 818 off of the first IC chiplet 802 and into multiple PGCs 812, and employing smaller and more space-efficient memory-agnostic interfaces on the beachfront 806 of the first IC chip 802, allows for accessing more memory resources that correspondingly increases a total memory bandwidth by a factor of at least L2/L1.


When received within a computer system via one or more computer-readable media, such data and/or instruction-based expressions of the above described circuits may be processed by a processing entity (e.g., one or more processors) within the computer system in conjunction with execution of one or more other computer programs including, without limitation, net-list generation programs, place and route programs and the like, to generate a representation or image of a physical manifestation of such circuits. Such representation or image may thereafter be used in device fabrication, for example, by enabling generation of one or more masks that are used to form various components of the circuits in a device fabrication process.


In the foregoing description and in the accompanying drawings, specific terminology and drawing symbols have been set forth to provide a thorough understanding of the present disclosure. In some instances, the terminology and symbols may imply specific details that are not required to practice embodiments of the disclosure. For example, any of the specific numbers of bits, signal path widths, signaling or operating frequencies, component circuits or devices and the like may be different from those described above in alternative embodiments. Also, the interconnection between circuit elements or circuit blocks shown or described as multi-conductor signal links may alternatively be single-conductor signal links, and single conductor signal links may alternatively be multi-conductor signal links. Signals and signaling paths shown or described as being single-ended may also be differential, and vice-versa. Similarly, signals described or depicted as having active-high or active-low logic levels may have opposite logic levels in alternative embodiments. Component circuitry within integrated circuit devices may be implemented using metal oxide semiconductor (MOS) technology, bipolar technology or any other technology in which logical and analog circuits may be implemented. With respect to terminology, a signal is said to be “asserted” when the signal is driven to a low or high logic state (or charged to a high logic state or discharged to a low logic state) to indicate a particular condition. Conversely, a signal is said to be “deasserted” to indicate that the signal is driven (or charged or discharged) to a state other than the asserted state (including a high or low logic state, or the floating state that may occur when the signal driving circuit is transitioned to a high impedance condition, such as an open drain or open collector condition). A signal driving circuit is said to “output” a signal to a signal receiving circuit when the signal driving circuit asserts (or deasserts, if explicitly stated or indicated by context) the signal on a signal line coupled between the signal driving and signal receiving circuits. A signal line is said to be “activated” when a signal is asserted on the signal line, and “deactivated” when the signal is deasserted. Additionally, the prefix symbol “/” attached to signal names indicates that the signal is an active low signal (i.e., the asserted state is a logic low state). A line over a signal name (e.g., ‘custom character’) is also used to indicate an active low signal. The term “coupled” is used herein to express a direct connection as well as a connection through one or more intervening circuits or structures. Integrated circuit device “programming” may include, for example and without limitation, loading a control value into a register or other storage circuit within the device in response to a host instruction and thus controlling an operational aspect of the device, establishing a device configuration or controlling an operational aspect of the device through a one-time programming operation (e.g., blowing fuses within a configuration circuit during device production), and/or connecting one or more selected pins or other contact structures of the device to reference voltage lines (also referred to as strapping) to establish a particular device configuration or operation aspect of the device. The term “exemplary” is used to express an example, not a preference or requirement.


While aspects of the disclosure have been described with reference to specific embodiments thereof, it will be evident that various modifications and changes may be made thereto without departing from the broader spirit and scope of the disclosure. For example, features or aspects of any of the embodiments may be applied, at least where practicable, in combination with any other of the embodiments or in place of counterpart features or aspects thereof. Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense.

Claims
  • 1. A chiplet-based multi-chip module (MCM), comprising: a package substrate;a first integrated circuit (IC) chiplet disposed on the package substrate and comprising: at least one processing element;a communications fabric switchably coupled to the at least one processing element; anda first memory-agnostic interface to transfer memory requests from the at least one processing element via the communications fabric;a peripheral gearbox chiplet comprising: a first port comprising a second memory-agnostic interface coupled to the first memory-agnostic interface of the first IC chiplet;a second port comprising a memory interface of a first memory type; andinterface conversion circuitry disposed between the second memory-agnostic interface and the memory interface of the first memory type, the interface conversion circuitry comprising a memory controller of the first memory type.
  • 2. The chiplet-based MCM of claim 1, wherein: the communications fabric comprises a network-on-chip (NoC) circuit.
  • 3. The chiplet-based MCM of claim 2, wherein: the first memory-agnostic interface comprises a NoC media access controller (MAC).
  • 4. The chiplet-based MCM of claim 3, wherein: the first memory-agnostic interface further comprises a first memory-agnostic die-to-die interface that cooperates with the NOC MAC.
  • 5. The chiplet-based MCM of claim 4, wherein: the first memory-agnostic die-to-die interface comprises a universal chiplet interconnect express (UCIe) or Universal Memory Interface (UMI) interface.
  • 6. The chiplet-based MCM of claim 1, further comprising: a memory chiplet of the first memory type coupled to the second port of the peripheral gearbox chiplet and disposed on the package substrate.
  • 7. The chiplet-based MCM of claim 6, wherein: the memory chiplet of the first memory type is coupled to the second port of the peripheral gearbox chiplet in a 2.5D packaging configuration.
  • 8. The chiplet-based MCM of claim 6, wherein: the memory chiplet of the first memory type is coupled to the second port of the peripheral gearbox chiplet in a 3.0D packaging configuration.
  • 9. The chiplet-based MCM of claim 8, wherein: the memory interface of the first memory type comprises a high-bandwidth memory (HBM) memory interface; andthe memory chiplet comprises an HBM chiplet.
  • 10. The chiplet-based MCM of claim 1, wherein: the memory interface of the first memory type comprises one of a double-data rate (DDR) memory interface, low-power double data rate (LPDDR) memory interface, a graphics double data rate (GDDR) memory interface or a compute express link (CXL) memory interface.
  • 11. An integrated circuit (IC) chiplet apparatus, comprising: a peripheral gearbox chiplet comprising: a first port comprising a first memory-agnostic interface to communicate memory-related information with a second memory-agnostic interface of a first IC chiplet, the first memory-agnostic interface comprising a network-on-chip (NoC) media access controller (MAC) to communicate with a communications fabric of the first IC chiplet, anda first memory-agnostic die-to-die interface that cooperates with the NOC MAC;a second port comprising a memory interface of a first memory type; andinterface conversion circuitry disposed between the first memory-agnostic interface of the first port and the memory interface of the first memory type, the interface conversion circuitry comprising a memory controller of the first memory type.
  • 12. The IC chiplet apparatus of claim 11, wherein: the first memory-agnostic interface exhibits a first lateral dimension; andwherein the memory interface of the first memory type exhibits a second lateral dimension that is larger than the first lateral dimension.
  • 13. The IC chiplet apparatus of claim 12, wherein: the first memory-agnostic die-to-die interface comprises a universal chiplet interconnect express (UCIe) or Universal Memory Interface (UMI) interface.
  • 14. The IC chiplet apparatus of claim 11, further comprising: a memory chiplet of the first memory type coupled to the second port of the peripheral gearbox chiplet.
  • 15. The IC chiplet apparatus of claim 14, wherein: the memory chiplet of the first memory type comprises a high bandwidth memory (HBM) chiplet.
  • 16. The IC chiplet apparatus of claim 15, wherein: the memory chiplet of the first memory type is coupled to the second port of the peripheral gearbox chiplet in a 2.5D packaging configuration.
  • 17. The IC chiplet apparatus of claim 15, wherein: the memory chiplet of the first memory type is coupled to the second port of the peripheral gearbox chiplet in a 3.0D packaging configuration.
  • 18. A chiplet-based multi-chip module (MCM), comprising: a package substrate;a first integrated circuit (IC) chiplet disposed on the package substrate and comprising: at least one processing element;a communications fabric switchably coupled to the at least one processing element;a first memory-agnostic interface to transfer first memory requests from the at least one processing element via the communications fabric; anda second memory-agnostic interface to transfer second memory requests from the at least one processing element via the communications fabric;a first peripheral gearbox chiplet comprising: a first port comprising a second memory-agnostic interface coupled to the first memory-agnostic interface of the first IC chiplet;a second port comprising a first memory interface;first interface conversion circuitry disposed between the second memory-agnostic interface and the first memory interface; anda second peripheral gearbox chiplet comprising: a third port comprising a third memory-agnostic interface coupled to the second memory-agnostic interface of the first IC chiplet;a fourth port comprising a second memory interface; andsecond interface conversion circuitry disposed between the third memory-agnostic interface and the second memory interface.
  • 19. The chiplet-based MCM of claim 18, wherein: the first memory interface comprises a high bandwidth memory (HBM) interface; andthe second memory interface comprises one of a double-data rate (DDR) memory interface, a low-power double data rate (LPDDR) memory interface, a graphics double data rate (GDDR) memory interface or a compute express link (CXL) memory interface.
  • 20. The chiplet-based MCM of claim 18, wherein: the communications fabric comprises a network-on-chip (NoC) circuit.
  • 21. The chiplet-based MCM of claim 18, wherein: the first memory-agnostic interface and the second memory-agnostic interface collectively occupy a beachfront length portion associated with the first IC chiplet; andwherein the second port comprising the first memory interface and the fourth port comprising the second memory interface collectively occupy a length that is larger than the beachfront length portion.
  • 22. The chiplet-based MCM of claim 18, wherein: the first memory interface is of a first type; andthe second memory interface is of a second type that is different than the first type.
CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a Non-Provisional that claims priority to U.S. Provisional Application No. 63/435,540, filed Dec. 27, 2022, entitled UNIVERSAL NETWORK-ATTACHED MEMORY (UNAM), and U.S. Provisional Application No. 63/543,517, filed Oct. 11, 2023, entitled UNIVERSAL MEMORY INTERFACE (UMI) WITH HALF-DUPLEX BIDIRECTIONAL D2D & C2C PHYS FOR PACKET-BASED MEMORY TRAFFIC TRANSFER, all of which are incorporated herein by reference in their entirety.

US Referenced Citations (148)
Number Name Date Kind
4334305 Girardi Jun 1982 A
5396581 Mashiko Mar 1995 A
5677569 Choi Oct 1997 A
5892287 Hoffman Apr 1999 A
5910010 Nishizawa Jun 1999 A
6031729 Berkely Feb 2000 A
6055235 Blanc Apr 2000 A
6417737 Moloudi Jul 2002 B1
6690742 Chan Feb 2004 B2
6721313 Van Duyne Apr 2004 B1
6932618 Nelson Aug 2005 B1
7027529 Ohishi Apr 2006 B1
7248890 Raghavan Jul 2007 B1
7269212 Chau Sep 2007 B1
7477615 Oshita Jan 2009 B2
7535958 Best May 2009 B2
7701957 Bicknell Apr 2010 B1
7978754 Yeung Jul 2011 B2
8004330 Acimovic Aug 2011 B1
8024142 Gagnon Sep 2011 B1
8121541 Rofougaran Feb 2012 B2
8483579 Fukuda Jul 2013 B2
8546955 Wu Oct 2013 B1
8861573 Chu Oct 2014 B2
8948203 Nolan Feb 2015 B1
8982905 Kamble Mar 2015 B2
9088334 Chakraborty Jul 2015 B2
9106229 Hutton Aug 2015 B1
9129935 Chandrasekar Sep 2015 B1
9294313 Prokop Mar 2016 B2
9349707 Sun May 2016 B1
9379878 Lugthart Jun 2016 B1
9432298 Smith Aug 2016 B1
9832006 Bandi Nov 2017 B1
9886275 Carlson Feb 2018 B1
9934842 Mozak Apr 2018 B2
9961812 Suorsa May 2018 B2
10171115 Shirinfar Jan 2019 B1
10410694 Arbel Sep 2019 B1
10439661 Heydari Oct 2019 B1
10642767 Farjadrad May 2020 B1
10678738 Dai Jun 2020 B2
10735176 Heydari Aug 2020 B1
10748852 Sauter Aug 2020 B1
10855498 Farjadrad Dec 2020 B1
10935593 Goyal Mar 2021 B2
11088876 Farjadrad Aug 2021 B1
11100028 Subramaniam Aug 2021 B1
11204863 Sheffler Dec 2021 B2
11841815 Farjadrad Dec 2023 B1
11842986 Ramin Dec 2023 B1
11855043 Farjadrad Dec 2023 B1
11855056 Rad Dec 2023 B1
11893242 Farjadrad Feb 2024 B1
20020122479 Agazzi Sep 2002 A1
20020136315 Chan Sep 2002 A1
20040088444 Baumer May 2004 A1
20040113239 Prokofiev Jun 2004 A1
20040130347 Moll Jul 2004 A1
20040156461 Agazzi Aug 2004 A1
20050041683 Kizer Feb 2005 A1
20050134306 Stojanovic Jun 2005 A1
20050157781 Ho Jul 2005 A1
20050205983 Origasa Sep 2005 A1
20060060376 Yoon Mar 2006 A1
20060103011 Andry May 2006 A1
20060158229 Hsu Jul 2006 A1
20060181283 Wajcer Aug 2006 A1
20060188043 Zerbe Aug 2006 A1
20060250985 Baumer Nov 2006 A1
20060251194 Bublil Nov 2006 A1
20070281643 Kawai Dec 2007 A1
20080063395 Royle Mar 2008 A1
20080143422 Lalithambika Jun 2008 A1
20080186987 Baumer Aug 2008 A1
20080222407 Carpenter Sep 2008 A1
20090113158 Schnell Apr 2009 A1
20090154365 Diab Jun 2009 A1
20090174448 Zabinski Jul 2009 A1
20090220240 Abhari Sep 2009 A1
20090225900 Yamaguchi Sep 2009 A1
20090304054 Tonietto Dec 2009 A1
20100177841 Yoon Jul 2010 A1
20100197231 Kenington Aug 2010 A1
20100294547 Hatanaka Nov 2010 A1
20110029803 Redman-White Feb 2011 A1
20110038286 Ta Feb 2011 A1
20110167297 Su Jul 2011 A1
20110187430 Tang Aug 2011 A1
20110204428 Erickson Aug 2011 A1
20110267073 Chengson Nov 2011 A1
20110293041 Luo Dec 2011 A1
20120082194 Tam Apr 2012 A1
20120182776 Best Jul 2012 A1
20120192023 Lee Jul 2012 A1
20120216084 Chun Aug 2012 A1
20120327818 Takatori Dec 2012 A1
20130181257 Ngai Jul 2013 A1
20130222026 Havens Aug 2013 A1
20130249290 Buonpane Sep 2013 A1
20130285584 Kim Oct 2013 A1
20140016524 Choi Jan 2014 A1
20140048947 Lee Feb 2014 A1
20140126613 Zhang May 2014 A1
20140192583 Rajan Jul 2014 A1
20140269860 Brown Sep 2014 A1
20140269983 Baeckler Sep 2014 A1
20150012677 Nagarajan Jan 2015 A1
20150172040 Pelekhaty Jun 2015 A1
20150180760 Rickard Jun 2015 A1
20150206867 Lim Jul 2015 A1
20150271074 Hirth Sep 2015 A1
20150326348 Shen Nov 2015 A1
20150358005 Chen Dec 2015 A1
20160056125 Pan Feb 2016 A1
20160071818 Wang Mar 2016 A1
20160111406 Mak Apr 2016 A1
20160217872 Hossain Jul 2016 A1
20160294585 Rahman Oct 2016 A1
20170317859 Hormati Nov 2017 A1
20170331651 Suzuki Nov 2017 A1
20180010329 Golding, Jr. Jan 2018 A1
20180082981 Gowda Mar 2018 A1
20180137005 Wu May 2018 A1
20180175001 Pyo Jun 2018 A1
20180190635 Choi Jul 2018 A1
20180315735 Delacruz Nov 2018 A1
20190044764 Hollis Feb 2019 A1
20190058457 Ran Feb 2019 A1
20190108111 Levin Apr 2019 A1
20190198489 Kim Jun 2019 A1
20200257619 Sheffler Aug 2020 A1
20200373286 Dennis Nov 2020 A1
20210082875 Nelson Mar 2021 A1
20210117102 Grenier Apr 2021 A1
20210181974 Ghosh Jun 2021 A1
20210183842 Fay Jun 2021 A1
20210225827 Lanka Jul 2021 A1
20210258078 Meade Aug 2021 A1
20210311900 Malladi Oct 2021 A1
20210365203 O Nov 2021 A1
20220051989 Agarwal Feb 2022 A1
20220159860 Winzer May 2022 A1
20220223522 Scearce Jul 2022 A1
20230039033 Zarkovsky Feb 2023 A1
20230068802 Wang Mar 2023 A1
20230090061 Zarkovsky Mar 2023 A1
20230181599 Erickson May 2023 A1
Non-Patent Literature Citations (7)
Entry
Kurt Lender et al., “Questions from the Compute Express Link Exploring Coherent Memory and Innovative Cases Webinar”, Apr. 13, 2020, CXL Consortium, pp. 1-6.
Planet Analog, “The basics of SerDes (serializers/deserializers) for interfacing”, Dec. 1, 2020, Planet Analog, as preserved by the internet Archive, pp. 1-9.
Block Memory Generator v8.2 LogiCORE IP Product Guide Vivado Design Suite; Xilinx; Apr. 1, 2015.
Universal Chiplet Interconnect Express (UCle) Specification, Revision 1.1, Version 1.0, Jul. 10, 2023.
Hybrid Memory Cube Specification 2.1, Hybrid Memory Cube Consortium, HMC-30G-VSR PHY, 2014.
Farjadrad et al., “A Bunch of Wires (BOW) Interface for Inter-Chiplet Communication”, 2019 IEEE Symposium on High-Performance Interconnects (HOTI), pp. 27-30, Oct. 2019.
Universal Chiplet Interconnect Express (UCIe) Specification Rev. 1.0, Feb. 24, 2022.
Provisional Applications (2)
Number Date Country
63543517 Oct 2023 US
63435540 Dec 2022 US