The present invention relates generally to devices having multiple interconnected processors and memories.
Integrated circuit devices such as microprocessors may have multiple processors and multiple memories coupled to the processors. The processors, often referred to as processing engines or cores, may be subdivided into a number of groups often referred to as blocks, clusters or islands. Each block may have a plurality of processors and one or more memories which are tightly connected to each of the processors by a bus or other network. The blocks may in turn be connected to each other and to the system memory by a network. In this manner, each processor of each block may communicate over the network with a processor or memory of the same block or a processor or memory of another block, or the system memory itself. The network may employ a number of buffers to buffer packets of data awaiting access to the network or a segment of the network.
a, 10b depict one embodiment of operations of hierarchical and parallel partition network control logic in accordance with one aspect of the present description.
As explained in greater detail below, for tightly coupled processors and memories organized in small blocks (also referred to as clusters or islands), in accordance with one aspect of the present description, hierarchical switches are provided to interconnect these blocks in a manner which is believed to improve the energy efficiency of communications both among the blocks and between the blocks and the system memory. In one embodiment, a switch is provided to interconnect a set of blocks organized as a “unit,” and a local memory controller for that unit, at a hierarchical level, referred to herein as the unit hierarchical level or simply the unit level. In addition, the switch, referred to herein as a unit level switch, may have links to other unit level switches and other, non-local memory controllers which may also be organized as hierarchical switches.
In one embodiment, a unit level switch for block to block communications may not have buffering in order to reduce the energy cost expended by the unit level switch. Instead, buffers at the end-points of die-to-die communication channels between the microprocessor die and a memory controller die, for example, are provided for intermediate storage.
It is believed that a hierarchical network architecture in accordance with the present description can provide improved predictability and uniformity in communication latencies from each block to a local memory controller assigned a unit. Furthermore, it is believed that non-local memory controllers may similarly be more equidistant in terms of latency from a central switch such as a unit level switch or in some embodiments, a unit-to-unit level switch. Still further, local block to block communications may have increased energy efficiency and may be restricted to a local switch such as a unit level switch. Moreover, it is believed that a hierarchical network architecture in accordance with the present description may provide improved scalability. For example, when adding more blocks or memory controllers, additional levels or hierarchies may be added to maintain energy efficient communication at the local level.
In another aspect of the present description, a network having a switch such as a unit level switch and a unit-to-unit level switch, for example, may be partitioned into multiple unit level parallel partition networks, each having parallel partition switches and unit-to-unit level parallel partition switches, respectively. It is believed that such an arrangement may further reduce the energy traversal cost through a switch. For example, energy expenditure may be reduced by the same or similar factor as the number of parallel partition switches the monolithic switch is partitioned into by trading reduced bandwidth for reduced energy consumption.
As explained in greater detail below, the processors 102 and the memory controller 106 each include hierarchical and parallel partition networks 108, 110, respectively. The hierarchical and parallel partition networks 108, 110, may improve for example, energy efficiency in communications among the various processors, and memories of the device 100. It is appreciated that other features and advantages may be realized, depending upon the particular application.
In the illustrated embodiment, the hierarchical and parallel partition networks 108 are built in to the die or dies of the processors 102, and the hierarchical and parallel partition networks 110 are built in to the die or dies of the memory controller 106. It is appreciated however that the hierarchical and parallel partition networks 108, 110 may be built in to other circuits of the computer architecture device 100, depending upon the particular application.
The computer architecture device 100 may further include storage 116 (e.g., a non-volatile storage, such as magnetic disk drives, optical disk drives, a tape drive, flash memory, etc.). The storage 116 may comprise an internal storage device or an attached or network accessible storage. Programs in the storage 116 are loaded into the memory 104 and executed by one or more processors of the processors 102 in a manner known in the art. The computer architecture device 100 further includes a network controller or adapter 118 to enable communication with an external network, such as an Ethernet, a Fiber Channel Arbitrated Loop, etc. Further, the architecture may, in certain embodiments, include a video controller 120 to render information on a display monitor, where the video controller 120 may be embodied on a video card or integrated on integrated circuit components mounted on a motherboard or other substrate. An input device 122 is used to provide input to the processor 102, and may include a keyboard, mouse, pen-stylus, microphone, touch sensitive display screen, input pins, sockets, or any other activation or input mechanism known in the art. An output device 124 is capable of rendering information transmitted from the processors 102, or other component, such as a display monitor, printer, storage, output pins, sockets, etc. The network adapter 118 may embodied on a network card, such as a Peripheral Component Interconnect (PCI) card, PCI-express, or some other I/O card, or on integrated circuit components mounted on a motherboard or other substrate.
One or more of the components of the device 100 may be omitted, depending upon the particular application. For example, a network router may lack a video controller 120, for example. Also, any one or more of the components of the computer architecture device 100 may include one or more integrated circuits employing hierarchical and parallel partition network communication in accordance with one aspect of the present description.
The plurality of blocks 200a, 200b, . . . 200n are organized and interconnected by a unit level switch 208a to form a first unit 210a. Similarly, the plurality of blocks 202a, 202b, . . . 202n are interconnected by a second unit level switch 208b to form a second unit 210b. Additional units as represented schematically as unit 210n in
Each unit of the plurality of units 210a, 210b . . . 210n, defines a unit level of a second hierarchical level which is higher than that of the block level. The unit level of the second hierarchical level is referred to herein as the unit hierarchical level or simply the unit level.
A plurality of units including units 210a, 210b . . . 210n are organized and interconnected by a unit-to-unit level switch 220a. The plurality of units including unit 210a, 210b, . . . 210n interconnected by the unit-to-unit level switch 220a define a unit-to-unit level of a third hierarchical level which is higher than that of the unit level. The unit-to-unit level of the third hierarchical level is referred to herein as the unit-to-unit hierarchical level or simply the unit-to-unit level.
The block 200a includes a plurality of processing engines, cores or other processors 310a, 310b . . . 310n and a plurality of shared memories 314a, 314b . . . 314n interconnected by a block network 320 which may be a serial or parallel bus, or cross-bar or matrix switch network, for example. Other types of networks may be used to interconnect the components of the block, depending upon the particular application. Each processor 310a, 310b . . . 310n may communicate locally with any of the other processors 310a, 310b . . . 310n and any of the shared memories 314a, 314b . . . 314n of the block 200a over the block network 320. Access to the block network 320 is granted by control logic 324 which arbitrates conflicting requests for access to the block network 320. Once granted access, a processor 310a, 310b . . . 310n may read or write a packet of data from or to, respectively, any of the other processors 310a, 310b . . . 310n and any of the shared memories 314a, 314b . . . 314n of the block 200a over the block network 320. In the illustrated embodiment, the block network 320 lacks a buffer to reduce energy expenditure. However, it is appreciated that in some embodiments, the block network 320 or one or more blocks may buffer packets in a local buffer coupled to the block network 320.
In accordance with another aspect of the present description, the interconnection of the blocks such as the block 200a, with other blocks of the device 100, and with the memory controller 106 of the device 100, is partitioned into a plurality of parallel partition packet networks, that is, parallel partition packet network 1, parallel partition packet network 2 . . . parallel partition packet network n. Each parallel partition packet network 1, 2, . . . n, is independent of the other parallel partition packet networks of the plurality of parallel partition packet networks. Thus, a processor such as the processor 310a, for example, of the block 200a, for example, may transmit a packet of data to another block or to the memory 106 over a selected parallel partition packet network such as the parallel partition packet network 1, for example, without transmitting the packet in whole or in part over the other parallel partition packet networks 2 . . . n.
In one embodiment, the processors 310a, 310b . . . 310n and the shared memories 314a, 314b . . . 314n interconnected by the block network 320 may be assigned priority for a particular parallel partition packet network. Thus, processor 310a and shared memory 314a, for example, may be assigned priority to transmit or receive packets over parallel partition packet network 1, for example. In a similar manner, processor 310b and shared memory 314b, for example, may be assigned priority to transmit or receive packets over parallel partition packet network 2, for example. The control logic 324 of the block network 320 includes parallel partition packet network control logic which selects the parallel partition packet network of the plurality of parallel partition packet networks 1, 2, . . . n, assigned to a particular processor or shared memory, and transmits a packet for that processor or shared memory through the selected parallel partition packet network independent of the other parallel partition packet networks. Thus, packets on a common bus or cross bar network may be demultiplexed and directed to the assigned parallel partition packet network. The particular assignments may vary depending upon the number of processors, shared memories and parallel partition packet networks, or other factors, depending upon the particular application.
It is believed that partitioning the block to block and block to system memory controller interconnections into independent, parallel partition networks may reduce energy usage. For example, each independent parallel partition network may have reduced energy usage resulting from a lower bandwidth since the packet traffic is distributed over other independent parallel partition packet networks for each block.
In accordance with another aspect of the present description, each parallel partition packet network 1, 2, . . . n, has multiple hierarchical levels.
As shown in
Also coupled to the unit level parallel partition switch 410a of the parallel partition packet network 1, is a plurality of shared memories 414a, . . . 414n and a parallel partition network connection 420aa of the parallel partition packet network 1. The unit level parallel partition switch 410a may be a cross-bar or matrix switch network, for example. Other types of switch networks may be used to interconnect the components of the unit 210a (
In the illustrated embodiment, the unit level parallel partition switch 410a lacks a buffer so as to reduce energy expenditure. Accordingly, the control logic 424 of the unit level parallel partition switch 410a and the control logic 324 of each block network 320 of each of the blocks 200a, 200b . . . 200n of the unit 210a (
In addition to the unit level parallel partition switch 410a, the unit level switch 208a (
The unit level switch 208b (
The block network 320 (
Referring again to the unit level parallel partition switch 410a for the parallel partition network 1, the parallel partition network connection 420aa of the parallel partition packet network 1, connects the unit level parallel partition switch 410a of the unit 210a (
As shown in
In one embodiment, a plurality of shared memories 514a, . . . 514n and a network connection 520a of the parallel partition packet network 1 to a next hierarchical level (if any), may also be coupled to the unit-to-unit level parallel partition switch 510a of the parallel partition packet network 1. The unit-to-unit level parallel partition switch 510a may be a cross-bar or matrix switch network, for example. Other types of switch networks may be used to interconnect the components connected to the unit-to-unit level parallel partition switch 510a, depending upon the particular application. The processors of each unit 210a, 210b, . . . 210n may communicate at the unit-to-unit level with any of the other units 210a, 210b, . . . 210n and any of the shared memories shared memories 514a, . . . 514n over the unit-to-unit level parallel partition switch 510a of the parallel partition packet network 1.
Access to the unit-to-unit level parallel partition switch 510a is granted by control logic 524 which arbitrates conflicting requests for access to the unit-to-unit level parallel partition switch 510a. Once granted access, a processor of units 210a, 210b, . . . 210n may read or write a packet of data from or to, respectively, a processor or memory of units 210a, 210b, . . . 210n or unit-to-unit memories 514a, . . . 514n, over the unit-to-unit level parallel partition switch 510a of the parallel partition packet network 1.
In the illustrated embodiment, the unit-to-unit level parallel partition switch 510a lacks a buffer so as to reduce energy expenditure. Accordingly, the control logic 524 of the unit-to-unit level parallel partition switch 510a, the control logic 424 of the unit level parallel partition switch 410a, 410b, . . . 410n, respectively, of the unit 210a, and the control logic 324 of each block network 320 of each of the blocks of the units 210a, 210b, . . . 210n, cooperate to grant access to the source block network 320, and to the source unit level parallel partition switch 410a of the packet source, and to the intermediate unit-to-unit level parallel partition switch 510a, and to the destination unit level parallel partition switch 410a and to the destination block network 320 of the packet destination such that a packet may be transmitted from its source block to its destination block, via the unit-to-unit level parallel partition switch 510a, and the source and destination unit level parallel partition switches 410a, without buffering.
Thus, in one embodiment, the control logic 524 of the unit-to-unit level parallel partition switch 510a, the control logic 424 of each unit level parallel partition switch 410a of each of the units 210a, 210b, . . . 210n and the control logic 324 of each block network 320 of each of the blocks of each of the units 210a, 210b, . . . 210n, cooperate to grant simultaneous access to a selected block network 320 and a selected unit level parallel partition switch 410a of the packet source, and to a selected block network 320 and a selected unit level parallel partition switch 410a of the packet destination, and to the intermediate unit-to-unit level parallel partition switch 510a, such that a packet may be transmitted from its source block to its destination block, via the unit-to-unit level parallel partition switch 510a, the unit level parallel partition switches 410a and the block networks 320 of the packet's path from source to destination.
In another embodiment, buffering may be added at the block network 320 level or the unit level parallel partition switch 410a level, or the unit-to-unit level parallel partition switch 510a level or in various combinations, depending upon the particular application. Thus, for example, it is appreciated that in some embodiments, the unit-to-unit level parallel partition switch 510a, or one or more units or blocks may buffer packets in a local buffer coupled to the unit-to-unit level parallel partition switch 510a, to the unit level parallel partition switches 410a or to one or more block networks 320 of the various blocks of the units 210a, 210b, . . . 210n.
In addition to the unit-to-unit level parallel partition switch 510a, the unit-to-unit level switch 220a (
Accordingly, the unit level parallel partition switch 410a (
In this manner, the device 100 comprises a plurality of units 210a, 210b, . . . 210n organized to define a unit-to-unit level (
Each parallel partition packet network 1, 2, . . . n has a unit level parallel partition switch such as a unit level parallel partition switch 410a, 410b, . . . 410n, respectively, of unit level switch 208a, for example, at the unit hierarchical level for each unit and coupled to the system memory 104 and to each block network 320 of a plurality of blocks of a particular unit. Each parallel partition packet network 1, 2, . . . n further has a unit-to-unit level parallel partition switch such as a unit-to-unit level parallel partition switch 510a, 510b, . . . 510n, respectively, of unit-to-unit level switch 220a, for example, at the unit-to-unit hierarchical level. Each unit-to-unit level parallel partition switch is coupled to each unit level parallel partition switch of the particular parallel partition packet network at the unit hierarchical level. Parallel partition network control logic switches a packet through the unit-to unit level parallel partition switch of the selected parallel partition packet network at the unit-to-unit hierarchical level, and between selected unit level parallel partition switches of the selected parallel partition packet network.
In addition to routing packets amongst the various blocks of the various units on the die of the microprocessor 102 (
The unit level parallel partition switch 410a of the parallel partition packet network 1 is also coupled by a parallel partition network connection of the parallel partition packet network 1 to an input buffer 460b for temporarily storing inbound packets having a destination on the microprocessor die 102. The input buffer 460b is coupled by another parallel partition network connection of the parallel partition packet network 1 to a die-to-die input channel 470b (of the parallel partition packet network 1), which in this embodiment, is coupled to a die of the memory controller 106.
In the illustrated embodiment of the device 100, each unit 210a, 210b, . . . 210n has an associated memory controller 106a, 106b, . . . 106n (
In this embodiment, the die-to-die channels 470a, 470b of the unit level parallel partition switch 410a of the die of the processors 102, and the die-to-die channels 710a, 710b (
Similarly, each unit level parallel partition switch 410a, 410b . . . 410n of the unit level switch 208b of the unit 210b is coupled by parallel partition network connections of each of the parallel partition networks 1, 2, . . . n, respectively, to the unit-to-unit level parallel partition switches 510a, 510b . . . 510n, respectively, of the unit-to-unit level switch 220a. The remaining unit level parallel partition switches of the remaining unit level switches of the remaining units, including the unit n, are coupled by parallel partition network connections of each of the parallel partition networks 1, 2, . . . n, respectively, to the unit-to-unit level parallel partition switches 510a, 510b . . . 510n, respectively, of the unit-to-unit level switch 220a.
Referring to
The unit level parallel partition switch 720a may be a cross-bar or matrix switch network, for example. Other types of switch networks may be used to interconnect the components of the parallel partition packet network 1 at the unit level for the memory controller 106a, depending upon the particular application.
The unit level parallel partition switch 720a is a partition of a unit level switch 820 (
Access to the unit level parallel partition switch 720a is granted by control logic 734 (
In the illustrated embodiment, the unit level parallel partition switch 720a has buffers 730a, 730b and 460a, 460b (
The memory controller 106a further has a plurality of common die-to-die communication channels 740a, 740b . . . 740n, each of which is adapted for carrying packets from each of the plurality of parallel partition packet networks 1, 2, . . . n. In the illustrated embodiment, each packet (as represented by the packet 910 of
The memory controller 106a further has a common unit level switch 750 coupled to each parallel partition packet networks 1, 2, . . . n. Thus, the unit level parallel partition switch 720a of the parallel partition packet network 1 is shown connected by parallel partition network connection 754a to the common unit level switch 750 for the parallel partition packet networks 1, 2, . . . n. In a similar manner, the remaining unit level parallel partition switch 720b . . . 720n (
Access to the common unit level switch 750 is granted by control logic 760 which arbitrates conflicting requests for access to the common unit level switch 750. Once granted access, the memory controller 106a may read or write a packet of data from or to, respectively, the die-to-die input/output channels 740a, 740b, . . . 740n and the input and output buffers 730a, 730b of parallel partition packet network 1, over the common unit level switch 750 and the unit level parallel partition switch 720a of parallel partition packet network 1. In a similar manner, once granted access, the memory controller 106a may read or write a packet of data from or to, respectively, the die-to-die input/output channels 740a, 740b . . . 740n and the input and output buffers of the associated parallel partition packet network 2 . . . n, over the common unit level switch 750 and the associated unit level parallel partition switch 720b . . . 720n of the parallel partition packet networks 2, 3 . . . n, respectively.
The common unit level switch 750 may be a cross-bar or matrix switch network, for example. Other types of switch networks may be used to interconnect the components of coupled to the common unit level switch 750 for the memory controller 106a, depending upon the particular application.
The parallel partition packet network control logic of the control logic 760 of the common unit level switch 750 multiplexes the packets from each of the parallel partition packet network 1, 2, 3 . . . n onto a selected common die-to-die communication channels 740a, 740b . . . 740n, for carrying packets from each of the plurality of parallel partition packet networks 1, 2, . . . n. The parallel partition packet network control logic of the control logic 760 of the common unit level switch 750, is further adapted to tag a packet from each parallel partition packet network 1, 2, 3 . . . n with a tag 920 (
In the illustrated embodiment, the die-to-die input/output communication channels 740a, 740b . . . 740n are each connected to a die containing a memory region of the system memory 104 (
The parallel partition packet network control logic of the control logic 760 of the common unit level switch 750, is adapted to read a tag 920 (
In the illustrated embodiment, the parallel partition packet network control logic of the control logic 760 (
a, 10b show one example of operations of network control logic in accordance with one embodiment of the present description in connection with parallel partition packet network 1 and unit 210a. The operations of network control logic in connection with parallel partition packet network 1, unit 210a, and memory controller 106a, are representative of operations of network control logic in connection with each of the other parallel partition packet networks 2, 3 . . . n, the other units 210b . . . 210n, and the other memory controllers 106b . . . 106n.
In a first operation, a packet is received (block 1010) for transmission through the network from a source such as a processor 310a (
However, if the destination of the packet is determined (block 1014) to be other than block level local, parallel partition packet network control logic selects (block 1024) a parallel partition packet network, such as the parallel partition network 1, for example, of a plurality of parallel partition packet networks, 1, 2, . . . n. In one embodiment, the particular parallel partition packet network may be selected as a function of the identity of the source or the identity of the destination, or both, of the packet. Hence, processors and shared memories may be assigned to particular parallel partition packet networks, 1, 2, . . . n. In one embodiment, the assignments may be fixed. In other embodiments, the assignments may change as a function of various factors. For example, if the traffic on a particular parallel partition packet networks, 1, 2, . . . n should become too heavy relative to that on other parallel partition packet networks, 1, 2, . . . n, the assignments may be changed to distribute the traffic on the parallel partition packet networks, 1, 2, . . . n more evenly. Assignments of packet sources and/or packet destinations to the parallel partition packet networks, 1, 2, . . . n may be a function of other factors, depending upon the particular application.
The parallel partition packet network control logic transmits (block 1028) the packet through the selected parallel partition packet network independently of the other parallel partition packet networks, to the local unit level switch, such as the unit level switch 410a, of the selected parallel partition packet network 1, of the unit 210a, for example, at the next higher hierarchical level, that is, the unit hierarchical level (
If the destination of the packet is determined (block 1032) to be die local, that is, on the same die as the source of the packet, and if the destination of the packet is determined (block 1036) to be unit level local, that is, within the same unit, such as the unit 210a, for example, as the source of the packet, the parallel partition packet network control logic switches (block 1040) the packet through the local unit level switch (switch 410a in this example) of the selected parallel partition packet network (parallel partition packet network 1 in this example) at the unit hierarchical level. The packet is switched to the block network of the packet destination, such as the block network 230 of the block 200b, for example, of the local unit 210a. The control logic of the block network 230 of the destination block (block 200b in this example of the unit 210a) switches (block 1044) the packet through the block network of the destination block (block 200b in this example) to the destination within the destination block.
If the destination of the packet is determined (block 1032) to be die local, that is, on the same die as the source of the packet, but if the destination of the packet is determined (block 1036) not to be unit level local, that is, not within the same unit, such as the unit 210a, for example, as the source of the packet, the parallel partition packet network control logic switches (block 1050) the packet through the local unit level switch (switch 410a in this example) of the selected parallel partition packet network (parallel partition packet network 1 in this example) at the unit hierarchical level, to the unit-to-unit switch of the of the selected parallel partition packet network (parallel partition packet network 1 in this example) at the unit-to-unit hierarchical level. Thus, the packet may be switched (block 1050) to the unit-to-unit switch 510a (
The parallel partition packet network control logic in turn switches (block 1054) the packet through the unit-to-unit switch (unit-to-unit switch 510a (
The parallel partition packet network control logic switches (block 1058) the packet through the destination unit level switch (switch 410a in this example) of the selected parallel partition packet network (parallel partition packet network 1 in this example) at the unit hierarchical level. The packet is switched to the block network of the packet destination, such as the block network of the block 202a (
If the destination of the packet is determined (block 1032) to be not die local, that is, not on the same die as the source of the packet, the parallel partition packet network control logic switches (block 1070,
The parallel partition packet network control logic switches (block 1078) the packet through the memory controller unit level switch (the unit level switch 720a of the memory controller 106a for the selected parallel partition packet network 1 in this example) and buffers the packet in the buffer of the selected parallel partition packet network (parallel partition packet network 1 in this example), such as input buffer 730a, for example.
In this example, the ultimate destination of the packet is a memory region of the system memory 104 located on yet another die. Accordingly, the parallel partition packet network control logic switches (block 10782) the packet through the memory controller unit level switch (the unit level switch 720a of the memory controller 106a for the selected parallel partition packet network 1 in this example) to a common unit level switch such as the common unit level switch 750, for example, for switching packets from or to each parallel partition packet network of the plurality of parallel partition packet networks 1, 2, . . . n.
The parallel partition packet network control logic tags (block 1086) the packet arriving from the selected parallel partition packet network (parallel partition packet network 1 in this example) with a tag 920 to identify the selected parallel partition packet network (parallel partition packet network 1 in this example) from which the packet arrived. The parallel partition packet network control logic switches (block 1090) the tagged packet through a common unit level switch to a common communication channel for carrying packets of each of the plurality of parallel partition packet networks 1, 2, . . . n. The parallel partition packet network control logic routes (block 1094) the tagged packets through the common communication channel to the ultimate destination such as a memory region on another die. For tagged packets arriving on a common communication channel, the parallel partition packet network control logic reads the tag of the packet arrived from the common communication channel, and switches the arrived packet through the common unit level switch to the particular parallel partition packet network identified by the tag of the packet.
The following examples pertain to further embodiments.
Example 1 is a system, comprising:
a memory;
a plurality of blocks defining a block hierarchical level, each block including a plurality of processors and a block network interconnecting the processors of the block;
a unit defining a unit hierarchical level higher than the block hierarchical level, said unit including a plurality of said blocks at the block hierarchical level; and
a plurality of parallel partition packet networks, each parallel partition packet network being independent of the other parallel partition packet networks of the plurality of parallel partition packet networks, each parallel partition packet network having a unit level switch at the unit hierarchical level coupled to the memory and to each block network of the plurality of blocks of the unit,
In Example 2, The subject matter of Examples 1 and 3-8 can optionally include parallel partition packet network control logic adapted to select a first parallel partition packet network of the plurality of parallel partition packet networks, and transmit a packet through the selected parallel partition packet network independent of the other parallel partition packet networks, said transmitting including switching the packet through the unit level switch of the selected parallel partition packet network at the unit hierarchical level,
In Example 3. the subject matter of Examples 1-2 and 4-8 can optionally include a plurality of said units defining a unit-to-unit hierarchical level higher than the unit hierarchical level, wherein each unit is at the unit hierarchical level and comprises a plurality of said blocks at the block hierarchical level, wherein each parallel partition packet network has a unit level switch at the unit hierarchical level for each unit and coupled to the memory and to each block network of the plurality of blocks of the particular unit, and a unit-to-unit level switch at the unit-to-unit hierarchical level and coupled to each unit level switch of the particular parallel partition packet network at the unit hierarchical level, wherein the parallel partition packet network control logic is further adapted to switch the packet through the unit-to unit level switch of the selected parallel partition packet network at the unit-to-unit hierarchical level, between selected unit level switches of the selected parallel partition packet network.
In Example 4, the subject matter of Examples 1-3 and 5-8 can optionally include a first die, a second die and a plurality of die-to-die communication channels, each parallel partition packet network having a die-to-die communication channel of the plurality of die-to-die communication channels, coupled to the first and second dies, wherein the unit of the plurality of blocks and the unit level switches of the plurality of parallel partition packet networks are on the first die, the device further comprising a plurality of buffers on the first die, each parallel partition packet network having a buffer coupled to the unit level switch and the die-to-die communication channel of the particular parallel partition packet network, wherein the parallel partition packet network control logic is further adapted to buffer a packet in the buffer of the selected parallel partition packet network before transmitting the packet to the die-to-die communication channel of the selected parallel partition packet network, and to buffer a packet in the buffer received from the die-to-die communication channel of the selected parallel partition packet network in the buffer of the selected parallel partition packet network.
In Example 5, the subject matter of Examples 1-4 and 6-8 can optionally include a plurality of unit hierarchical level memory controllers on the second die, each unit level memory controller adapted to control memory transactions between the memory and the processors of an associated unit, a plurality of unit level switches on the second die, wherein each parallel partition packet network has a unit level switch of the plurality of unit level switches on the second die, and a buffer of the plurality of buffers on the second die, each unit level switch on the second die coupled to the unit level switch and buffer of the particular parallel partition packet network on the first die.
In Example 6, the subject matter of Examples 1-5 and 7-8 can optionally include that each memory controller has parallel partition packet network control logic on the second die adapted to switch a packet through the unit level switch of the selected parallel partition packet network on the second die, to buffer a packet in the buffer of the selected parallel partition packet network one the second die before transmitting the packet to the die-to-die communication channel of the selected parallel partition packet network, and to buffer a packet in the buffer on the second die received from the die-to-die communication channel of the selected parallel partition packet network on the second die.
In Example 7, the subject matter of Examples 1-6 and 8 can optionally include a common communication channel for carrying packets from each of the plurality of parallel partition packet networks wherein each packet carried by the common communication channel has a tag identifying a parallel partition packet network, and a common unit level switch coupled to each parallel partition packet network of the plurality of parallel partition packet networks, wherein the parallel partition packet network control logic is adapted to tag a packet from each parallel partition packet network with a tag to identify the parallel partition packet network from which the packet arrived, and switch the tagged packet through the common unit level switch to the common communication channel.
In Example 8, the subject matter of Examples 1-7 can optionally include that the parallel partition packet network control logic is adapted read a tag of a packet from the common communication channel, and to switch the packet through the common unit level switch to the parallel partition packet network identified by the tag of the packet.
Example 9 is a method, comprising:
parallel partition packet network control logic selecting a first parallel partition packet network of a plurality of parallel partition packet networks, coupling a memory of a device, to a plurality of blocks defining a block hierarchical level, each block including a plurality of processors and a block network interconnecting the processors of the block, wherein a plurality of said blocks is organized in at least one unit defining a unit hierarchical level higher than the first hierarchical level, each parallel partition packet network having a unit level switch at the unit hierarchical level coupled to the memory and to each block network of the plurality of blocks of the unit; and
parallel partition packet network control logic transmitting a packet through the selected parallel partition packet network independent of the other parallel partition packet networks, said transmitting including switching the packet through a unit level switch of the selected parallel partition packet network at the unit hierarchical level.
In Example 10, the subject matter of Examples 9 and 11-15 can optionally include
parallel partition packet network control logic switching the packet through a unit-to unit level switch of the selected parallel partition packet network at a unit-to-unit hierarchical level, between selected unit level switches of the selected parallel partition packet network,
wherein a plurality of said units define the unit-to-unit level of a third hierarchical level higher than the second hierarchical level, wherein each unit is at the unit hierarchical level and comprises a plurality of said blocks at the block hierarchical level, wherein each parallel partition packet network has a unit level switch at the unit hierarchical level for each unit and coupled to the memory and to each block network of the plurality of blocks of the particular unit, and a unit-to-unit level switch at the unit-to-unit hierarchical level and coupled to each unit level switch of the particular parallel partition packet network at the unit hierarchical level.
In Example 11, the subject matter of Examples 9-10 and 12-15 can optionally include parallel partition packet network control logic buffering on a first die, a packet in a buffer of the selected parallel partition packet network, each parallel partition packet network having a buffer and a die-to-die communication channel at the unit hierarchical level, each buffer of a parallel partition packet network being coupled to the die-to-die communication channel and the unit level switch of the particular parallel partition packet network, the method further comprising parallel partition packet network control logic transmitting a packet from a buffer to a die-to-die communication channel of the selected parallel partition packet network, receiving a packet from the die-to-die communication channel of the selected parallel partition packet network, and buffering the received packet in the buffer of the selected parallel partition packet network.
In Example 12, the subject matter of Examples 9-11 and 13-15 can optionally include a unit level memory controller of a plurality of unit hierarchical level memory controllers on a second die, controlling memory transactions between a memory and the processors of an associated unit, each unit having an associated memory controller of the plurality of unit level memory controllers on the second die;
parallel partition packet network control logic on the second die switching a packet received from the die-to-die communication channel of the selected parallel partition packet network, through a unit level switch of the selected parallel partition packet network on the second die, each parallel partition packet network having a unit level switch of a plurality of unit level switches on the second die; and
parallel partition packet network control logic buffering on a second die, a packet received from the die-to-die communication channel of the selected parallel partition packet network in a buffer of the selected parallel partition packet network, each parallel partition packet network having a buffer of a plurality of buffers on the second die, coupled to the unit level switch of the particular parallel partition packet network, each unit level switch on the second die being coupled to the die-to-die communication channel of the particular parallel partition packet network.
In Example 13, the subject matter of Examples 9-12 and 14-15 can optionally include parallel partition packet network control logic on the second die switching a packet through a unit level switch of the selected parallel partition packet network on the second die, buffering a packet in the buffer of the selected parallel partition packet network on the second die, and transmitting a packet from the buffer of the selected parallel partition packet network to the die-to-die communication channel of the selected parallel partition packet network.
In Example 14, the subject matter of Examples 9-13 and 15 can optionally include parallel partition packet network control logic tagging a packet arriving from the selected parallel partition packet network with a tag to identify the selected parallel partition packet network from which the packet arrived, and switching the tagged packet through a common unit level switch to a common communication channel coupled to each parallel partition packet network of the plurality of parallel partition packet networks, for carrying packets from each of the plurality of parallel partition packet networks wherein each packet carried by the common communication channel has a tag identifying a parallel partition packet network from which the tagged packet arrived.
In Example 15, the subject matter of Examples 1-14 can optionally include parallel partition packet network control logic reading a tag of a packet arrived from the common communication channel, and switching the arrived packet through the common unit level switch to the parallel partition packet network identified by the tag of the packet.
Example 16 is a device for use with a memory, comprising:
a plurality of blocks defining a block hierarchical level, each block including a plurality of processors and a block network interconnecting the processors of the block;
a unit defining a unit hierarchical level higher than the block hierarchical level, said unit including a plurality of said blocks at the block hierarchical level; and
a plurality of parallel partition packet networks, each parallel partition packet network being independent of the other parallel partition packet networks of the plurality of parallel partition packet networks, each parallel partition packet network having a unit level switch at the unit hierarchical level coupled to the memory and to each block network of the plurality of blocks of the unit.
In Example 17, the subject matter of Examples 16 and 18-23 can optionally include that parallel partition packet network control logic adapted to select a first parallel partition packet network of the plurality of parallel partition packet networks, and transmit a packet through the selected parallel partition packet network independent of the other parallel partition packet networks, said transmitting including switching the packet through the unit level switch of the selected parallel partition packet network at the unit hierarchical level.
In Example 18, the subject matter of Examples 16-17 and 19-23 can optionally include a plurality of said units defining a unit-to-unit hierarchical level higher than the unit hierarchical level, wherein each unit is at the unit hierarchical level and comprises a plurality of said blocks at the block hierarchical level, wherein each parallel partition packet network has a unit level switch at the unit hierarchical level for each unit and coupled to the memory and to each block network of the plurality of blocks of the particular unit, and a unit-to-unit level switch at the unit-to-unit hierarchical level and coupled to each unit level switch of the particular parallel partition packet network at the unit hierarchical level, wherein the parallel partition packet network control logic is further adapted to switch the packet through the unit-to unit level switch of the selected parallel partition packet network at the unit-to-unit hierarchical level, between selected unit level switches of the selected parallel partition packet network.
In Example 19, the subject matter of Examples 16-18 and 20-23 can optionally include a first die, a second die and a plurality of die-to-die communication channels, each parallel partition packet network having a die-to-die communication channel of the plurality of die-to-die communication channels, coupled to the first and second dies, wherein the unit of the plurality of blocks and the unit level switches of the plurality of parallel partition packet networks are on the first die, the device further comprising a plurality of buffers on the first die, each parallel partition packet network having a buffer coupled to the unit level switch and the die-to-die communication channel of the particular parallel partition packet network, wherein the parallel partition packet network control logic is further adapted to buffer a packet in the buffer of the selected parallel partition packet network before transmitting the packet to the die-to-die communication channel of the selected parallel partition packet network, and to buffer a packet in the buffer received from the die-to-die communication channel of the selected parallel partition packet network in the buffer of the selected parallel partition packet network.
In Example 20, the subject matter of Examples 16-19 and 21-23 can optionally include a plurality of unit hierarchical level memory controllers on the second die, each unit level memory controller adapted to control memory transactions between the memory and the processors of an associated unit, a plurality of unit level switches on the second die, wherein each parallel partition packet network has a unit level switch of the plurality of unit level switches on the second die, and a buffer of the plurality of buffers on the second die, each unit level switch on the second die coupled to the unit level switch and buffer of the particular parallel partition packet network on the first die.
In Example 21, the subject matter of Examples 16-20 and 22-23 can optionally include each memory controller has parallel partition packet network control logic on the second die adapted to switch a packet through the unit level switch of the selected parallel partition packet network on the second die, to buffer a packet in the buffer of the selected parallel partition packet network one the second die before transmitting the packet to the die-to-die communication channel of the selected parallel partition packet network, and to buffer a packet in the buffer on the second die received from the die-to-die communication channel of the selected parallel partition packet network on the second die.
In Example 22, the subject matter of Examples 16-21 and 23 can optionally include a common communication channel for carrying packets from each of the plurality of parallel partition packet networks wherein each packet carried by the common communication channel has a tag identifying a parallel partition packet network, and a common unit level switch coupled to each parallel partition packet network of the plurality of parallel partition packet networks, wherein the parallel partition packet network control logic is adapted to tag a packet from each parallel partition packet network with a tag to identify the parallel partition packet network from which the packet arrived, and switch the tagged packet through the common unit level switch to the common communication channel.
In Example 23, the subject matter of Examples 16-22 can optionally include that the parallel partition packet network control logic is adapted read a tag of a packet from the common communication channel, and to switch the packet through the common unit level switch to the parallel partition packet network identified by the tag of the packet.
Example 24 is a computer architecture system, comprising:
a memory;
a plurality of blocks defining a block hierarchical level, each block including a plurality of processors and a block network interconnecting the processors of the block;
a unit defining a unit hierarchical level higher than the block hierarchical level, said unit including a plurality of said blocks at the block hierarchical level; and
a plurality of parallel partition packet networks, each parallel partition packet network being independent of the other parallel partition packet networks of the plurality of parallel partition packet networks, each parallel partition packet network having a unit level switch at the unit hierarchical level coupled to the memory and to each block network of the plurality of blocks of the unit.
In Example 25, the subject matter of Examples 24 and 26-31 can optionally include parallel partition packet network control logic adapted to select a first parallel partition packet network of the plurality of parallel partition packet networks, and transmit a packet through the selected parallel partition packet network independent of the other parallel partition packet networks, said transmitting including switching the packet through the unit level switch of the selected parallel partition packet network at the unit hierarchical level.
In Example 26, the subject matter of Examples 24-25 and 27-31 can optionally include a plurality of said units defining a unit-to-unit hierarchical level higher than the unit hierarchical level, wherein each unit is at the unit hierarchical level and comprises a plurality of said blocks at the block hierarchical level, wherein each parallel partition packet network has a unit level switch at the unit hierarchical level for each unit and coupled to the memory and to each block network of the plurality of blocks of the particular unit, and a unit-to-unit level switch at the unit-to-unit hierarchical level and coupled to each unit level switch of the particular parallel partition packet network at the unit hierarchical level, wherein the parallel partition packet network control logic is further adapted to switch the packet through the unit-to unit level switch of the selected parallel partition packet network at the unit-to-unit hierarchical level, between selected unit level switches of the selected parallel partition packet network.
In Example 27, the subject matter of Examples 24-26 and 28-31 can optionally include a first die, a second die and a plurality of die-to-die communication channels, each parallel partition packet network having a die-to-die communication channel of the plurality of die-to-die communication channels, coupled to the first and second dies, wherein the unit of the plurality of blocks and the unit level switches of the plurality of parallel partition packet networks are on the first die, the device further comprising a plurality of buffers on the first die, each parallel partition packet network having a buffer coupled to the unit level switch and the die-to-die communication channel of the particular parallel partition packet network, wherein the parallel partition packet network control logic is further adapted to buffer a packet in the buffer of the selected parallel partition packet network before transmitting the packet to the die-to-die communication channel of the selected parallel partition packet network, and to buffer a packet in the buffer received from the die-to-die communication channel of the selected parallel partition packet network in the buffer of the selected parallel partition packet network.
In Example 28, the subject matter of Examples 24-27 and 29-31 can optionally include a plurality of unit hierarchical level memory controllers on the second die, each unit level memory controller adapted to control memory transactions between the memory and the processors of an associated unit, a plurality of unit level switches on the second die, wherein each parallel partition packet network has a unit level switch of the plurality of unit level switches on the second die, and a buffer of the plurality of buffers on the second die, each unit level switch on the second die coupled to the unit level switch and buffer of the particular parallel partition packet network on the first die.
In Example 29, the subject matter of Examples 24-28 and 30-31 can optionally include that each memory controller has parallel partition packet network control logic on the second die adapted to switch a packet through the unit level switch of the selected parallel partition packet network on the second die, to buffer a packet in the buffer of the selected parallel partition packet network one the second die before transmitting the packet to the die-to-die communication channel of the selected parallel partition packet network, and to buffer a packet in the buffer on the second die received from the die-to-die communication channel of the selected parallel partition packet network on the second die.
In Example 30, the subject matter of Examples 24-29 and 31 can optionally include a common communication channel for carrying packets from each of the plurality of parallel partition packet networks wherein each packet carried by the common communication channel has a tag identifying a parallel partition packet network, and a common unit level switch coupled to each parallel partition packet network of the plurality of parallel partition packet networks, wherein the parallel partition packet network control logic is adapted to tag a packet from each parallel partition packet network with a tag to identify the parallel partition packet network from which the packet arrived, and switch the tagged packet through the common unit level switch to the common communication channel.
In Example 31, the subject matter of Examples 24-30 can optionally include that the parallel partition packet network control logic is adapted read a tag of a packet from the common communication channel, and to switch the packet through the common unit level switch to the parallel partition packet network identified by the tag of the packet.
Example 32 is a method of routing packets in at least one integrated circuit, comprising:
parallel partition packet network control logic selecting a first parallel partition packet network of a plurality of parallel partition packet networks, coupling a memory of a device, to a plurality of blocks defining a block hierarchical level, each block including a plurality of processors and a block network interconnecting the processors of the block, wherein a plurality of said blocks is organized in at least one unit defining a unit hierarchical level higher than the first hierarchical level, each parallel partition packet network having a unit level switch at the unit hierarchical level coupled to the memory and to each block network of the plurality of blocks of the unit; and
parallel partition packet network control logic transmitting a packet through the selected parallel partition packet network independent of the other parallel partition packet networks, said transmitting including switching the packet through a unit level switch of the selected parallel partition packet network at the unit hierarchical level.
In Example 33, the subject matter of Examples 32 and 34-38 can optionally include
parallel partition packet network control logic switching the packet through a unit-to unit level switch of the selected parallel partition packet network at a unit-to-unit hierarchical level, between selected unit level switches of the selected parallel partition packet network,
wherein a plurality of said units define the unit-to-unit level of a third hierarchical level higher than the second hierarchical level, wherein each unit is at the unit hierarchical level and comprises a plurality of said blocks at the block hierarchical level, wherein each parallel partition packet network has a unit level switch at the unit hierarchical level for each unit and coupled to the memory and to each block network of the plurality of blocks of the particular unit, and a unit-to-unit level switch at the unit-to-unit hierarchical level and coupled to each unit level switch of the particular parallel partition packet network at the unit hierarchical level.
In Example 34, the subject matter of Examples 32-33 and 35-38 can optionally include parallel partition packet network control logic buffering on a first die, a packet in a buffer of the selected parallel partition packet network, each parallel partition packet network having a buffer and a die-to-die communication channel at the unit hierarchical level, each buffer of a parallel partition packet network being coupled to the die-to-die communication channel and the unit level switch of the particular parallel partition packet network, the method further comprising parallel partition packet network control logic transmitting a packet from a buffer to a die-to-die communication channel of the selected parallel partition packet network, receiving a packet from the die-to-die communication channel of the selected parallel partition packet network, and buffering the received packet in the buffer of the selected parallel partition packet network.
In Example 35, the subject matter of Examples 32-34 and 36-38 can optionally include a unit level memory controller of a plurality of unit hierarchical level memory controllers on a second die, controlling memory transactions between a memory and the processors of an associated unit, each unit having an associated memory controller of the plurality of unit level memory controllers on the second die;
parallel partition packet network control logic on the second die switching a packet received from the die-to-die communication channel of the selected parallel partition packet network, through a unit level switch of the selected parallel partition packet network on the second die, each parallel partition packet network having a unit level switch of a plurality of unit level switches on the second die; and
parallel partition packet network control logic buffering on a second die, a packet received from the die-to-die communication channel of the selected parallel partition packet network in a buffer of the selected parallel partition packet network, each parallel partition packet network having a buffer of a plurality of buffers on the second die, coupled to the unit level switch of the particular parallel partition packet network, each unit level switch on the second die being coupled to the die-to-die communication channel of the particular parallel partition packet network.
In Example 36, the subject matter of Examples 32-35 and 37-38 can optionally include parallel partition packet network control logic on the second die switching a packet through a unit level switch of the selected parallel partition packet network on the second die, buffering a packet in the buffer of the selected parallel partition packet network on the second die, and transmitting a packet from the buffer of the selected parallel partition packet network to the die-to-die communication channel of the selected parallel partition packet network.
In Example 37, the subject matter of Examples 32-36 and 38 can optionally include parallel partition packet network control logic tagging a packet arriving from the selected parallel partition packet network with a tag to identify the selected parallel partition packet network from which the packet arrived, and switching the tagged packet through a common unit level switch to a common communication channel coupled to each parallel partition packet network of the plurality of parallel partition packet networks, for carrying packets from each of the plurality of parallel partition packet networks wherein each packet carried by the common communication channel has a tag identifying a parallel partition packet network from which the tagged packet arrived.
In Example 38, the subject matter of Examples 32-37 can optionally include parallel partition packet network control logic reading a tag of a packet arrived from the common communication channel, and switching the arrived packet through the common unit level switch to the parallel partition packet network identified by the tag of the packet.
Example 39 is a computer architecture device for use with a memory, comprising:
a plurality of blocks defining a block hierarchical level, each block including a plurality of processors and a block network interconnecting the processors of the block;
a unit defining a unit hierarchical level higher than the block hierarchical level, said unit including a plurality of said blocks at the block hierarchical level; and
a plurality of parallel partition packet networks, each parallel partition packet network being independent of the other parallel partition packet networks of the plurality of parallel partition packet networks, each parallel partition packet network having a unit level switch at the unit hierarchical level coupled to the memory and to each block network of the plurality of blocks of the unit.
In Example 40, the subject matter of Examples 39 and 41-46 can optionally include parallel partition packet network control logic adapted to select a first parallel partition packet network of the plurality of parallel partition packet networks, and transmit a packet through the selected parallel partition packet network independent of the other parallel partition packet networks, said transmitting including switching the packet through the unit level switch of the selected parallel partition packet network at the unit hierarchical level.
In Example 41, the subject matter of Examples 39-40 and 42-46 can optionally include a plurality of said units defining a unit-to-unit hierarchical level higher than the unit hierarchical level, wherein each unit is at the unit hierarchical level and comprises a plurality of said blocks at the block hierarchical level, wherein each parallel partition packet network has a unit level switch at the unit hierarchical level for each unit and coupled to the memory and to each block network of the plurality of blocks of the particular unit, and a unit-to-unit level switch at the unit-to-unit hierarchical level and coupled to each unit level switch of the particular parallel partition packet network at the unit hierarchical level, wherein the parallel partition packet network control logic is further adapted to switch the packet through the unit-to unit level switch of the selected parallel partition packet network at the unit-to-unit hierarchical level, between selected unit level switches of the selected parallel partition packet network.
In Example 42, the subject matter of Examples 39-40 and 42-46 can optionally include a first die, a second die and a plurality of die-to-die communication channels, each parallel partition packet network having a die-to-die communication channel of the plurality of die-to-die communication channels, coupled to the first and second dies, wherein the unit of the plurality of blocks and the unit level switches of the plurality of parallel partition packet networks are on the first die, the device further comprising a plurality of buffers on the first die, each parallel partition packet network having a buffer coupled to the unit level switch and the die-to-die communication channel of the particular parallel partition packet network, wherein the parallel partition packet network control logic is further adapted to buffer a packet in the buffer of the selected parallel partition packet network before transmitting the packet to the die-to-die communication channel of the selected parallel partition packet network, and to buffer a packet in the buffer received from the die-to-die communication channel of the selected parallel partition packet network in the buffer of the selected parallel partition packet network.
In Example 43, the subject matter of Examples 39-42 and 44-46 can optionally include a plurality of unit hierarchical level memory controllers on the second die, each unit level memory controller adapted to control memory transactions between the memory and the processors of an associated unit, a plurality of unit level switches on the second die, wherein each parallel partition packet network has a unit level switch of the plurality of unit level switches on the second die, and a buffer of the plurality of buffers on the second die, each unit level switch on the second die coupled to the unit level switch and buffer of the particular parallel partition packet network on the first die.
In Example 44, the subject matter of Examples 39-43 and 44-46 can optionally include that each memory controller has parallel partition packet network control logic on the second die adapted to switch a packet through the unit level switch of the selected parallel partition packet network on the second die, to buffer a packet in the buffer of the selected parallel partition packet network one the second die before transmitting the packet to the die-to-die communication channel of the selected parallel partition packet network, and to buffer a packet in the buffer on the second die received from the die-to-die communication channel of the selected parallel partition packet network on the second die.
In Example 45, the subject matter of Examples 39-44 and 46 can optionally include a common communication channel for carrying packets from each of the plurality of parallel partition packet networks wherein each packet carried by the common communication channel has a tag identifying a parallel partition packet network, and a common unit level switch coupled to each parallel partition packet network of the plurality of parallel partition packet networks, wherein the parallel partition packet network control logic is adapted to tag a packet from each parallel partition packet network with a tag to identify the parallel partition packet network from which the packet arrived, and switch the tagged packet through the common unit level switch to the common communication channel.
In Example 46, the subject matter of Examples 39-45 can optionally include that the parallel partition packet network control logic is adapted read a tag of a packet from the common communication channel, and to switch the packet through the common unit level switch to the parallel partition packet network identified by the tag of the packet.
Example 47 is a method of routing packets in at least one integrated circuit, comprising:
parallel partition packet network control logic selecting a first parallel partition packet network of a plurality of parallel partition packet networks, coupling a memory of a device, to a plurality of blocks defining a block hierarchical level, each block including a plurality of processors and a block network interconnecting the processors of the block, wherein a plurality of said blocks is organized in at least one unit defining a unit hierarchical level higher than the first hierarchical level, each parallel partition packet network having a unit level switch at the unit hierarchical level coupled to the memory and to each block network of the plurality of blocks of the unit; and
parallel partition packet network control logic transmitting a packet through the selected parallel partition packet network independent of the other parallel partition packet networks, said transmitting including switching the packet through a unit level switch of the selected parallel partition packet network at the unit hierarchical level.
In Example 48, the subject matter of Examples 47 and 49-53 optionally include
parallel partition packet network control logic switching the packet through a unit-to unit level switch of the selected parallel partition packet network at a unit-to-unit hierarchical level, between selected unit level switches of the selected parallel partition packet network,
wherein a plurality of said units define the unit-to-unit level of a third hierarchical level higher than the second hierarchical level, wherein each unit is at the unit hierarchical level and comprises a plurality of said blocks at the block hierarchical level, wherein each parallel partition packet network has a unit level switch at the unit hierarchical level for each unit and coupled to the memory and to each block network of the plurality of blocks of the particular unit, and a unit-to-unit level switch at the unit-to-unit hierarchical level and coupled to each unit level switch of the particular parallel partition packet network at the unit hierarchical level.
In Example 49, the subject matter of Examples 47-48 and 50-53 can optionally include parallel partition packet network control logic buffering on a first die, a packet in a buffer of the selected parallel partition packet network, each parallel partition packet network having a buffer and a die-to-die communication channel at the unit hierarchical level, each buffer of a parallel partition packet network being coupled to the die-to-die communication channel and the unit level switch of the particular parallel partition packet network, the method further comprising parallel partition packet network control logic transmitting a packet from a buffer to a die-to-die communication channel of the selected parallel partition packet network, receiving a packet from the die-to-die communication channel of the selected parallel partition packet network, and buffering the received packet in the buffer of the selected parallel partition packet network.
In Example 50, the subject matter of Examples 47-49 and 51-53 can optionally include a unit level memory controller of a plurality of unit hierarchical level memory controllers on a second die, controlling memory transactions between a memory and the processors of an associated unit, each unit having an associated memory controller of the plurality of unit level memory controllers on the second die;
parallel partition packet network control logic on the second die switching a packet received from the die-to-die communication channel of the selected parallel partition packet network, through a unit level switch of the selected parallel partition packet network on the second die, each parallel partition packet network having a unit level switch of a plurality of unit level switches on the second die; and
parallel partition packet network control logic buffering on a second die, a packet received from the die-to-die communication channel of the selected parallel partition packet network in a buffer of the selected parallel partition packet network, each parallel partition packet network having a buffer of a plurality of buffers on the second die, coupled to the unit level switch of the particular parallel partition packet network, each unit level switch on the second die being coupled to the die-to-die communication channel of the particular parallel partition packet network.
In Example 51, the subject matter of Examples 47-50 and 52-53 can optionally include parallel partition packet network control logic on the second die switching a packet through a unit level switch of the selected parallel partition packet network on the second die, buffering a packet in the buffer of the selected parallel partition packet network on the second die, and transmitting a packet from the buffer of the selected parallel partition packet network to the die-to-die communication channel of the selected parallel partition packet network.
In Example 52, the subject matter of Examples 47-51 and 53 can optionally include parallel partition packet network control logic tagging a packet arriving from the selected parallel partition packet network with a tag to identify the selected parallel partition packet network from which the packet arrived, and switching the tagged packet through a common unit level switch to a common communication channel coupled to each parallel partition packet network of the plurality of parallel partition packet networks, for carrying packets from each of the plurality of parallel partition packet networks wherein each packet carried by the common communication channel has a tag identifying a parallel partition packet network from which the tagged packet arrived.
In Example 53, the subject matter of Examples 47-52 can optionally include parallel partition packet network control logic reading a tag of a packet arrived from the common communication channel, and switching the arrived packet through the common unit level switch to the parallel partition packet network identified by the tag of the packet.
Example 54 is an apparatus comprising means to perform a method as described in any preceding Example.
Example 55 is a computer architecture device for use with a memory, comprising:
a plurality of blocks defining a block hierarchical level, each block including a plurality of processors and a block network interconnecting the processors of the block;
a unit defining a unit hierarchical level higher than the block hierarchical level, said unit including a plurality of said blocks at the block hierarchical level; and
a plurality of parallel partition packet networks, each parallel partition packet network being independent of the other parallel partition packet networks of the plurality of parallel partition packet networks, each parallel partition packet network having a unit level switch at the unit hierarchical level coupled to the memory and to each block network of the plurality of blocks of the unit.
In Example 56, the subject matter of Examples 55 and 57-62 can optionally include parallel partition packet network control logic adapted to select a first parallel partition packet network of the plurality of parallel partition packet networks, and transmit a packet through the selected parallel partition packet network independent of the other parallel partition packet networks, said transmitting including switching the packet through the unit level switch of the selected parallel partition packet network at the unit hierarchical level.
In Example 57, the subject matter of Examples 55-56 and 58-62 can optionally include a plurality of said units defining a unit-to-unit hierarchical level higher than the unit hierarchical level, wherein each unit is at the unit hierarchical level and comprises a plurality of said blocks at the block hierarchical level, wherein each parallel partition packet network has a unit level switch at the unit hierarchical level for each unit and coupled to the memory and to each block network of the plurality of blocks of the particular unit, and a unit-to-unit level switch at the unit-to-unit hierarchical level and coupled to each unit level switch of the particular parallel partition packet network at the unit hierarchical level, wherein the parallel partition packet network control logic is further adapted to switch the packet through the unit-to unit level switch of the selected parallel partition packet network at the unit-to-unit hierarchical level, between selected unit level switches of the selected parallel partition packet network.
In Example 58, the subject matter of Examples 55-57 and 59-62 can optionally include a first die, a second die and a plurality of die-to-die communication channels, each parallel partition packet network having a die-to-die communication channel of the plurality of die-to-die communication channels, coupled to the first and second dies, wherein the unit of the plurality of blocks and the unit level switches of the plurality of parallel partition packet networks are on the first die, the device further comprising a plurality of buffers on the first die, each parallel partition packet network having a buffer coupled to the unit level switch and the die-to-die communication channel of the particular parallel partition packet network, wherein the parallel partition packet network control logic is further adapted to buffer a packet in the buffer of the selected parallel partition packet network before transmitting the packet to the die-to-die communication channel of the selected parallel partition packet network, and to buffer a packet in the buffer received from the die-to-die communication channel of the selected parallel partition packet network in the buffer of the selected parallel partition packet network.
In Example 59, the subject matter of Examples 55-58 and 60-62 can optionally include a plurality of unit hierarchical level memory controllers on the second die, each unit level memory controller adapted to control memory transactions between the memory and the processors of an associated unit, a plurality of unit level switches on the second die, wherein each parallel partition packet network has a unit level switch of the plurality of unit level switches on the second die, and a buffer of the plurality of buffers on the second die, each unit level switch on the second die coupled to the unit level switch and buffer of the particular parallel partition packet network on the first die.
In Example 60, the subject matter of Examples 55-59 and 61-62 can optionally include that each memory controller has parallel partition packet network control logic on the second die adapted to switch a packet through the unit level switch of the selected parallel partition packet network on the second die, to buffer a packet in the buffer of the selected parallel partition packet network one the second die before transmitting the packet to the die-to-die communication channel of the selected parallel partition packet network, and to buffer a packet in the buffer on the second die received from the die-to-die communication channel of the selected parallel partition packet network on the second die.
In Example 61, the subject matter of Examples 55-60 and 62 can optionally include a common communication channel for carrying packets from each of the plurality of parallel partition packet networks wherein each packet carried by the common communication channel has a tag identifying a parallel partition packet network, and a common unit level switch coupled to each parallel partition packet network of the plurality of parallel partition packet networks, wherein the parallel partition packet network control logic is adapted to tag a packet from each parallel partition packet network with a tag to identify the parallel partition packet network from which the packet arrived, and switch the tagged packet through the common unit level switch to the common communication channel.
In Example 62, the subject matter of Examples 55-61 can optionally include that the parallel partition packet network control logic is adapted read a tag of a packet from the common communication channel, and to switch the packet through the common unit level switch to the parallel partition packet network identified by the tag of the packet. The described operations may be implemented as a method, apparatus or computer program product using standard programming and/or engineering techniques to produce software, firmware, hardware, or any combination thereof. The described operations may be implemented as computer program code maintained in a “computer readable storage medium”, where a processor may read and execute the code from the computer storage readable medium. The computer readable storage medium includes at least one of electronic circuitry, storage materials, inorganic materials, organic materials, biological materials, a casing, a housing, a coating, and hardware. A computer readable storage medium may comprise, but is not limited to, a magnetic storage medium (e.g., hard disk drives, floppy disks, tape, etc.), optical storage (CD-ROMs, DVDs, optical disks, etc.), volatile and non-volatile memory devices (e.g., EEPROMs, ROMs, PROMs, RAMs, DRAMs, SRAMs, Flash Memory, firmware, programmable logic, etc.), Solid State Devices (SSD), etc. The code implementing the described operations may further be implemented in hardware logic implemented in a hardware device (e.g., an integrated circuit chip, Programmable Gate Array (PGA), Application Specific Integrated Circuit (ASIC), etc.). Still further, the code implementing the described operations may be implemented in “transmission signals”, where transmission signals may propagate through space or through a transmission media, such as an optical fiber, copper wire, etc. The transmission signals in which the code or logic is encoded may further comprise a wireless signal, satellite transmission, radio waves, infrared signals, Bluetooth, etc. The program code embedded on a computer readable storage medium may be transmitted as transmission signals from a transmitting station or computer to a receiving station or computer. A computer readable storage medium is not comprised solely of transmissions signals. Those skilled in the art will recognize that many modifications may be made to this configuration without departing from the scope of the present description, and that the article of manufacture may comprise suitable information bearing medium known in the art. Of course, those skilled in the art will recognize that many modifications may be made to this configuration without departing from the scope of the present description, and that the article of manufacture may comprise any tangible information bearing medium known in the art.
In certain applications, a device in accordance with the present description, may be embodied in a computer system including a video controller to render information to display on a monitor or other display coupled to the computer system, a device driver and a network controller, such as a computer system comprising a desktop, workstation, server, mainframe, laptop, handheld computer, etc. Alternatively, the device embodiments may be embodied in a computing device that does not include, for example, a video controller, such as a switch, router, etc., or does not include a network controller, for example.
The illustrated logic of figures may show certain events occurring in a certain order. In alternative embodiments, certain operations may be performed in a different order, modified or removed. Moreover, operations may be added to the above described logic and still conform to the described embodiments. Further, operations described herein may occur sequentially or certain operations may be processed in parallel. Yet further, operations may be performed by a single processing unit or by distributed processing units.
The foregoing description of various embodiments has been presented for the purposes of illustration and description. It is not intended to be exhaustive or to limit to the precise form disclosed. Many modifications and variations are possible in light of the above teaching.