MULTILAYER 3D MEMORY BASED ON NETWORK-ON-CHIP INTERCONNECTION

Description

BACKGROUND OF THE DISCLOSURE

Field of the Disclosure

Embodiments of the present disclosure generally relate to data and memory storage systems, and more particularly, to a memory device utilizing a three-dimensional Network-on-Chip routing protocol for the interconnection of memory subarrays, mats, arrays, subbanks, and/or banks

Description of the Related Art

The cerebral cortex of a computer is a magnetic recording device, which typically may include a rotating magnetic media or a solid state volatile or non-volatile media device. A number of different technologies exist today for storing information for use in a computing system.

In recent years, there has been a demand for higher density devices—volatile and non-volatile—which maintain a relatively low cost per bit, for use in high capacity storage applications. Today the memory technologies that generally dominate the computing industry are DRAM and NAND flash; however, these memory technologies may not be able to address the current and future capacity or energy demands of next generation computing systems.

Existing non-volatile memory bank architecture employs a classic fabric routing methodology, which has been widely adopted in SRAM, DRAM, FLASH, MRAM, PCM, and ReRAM, as well as with HMC memory banks HMC memory banks utilize Through-Silicon-Via (TSV) to interconnect stacked dies with a Network-on-Chip-like protocol to access the many dies. However, each die is a classic DRAM die based on traditional H-Tree routing techniques to route the many subarrays. This classic methodology limits the density and amount of memory cells that may be included in a single die as well as the amount of bandwidth and access points to the same memory bank.

Traditionally, memory banks are architectured and organized as banks comprising arrays of subbanks Each subbank may comprise multiple MATs. Each MAT may be composed of four or more subarrays and predecoding logic. As such, H-Tree routing may be used to route the I/O of the subarrays across the die vertically and horizontally. However, approximately 60% to 80% of the area is utilized to interconnect the subarrays; therefore, the majority of the surface of the memory is logic interconnection and not memory. As such, the biggest limitation with existing memory bank architecture is the amount of wire necessary to route the entire memory. Excessive amount of wire is the main cause for latency in existing memory banks from SRAM to DRAM. Given the physical limitations of traditional memory banks, subarrays share wordlines to write and read. As such, each bank can only access one subarray at a given time. With such limitations, there may only be one physical access interface, due to complexity and cost, to implement additional interfaces.

Hence, there is a need in art for an improved memory device that utilizes an improved three-dimensional routing protocol, allowing for access to any given subarray in parallel while improving total density and bandwidth. Furthermore, there is a need in the art for an improved three-dimensional apparatus and method for routing memory banks without employing a majority of the die for routing.

SUMMARY OF THE DISCLOSURE

The present disclosure generally relates to the use of three-dimensional solid state memory structures, both volatile and non-volatile, utilizing a Network-on-Chip routing protocol which provide for the access of memory storage via a router. As such, data may be sent to and/or from memory storage as data packets on the chip. The Network-on-Chip routing protocol may be utilized to interconnect unlimited numbers of three-dimensional memory cell matricies, spread on a die, or multiple dies, thus allowing for reduced latencies among matricies, selective power control, unlimited memory density growth without major latency penalties, and reduced parasitic capacitance and resistance. Other benefits include an increase in total density as compared to two-dimensional solid state memory structures utilizing a Network-on-Chip routing protocol, improved signal integrity, larger die areas, improved bandwidths and higher frequencies of operation.

In one embodiment, a three-dimensional memory device is disclosed. The three-dimensional memory device includes a first die, a second die, and a plurality of links. The first die includes a plurality of memory arrays. The second die includes a plurality of routers. Each router is operatively connected to at least one memory array via a link. Each router includes data packet switching logic and at least one aggregator. The at least one aggregator is operatively connected with the data packet switching logic. The plurality of links interconnect each router in a Network-on-Chip routing protocol.

In another embodiment, a multi-layer memory device is disclosed. The multilayer memory device includes two or more first die, a second die, and a second plurality of links for operatively connecting the two or more first die and the second die. Each first die includes a plurality of memory arrays. The two or more first die are each superposed on a first side of the second die. The second die includes a plurality of routers operatively connected via a first plurality of links in a Network-on-Chip routing protocol. Each router includes data packet switching logic and at least one aggregator. The aggregator is operatively connected with the data packet switching logic.

In another embodiment, a memory device is disclosed. The memory device includes a plurality of memory layers. Each memory layer includes a first layer and a second layer. The first layer includes a plurality of routers. Each router is operatively connected to at least one other router of the plurality of routers via a link in a Network-on-Chip routing protocol. Each router includes data packet switching logic and at least one aggregator operatively connected with the data packet switching logic. The second layer includes a plurality of memory arrays. Each router is operatively connected to at least one memory array with a link. The first layer is superposed by the second layer.

BRIEF DESCRIPTION OF THE DRAWINGS

So that the manner in which the above recited features of the present disclosure can be understood in detail, a more particular description of the disclosure, briefly summarized above, may be had by reference to embodiments, some of which are illustrated in the appended drawings. It is to be noted, however, that the appended drawings illustrate only typical embodiments of this disclosure and are therefore not to be considered limiting of its scope, for the disclosure may admit to other equally effective embodiments.

FIG. 1A illustrates architecture of a memory array of a conventional memory device.

FIG. 1B illustrates an overview of memory bank architecture utilizing a conventional H-Tree technique.

FIG. 2A illustrates a router according to one embodiment described herein.

FIG. 2B illustrates a three-dimensional solid state memory structure utilizing a Network-on-Chip routing protocol with a memory array on a different layer than a router according to one embodiment described herein.

FIG. 2C illustrates a three-dimensional solid state memory structure utilizing a Network-on-Chip routing protocol with a plurality of three-dimensionally stacked memory arrays on different layers than a router according to one embodiment described herein.

FIG. 2D illustrates a three-dimensional solid state memory structure utilizing a Network-on-Chip routing protocol with a plurality of memory layers according to one embodiment described herein.

FIG. 2E illustrates a router according to one embodiment described herein.

To facilitate understanding, identical reference numerals have been used, where possible, to designate identical elements that are common to the figures. It is contemplated that elements disclosed in one embodiment may be beneficially utilized on other embodiments without specific recitation.

DETAILED DESCRIPTION

In the following, reference is made to embodiments of the disclosure. However, it should be understood that the disclosure is not limited to specific described embodiments. Instead, any combination of the following features and elements, whether related to different embodiments or not, is contemplated to implement and practice the disclosure. Furthermore, although embodiments of the disclosure may achieve advantages over other possible solutions and/or over the prior art, whether or not a particular advantage is achieved by a given embodiment is not limiting of the disclosure. Thus, the following aspects, features, embodiments and advantages are merely illustrative and are not considered elements or limitations of the appended claims except where explicitly recited in a claim(s). Likewise, reference to “the disclosure” shall not be construed as a generalization of any inventive subject matter disclosed herein and shall not be considered to be an element or limitation of the appended claims except where explicitly recited in a claim(s).

Embodiments disclosed herein generally relate to the use of three-dimensional solid state memory structures, both volatile and non-volatile, utilizing a Network-on-Chip routing protocol which provide for the access of memory storage via a router. As such, data may be sent to and/or from memory storage as data packets on the chip. The Network-on-Chip routing protocol may be utilized to interconnect unlimited numbers of three-dimensional memory cell matricies, spread on a die, or multiple dies, thus allowing for reduced latencies among matricies, selective power control, unlimited memory density growth without major latency penalties, and reduced parasitic capacitance and resistance. Other benefits include a reduction in total density as compared to two-dimensional solid state memory structures utilizing a Network-on-Chip routing protocol, improved signal integrity, larger die areas, improved bandwidths and higher frequencies of operation.

In the following description of aspects of the present disclosure, reference is made to the accompanying drawings that form a part hereof, and in which is shown by way of illustration of the specific implementations in which the disclosure may be practiced. It should be noted that the figures discussed herein are not drawn to scale and do not indicate actual or relative sizes. Any hatching in the figures is used to distinguish layers and does not represent the type of material used.

FIG. 1A illustrates the architecture of conventional memory banks 102, 104 of a memory device 100 as known in the art. As shown, the memory device 100 may include a first memory bank 102 and a second memory bank 104 operatively connected by interface logic 106. In certain embodiments, the first memory bank 102 and/or the second memory bank may be a subbank. It is contemplated, however, that more memory banks may be utilized within the memory device 100, wherein each memory bank may be connected by interface logic. The first memory bank 102 and the second memory bank 104 may each comprise a plurality of subarrays 108. In certain embodiments, the first memory bank 102 and the second memory bank 104 may each comprise a plurality of subbanks (not shown). Each subarray 108 may be a subarray of memory cells. Interconnectors 110 may separate each subarray 108 within each of the first memory bank 102 and the second memory bank 104. The interconnectors 110 may be wires dedicated to the routing of the entire memory device 100. As shown, approximately about 60% to about 80% of the area of each of the first memory bank 102 and the second memory bank 104 is dedicated to interconnectors 110.

A drawback of existing memory bank architecture, such as the architecture of the first memory bank 102 and the second memory bank 104 of FIG. 1A, is the amount of wire necessary to route the entire memory device 100. As such, a main cause of latency in existing memory banks, such as those of the first memory bank 102 and the second memory bank 104, is the amount of wire regardless of the type of device (for example, SRAM, DRAM, etc.). As such, a challenge exists in determining the tradeoff between power, area, and latency within such memory devices.

FIG. 1B illustrates another embodiment of a conventional memory bank architecture scheme 120 utilizing an H-Tree routing layout 122. Conventional memory banks may include banks of memory (not shown), each bank being divided into arrays of subbanks (not shown). Each subbank may be further divided into multiple MATs (not shown), and each MAT may be composed of four or more subarrays 124. Each subarray 124 may include predecoding logic (not shown), 2-D memory array cells (not shown), row and column decoders (not shown), wordline drivers (not shown), bitline muxers (not shown), sense amplifiers (not shown), and/or output drivers (not shown). Each element of each subarray 124 may be interconnected with the I/O interface (not shown).

Each subarray 124 may be connected within the conventional memory bank architecture scheme 120 via wire 126. A conventional memory bank architecture scheme 120 utilizing a line size of eight words of 64 bits maintains a total of 512 bits, or metal tracks. As such, collectively, each conventional memory bank architecture scheme 120 may utilize over 8,000 wires 126 to interconnect each subarray 124 therewithin. The utilization of H-Tree routing layout 122 necessitates that power is constantly applied to the entire H-Tree.

The conventional memory bank 102, 104 of FIG. 1A may generally assume the H-Tree routing layout 122. As discussed, supra, the use of the H-Tree routing layout 122 may utilize between about 60% and about 80% of the space of the memory device 100 for routing wires 126 and I/O fabric interconnection.

FIG. 2A illustrates a router 212, according to one embodiment described herein. Each router 212 may include a plurality of first in-first out devices (FIFOs) 216, data packet switching logic 218, and/or a least one aggregator 220. Each FIFO 216 may allow for the individual breaking of clock domains across multiple channels. Each FIFO 216 may organize and/or manipulate a data buffer such that the first entry received is the processed first. In some embodiments, the plurality of FIFOs 216 may be at least six FIFOs 216. In another embodiment, the plurality of FIFOs 216 may be ten FIFOs 216. It is contemplated however, that any number of FIFOs 216 may be utilized. Each channel may be a full-duplex path, including an input and an output interface. The input and output interface may be able to send and receive data in parallel. Each transmitter interface may be operatively connected with a FIFO 216. Each receiver interface may be operatively connected with a FIFO 216. As such, the communication may occur across the FIFOs 216, thus completely separating the internals of each router 212. Each FIFO 216 may be operatively connected to an adjoining FIFO 216 as a pair of FIFOs 216 via a first interconnector 222. In some embodiments, the first interconnector 222 may be wire. Furthermore, each FIFO 216 may be operatively connected to the data packet switching logic 218 via the first interconnector 222. In some embodiments, each router 212 may further include data packet switching logic 218. Each channel may be operated at an independent operating frequency or clock. Furthermore, each channel may be a full-duplex channel. Each channel may be operated on a different bandwidth. A first link 206 may operatively connect each router 212 to a memory array 210 (See, infra). Furthermore, a second link 214 may connect adjacent routers (See, infra).

The use of a FIFO 216 may allow for the breaking of clock domains one-by-one completely independently across various channels. As such, a full duplex channel may operate in different bandwidths and/or operating frequencies. Furthermore, each channel may operate in different and independent frequencies. The use of a FIFO 216 may allow for an EDA tool to route a Clock Tree Synthesis with improved performance and/or improved signal to noise ratio. Additionally, each FIFO 216 may be implemented with non-volatile and/or volatile technology, for example, SRAM and/or MRAM. Furthermore, the internal logic of the router 212, data packet switching logic 218, and aggregator 220 may operate in different clock-domains. The different clock domains may be on different and/or multiple frequencies. In another embodiment, the different clock domains may be aligned out of phase. As such, future expansion of the design into GaLs (Globally Asynchronous Locally Synchronous) may be permitted.

As further shown in FIG. 2A the router 212 may further include an aggregator 220. Although one aggregator 220 is shown, it is contemplated that any number of aggregators 220 may be utilized. Although one aggregator 220 is shown, it is contemplated that any number of data packet switching logics 218 may be utilized and/or operatively connected to or with the aggregator 220. The aggregator 220 may be connected to at least one FIFO 216 via a second interconnector 224, such that the at least one FIFOs 216 are between the data packet switching logic 218, or multiple data packet switching logics 218, and the aggregator 220. The aggregator 220 may be operatively connected with the data packet switching logic 218. In some embodiments the second interconnector 224 may be wire. In certain embodiments, the FIFOs 216 may be connected to the aggregator 220, such that the at least one FIFOs 216 is between the aggregator 220 and the data packet switching logic 218. In certain embodiments, the aggregator 220 may be connected to some or all of the FIFOs 216 of the router 212. The aggregator 220 may translate from the Network-on-Chip routing protocol between the router 212 and the memory array 210. In certain embodiments, the aggregator 220 may concentrate multiple channels.

Furthermore, the aggregator 220 may be operatively connected to a memory array 210. In some embodiments, the connection between the aggregator 220 and the FIFOs 216 may allow for access to the memory array 210. In certain embodiments, the aggregator 220 may perform translations between a network channel and the memory array 210. In other embodiments, the aggregator 220 may perform translations between multiple network channels and the memory array 210. In one embodiment, the aggregator 220 may be connected to the FIFOs 216 connected to the memory array 210.

Each memory array 210 may be accessed via the router 212. In certain embodiments, each memory array 210 may be accessed via a respective router 212, such as a router operatively connected with the memory array 210. Data packets (not shown) may be fragmented, such that data may be sent to and from the memory array 210 via the router 212 and/or the plurality of first links 206 as a fragmented data packet. For example, 64 bits may be broken into four packets of 16 bits or eight packets of eight bits. The same path need not be followed to send each data packet to its destination. As such, four clock cycles plus hops are needed to transit the data packet across the network to read or write the memory in any position.

FIG. 2B, FIG. 2C, and FIG. 2D each illustrates an embodiment of a memory device 200 utilizing a Network-on-Chip routing protocol. The memory device 200 may be a three-dimensional memory device and/or a multilayer memory device. In certain embodiments, the memory device 200 may be a solid-state memory device. The use of Network-on-Chip routing protocol may allow for the interconnection of an unlimited number of stacked memory layers, in a three-dimensional fashion, including routers 212 and memory arrays 210. As such, the final density of a memory bank and associated latencies may be predicted beforehand. The Network-on-Chip routing protocol may be an interconnected memory cell matrix. In certain embodiments, the Network-on-Chip routing protocol may support silicon implementation of a three-dimensional structure of the topology.

FIG. 2B illustrates an embodiment of a three-dimensional memory device 200. As shown, each memory device 200 may include a first die 202, a second die 204, and/or a plurality of second links 214. The first die 202 may include a plurality of memory arrays 210. In some embodiments, the plurality of memory arrays 210 may be a plurality of intellectual property (IP) cores. The second die 204 may include a plurality of routers 212. A plurality of first links 206 may interconnect a router 212 of the second die 204 with a memory array 210 of the first die 202. In some embodiments, the plurality of first links 206 may operatively connect at least one router 212 of the second die 204 and at least one memory array 210 of the first die 202. In some embodiments the plurality of first links 206 may be wire. The plurality of second links 214 may interconnect each router 212 of the second die 204 in a Network-on-Chip routing protocol. In some embodiments the plurality of second links 206 may be wire. As such, each router 212 may be operatively connected to at least one other router 212 of the second die 204 via the second link 214, such as an adjacent router 212. In some embodiments, each router 212 may be operatively connected to at least one other adjacent router 212 of the second die 204 via the second link 214. Each router 212, as described with reference to FIG. 2A supra, may include a plurality of FIFOs 216, data packet switching logic 218, and/or a least one aggregator 220. Each FIFO 216 may allow for the individual breaking of clock domains across multiple channels. The at least one aggregator 220 may be operatively connected with the data packet switching logic 218 and/or the at least one FIFO 216.

Furthermore, the interconnection of the first die 202 and the second die 204 of the memory device 200 may be in a stacked formation such that the first die 202 is stacked above and/or below the second die 204. The first die 202 may be superposed by the second die 204 such that the first die 202 and the second die 204 are coplanar in a first plane. As such, the stacking of the first die 202 and the second die 204 is three-dimensional. Each memory array 210 may be in parallel within the first die 202. Furthermore, in some embodiments, each memory array 210 may maintain a unique channel.

The memory device 200 of FIG. 2B maintains an increased memory density as compared to conventional memory devices (See FIGS. 1A, 1B, supra) while maintaining a reduction in the total die surface. Although four memory arrays 210 and four routers 212 are shown in FIG. 2B, it is contemplated that any number of memory arrays 210 and any number of routers 212 may be utilized.

FIG. 2C illustrates another embodiment of a memory device 200 utilizing a Network-on-Chip routing protocol. As shown in FIG. 2C, the memory device 200 may be a multilayer memory device 200. The memory device 200 may include two or more first die 202, a second die 204, and a first plurality of links 206 for connecting the two or more first die 202 and the second die 204. Each first die 202 may include a plurality of memory arrays 210. The second die 204 may include a plurality of routers 212 operatively connected via a second plurality of links in a Network-on-Chip routing protocol. Each router 212 may include a plurality of FIFOs 216, data packet switching logic 218, and/or at least one aggregator 220. Each router 212 may be operatively connected with at least one adjacent router 212 of the second die 204. The aggregator 220 may be operatively connected to at least one FIFO 216. The two or more first die 202 may each be superposed on a first side 208 of the second die 204. In some embodiments the two or more first die 202 and the second die 204 may be stacked in a common plane.

The two or more first die 202 and the second die 204 of the multilayer memory device 200 of FIG. 2C may share the same communication channel. However, in some embodiments the memory device 200 of FIG. 2C may use a broadcast protocol, as the memory device 200 is based on a Network-on-Chip routing protocol. In some embodiments, each first die 202 may be accessed independently. Furthermore, each first die 202 may be powered independently.

Although eight memory arrays 210 and four routers 212 are shown in FIG. 2C, it is contemplated that any number of memory arrays 210 and any number of routers 212 may be utilized and/or within the respective first die 202 and/or second die 204. Each memory array 210 may be in parallel within the first die 202. Furthermore, each memory array 210 may maintain a unique channel. In some embodiments, the two or more first die 202 and the second die 204 may share the same communication channel.

FIG. 2D illustrates an embodiment of memory device 200 utilizing a Network-on-Chip routing protocol. The memory device 200 may include a plurality of memory layers 230. Each memory layer 230 may include a first layer 232 and a second layer 234. The first layer 232 may be superposed by the second layer 234.

In some embodiments, the first layer 232 may be a fabric interconnection die. The first layer 232 may include a plurality of routers 212. Each router may be operatively connected to at least one other router 212 of the plurality of routers 212 via a second link 214 such that the routers are in a Network-on-Chip routing protocol. Each router 212 may include a plurality of FIFOs 216, data packet switching logic 218, and/or at least one aggregator 220. Each router 212 may implement multiple channels. In certain embodiments, each router 212 of the first layer 232 may implement an upper channel and a lower channel. The at least one aggregator 220 may be operatively connected to at least one FIFO 216. In some embodiments, multidirectional access may be had such that all layers can access layers above and below.

In some embodiments, the second layer 234 may be a memory array matricies die. The second layer 234 may include a plurality of memory arrays 210. Each router 212 may be operatively connected to at least one memory array 210 with a first link 206. Each router 212 of the first layer 232 may be operatively connected with at least one memory array 210 of a different memory layer 230.

In certain embodiments, the plurality of memory layers 230 may be more than two memory layers 230. As such, by way of example only, the plurality of memory layers 230 may include at least a first memory layer 236, a second memory layer 238, and a third memory layer 240. The second memory layer 238 may be between the first memory layer 236 and the third memory layer 240. Each router 212 of the first memory layer 236, the second memory layer 238, and the third memory layer 240 may be operatively connected with at least one other router 212 of the plurality of routers 212 across the plurality of memory layers 230. The plurality of routers 212 of the second memory layer 238 may be operatively connected with at least one memory array 210 of the second memory layer 238 and at least one memory array 210 of the third memory layer 240. The plurality of memory arrays 210 of the second memory layer 238 may be operatively connected with at least one router 212 of the second memory layer 238 and at least one router 212 of the first memory layer 236. The first memory layer 236 may be superposed by the second memory layer 238. The second memory layer 238 may be superposed by the third memory layer 240.

Although twelve memory arrays 210 and twelve routers 212 are shown in FIG. 2D, it is contemplated that any number of memory arrays 210 and any number of routers 212 may be utilized. Furthermore, although three memory layers 230 are shown in FIG. 2D, it is contemplated that any number of memory layers 230 may be utilized. Furthermore, although each memory layer 230 of FIG. 2D is shown with four memory arrays 210 and four routers 212, it is contemplated that any number of memory arrays 210 and any number routers 212 may be utilized in each memory layer 230.

In each of FIG. 2B, FIG. 2C, and FIG. 2D, each router 212 may be operatively connected to at least one memory array 210 via the first link 206. As such, each memory array 210 may be accessed via the router 212 connected thereto.

In each embodiment of FIG. 2B, FIG. 2C, and FIG. 2D, the routers 212 may be located at grid points where data packets or message packets may change directions on an X-Y-Z plane (which may correspond to north, south, east, west, up, and down directions) and/or exit to host blocks on the Network-on-Chip routing protocol. As such, routers 212 may be utilized when data packets need to switch from one router 212 and/or memory array 210 location to another router 212 and/or memory array 210 location on the path of the data packet. A router 212 may be utilized at points where data packets from multiple input links may meet and contend for a common output link.

Each router 212 may be sequentially accessed across a path of a data packet (not shown) without supplying power to the entire memory device 200.

It is contemplated that, in certain embodiments, the memory device 200 may include two or more layers (e.g., an upper layer and a bottom layer) that are not dies. In some embodiments, an upper layer and a lower layer may be included. The layers may be stacked in the manners described herein, and may further repeat the patterns of the lower layer to create a monolithic three-dimensional element, which may be similar to a die. Data packets may be exchanged via a data packet exchange hop from the upper layer to the bottom layer and/or from the bottom layer to the upper layer. As such, there may be no direct connection from the I/O interface, as only data packets may be exchanged across the network, except in embodiments in which a channel of super-high priority is created in which there is a single position FIFO to maintain the element of clock domain dissociation.

The amount of memory available to each router 212 may define the range of addresses for the router 212. To accommodate the Network-on-Chip routing protocol, each router 212 may have a range of addresses, rather than a single address. As such, in some embodiments, each router 212 may have a unique range of addresses rather than X and/or Y coordinates. In certain embodiments, the unique range of addresses for each router 212 may be a unique address. In some embodiments, the range of addresses for each router 212 may be a sequential range of addresses. In some embodiments, the range of addresses for each router 212 may not be a sequential range of addresses, depending on the chosen organization. Additionally, each memory array 210 may have a unique address and/or a unique range of addresses rather than X and/or Y coordinates. The range of addresses for each memory array 210 may be a sequential range of addresses, as each memory array 210 is a linear sequence of memory addresses.

As such, a data packet (not shown) may be sent to an address. Each router 212 may have a range of addresses that are defined by the amount of memory available therein, such as within the attached memory array 210. As such, each router 212 and attached memory array 210 may be, by way of example only, 1024 lines or 2048 lines, independently, in the same design. Therefore, the data packet switching logic 218 may match a row and column to a field of the data packet and send the data packet to a local port connected with a memory array 210. In certain embodiments, a calculation may be performed and the result compared to the properties of the router 212 and attached memory arrays 210. If the addressing of the row is larger and/or smaller than the router 212 and attached memory array 210, the data packet may be routed north and/or south. If the addressing of the column is larger and/or smaller than the router 212 and attached memory array 210, the data packet may be routed east and/or west. Furthermore, if the addressing layer is larger or smaller, the data packet may be routed up and/or down. As such, the topology may enforce the address routing mechanism on the network. The data packet switching logic 218 may perform a calculation to verify that the data packet address is inside a range of the global space. If the data packet address is not within the range of the global space, multiple different routing algorithms may be calculated on the fly to reroute the data packet. If a match of the address is subtracted from the base address, the address inside of the memory array range may be connected to the local port. If a match of the address is not subtracted from the base address, the data packet may be routed to another port. The decision of which port may depend on the topology of the memory device and a routing table. Additionally, the Network-on-Chip routing protocol may be built in any shape, without modifying or correcting the addressing logic.

In certain embodiments, the multi-layer memory device 200 may further include a plurality of modulators and/or a plurality of demodulators. FIG. 2E illustrates another embodiment of router 212. In some embodiments, the router 212 may provide single lane communication, which may be an optical channel. As such, the router 212 may include a plurality of modulators/demodulators 280. The modulator/demodulator 280 may modulate signals to encode digital information and demodulate signals to decode the transmitted information. Each FIFO 216 of the router 212 may be operatively connected with a modulator/demodulator 280 via the first interconnector 222 and/or a wire. The modulator/demodulator 280 may be operatively connected with the second link 214 which may operatively connect one router 212 to another router 212. In certain embodiments, the second link 214 may be a channel, such as an optical channel or lane, and/or a metallic link. As such, the modulator/demodulator 280 may modulate and/or demodulate the signal data sent via the single lane communication.

Benefits of the present disclosure include higher density memory banks, capable of functioning with higher bandwidths than previously known in the art. Additional benefits include that the Network-on-Chip routing protocol may be utilized to interconnect unlimited numbers of three-dimensional memory cell matricies, spread on a die, thus allowing for reduced latencies among matricies, selective power control, unlimited memory density growth without major latency penalties, and reduced parasitic capacitance and resistance. Other benefits include a reduction in total density as compared to two-dimensional solid state memory structures utilizing a Network-on-Chip routing protocol, improved signal integrity, larger die areas, improved bandwidths and higher frequencies of operation.

Furthermore, the amount of wires, as shown with reference to FIGS. 1A and 1B, is reduced with the use of Network-on-Chip routing protocol, as shown with reference to the memory device 200 of FIG. 2B, FIG. 2C, and FIG. 2D. Due the low amount of wires across each node and the use of FIFOs 216, the memory device 200 may operate at increased frequencies as compared to the memory device shown in FIGS. 1A and 1B. Additional embodiments may include the usage of SerDes across Network-on-Chip routing protocol channels (e.g. serializers/deserializers) to use a single metal track and/or wire for each channel. Reduction to a single metal track and/or wire in any direction may further allow for embodiments containing an optoelectronic channel across the routers, and/or embodiments which interface the network to the memory with a system employing an optoelectronic channel. Optoelectronic channels may be used across the network to replace metal tracks and/or wires, thus adding a new modulation-demodulation to the interfaces.

Additionally, given the layout and hierarchy defined by the Network-on-Chip routing protocol, a prediction of the total latency of any communication in a given system, prior to fabrication, may be had. Furthermore, the final power consumption of a memory device may be accurately predicted. In addition, specific routers 212 may be activated sequentially across the path of a data packet as it travels across the network, without having to power the entire network. Additionally, the memory arrays may not require power unless the memory arrays are in use, despite any ongoing communication traffic in the memory device 200.

While the foregoing is directed to embodiments of the present disclosure, other and further embodiments of the disclosure may be devised without departing from the basic scope thereof, and the scope thereof is determined by the claims that follow.

Claims

1. A three-dimensional memory device, comprising: a first die, comprising a plurality of memory arrays;a second die, comprising a plurality of routers, wherein each router is operatively connected to at least one memory array via a first link, and wherein the router comprises: data packet switching logic; andat least one aggregator operatively connected with the data packet switching logic; anda plurality of second links for interconnecting each router in a Network-on-Chip routing protocol.
2. The three-dimensional memory device of claim 1, wherein the router further comprises: a plurality of FIFOs, wherein each FIFO allows for the individual breaking of clock domains across multiple channels.
3. The three-dimensional memory device of claim 2, wherein the at least one aggregator is operatively connected with at least one FIFO.
4. The three-dimensional memory device of claim 1, wherein the three-dimensional memory device is a solid state memory device.
5. The three-dimensional memory device of claim 1, wherein each memory array is accessed via the router.
6. The three-dimensional memory device of claim 1, wherein the first die is superposed by the second die such that the first die and the second die are coplanar in a first plane.
7. The three-dimensional memory device of claim 1, wherein each router is operatively connected to at least one adjacent router.
8. The three-dimensional memory device of claim 1, wherein the amount of memory available to each router defines a range of addresses for the router.
9. The three-dimensional memory device of claim 1, wherein the plurality of memory arrays are in parallel.
10. The three-dimensional memory device of claim 1, wherein each router is accessed across a path of a data packet without supplying power to the entire three-dimensional memory device.
11. The three-dimensional memory device of claim 1, wherein each memory array maintains a unique channel.
12. A multi-layer memory device, comprising: two or more first die, each first die comprising a plurality of memory arrays;a second die, comprising a plurality of routers operatively connected via a second plurality of links in a Network-on-Chip routing protocol, wherein the two or more first die are each superposed on a first side of the second die, and wherein each router comprises: data packet switching logic; andat least one aggregator, wherein the aggregator is operatively connected with the data packet switching logic; anda first plurality of links for operatively connecting the two or more first die and the second die.
13. The multi-layer memory device of claim 12, wherein each router further comprises: a plurality of FIFOs, wherein the at least one aggregator is operatively connected with at least one FIFO.
14. The multi-layer memory device of claim 12, further comprising: a plurality of modulators; anda plurality of demodulators.
15. The multi-layer memory device of claim 12, wherein the two or more first die and the second die are stacked in a common plane.
16. The multi-layer memory device of claim 12, wherein the two or more first die and the second die share the same communication channel.
17. The multi-layer memory device of claim 12, wherein the multi-layer memory device is a solid state memory device.
18. The multi-layer memory device of claim 12, wherein each of the plurality of memory arrays are accessed via the router.
19. The multi-layer memory device of claim 12, wherein each router is operatively connected to at least one adjacent router.
20. The multi-layer memory device of claim 12, wherein the amount of memory available to each router defines a range of addresses for the router.
21. The multi-layer memory device of claim 12, wherein each router is sequentially accessed across a path of a data packet without supplying power to the entire multi-layer memory device.
22. The multi-layer memory device of claim 12, wherein each memory array maintains a unique channel.
23. The multi-layer memory device of claim 12, wherein each first die is accessed independently, and wherein each first die is powered independently.
24. A memory device, comprising: a plurality of memory layers, each memory layer comprising: a first layer, comprising a plurality of routers, wherein each router is operatively connected to at least one other router of the plurality of routers via a link in a Network-on-Chip routing protocol, and wherein each router comprises: data packet switching logic; andat least one aggregator operatively connected with the data packet switching logic; anda second layer, comprising a plurality of memory arrays, wherein each router is operatively connected to at least one memory array with a link, andwherein the first layer is superposed by the second layer.
25. The memory device of claim 24, wherein each router further comprises: a plurality of FIFOs, wherein the at least one aggregator is operatively connected with at least one FIFO.
26. The memory device of claim 24, wherein the first layer is a fabric interconnection die, and wherein the second layer is a memory array matrices die.
27. The memory device of claim 24, wherein each router of the first layer implements an upper channel and a lower channel.
28. The memory device of claim 24, wherein each router is operatively connected with at least one memory array of a different memory layer.
29. The memory device of claim 24, wherein the plurality of memory layers is a first memory layer, a second memory layer, and a third memory layer, wherein the second memory layer is between the first memory layer and the second memory layer.
30. The memory device of claim 29, wherein each router is operatively connected with at least one other router of the plurality of routers across the plurality of memory layers.
31. The memory device of claim 29, wherein the plurality of routers of the second memory layer is operatively connected with at least one memory array of the second memory layer and at least one memory array of the third memory layer.
32. The memory device of claim 29, wherein the plurality of memory arrays of the second memory layer is operatively connected with at least one router of the second memory layer and at least one router of the first memory layer.
33. The memory device of claim 29, wherein the first memory layer is superposed by the second memory layer, and the second memory layer is superposed by the third memory layer.
34. The memory device of claim 24, wherein each memory layer is superposed by a subsequent memory layer of the plurality of memory layers.
35. The memory device of claim 24, wherein each router is operatively connected to at least one adjacent router.

MULTILAYER 3D MEMORY BASED ON NETWORK-ON-CHIP INTERCONNECTION

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims