This invention is related to integrated circuit devices and in particular to random access semiconductor memories, including dynamic random access memories (DRAMs).
A DRAM is a type of random access memory that typically stores each bit of data in a separate capacitor coupled to a transistor within the integrated circuit. Lithographic scaling and process enhancements usually quadruple number of bits of storage in a DRAM integrated circuit about every three years, sometimes referred to as “Moore's Law.” However, the individual memory cells are now so small that maintaining the capacitance of each cell, as well as reducing charge leakage from the cell, may significantly inhibit continual size reductions.
What is needed is a memory cell that is smaller than the conventional one-transistor, one-capacitor cell, that is readily scalable below 20 nm design rules, that is compatible with standard bulk silicon processing technology, and that consumes lower power, both statically and dynamically. One replacement for the conventional one-transistor, one-capacitor cell is a vertically arranged thyristor, or semiconductor-controlled rectifier (SCR), memory cell. Examples of vertically arranged thyristor, or vertical layer thyristor (VLT), memory cells are described in U.S. Pat. No. 9,564,199, which issued on Feb. 7, 2017 and assigned to the present assignee, and related cases.
The present invention utilizes such VLT memory cell arrays. But there is still the problem of reading the bit information stored in a memory cell array and of writing bit information into a memory cell array for storage. For DRAM memory cell arrays, there is a DDR SDRAM (Double Data Rate Synchronous DRAM) standard, or more precisely, a series of standards, developed by JEDEC (Joint Electron Device Engineering Council), under which bits are transferred to and from DRAM integrated circuits at high speed. The DDR SRAM (or DDR for short) standard also defines high speed transfer of bits to and from memory modules of DRAM integrated circuits along the buses of a computer system.
It is desirable that a thyristor-based memory integrated circuit, which includes, or is based on, a VLT memory be designed to be compatible with DDR standards. In particular, it is desirable that a VLT memory integrated circuit be designed to interface with a standard DDR controller or with a customized DDR controller.
The present invention provides for a memory array in a semiconductor substrate which has: a plurality of parallel bit lines in a first direction and a plurality of parallel word lines in a second direction perpendicular to the first direction; a plurality of thyristor memory cells, each thyristor memory cell at the intersection of a bit line and a word line, and formed by semiconductor layers vertically arranged in the semiconductor substrate, the semiconductor layers doped with alternating P and N-type dopants, a top semiconductor layer connected to a bit or word lines, a bottom semiconductor layer connected to the other of the bit or word lines; and each of the bit lines connected to an input of a multiplexer, the output of the multiplexer coupled to a sense amplifier for reading the value of a bit stored in a selected thyristor memory cell. The memory array has a greater efficiency in comparison to a DRAM memory array. The memory array has at least 8G (233) memory cells which are formed by 2X-nm process technology.
The present invention further provides for the memory array to be comprised of a plurality of MATs (Memory Array Tiles), each MAT having at least 8M (223) thyristor memory cells.
Other objects, features, and advantages of the present invention will become apparent upon consideration of the following detailed description and the accompanying drawings, in which like reference designations represent like features throughout the figures.
In order to show VLT memory arrays can be adapted to DDR standard requirements, it may be helpful to review how conventional DRAMs operate under the DDR standard. In this particular example the version of the DDR standard is LPDDR4 (Low-Power Double Data Rate 4), which represents a low-power version of the fourth iteration of DDR SDRAM standard.
Many aspects of DRAM operation derive from the characteristics of a capacitive storage cell. The leakiness of the capacitor results in a need for refresh operations to recharge the cell capacitor, and the fundamental way the storage cell is read affects other aspects of how the DRAM memory is organized. Charge sharing is used to sense the bit value of a memory cell. The bit line is first pre-charged to a voltage between a logic 1 and a logic 0, i.e., VDD/2. The cell is then selected by turning on the access transistor of the cell so charge moves between the bit line and the cell. If the bit line is at a higher voltage than the cell, then charge moves out of the cell onto the bit line; if the bit line is at a lower voltage than the cell, then charge moves from the bit line into the cell. A charge transfer changes the voltage on the bit line, which is sensed and latched as the read value of the cell. However, a loss or gain of charge in the storage capacitor alters the original charge on the node, so that the read process is destructive. Therefore, after each read operation, the original charge levels must be restored through a write-back operation.
Two ways of looking at a DRAM are: logically, where physical details are abstracted; and physically, where silicon array characteristics are considered. The logical arrangement should correspond to the physical arrangement. For example, an LPDDR4 memory chip may have a capacity for 8 Gb of storage, implemented as two independent channels of 4 Gb each. A channel may have eight banks of memory. Each bank may have a size of 512 Mb arranged as 32K pages of 16K bits each.
A full LPDDR4 memory integrated circuit consists of two high-level elements: the memory cell array and the DDR interface. Some operations affect the memory cell array; others affect the interface. Activity between the interface and the array may take place in the background while other activity is present between the interface and the external system.
LPDDR4 functionality fundamentally consists of four basic operations: activate, read, write, and pre-charge. Variations on these (like burst read/write, auto-pre-charge, and so forth) may result in a longer list of commands but do not change the technology. These instructions are supplemented with maintenance operations, such as refresh, training, and mode register operations for a complex set of operational commands.
The basic operations may be summarized as follows: An Activate operation causes a page of memory to be opened by selecting the appropriate word lines in the array. (A page is a fixed-length contiguous block of virtual memory, described by a single entry in the page table. It is the smallest unit of data for memory management in a virtual memory operating system and hence the smallest unit of data to move into, or from memory, in a computer system.) The contents of the page are sensed and latched; the page is left open either for write-back (in the case of a read operation) or write (in what is effectively a read-modify-write). A Read operation starts the read-out sequence. Each burst group is loaded from the sense amplifier latches into the DDR register, and then the DDR register is read sequentially, one 16-bit word at a time. Write-back occurs in the background via bit line sense amplifiers via the still-open page. A Write operation loads data into a DDR register, one 16-bit word at a time. The contents are then loaded into the Shadow Register for writing into the array (via the still-open page). While they are written, a new 256-bit value can be loaded into the DDR register for a subsequent write, if desired. A Pre-charge prepares the array for the next operation after the final burst group has been read or written. In the case of a writing operation, a write-recovery delay is needed to ensure that the last burst group is successfully written to the array before proceeding. The opened page is closed, which allows the bit lines to float and be charged back to a VDD/2 level.
Only the Activate operation involves memory array sensing; the Read operation involves transfer of data between the latched sensed data and the DDR register and the readout of the DDR register.
Two ways of looking at a DRAM are: logically, where physical details are abstracted; and physically, where silicon array characteristics are considered. The logical arrangement should correspond to the physical arrangement.
A DDR register is the main interface between the outside system and the memory array. When reading, memory cell array data is loaded first into the DDR register; when writing, the desired data is first written from outside into the register. Because reading the traditional DRAM cell array contents is destructive, each read operation must be followed by a write-back operation that restores the original values into the cells. After reading, the DDR register contents are copied to a Shadow Register. While the contents of the DDR register are read by the outside system, the Shadow Register implements the write-back to restore the values on the selected page. Likewise, when writing to the array, the value in the DDR register is transferred to the Shadow Register for writing. While that value is being written, a new value can be loaded into the DDR register.
Reading a page of memory involves a sequence of events that resemble two nested software DO-loops. Each page is divided into burst groups of 256 bits each. Thus, a single bank's 16K-bit page has 64 burst groups. A full page is read by sequentially reading each burst group; this resembles the outer DO loop. Each burst group is loaded into the 256-bit DDR register. That register is divided into 16 16-bit words, and the contents are read out by sequentially, delivering each 16-bit word on each clock edge. This behaves as the inner DO loop. The row address signals, entered as the RAS, select the page. The column address signals, or CAS, select both the burst group and set the starting word to be read from the DDR register, since one doesn't need to start reading at the left side of the DDR register.
This sequence of reading values out of (or writing values into) the DDR register that proceeds while the Shadow Register is implementing either write-back of the values just read or writing of the values previously loaded into the DDR register.
Timing may be complex, depending on the desired sequence of operations. If a read in one bank is followed by a read in a different bank, timing requirements are eased, since there is no need to wait for write-back and pre-charge in the original bank before reading from the next bank. The toughest timing requirements involve successive reads and writes from the same bank.
While a memory bank would appear logically to be a single array with 32K rows and 16K columns, it's not physically possible to build such an array given known techniques. Drivers that select pages have a limited drive capacity; thus, only a limited number of select transistors can be driven without performance degrading below specification. Sense amplifiers can support only a limited number of storage cells. With too many cells, the change in voltage due to charge sharing becomes too small and is swamped by noise.
Therefore, each memory implementation results in some maximum physical array that can be built in order to guarantee a manufacturable memory that is fast and reliable. This maximum array is referred to as a memory array tile, or MAT. Each MAT is a self-contained array with its own word line and bit line decoding and sense amps. The memory is formed by a replication of a MAT in some arrangement.
One example MAT from a conventional DRAM array at the 2X-nm process (2X-nm process is a shorthand designation of semiconductor processes with a range of 20 to 29 nm critical dimensions) node is sized at 1024 bitlines and 620 wordlines. This non-power-of-2 sizing poses some decoding challenges and may only partially use the last MATs. A bank with this size MAT is then built by creating an array of MATs that is 16 wide and 53 deep, for a total of 848 MATs, as shown by the left hand drawing in
In an embodiment of the present invention, a thyristor-based memory can be used in place of the conventional one-capacitor, one transistor DRAM. In particular, vertical layer thyristor (VLT) memory cell arrays are designed to be compatible with LPDDR4 (Low-Power Double Data Rate 4) DRAM memory. Banks of VLT memory can mimic banks of conventional DRAM, with similar, equivalent, or compatible timing and a VLT memory may even interface to a standard DDR controller or to a customized DDR controller.
An array of VLT thyristor memory bit cells are arranged in a cross-point grid and interconnected by metal conductors and buried doped layers which form row and column lines. In one embodiment of the present invention, each memory cell thyristor is formed by a PNPN stack built over a P-well in a semiconductor substrate with an anode connected to a row line and a cathode connected to a column line. The bottom N layers (i.e., the cathode) of a column of memory cell thyristors are connected together by a buried bit line between deep-trench isolation regions which separate the memory cell columns. Shallow trench isolation (STI) regions separate the memory cell rows. In one embodiment, the bottom of the STI regions is made of tungsten so that a buried bit is formed by alternating sections of N regions and tungsten. Tungsten is relatively resistive so that the voltage along the line is maintained by tapping from a copper metal 1 layer. Copper permits a much longer bit line to be supported than is possible with conventional DRAM. The bottom N region removes minority carriers, such as holes, in a memory cell that may otherwise disturb adjacent memory cells.
It should be noted that the VLT memory cells can also be constructed having row and column connections reversed with corresponding modifications well understood by semiconductor circuit designers. Furthermore, through the drawings show rows as horizontal and columns as vertical, the words, row and column, should be understood in the general sense of a first direction and a perpendicular second direction.
With VLT memory cells a RAM (Random Access Memory) integrated circuit can be constructed as a replacement for a DRAM integrated circuit. As shown in
A significant difference from a conventional DRAM integrated circuit, however, is that the memory cells in the array 10 do not require refresh operations, which are intimately woven into DRAM operations. Whether the DRAM memory is idle or being accessed, refresh events must occur to prevent loss of data. During a read operation all bit lines of a conventional DRAM must be read for write-back purposes, i.e., all the bit lines are read so that the read values are written back into the read memory cells. On the other hand, a read operation in a VLT memory cell array is not destructive and no write-back is needed. Therefore the VLT bit cells can share sense amps, as represented in
The sensing mechanism for thyristor memory cells arrays does not rely on charge sharing so that the VLT sense amplifiers can support longer bit lines than the DRAM sense amplifiers. Therefore, the VLT technology can support a MAT that is 2K (211) bits wide and 4K (212) bits deep, or 8M (223) bits in total—much larger than a conventional MAT. A memory made of fewer tiles has less overhead than one made with more tiles. In one embodiment, the array efficiency of a basic VLT array is 77% as compared to 64% for conventional DRAM fabricated on the same 2X-nm process technology. With a larger MAT, an LPDDR4 bank can be built with fewer MATs. From a pure bit-count standpoint, a VLT-based bank contains 64 MATs, as shown by the right hand drawing of
Page access differs between a conventional DRAM, which selects all MATs in a row (shown in the cross-hatched MAT row in left hand drawing of
An effective arrangement of the 64 MATs has each MAT with 512 sense amplifiers supporting the 4K bit lines; each sense amplifier has a multiplexer to select between 8 bit lines, as shown in detail in
In operation, for each page selected one eighth of the bit lines in each MAT are selected, in contrast with the conventional DRAM implementation, where all bit lines are selected. This more efficient use of sense amplifiers would not be possible in a conventional DRAM, where all bit lines must be read for write-back purposes. Because the VLT read is not destructive, no write-back is needed, and therefore bit cells can share sense amps.
Given the flexibility of a VLT MAT, the physical arrangement of MATs may be decoupled from the logical arrangement. Any number of physical arrangements may be possible so long as the busses are routed properly. In one example shown in
To the outside, all LPDDR4 operations for the VLT memory behave as they would for a conventional DRAM memory. However, internal operations will vary, since, for example, write-back and pre-charge may not be required. However, any such internal difference may not be evident to the DDR controller since the internal memory logic maps all of the operations to their VLT equivalents, with all timing requirements being met.
In an embodiment, a benefit of the VLT bit cell is that it requires no refresh, but refresh is intimately woven into DRAM operation. Whether the DRAM memory is idle or being accessed, refresh events must occur to prevent loss of data. The state machine for DDR controller illustrates the impact of refresh on operation. All states relating to refresh (or branches based on refresh) are rendered redundant for VLT memory and may be removed. Thus, in an embodiment, a VLT-based memory may use a DDR controller that has been adapted, or modified, to accommodate non-existent states in the memory.
This description of the invention has been presented for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form described, and many modifications and variations are possible in light of the teaching above. The embodiments were chosen and described in order to best explain the principles of the invention and its practical applications. This description will enable others skilled in the art to best utilize and practice the invention in various embodiments and with various modifications as are suited to a particular use. The scope of the invention is defined by the following claims.
This patent application claims priority from U.S. Provisional Patent Application No. 62/405,848, entitled, “Memory Array Tile for Thyristor Memory Cell,” and filed Oct. 7, 2016.
Number | Date | Country | |
---|---|---|---|
62405848 | Oct 2016 | US |