Semiconductor memory has been used to make Solid State Drives (SSD). A SSD, also known as a solid-state disk although it contains neither an actual disk nor a drive motor to spin a disk, is a solid-state storage device that uses integrated circuit assemblies as memory to store data persistently. SSDs primarily (but not always) use electronic interfaces compatible with traditional block input/output (I/O) hard disk drives, which permit simple replacements in common applications. Additionally, new I/O interfaces, like SATA Express, have been designed to address specific requirements of the SSD technology.
SSDs have no moving (mechanical) components. This distinguishes them from traditional electromechanical magnetic disks such as hard disk drives (HDDs) or floppy disks, which contain spinning disks and movable read/write heads. Compared with electromechanical disks, SSDs are typically more resistant to physical shock, run silently, have lower access time, less latency and lower power consumption. However, while the price of SSDs has continued to decline over time, consumer-grade SSDs are still more expensive per unit of storage than consumer-grade HDDs.
Like-numbered elements refer to common components in the different figures.
Technology is proposed for an SSD with a package optimized for semiconductor wafers. An SSD is configured by thinning a plurality of wafers, stacking the undiced wafers, and connecting the wafers by through silicon vias (TSV). A TSV is a vertical electrical connection (via) passing through a silicon wafer or die. TSVs are a high performance interconnect technique used as an alternative to wire-bond and flip chips to create three dimensional (3D) packages and 3D integrated circuits.
Some SSDs use NAND flash memory (including two dimensional and three dimensional memory structures). However, other forms of non-volatile semiconductor memory can also be used. One technology that can be used to implement memory for an SSD includes charge-trapping material arranged vertically in a 3D stacked memory structure. One example of a 3D memory structure is the Bit Cost Scalable (BiCS) architecture which comprises a stack of alternating conductive and dielectric layers. A memory hole is formed in the stack and a NAND string is then formed by filling the memory hole with materials including a charge-trapping layer to create a vertical column of memory cells. A straight NAND string extends in one memory hole. Control gates of the memory cells are provided by the conductive layers.
The following discussion provides details of one example of a suitable structure for memory devices that can be used to implement a non-volatile memory system for a SSD.
In one example implementation, the length of the plane in the x-direction, represents a direction in which signal paths for word lines extend (a word line or SGD line direction), and the width of the plane in the y-direction, represents a direction in which signal paths for bit lines extend (a bit line direction). The z-direction represents a height of the memory device.
Memory structure 126 may comprise one or more arrays of memory cells including a 3D array. The memory structure may comprise a monolithic three dimensional memory structure in which multiple memory levels are formed above (and not in) a single substrate, such as a wafer, with no intervening substrates. The memory structure may comprise any type of non-volatile memory that is monolithically formed in one or more physical levels of arrays of memory cells having an active area disposed above a silicon substrate. The memory structure may be in a non-volatile memory device having circuitry associated with the operation of the memory cells, whether the associated circuitry is above or within the substrate.
Control circuitry 110 cooperates with the read/write circuits 128 to perform memory operations (e.g., erase, program, read, and others) on memory structure 126, and includes a state machine 112, an on-chip address decoder 114, and a power control module 116. The state machine 112 provides chip-level control of memory operations. Code and parameter storage 113 may be provided for storing operational parameters and software. In one embodiment, state machine 112 is programmable by the software stored in code and parameter storage 113. In other embodiments, state machine 112 does not use software and is completely implemented in hardware (e.g., electrical circuits).
The on-chip address decoder 114 provides an address interface between addresses used by host 140 or memory controller 122 to the hardware address used by the decoders 124 and 132. Power control module 116 controls the power and voltages supplied to the word lines and bit lines during memory operations. It can include drivers for word line layers (discussed below) in a 3D configuration, select transistors (e.g., SGS and SGD transistors, described below) and source lines. Power control module 116 may include charge pumps for creating voltages. The sense blocks include bit line drivers. An SGS transistor is a select gate transistor at a source end of a NAND string, and an SGD transistor is a select gate transistor at a drain end of a NAND string.
Any one or any combination of control circuitry 110, state machine 112, decoders 114/124/132, code and parameter storage 113, power control module 116, sense blocks SB1, SB2, . . . , SBp, read/write circuits 128, and controller 122 can be considered one or more control circuits that performs the functions described herein.
Any one or any combination of control circuitry 110, state machine 112, decoders 114/124/132, code and parameter storage 113, power control module 116, sense blocks SB1, SB2, . . . , SBp, read/write circuits 128, and controller 122 can be considered peripheral circuits.
The (on-chip or off-chip) controller 122 may comprise a processor 122c, ROM 122a, RAM 122b and a Memory Interface 122d. The storage devices (ROM 122a, RAM 122b) comprises code such as a set of instructions, and the processor 122c is operable to execute the set of instructions to provide the functionality described herein. Alternatively or additionally, processor 122c can access code from a storage device in the memory structure, such as a reserved area of memory cells connected to one or more word lines. Memory interface 122d, in communication with ROM 122a, RAM 122b and processor 122c, is an electrical circuit that provides an electrical interface between controller 122 and memory die 108. For example, memory interface 122d can change the format or timing of signals, provide a buffer, isolate from surges, latch I/O, etc.
Multiple memory elements in memory structure 126 may be configured so that they are connected in series or so that each element is individually accessible. By way of non-limiting example, flash memory devices in a NAND configuration (NAND flash memory) typically contain memory elements connected in series. A NAND string is an example of a set of series-connected memory cells and select gate transistors.
A NAND flash memory array may be configured so that the array is composed of multiple NAND strings of which a NAND string is composed of multiple memory cells sharing a single bit line and accessed as a group. Alternatively, memory elements may be configured so that each element is individually accessible, e.g., a NOR memory array. NAND and NOR memory configurations are exemplary, and memory cells may be otherwise configured.
The memory cells may be arranged in the single memory device level in an ordered array, such as in a plurality of rows and/or columns. However, the memory elements may be arrayed in non-regular or non-orthogonal configurations, or in structures not considered arrays.
A three dimensional memory array is arranged so that memory cells occupy multiple planes or multiple memory device levels, thereby forming a structure in three dimensions (i.e., in the x, y and z directions, where the z direction is substantially perpendicular and the x and y directions are substantially parallel to the major surface of the substrate).
As a non-limiting example, a three dimensional memory structure may be vertically arranged as a stack of multiple two dimensional memory device levels. As another non-limiting example, a three dimensional memory array may be arranged as multiple vertical columns (e.g., columns extending substantially perpendicular to the major surface of the substrate, i.e., in the y direction) with each column having multiple memory cells. The vertical columns may be arranged in a two dimensional configuration, e.g., in an x-y plane, resulting in a three dimensional arrangement of memory cells, with memory cells on multiple vertically stacked memory planes. Other configurations of memory elements in three dimensions can also constitute a three dimensional memory array.
By way of non-limiting example, in a three dimensional NAND memory array, the memory elements may be coupled together to form a vertical NAND string that traverses across multiple horizontal memory device levels. Other three dimensional configurations can be envisioned wherein some NAND strings contain memory elements in a single memory level while other strings contain memory elements which span through multiple memory levels. Three dimensional memory arrays may also be designed in a NOR configuration and in a ReRAM configuration.
A person of ordinary skill in the art will recognize that the technology described herein is not limited to a single specific memory structure, but covers many relevant memory structures within the spirit and scope of the technology as described herein and as understood by one of ordinary skill in the art.
The interface between controller 122 and non-volatile memory die 104 may be any suitable flash interface, such as Toggle Mode 200, 400, or 800. In one embodiment, memory system 100 may be a card based system, such as a secure digital (SD) or a micro secure digital (micro-SD) card. In an alternate embodiment, memory system 100 may be part of an embedded memory system. For example, the flash memory may be embedded within the host, such as in the form of a solid state disk (SSD) drive installed in a personal computer.
In some embodiments, non-volatile memory system 100 includes a single channel between controller 122 and non-volatile memory die 108, the subject matter described herein is not limited to having a single memory channel. For example, in some memory system architectures, 2, 4, 8 or more channels may exist between the controller and the memory die, depending on controller capabilities. In any of the embodiments described herein, more than a single channel may exist between the controller and the memory die, even if a single channel is shown in the drawings.
As depicted in
The components of controller 122 depicted in
Controller 122 may include recondition circuitry 212, which is used for reconditioning memory cells or blocks of memory. The reconditioning may include refreshing data in its current location or reprogramming data into a new word line or block as part of performing word line maintenance, as described herein.
Referring again to modules of the controller 122, a buffer manager/bus controller 214 manages buffers in random access memory (RAM) 216 and controls the internal bus arbitration of controller 122. A read only memory (ROM) 218 stores system boot code. Although illustrated in
Front end module 208 includes a host interface 220 and a physical layer interface (PHY) 222 that provide the electrical interface with the host or next level storage controller. The choice of the type of host interface 220 can depend on the type of memory being used. Examples of host interfaces 220 include, but are not limited to, SATA, SATA Express, SAS, Fibre Channel, USB, PCIe, and NVMe. The host interface 220 typically facilitates transfer for data, control signals, and timing signals.
Back end module 210 includes an error correction controller (ECC) engine 224 that encodes the data bytes received from the host, and decodes and error corrects the data bytes read from the non-volatile memory. A command sequencer 226 generates command sequences, such as program and erase command sequences, to be transmitted to non-volatile memory die 108. A RAID (Redundant Array of Independent Drives) module 228 manages generation of RAID parity and recovery of failed data. The RAID parity may be used as an additional level of integrity protection for the data being written into the non-volatile memory system 100. In some cases, the RAID module 228 may be a part of the ECC engine 224. A memory interface 230 provides the command sequences to non-volatile memory die 108 and receives status information from non-volatile memory die 108. In one embodiment, memory interface 230 may be a double data rate (DDR) interface, such as a Toggle Mode 200, 400, or 800 interface. A flash control layer 232 controls the overall operation of back end module 210.
Additional components of system 100 illustrated in
The FTL or MML 238 may be integrated as part of the flash management that may handle flash errors and interfacing with the host. In particular, MML may be a module in flash management and may be responsible for the internals of NAND management. In particular, the MML 238 may include an algorithm in the memory device firmware which translates writes from the host into writes to the flash memory 126 of die 108. The MML 238 may be needed because: 1) the flash memory may have limited endurance; 2) the flash memory 126 may only be written in multiples of pages; and/or 3) the flash memory 126 may not be written unless it is erased as a block. The MML 238 understands these potential limitations of the flash memory 126 which may not be visible to the host. Accordingly, the MML 238 attempts to translate the writes from host into writes into the flash memory 126. As described below, erratic bits may be identified and recorded using the MML 238. This recording of erratic bits can be used for evaluating the health of blocks and/or word lines (the memory cells on the word lines).
The block depicted in
Although
For ease of reference, drain side select layers SGD0, SGD1, SGD2 and SGD3; source side select layers SGS0, SGS1, SGS2 and SGS3; dummy word line layers DWLL1a, DWLL1b, DWLL2a and DWLL2b; and word line layers WLL0-WLL47 collectively are referred to as the conductive layers. In one embodiment, the conductive layers are made from a combination of TiN and Tungsten. In other embodiments, other materials can be used to form the conductive layers, such as doped polysilicon, metal such as Tungsten or metal silicide. In some embodiments, different conductive layers can be formed from different materials. Between conductive layers are dielectric layers DLO-DL59. For example, dielectric layers DL49 is above word line layer WLL43 and below word line layer WLL44. In one embodiment, the dielectric layers are made from SiO2. In other embodiments, other dielectric materials can be used to form the dielectric layers.
The non-volatile memory cells are formed along vertical columns which extend through alternating conductive and dielectric layers in the stack. In one embodiment, the memory cells are arranged in NAND strings. The word line layer WLL0-WLL47 connect to memory cells (also called data memory cells). Dummy word line layers DWLL1a, DWLL1b, DWLL2a and DWLL2b connect to dummy memory cells. A dummy memory cell does not store user data, while a data memory cell is eligible to store user data. Drain side select layers SGD0, SGD1, SGD2 and SGD3 are used to electrically connect and disconnect NAND strings from bit lines. Source side select layers SGS0, SGS1, SGS2 and SGS3 are used to electrically connect and disconnect NAND strings from the source line SL.
Drain side select gate layer SGD0 (the top layer) is also divided into regions 420, 430, 440 and 450, also known as fingers or select line fingers. In one embodiment, the four select line fingers on a same level are connected together. In another embodiment, each select line finger operates as a separate word line.
When a memory cell is programmed, electrons are stored in a portion of the charge trapping layer 473 which is associated with the memory cell. These electrons are drawn into the charge trapping layer 473 from the channel 471, through the tunneling dielectric 472, in response to an appropriate voltage on word line region 476. The threshold voltage (Vth) of a memory cell is increased in proportion to the amount of stored charge. In one embodiment, the programming is achieved through Fowler-Nordheim tunneling of the electrons into the charge trapping layer. During an erase operation, the electrons return to the channel or holes are injected into the charge trapping layer to recombine with electrons. In one embodiment, erasing is achieved using hole injection into the charge trapping layer via a physical mechanism such as gate induced drain leakage (GIDL).
Although the example memory system discussed above is a three dimensional memory structure that includes vertical NAND strings with charge-trapping material, other (2D and 3D) memory structures can also be used with the technology described herein. For example, floating gate memories (e.g., NAND-type and NOR-type flash memory), ReRAM cross-point memories, magnetoresistive memory (e.g., MRAM), and phase change memory (e.g., PCRAM) can also be used.
One example of a ReRAM cross point memory includes reversible resistance-switching elements arranged in cross point arrays accessed by X lines and Y lines (e.g., word lines and bit lines). In another embodiment, the memory cells may include conductive bridge memory elements. A conductive bridge memory element may also be referred to as a programmable metallization cell. A conductive bridge memory element may be used as a state change element based on the physical relocation of ions within a solid electrolyte. In some cases, a conductive bridge memory element may include two solid metal electrodes, one relatively inert (e.g., tungsten) and the other electrochemically active (e.g., silver or copper), with a thin film of the solid electrolyte between the two electrodes. As temperature increases, the mobility of the ions also increases causing the programming threshold for the conductive bridge memory cell to decrease. Thus, the conductive bridge memory element may have a wide range of programming thresholds over temperature.
Magnetoresistive memory (MRAM) stores data by magnetic storage elements. The elements are formed from two ferromagnetic plates, each of which can hold a magnetization, separated by a thin insulating layer. One of the two plates is a permanent magnet set to a particular polarity; the other plate's magnetization can be changed to match that of an external field to store memory. This configuration is known as a spin valve and is the simplest structure for an MRAM bit. A memory device is built from a grid of such memory cells. In one embodiment for programming, each memory cell lies between a pair of write lines arranged at right angles to each other, parallel to the cell, one above and one below the cell. When current is passed through them, an induced magnetic field is created.
Phase change memory (PCRAM) exploits the unique behavior of chalcogenide glass. One embodiment uses a GeTe-Sb2Te3 super lattice to achieve non-thermal phase changes by simply changing the co-ordination state of the Germanium atoms with a laser pulse (or light pulse from another source). Therefore, the doses of programming are laser pulses. The memory cells can be inhibited by blocking the memory cells from receiving the light. Note that the use of “pulse” in this document does not require a square pulse, but includes a (continuous or non-continuous) vibration or burst of sound, current, voltage light, or other wave.
The above-described technologies can be used to implement memory circuits to be used in an SSD. The memory circuits, peripheral circuits and/or separate controllers (optional) are manufactured on semiconductor wafers, and these wafers are then used to make an SSD. Technology is proposed for an SSD with a package optimized for the semiconductor wafers. In one embodiment, an SSD is configured by thinning a plurality of wafers, stacking the undiced wafers, and connecting the wafers by TSV (through silicon via). By “undiced” it is meant that the wafers have not been cut into separate dies.
Step 1: Wafers in which flash memories are formed are prepared.
Step 2: The wafers are thinned. The wafers are not diced (not cut into separate dies) and the wafers are stacked on top of each other, and adhered to each other. (See
Step 3: The wafers are connected to each other by TSV (through silicon via) (See
Step 4: Testing process is performed on the stacked wafers. Memory blocks that are defective and/or of low performance are detected. The SSD is configured such that the detected defective or low performance memory blocks will not be used.
Step 5: The stacked wafers are housed in a Casing that comprises a power supply circuit connected to a Power Supply Line, communication interface connected to a Connection Line, substrate and etc. The casing may have a cylindrical shape (e.g., the same cylindrical shape as the Stacked Wafers) of which diameter is approximately 300 mm (or a different size). (See
Traditionally. there has been a need for making the shape of an SSD the same as the shape of a HDD. However, in future, the demand for the SSD itself other than as a replacement for HDDs will be increased. Then, the shape of SSD casing does not need to be matched to the shape of the HDD casing. Therefore, it becomes possible to adopt the casing housing which is optimized with respect to the wafers as in the proposed technology (e.g., cylindrical shape with diameter of approximately 300 mm). The proposed technology is advantageous in a market which does not demand the downsizing of SSD, especially a data center.
There are many advantages to this technology. First, the stacked wafers do not need to be diced into chips. Therefore, it is possible to allow for lower price of SSD because:
(1) Areas required for dicing (e.g., dicing lines) will no longer be required. Effective use of wafer area can be achieved.
(2) Post-processes can be simplified, for example, dicing of chips, testing of chips, and soldering chips to board are no longer performed.
Additionally, the concept of “chips” can be eliminated. Therefore, the degree of freedom for reticle layout can be drastically improved. As a result, the effective area of wafer can be increased. Details will be described herein below.
On the other hand,
In a current exposure machine, a region of “26 mm*33 mm” is a maximum region that can be exposed with one shot. In the proposed technology, it becomes possible to repeat the exposure at this maximum size.
On the other hand,
A peripheral circuit and memory cells can be on separate wafers. A plurality of 1st wafers comprising only memory cells is stacked on top of each other. As depicted in
Several kinds of problems undesirably occur if memory cells and a peripheral circuit are formed on a same wafer (e.g., thermal budget of the memory cells gives negative effect on the peripheral circuit). In one embodiment, this problem can be resolved by locating them on different wafers positioned in a common housing.
The above-described technology for staking semiconductor wafers includes a challenge with respect to heat dissipation. For example, the ability to dissipate heat may be low due to the wafers being stacked. Therefore, a structure for cooling is proposed so that data retention lifetime is elongated.
Each one of the assemblies 705 include a plurality of units 704 that are stacked vertically on top of each other. Each unit 704 includes a group of stacked semiconductor wafers. Therefore, one example SSD 701 includes multiple assemblies 705 and each assembly 705 includes multiple groups 704 of stacked semiconductor wafers. In one embodiment, each unit 704 includes a group of 10-20 stacked semiconductor wafers. In other embodiments, different numbers of stacked wafers can be implemented. In one embodiment, units 704 are the stacked wafers described above with respect to
In one embodiment, bumps 710 are made of a metal with a high heat conductivity. This will improve the heat sink function that is inherent in the bumps. Further, a height H1 of the bumps 710 can be freely set. Since a cross sectional height of the ventilation holes 707 (ventilation paths) is greater than the height H1 of the bumps, better cooling of the assemblies 705 can be realized.
The number of semiconductor wafers 711 that comprises only memory cells can be suitably determined depending on the required capacity. One example is the situation where one semiconductor wafer has a memory volume of 65 TB and an SSD of 128 PB is created. If one unit 704 is formed of twenty semiconductor wafers 711 that comprises only memory cells, that makes 1.28 PB (per unit). If one assembly 705 is formed of ten units 704, that makes 12.8 PB (per assembly). If one SSD is formed of 10 assemblies 705, a large capacity of 128 PB can be realized
In one embodiment, the inside of the casing 702 can be cooled by various methods. The following variations are examples: (1) the casing 702 is sealed and helium gas is filled therein; (2) an air conditioner is mounted in the casing of SSD 701 and cooled air is supplied from the blow fan 703; and/or (3) one or more cooling pipes are inserted in the intermediate layers for transmitting fluids in the ventilation paths so that the assemblies 705 can be water-cooled (or cooled by a different liquid).
One embodiment includes a non-volatile storage system, comprising: a plurality of stacked undiced wafers that each include non-volatile memory cells; and one or more control circuits connected to the memory cells.
One embodiment includes an apparatus, comprising: a plurality of semiconductor wafers connected to each other by through silicon vias, a subset of the wafers include memory circuits, one wafer not in the subset includes peripheral circuits.
One embodiment includes a method, comprising: preparing semiconductor wafers in which flash memory cells are formed; thinning the wafers, the wafers are not diced; stacking the wafers on top of each other and adhering the wafers to each other; connecting the wafers to each other by through silicon vias; and performing a testing process on the stacked wafers.
One embodiment includes a non-volatile storage system, comprising: multiple groups of stacked semiconductor wafers, the groups of stacked semiconductor wafers are stacked with ventilation paths between groups, within each group the semiconductor wafers are electrically connected to adjacent semiconductor wafers by through silicon vias, at least a subset of the semiconductor wafers include non-volatile memory cells.
One embodiment includes a non-volatile storage system, comprising: a plurality of semiconductor wafers that include non-volatile memory cells, the plurality of semiconductor wafers are arranged in stacks; and means for cooling the plurality of semiconductor wafers.
For purposes of this document, reference in the specification to “an embodiment,” “one embodiment,” “some embodiments,” or “another embodiment” may be used to describe different embodiments or the same embodiment.
For purposes of this document, a connection may be a direct connection or an indirect connection (e.g., via one or more others parts). In some cases, when an element is referred to as being connected or coupled to another element, the element may be directly connected to the other element or indirectly connected to the other element via intervening elements. When an element is referred to as being directly connected to another element, then there are no intervening elements between the element and the other element. Two devices are “in communication” if they are directly or indirectly connected so that they can communicate electronic signals between them.
For purposes of this document, the term “based on” may be read as “based at least in part on.”
For purposes of this document, without additional context, use of numerical terms such as a “first” object, a “second” object, and a “third” object may not imply an ordering of objects, but may instead be used for identification purposes to identify different objects.
For purposes of this document, the term “set” of objects may refer to a “set” of one or more of the objects.
The foregoing detailed description has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit to the precise form disclosed. Many modifications and variations are possible in light of the above teaching. The described embodiments were chosen in order to best explain the principles of the proposed technology and its practical application, to thereby enable others skilled in the art to best utilize it in various embodiments and with various modifications as are suited to the particular use contemplated. It is intended that the scope be defined by the claims appended hereto.
The application claims the benefit of U.S. Provisional Application 62/387,414, filed on Dec. 23, 3015.
Number | Date | Country | |
---|---|---|---|
62387414 | Dec 2015 | US |