The present invention relates generally to semiconductor devices and more particularly to memory devices and methods of manufacturing asymmetric memory cells using a single halo implant selective to the drain side to reduce the number of masks required and to improve programming performance in embedded memory technologies.
A conventional MOS transistor generally includes a semiconductor substrate, such as silicon, having a source, a drain, and a channel positioned between the source and drain. A gate stack composed of a conductive material (a gate conductor), an oxide layer (a gate oxide), and sidewall spacers, is typically located above the channel. The gate oxide is typically located directly above the channel, while the gate conductor, generally comprised of polycrystalline silicon (poly or polysilicon) material, is located above the gate oxide. The offset and sidewall spacers protect the sidewalls of the gate conductor.
Memory devices, for example, electrically erasable, programmable read only memory (EEPROM), and FLASH or EEPROM memory use a mixture of such MOS transistors on an integrated circuit chip which may include low voltage CMOS (LVCMOS), high voltage CMOS (HVCMOS), and the memory array or FLASH/EEPROM memory array areas. Such MOS transistor areas are commonly embedded together utilizing embedded memory technologies to integrate these LVCMOS, HVCMOS, and FLASH/EEPROM memory cell arrays together. For example, an embedded memory device may include carefully tailored memory cell transistors utilized in the memory array (FLASH/EEPROM region) as well as general purpose CMOS logic transistors used in various peripheral control circuits (peripheral or logic region) which control access to the array and perform various other control or CPU functions.
Embedded FLASH/EEPROM arrays range from very large arrays, (e.g., utilizing kbits, Mbits), where the need for a smaller area may justify considerably greater process complexity, through the very small examples (e.g., several dozens of bits) where keeping the added process complexity to a minimum justifies a larger area for each bit. Thus, the very large memory arrays generally utilize greater process complexity such as more masking steps and layers to obtain smaller cells, while smaller array application may utilize less process complexity such as single level poly (SLP) to save process costs utilizing larger area cells.
For example, It is common to add as many a six photo mask operations and associated processing steps in an effort to achieve a high density of memory bits when embedding FLASH memory to digital CMOS. Some of these process operations that are used to fabricate higher density FLASH/EEPROM (e.g., stack etch, where multiple layers are etched or patterned using a single masking layer, or additional layers of poly-crystalline silicon and multiple thicknesses of gate or tunnel oxide) are operations that are not normally required to build designs without FLASH/EEPROM. These process operations add cost and may require specialized equipment and skills.
Thus, there is significant motivation to minimize the number of masks required to accomplish the high levels of integration utilized in embedded FLASH memory devices, particularly at the higher end of the spectrum of process costs in higher density embedded FLASH.
The basic FLASH/EEPROM storage element is called a floating gate transistor. One prior-art floating-gate transistor has a source S, a polysilicon floating gate (FG) storage node with no connection permitted, and a drain (D). The gate is said to float without any direct electrical contact, embedded within a high-quality insulator, with the floating gate capacitively charge coupled through an electrically isolated control gate (CG). A charge placed on such a floating gate typically represents a data state or bit of data, and may be retained for about a decade or more.
FLASH/EEPROM memories using Fowler-Nordheim (F-N) tunneling are often programmed by applying a relatively high voltage level of from about 5 volts to about 30 volts across a tunneling region (e.g., a tunnel oxide or gate oxide) for a controlled period of time. Typical tunnel oxide thicknesses range from about 50 Angstroms to about 200 Angstroms. The silicon under at least part of the tunnel oxide area is doped sufficiently to avoid excessive depletion when programming voltages are applied. Electrons are placed on the floating gate storage node as charge flows through the gate oxide or tunnel oxide, reducing the electric field as the current falls towards zero. A reversed polarity results in reverse charge flow, providing the ability for a large but finite number of write/erase cycles.
FLASH/EEPROM memory is typically arranged as a matrix of memory cells fabricated in an integrated circuit chip, and address decoding in the chip allows access to each cell for read/write functions. These FLASH memory cells are often arranged in rows so that blocks of data such as words or bytes can be written or read simultaneously. FLASH EPROM and EEPROM memory cells have many variations.
FLASH memory cells of arrays may also be formed as MOS transistors having asymmetric source and drain regions to provide better programming performance than symmetrically formed structures. One prior art provides an asymmetric structure using birds beak like structures underlying the edge of the gate in the channel region. Another prior art forms an asymmetric structure using offset and varied thickness gate oxides. However, such asymmetric flash cell structures having asymmetric physical geometries may require careful alignment tolerances and/or one or more associated masks which may result in higher manufacturing costs.
Accordingly, there is a need for an improved memory cell of an embedded memory device and method that effectively provides asymmetric source and drain formation while reducing the number of masks required and the probability of damage defects during implantation in order to improve programming performance in embedded memory technologies.
The following presents a simplified summary in order to provide a basic understanding of one or more aspects of the invention. This summary is not an extensive overview of the invention, and is neither intended to identify key or critical elements of the invention, nor to delineate the scope thereof. Rather, the primary purpose of the summary is to present some concepts of the invention in a simplified form as a prelude to the more detailed description that is presented later.
The present invention relates to a device and method of fabrication, wherein the MOS transistors, for example, generally NMOS transistors of an FLASH memory device having FLASH array, low voltage CMOS (LVCMOS) regions, and/or high voltage CMOS (HVCMOS) (e.g., logic control, peripheral, or processing portions) exhibit improved programming performance (e.g., shorter programming time) due to the asymmetric formation of source and drain regions in the (e.g., NMOS) FLASH region. Conventionally, this asymmetry may be accomplished with an additional mask and implants.
The present invention, however, beneficially accomplishes this goal by utilizing a single halo implant selectively implanted from the drain side of the MOS transistors without any EE mask other than the gate stack. Consequently, less dopant is implanted on the source side, thereby allowing a greater dopant concentration to remain on the source side of the gate and to a greater depth. Therefore, one masking process is avoided, one halo implant is eliminated (along with the two 90° wafer rotations), and the additional damage defects introduced by implanting the source are also avoided. The asymmetric FLASH structure has been shown to provide a higher electrical field near the drain side for the same bias levels, which produces a higher hot electron injection current to the gate and a shorter programming time than symmetric structures.
In accordance with one aspect of the present invention, an embedded memory device having a LVCMOS or logic region and a FLASH or EEPROM region comprises one or more CMOS logic transistors within the LVCMOS region of the memory device, the logic transistors having a semiconductor body of a first conductivity type, source and drain regions of a second conductivity type formed in the semiconductor body on opposing sides of a channel. The device also has substantially symmetric halo regions implanted using the first conductivity type at an angle underlying a channel into both of the source and drain regions, the halos implanted from both source and drain region sides of the channel.
The memory device further comprises one or more asymmetric FLASH/EEPROM memory cell transistors within the FLASH/EEPROM region of the device, the cell transistors having a semiconductor body of the first conductivity type and source and drain regions of the second conductivity type formed in the semiconductor body on opposing sides of a channel. The device also comprises asymmetric halo regions implanted using the first conductivity type at an angle underlying the channel of the memory cell transistors, selectively implanted only from the drain side of the channel and not from the source side of the channel, wherein the halo formed on the drain side is substantially larger than the halo formed on the source side, and wherein a greater dopant concentration is provided on the source side than on the drain side.
In one aspect of the invention, the FLASH memory cell transistors of the embedded memory device have a semiconductor body of the first conductivity type comprising NMOS transistors of a 1T FLASH/EEPROM cell.
In accordance with a method of the present invention, asymmetric FLASH memory cell transistors are formed in an embedded FLASH memory device. The method comprises masking over the FLASH/EEPROM array region; halo implanting a first conductivity (e.g., p-type) dopant in NMOS regions of the LVCMOS or logic region in first and second implant directions; masking over the logic region; halo implanting the first conductivity dopant in NMOS regions of the FLASH/EEPROM region in the second implant direction only, thereby reducing the number of masks required; masking over the FLASH/EEPROM region; halo implanting a second conductivity dopant (e.g., n-type) in PMOS regions of the logic region in the first and second implant directions. Thereafter, conventional back-end processes may be utilized.
In accordance with another aspect of the present invention, a method of asymmetrically forming FLASH memory cell transistors is provided, wherein a gate structure is formed over a semiconductor body, thereby defining a channel region therebelow in the semiconductor body. The method further comprises masking over the FLASH/EEPROM array region; halo implanting a first conductivity (e.g., p-type) dopant in NMOS regions of the logic region in first and second implant directions; masking over the logic region; halo implanting the first conductivity dopant in NMOS regions of the FLASH/EEPROM region in the second implant direction only, thereby reducing the number of masks required; masking over the FLASH/EEPROM region; halo implanting a second conductivity dopant (e.g., n-type) in PMOS regions of the logic region in the first and second implant directions. Thereafter, conventional back-end processes may be utilized.
In another aspect, the first conductivity type is p-type and the second conductivity type is n-type, and wherein the halo formed asymmetrically on the source side of the memory cell transistors is laterally displaced away from the channel further than the halo formed on the drain side as a result of the single drain side halo implant.
In yet another aspect, the halo implanting in the first implant direction comprises implanting at an angle underlying the channel from the source region side of the channel, and the halo implanting in the second implant direction comprises implanting at an angle underlying the channel from the drain region side of the channel.
The following description and annexed drawings set forth in detail certain illustrative aspects and implementations of the invention. These are indicative of but a few of the various ways in which the principles of the invention may be employed.
One or more implementations of the present invention will now be described with reference to the attached drawings, wherein like reference numerals are used to refer to like elements throughout, and wherein the illustrated structures are not necessarily drawn to scale. The invention provides an asymmetric MOS transistor structure for FLASH/EEPROM memory devices and methods in which cell programming times are improved while minimizing defects and eliminating a masking operation, by using a single halo implant selective to the drain region of the cell transistors in the FLASH/EEPROM region of embedded memory devices. These concepts and benefits are further revealed in association with the following exemplary figures and discussions.
In addition, a regular or “normal implant” 5 may also be utilized to implant dopants, wherein the beam impacts the wafer at an angle normal or perpendicular to the surface of the wafer 8. For example, normal implants 5 as well as angled implants 1, 2, 3, 4 to the semiconductor substrate of wafer 8 may be utilized to form low density drain LDD regions therein alongside gate structures for example. The LDD regions are formed by normal 5 or angled implantations 1, 2, 3, 4 using appropriate dopant species atoms through openings in the mask or resist layer. The LDD implant is typically implanted substantially perpendicular to the surface of the substrate or wafer 8 (at a 0° angle), so an unrestricted LDD region width is available as an opening for the LDD implantation.
In some semiconductor cells, all features (e.g., active regions or gates) are oriented in one direction. In such cases, a pair of pocket or halo implants may be used to introduce dopants into both sides of these active regions or under both sides of the gate in the channel region. These implants may be referred to as “two rotations”, from the use of two 90° rotations of the wafer mounting disc in an ion implanter. The features of other semiconductor cells may have multiple orientations, wherein four pocket rotations are typically used to implant both sides of all features substantially equally. Pocket shadowing from the edge of the mask or the gate structure of a cell may occur from any direction, including the directions parallel to, or perpendicular to these features. Generally speaking, such pocket shadowing has produced detrimental results in MOS transistors, however, the shadowing from the gate structure is advantageously used in association with the asymmetrically formed cell transistors of the present invention.
Referring now to
An individual FLASH cell is selected via the word line and a pair of bit lines bounding the associated cell. For example, in reading the FLASH cell 20a, a conduction path would be established when a positive voltage is applied to the bit line (BL0) 25 coupled to the drain of FLASH cell 20a, and the source 30 which is coupled to the bit line (BL1) 26, is selectively coupled to ground (VSS). Thus, a virtual ground is formed by selectively switching to ground the bit line associated with the source terminal of only those selected FLASH cells which are to be programmed or read.
Offset spacers 37 may then be formed over the lateral sidewalls of the gate stack 22 to help direct or align an LDD implant region 39 to the lateral edges of the offset spacers 37. Halo implants 1 and 2 from and corresponding with both the source 30 and drain 31 sides, respectively, of the cell 20 are implanted at an angle under the channel region to form symmetric source and drain halos 40 and 41, respectively. Thereafter, sidewall spacers 38 may then be formed over the lateral sidewalls of the offset spacers 37 to direct the deep source/drain implant.
Conventional FLASH cell 20 is formed symmetrically, for example, in one or more FLASH/EEPROM regions 11, which are implanted by the two halo implants (e.g., 1 and 2), symmetrically from opposing directions (e.g., implant rotations 1 and 2, or 3 and 4), corresponding with both source and drain region sides of the cell 20. The symmetrical cell structure requires an EE mask pattern that provides an opening in the one or more FLASH/EEPROM regions 11, while covering the LVCMOS 12 and HVCMOS regions 13, for example.
To make the symmetric structure into the more advantageous asymmetric structure, however, an additional (second) EE mask would be needed to cover the source region 30 during halo implantations (e.g., rotations 1 and 2) in order to implant more dopant at the drain region 31. In particular, the FLASH/EE mask pattern provides an opening in the one or more FLASH/EEPROM regions 11, while covering the LVCMOS 12 and HVCMOS regions 13. This extra EE mask is not required, however, in the method of the present invention, which instead uses a single sided halo implant selective to the drain region.
The net dopant concentrations of
FLASH cell 50 similarly comprises a semiconductor body 32 (e.g., a p-type substrate material) having source 54 and drain 55 regions formed asymmetrically within the substrate 32 on opposing sides of the gate stack 52. The gate stack 52 comprises a gate oxide dielectric 33 formed overlying the substrate 32 in a channel region. The gate stack 52 further comprises a first polysilicon floating gate 34 formed over the gate oxide 33, an insulating material 35 formed over the floating gate 34, and a second polysilicon material 36 formed over the insulating material 35.
An offset spacer 37 and a sidewall spacer 38 may then be formed over the lateral sidewalls of the gate stack 52 to help direct or align an LDD implant region 53 to the lateral edges of the sidewall spacer 38. The halo implant (e.g., implant rotation 2), selectively implanted from only the drain 55 side of the cell 50, is implanted at an angle under the channel region to form asymmetric source halos 56 and drain halos 57, respectively.
FLASH cell 50 is asymmetrically formed in accordance with the method of the present invention, without using an extra (second) EE mask pattern, such as would be required to convert the symmetric cell 20 of
Similar to the net dopant concentrations of
Referring now to
While the exemplary method 70 is illustrated and described below as a series of acts or events, it will be appreciated that the present invention is not limited by the illustrated ordering of such acts or events. For example, some acts may occur in different orders and/or concurrently with other acts or events apart from those illustrated and/or described herein, in accordance with the invention. In addition, not all illustrated steps may be required to implement a methodology in accordance with the present invention. Furthermore, the methods according to the present invention may be implemented in association with the fabrication of MOS transistors, FLASH EEPROM cells, ICs and composite transistors illustrated and described herein, as well as in association with other cells, transistors, and structures not illustrated, including but not limited to NMOS and/or PMOS composite transistors formed in the memory region and the logic region of an embedded FLASH memory device.
Initially, before the method 70 begins transistor fabrication may be initiated by the formation of gate structures in peripheral or logic regions (e.g., logic region 14 comprising LVCMOS 12 and/or HVCMOS 13 regions of
Method 70 then begins at 71, wherein a mask is formed over the structures in memory array (FLASH/EEPROM) regions (e.g., 11 of
At 73 a mask is formed over the logic regions 14 (e.g., LVCMOS region 12 and/or HVCMOS region 13 of
At 75 a mask is formed over the memory array (FLASH/EEPROM) region (e.g., 11 of
For example, extension regions and source/drain regions may be subsequently formed in the active region of the silicon body. For example, lightly doped, medium doped or heavily doped extension region implants are performed in the NMOS and PMOS regions, or alternatively, the NMOS regions and PMOS regions may be implanted separately with differing dopants by mask off each region, respectively. Since the extension region implants are formed after the offset spacer, it is self-aligned with respect to the offset spacer, thereby placing both regions extremely close to the lateral edge of the gate structure within the semiconductor body. A thermal process such as a rapid thermal anneal may then be employed to activate the extension region dopants, which causes the extension regions to diffuse laterally underneath the offset spacer and slightly overlap the gate stack toward the channels.
Source/drain sidewall spacers may then also be formed on the gate structures. The sidewall spacers comprise an insulating material such as an oxide, a nitride or a combination of such layers. The spacers are formed by depositing a layer of such spacer material(s) over the device in a generally conformal manner, followed by an anisotropic etch thereof, thereby removing such spacer material from the top of the gate structure and from the moat or active area and leaving a region on the lateral edges of the gate structure, overlying the offset spacers. The sidewall spacers are substantially thicker than the offset spacers, thereby resulting in the subsequently formed source/drain regions to be offset from lateral edges of the gate structure. The source/drain regions are then formed by implantation, wherein a source/drain dopant is introduced into the exposed areas (top of gate electrode and active areas not covered by sidewall spacers). The source/drain regions are then completed with a thermal process to activate the dopant.
Other processing may then include silicide processing, wherein a metal layer is formed over the device, followed by a thermal process, wherein the metal and silicon interfaces react to form a silicide (on top of the gate and in the source/drain regions). Unreacted metal is then stripped away, and other back end processing such as interlayer dielectric and metallization layers are formed to conclude the device formation.
Metal contacts Met1110 and Met2120 are subsequently formed overlying and contacting the source S 54 and drain D 55 regions to provide separate bitline (BL) connections thereto. The isolation regions STI 150 between the source S 54 and drain D55 regions of neighboring memory cells and the extra set of metal bitline connections thereto, are only needed because the asymmetric doping differences between the source and drain regions require and produce different BL voltages on the respective source and drain terminals during memory operations.
Halo implant rotation 2 is shown on the right hand side of the array 100, illustrating the direction of the single halo implant selective to the drain side of the gate 130 or channel. Advantageously, this single-sided halo implant avoids the additional defects associated with implanting a second time on the source side of the gate/channel, avoids the second implant rotation and time required, and avoids the added mask and costs required of a mask produced asymmetry.
The array 15 of
The array 200 of
Thus, the resulting transistors (e.g., NMOS memory cell transistors 50) of the array 100 of
Accordingly, the systems and methods of the present invention provide a memory device, for example, wherein the NMOS transistors of an FLASH EEPROM memory device having memory (FLASH/EEPROM) and logic (e.g., LVCMOS and/or HVCMOS) regions, exhibit improved programming performance of embedded memory technology due to the application of a single halo implant selective to the drain side of the channel, and reduced defects associated with the avoidance of the halo implant from the source side of the channel in the FLASH/EEPROM region of the device.
Further, while the invention is generally described above with respect to NMOS transistor fabrication within the memory (array) region (e.g., 11 of
Although the invention has been shown and described with respect to one or more implementations, equivalent alterations and modifications will occur to others skilled in the art based upon a reading and understanding of this specification and the annexed drawings. The invention includes all such modifications and alterations and is limited only by the scope of the following claims. In addition, while a particular feature or aspect of the invention may have been disclosed with respect to only one of several implementations, such feature or aspect may be combined with one or more other features or aspects of the other implementations as may be desired and advantageous for any given or particular application. Furthermore, to the extent that the terms “includes”, “having”, “has”, “with”, or variants thereof are used in either the detailed description or the claims, such terms are intended to be inclusive in a manner similar to the term “comprising.” Also, the term “exemplary” is merely meant to mean an example, rather than the best. It is also to be appreciated that layers and/or elements depicted herein are illustrated with particular dimensions relative to one another (e.g., layer to layer dimensions and/or orientations) for purposes of simplicity and ease of understanding, and that actual dimensions of the elements may differ substantially from that illustrated herein. Additionally, the layers can be formed in any number of suitable ways, such as with spin-on techniques, sputtering techniques (e.g., magnetron or ion beam sputtering), (thermal) growth techniques and/or deposition techniques such as chemical vapor deposition (CVD), for example.
This is a division of application Ser. No. 11/443,779, filed May 31, 2006, the entire disclosure of which is hereby incorporated by reference.
Number | Date | Country | |
---|---|---|---|
Parent | 11443779 | May 2006 | US |
Child | 12407624 | US |