The present invention relates to low power programmable logic devices. Traditionally, integrated circuit (IC) devices such as custom, semi-custom, or application specific integrated circuit (ASIC) devices have been used in electronic products to reduce cost, enhance performance or meet space constraints. However, the design and fabrication of custom or semi-custom ICs can be time consuming and expensive. The customization involves a lengthy design cycle during the product definition phase and high Non Recurring Engineering (NRE) costs during manufacturing phase. In the event of finding a logic error in the custom or semi-custom IC during final test phase, the design and fabrication cycle has to be repeated. Such lengthy correction cycles further aggravate the time to market and engineering cost. As a result, ASICs serve only specific applications and are custom built for high volume and low cost.
Another type of semi custom device called a Gate Array customizes modular blocks at a reduced NRE cost by synthesizing the design using a software model similar to the ASIC. The missing silicon level design verification results in multiple spins and lengthy design iterations. Structured ASICs come under larger module Gate Arrays.
In recent years there has been a move away from custom or semi-custom ICs toward field programmable components whose function is determined not when the integrated circuit is fabricated, but by an end user “in the field” prior to use. Off the shelf, generic Programmable Logic Device (PLD) or Field Programmable Gate Array (FPGA) products greatly simplify the design cycle. These products offer user-friendly software to fit custom logic into the device through programmability, and the capability to tweak and optimize designs to improve silicon performance. The flexibility of this programmability is expensive in terms of silicon real estate, but reduces design cycle and upfront NRE cost to the designer.
FPGAs offer the advantages of low non-recurring engineering costs, fast turnaround (designs can be placed and routed on an FPGA in typically a few minutes), and low risk since designs can be easily amended late in the product design cycle. It is only for high volume production runs that there is a cost benefit in using the more traditional approaches. Compared to PLD and FPGA, an ASIC has hard-wired logic connections, identified during the chip design phase. ASIC has no multiple logic choices and no configuration memory to customize logic. This is a large chip area and cost saving for the ASIC. Smaller ASIC die sizes lead to better performance. A full custom ASIC also has customized logic functions which take less gate counts compared to PLD and FPGA configurations of the same functions. Thus, an ASIC is significantly smaller, faster, cheaper and more reliable than an equivalent gate-count PLD or FPGA. The trade-off is between time-to-market (PLD and FPGA advantage) versus low cost and better reliability (ASIC advantage). The cost of Silicon real estate for programmability provided by the PLD and FPGA compared to ASIC determines the extra cost the user has to bear for customer re-configurability of logic functions.
In an FPGA and PLD, a complex logic design is broken down to smaller logic blocks and shorter interconnect segments. Those elements are programmed into logic blocks and interconnect fabric provided in the FPGA. Logic elements offer sequential and combinational logic design implementations. Combinational logic has no memory and outputs reflect a function solely of present inputs. Sequential logic is implemented by inserting memory into the logic path to store past history. Existing PLD and FPGA logic elements include transistor pairs, NAND or OR type gates, multiplexers, look-up-tables (LUTs) and AND-OR type structures. In a PLD the basic logic element is labeled as macro-cell. Hereafter the terminology FPGA will include both FPGAs and PLDs, and the terminology logic element will include both logic elements and macro-cells. Granularity of a FPGA refers to logic content of a basic logic element. Smaller blocks of a complex logic design are customized to fit into FPGA grain. In fine-grain architectures, a small basic logic element is enclosed in a routing matrix and replicated. These offer easy logic fitting at the expense of complex routing. In course-grain architectures, many basic logic elements are combined with local routing and wrapped in a routing matrix to form a logic block. The logic block is then replicated with global routing. Larger logic blocks make the logic fitting difficult and the routing easier. A challenge for FPGA architectures is to provide easy logic fitting (like fine-grain) and maintain easy routing (like course-grain).
Inputs and outputs for the Logic Element or Logic Block is selected from the programmable interconnect fabric, also know as Routing Matrix. An exemplary routing matrix containing logic elements described in Ref-1 (Seals & Whapshott) is shown in
FPGA architectures are discussed in IDS references. Those disclose logic elements and specialized routing blocks to connect the logic elements in FPGA's. In all cases the routing block is programmed to define inputs and outputs for the logic blocks, while a logic block performs a specific logic function. Methods for implementing programmable point to point connections, synonymous with programmable switches, include fuses, antifuses, EPROM cells, EEPROM cells, Flash cells and SRAM cells. Fuse and anti-fuse are both one time programmable due to the non-reversible nature of the change. EPROM, EEPROM, Flash & SRAM incur non destructive changes. In pass-gate logic, the gate signal is generated by a configuration circuit that includes memory. The choice of memory varies from user to user. Anti-fuse and SRAM based architectures are widely used in commercial FPGA's, while EPROM, EEPROM, anti-fuse and fuse links are widely used in commercial PLD's. Volatile SRAM memory needs no high programming voltages, is freely available in every logic process, is compatible with standard CMOS SRAM memory, lends to process and voltage scaling and has become the de-facto choice for modern very large FPGA devices.
A volatile six transistor SRAM based configuration circuit is shown in
A programmable MUX utilizes a plurality of point to point switches. The 4:1 MUX in
FPGA and ASICs require buffers to improve signal propagation delay in long wires. This is shown in
A particular difficulty with volatile configuration memory based FPGA's is the power up sequence. When a system comprising an FPGA is booted, the FPGA is not configured. Certain power up protocols must be followed to ensure proper sequencing of different circuit blocks within the FPGA. First the FPGA needs to be powered up without configuration data. Second, the FPGA must generate a signal requesting configuration data when ready to receive such data. Third, the FPGA must generate a signal that it has received the proper configuration data and its ready for operation. Without configuration data, the volatile configuration memory elements may power up at any random data state—thus all logic elements and interconnects may interact in a harmful manner. The power up time penalty and extra circuit overhead (hence cost) penalty can be considerable to avoid contentious conditions. Another undesirable effect during power up is the power consumption during power-up. The buffer structures shown in
A standard prior art technique to reduce power-up transient current is to add extra power control circuitry as shown in
What is desirable is to have easy to use, efficient and inexpensive power management circuits and power up conditions for FPGAs. It is also beneficial to have the ability to shut off unused power consuming devices. Such circuits should have reasonable cost parity to ASICs, easy user interface, and also lend to FPGA to ASIC easy design conversions. Many users may desire identical timing ASICs to reduce cost, save power up time and power, and at the same time not having extra time and NRE cost for design conversion to ASIC. The users further seek to avoid using an external Boot ROM as the data stream from the Boot ROM can be easily stolen during power up phase.
In one aspect, a configuration circuit for a programmable logic device comprising a plurality of memory elements and a mode to force all control signals generated by the memory elements to a zero voltage state regardless of the stored memory data.
Implementations of the above aspect may include one or more of the following. A configuration circuit comprises a memory element to generate a control signal. The control signal is coupled to a programmable element to configure the element. Most common programmable element is a pass-gate. A pass-gate is an NMOS transistor, a PMOS transistor or a CMOS transistor pair that can electrically connects two points. The gate electrode signal on these pass-gates allows a programmable method of controlling an on and off connection. Other programmable logic elements may be inverters, multiplexes and buffers. The logic device comprises circuits consisting of CMOS transistors that include AND, NAND, OR, NOR and pass-gate type logic structures. Multiple logic circuits are combined into larger logic blocks. Configuration circuits are used to change programmable logic functionality. Configuration circuits have memory elements and access circuitry to change memory data. Each memory element can be a transistor or a diode or a group of electronic devices. The memory elements can be made of CMOS devices, capacitors, diodes on a substrate. The memory elements can be made of thin film devices such as thin film transistors (TFT), capacitors and diodes. The memory element can be selected from the group consisting of volatile and non volatile memory elements. The memory element can also be selected from the group of fuses, antifuses, SRAM cells, DRAM cells, optical cells, metal optional links, EPROMs, EEPROMs, flash, magnetic and ferro-electric elements. One or more redundant memory elements can be provided for controlling the same circuit block. The memory element can generate an output signal to control logic gates. Memory element can generate a signal that is used to derive a control signal. The control signal is coupled to pass-gate logic element, AND array, NOR array, a MUX or a Look-Up-Table (LUT) logic. Configuration circuits take a large Silicon foot print, adding to cost of PLDs compared to a similar functionality ASICs. Reducing configuration circuit Silicon usage helps reduce programmable logic cost. A 3-dimensional integration of configuration circuits provides such a cost reduction. Logic circuits are fabricated using a basic logic process capable of making CMOS transistors. The pass-gates are formed on P-type, N-type, epi or SOI substrate wafers. The configuration circuits may be formed above the logic transistors. This may be achieved by inserting a thin-film transistor (TFT) module at contact layer of the logic process. The thin-film transistor outputs may be directly coupled to gate electrodes of pass-gates on substrate to provide logic control. Buried contacts may be used to make these connections. The contacts may be filled with doped poly-silicon, Titanium-Tungsten, Tungsten Silicide, or some other refractory metal. Memory elements may be constructed also with TFT transistors, capacitors and diodes. The TFT layers may be restricted for only configuration circuits, not used for logic signal lines. Metal layers above the TFT layers may be used for all the routing for the storage device, and configuration device. All signal paths may utilize wires and storage circuitry with no impact from TFT layers used for configuration circuits. This simple pass-gate switch with a vertically integrated configuration circuit reduces programmable logic cost. When the memory element is volatile, the configuration circuit has to be loaded from an external memory source called a Boot ROM. Until valid data is received the control signals have values that may be arbitrary. Such control signals may cause harmful behavior in the PLD due to contentions. A configuration circuit that guarantees a known control signal state prevents damage and reduces power consumption during power-up and configuration time.
In a second aspect, a multiplexer (MUX) for a programmable logic device, the MUX comprising: a plurality of inputs and an output; and a configuration circuit comprising a plurality of memory elements, each memory element generating a control signal, the configuration circuit comprising a first mode of operation to force each of said control signals to a first voltage level regardless of the memory state in the memory element; and a first device coupling a power supply voltage to the output, said first device having a gate electrode controlled by a said control signal of the configuration circuit; and one or more second devices coupling one or more inputs to the output, each said second device having a gate electrode controlled by a said control signal of the configuration circuit; wherein, the first device is in a conducting state to couple the power supply voltage to the MUX output during the first mode of operation of the configuration circuit.
Programmable logic devices use extensive multiplexing techniques to provide programmable choices. A plurality of wires can couple to a single wire through a MUX. The MUX is programmed by a configuration circuit. When the configuration data is missing, the MUX inputs are undefined. It is desirable to have a known input condition even when the configuration data is missing, or is incorrect. A configuration circuit that can guarantee a known zero control signal data state during a specific mode can be used to force a known condition to programmable MUX inputs. A PMOS device coupled to power-supply will conduct when the control gate is at zero volts, has the input of a MUX with a PMOS pass-gate will be forced to Vcc voltage. When the configuration circuit has this mode, the PMOS gate can either stay at zero, or programmed to one state to shut-off the PMOS. Thus a MUX structure can always posses a known data state at the input, avoiding a floating disastrous voltage level.
A buffer circuit of a programmable logic device, the circuit comprising: a buffer having a plurality of inverters coupled in series, the first inverter in the series having a buffer input; and a programmable multiplexer (MUX) comprising: a plurality of MUX inputs and a MUX output coupled to the buffer input; and a configuration circuit comprising a plurality of memory elements, each memory element generating a control signal, the configuration circuit adapted to force each of said control signals to a first voltage level regardless of the memory state in the memory element; and a first MUX device coupling a power supply voltage to the MUX output, said first device having a gate electrode controlled by a said control signal of the configuration circuit; and one or more second MUX devices coupling one or more MUX inputs to the MUX output, each said second device having a gate electrode controlled by a said control signal of the configuration circuit; wherein, the first MUX device is in a conducting state to couple the power supply voltage to the buffer input during power up and configuration of memory elements in the configuration circuit.
Implementations of the above aspect may include one or more of the following. A buffer is a circuit to amplify a signal. A simple buffer is two or three inverters in series, sized sequentially larger to generate a high drive current. Buffer output polarity can be the same or opposite of that at input. Odd and even number of stages determine the polarity. A buffer is used to transmit signals in long wires, or wires comprising a high capacitive load. A wire carries data in one direction, from a source to a sink node. A programmable wire allows the direction of data flow to be chosen by the user. A programmable buffer allows bi-directional data flow to facilitate long wire connections in programmable logic. Application specific logic also utilizes buffers to amplify signals. Buffers cause significant power surges during power up. The input conditions to buffers must be well defined. FPGA's have nondeterministic input conditions in buffers during power up and configuration. A configuration circuit with a defined mode to force control signals to zero state can aid in turning buffers off. PMOS pass-gate coupled to buffer input allows the buffer input to couple to Vcc during the special mode of configuration circuit, thus ensuring no crowbar currents in the buffer, and to prevent back-to-back bi-directional buffers entering into a ring-oscillation.
In a third aspect, a programmable logic device comprising: a configuration circuit having a plurality of memory elements, each memory element generating a control signal, the configuration circuit comprising a first mode to force all said control signals to ground voltage level; and a programmable logic circuit comprising a plurality of programmable elements, each said element coupled to one or more of said control signals; and a programmable buffer having an input coupled by a PMOS pass-gate to a power supply voltage, the gate electrode of said PMOS pass-gate coupled to a said control signal; wherein the configuration circuit is held in first mode to power up and configure the programmable logic device. In a fourth aspect, a power up sequence of a programmable logic device comprises forcing a configuration circuit into a mode wherein all control signals generated by memory elements in the circuit are forced to zero voltage until the configuration circuit is configured.
Implementations of the above aspect may further include one or more of the following. The memory element can be selected from the group of volatile or non-volatile memory elements. Volatile memory needs to be loaded at power up. The memory can be implemented using a TFT process technology that contains one or more of Fuses, Anti-fuses, DRAM, EPROM, EEPROM, Flash, Ferro-Electric, optical, magnetic and SRAM elements. Configuration circuits may include thin film elements such as diodes, transistors, resistors and capacitor. The process implementation is possible with any memory technology where the programmable element is vertically integrated in a removable module. The configuration circuit includes a predetermined conductive pattern in lieu of memory elements to control the programmable logic in the storage circuits. Multiple memory bits exist to customize logic functions. Each memory bit pattern has a corresponding unique conductive pattern to duplicate the same customized logic function. Circuit performance of the logic function is not affected by the choice of logic control: memory elements or conductive pattern. Forcing control signals to zero state regardless of memory values allow the logic circuits to power up in known states to prevent logic contention and high power consumption.
Implementations of the above aspects may include one or more of the following. The novel circuits and power up sequence constitutes fabricating a power managed VLSI IC product. The IC product is re-programmable in its initial stage with turnkey conversion to an ASIC. The IC has the end ASIC cost structure and FPGA re-programmability with superior power management. The IC product offering occurs in two phases: the first stage is a generic FPGA that has re-programmability containing a programmable module, and the second stage is an ASIC with the entire programmable module replaced by a customized metal mask.
A series product families can be provided with a modularized programmable element in an FPGA version followed by a turnkey custom ASIC with the same base die with 1-2 custom masks. The vertically integrated programmable module does not consume valuable silicon real estate of a base die. Furthermore, the design and layout of these product families adhere to removable module concept: ensuring the functionality and timing of the product in its FPGA and ASIC canonicals. These IC products can replace existing PLD and FPGA products and compete with existing Gate Arrays and ASIC's in cost and performance. An easy turnkey customization of an ASIC from an original smaller cheaper and faster PLD or FPGA would greatly enhance time to market, performance, and product reliability.
In the following detailed description of the invention, reference is made to the accompanying drawings which form a part hereof, and in which is shown, by way of illustration, specific embodiments in which the invention may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice the invention. Other embodiments may be utilized and structural, logical, and electrical changes may be made without departing from the scope of the present invention.
Definitions: The terms wafer and substrate used in the following description include any structure having an exposed surface with which to form the integrated circuit (IC) structure of the invention. The term substrate is understood to include semiconductor wafers. The term substrate is also used to refer to semiconductor structures during processing, and may include other layers that have been fabricated thereupon. Both wafer and substrate include doped and undoped semiconductors, epitaxial semiconductor layers supported by a base semiconductor or insulator, SOI material as well as other semiconductor structures well known to one skilled in the art. The term conductor is understood to include semiconductors, and the term insulator is defined to include any material that is less electrically conductive than the materials referred to as conductors.
The term module layer includes a structure that is fabricated using a series of predetermined process steps. Wafer processing includes masking layers, deposition and etching techniques. The boundary of the structure is defined by a first step, one or more intermediate steps, and a final step. The resulting structure is formed on a substrate. A cross section of the fully fabricated device may be used to delineate the module layers.
The term pass-gate refers to a structure that can pass a signal when on, and blocks signal passage when off. A pass-gate connects two points when on, and disconnects two points when off. A pass-gate can be a floating-gate transistor, an NMOS transistor, a PMOS transistor or a CMOS transistor pair. The gate electrode of pass-gate determines the state of the connection. A CMOS pass-gate requires complementary signals coupled to NMOS and PMOS gate electrodes. A control logic signal is connected to gate electrode of a pass-gate for programmable logic.
The term configuration circuit includes one or more configurable elements and connections that can be programmed for controlling one or more circuit blocks in accordance with a predetermined user-desired functionality. The configuration circuit includes the memory element and the access circuitry, herewith called memory circuitry (memory circuits include decoders, word lines, bit lines and sensing devices required to access individual memory bits arranged in an array) to modify said memory element. In configuration circuits, each memory element generates a control signal that is coupled to a programmable circuit to program the programmable circuit. In a typical memory circuit of a memory device, as all memory elements are coupled together, no individual memory element generates a control signal. Configuration circuit does not include the logic pass-gate controlled by the memory element. In one embodiment, the configuration circuit includes a plurality of memory circuits to store instructions to configure an FPGA. In another embodiment, the configuration circuit includes a first selectable configuration where a plurality of memory circuits is formed to store instructions to control one or more circuit blocks. The configuration circuits include a second selectable configuration with a predetermined conductive pattern formed in lieu of the memory circuit to control substantially the same circuit blocks. The configuration circuit includes elements such as diode, transistor, resistor, capacitor, metal link, among others. The configuration circuit also includes thin film elements. In yet another embodiment, the configuration circuits include a predetermined conductive pattern, via, resistor, capacitor or other suitable circuits formed in lieu of the memory circuit to control substantially the same circuit blocks.
The term “horizontal” as used in this application is defined as a plane parallel to the conventional plane or surface of a wafer or substrate, regardless of the orientation of the wafer or substrate. The term “vertical” refers to a direction perpendicular to the horizontal direction as defined above. Prepositions, such as “on”, “side”, “higher”, “lower”, “over” and “under” are defined with respect to the conventional plane or surface being on the top surface of the wafer or substrate, regardless of the orientation of the wafer or substrate. The following detailed description is, therefore, not to be taken in a limiting sense.
A significant draw back with the SRAM configuration circuit in
In
A new kind of a programmable logic device utilizing thin-film transistor configurable circuits is disclosed in application Ser. No. 10/267,483 entitled “Three Dimensional Integrated Circuits”, application Ser. No. 10/267,484 entitled “Methods for Fabricating Three-Dimensional Integrated Circuits”, and U.S. Pat. No. 6,747,478 and list as inventor Mr. R. U. Madurawe, the contents of which are incorporated-by-reference. The disclosures describe a programmable logic device and an application specific device fabrication from the same base Silicon die. The PLD is fabricated with a programmable memory module, while the ASIC is fabricated with a conductive pattern in lieu of the memory. Both memory module and conductive pattern provide identical control of logic circuits. For each set of memory bit patterns, there is a unique conductive pattern to achieve the same logic functionality. The vertical integration of the configuration circuit leads to a significant cost reduction for the PLD, and the elimination of TFT memory for the ASIC allows an additional cost reduction for the user. The TFT vertical memory integration scheme is briefly described next.
Fabrication of the IC also follows a modularized device formation. Formation of transistors 850 and routing 854 is by utilizing a standard logic process flow used in the ASIC fabrication. Extra processing steps used for memory element 852 formation are inserted into the logic flow after circuit layer 850 is constructed. A full disclosure of the vertical integration of the TFT module using extra masks and extra processing is in the incorporated by reference applications discussed above.
During the customization, the base die and the data in those remaining mask layers do not change making the logistics associated with chip manufacture simple. Removal of the SRAM module provides a low cost standard logic process for the final ASIC construction with the added benefit of a smaller die size. The design timing is unaffected by this migration as lateral metal routing and silicon transistors are untouched. Software verification and the original FPGA design methodology provide a guaranteed final ASIC solution to the user. A full disclosure of the ASIC migration from the original FPGA is in the incorporated by reference applications discussed above.
In
In yet another embodiment of a programmable multi-dimensional semiconductor device, a first module layer is fabricated having a plurality of circuit blocks formed on a first plane. The programmable multi-dimensional semiconductor device also includes a second module layer formed on a second plane. A plurality of configuration circuits is then formed in the second plane to store instructions to control a portion of the circuit blocks.
The fabrication of thin-film transistors to construct configuration circuits is discussed next. A full disclosure is provided in application Ser. No. 10/413,809 entitled “Semiconductor Switching Devices”, filed on Apr. 14, 2003, which lists as inventor Mr. R. U. Madurawe, the contents of which are incorporated herein by reference.
The following terms used herein are acronyms associated with certain manufacturing processes. The acronyms and their abbreviations are as follows:
VT Threshold voltage
LDN Lightly doped NMOS drain
LDP Lightly doped PMOS drain
LDD Lightly doped drain
RTA Rapid thermal annealing
Ni Nickel
Ti Titanium
TiN Titanium-Nitride
W Tungsten
S Source
D Drain
G Gate
ILD Inter layer dielectric
C1 Contact-1
M1 Metal-1
P1 Poly-1
P− Positive light dopant (Boron species, BF2)
N− Negative light dopant (Phosphorous, Arsenic)
P+ Positive high dopant (Boron species, BF2)
N+ Negative high dopant (Phosphorous, Arsenic)
Gox Gate oxide
C2 Contact-2
LPCVD Low pressure chemical vapor deposition
CVD Chemical vapor deposition
ONO Oxide-nitride-oxide
LTO Low temperature oxide
A logic process is used to fabricate CMOS devices on a substrate layer for the fabrication of storage circuits. These CMOS devices may be used to build AND gates, OR gates, inverters, adders, multipliers, memory and pass-gate based logic functions in an integrated circuit. A CMOSFET TFT module layer or a Complementary gated FET (CGated-FET) TFT module layer may be inserted to a logic process at a first contact mask to build a second set of TFT MOSFET or Gated-FET devices. Configuration circuitry is build with these second set of transistors. An exemplary logic process may include one or more following steps:
P-type substrate starting wafer
Shallow Trench isolation: Trench Etch, Trench Fill and CMP
Sacrificial oxide
PMOS VT mask & implant
NMOS VT mask & implant
Pwell implant mask and implant through field
Nwell implant mask and implant through field
Dopant activation and anneal
Sacrificial oxide etch
Gate oxidation/Dual gate oxide option
Gate poly (GP) deposition
GP mask & etch
LDN mask & implant
LDP mask & implant
Spacer oxide deposition & spacer etch
N+ mask and NMOS N+ G, S, D implant
P+ mask and PMOS P+ G, S, D implant
Ni deposition
RTA anneal—Ni salicidation (S/D/G regions & interconnect)
Unreacted Ni etch
ILD oxide deposition & CMP
C1 mask & etch
W-Silicide plug fill & CMP
200A to 500A poly P1 (amorphous poly-1) deposition
P1 mask & etch
Vtn mask & P− implant (NMOS Vt)
Vtp mask & N− implant (PMOS Vt)
TFT Gox (70A PECVD) deposition
200A to 500A P2 (amorphous poly-2) deposition
P2 mask & etch
LDN mask and NMOS N− tip implant
LDP mask and PMOS P− tip implant
Spacer LTO deposition
Spacer LTO etch to form spacers & expose P1
N+ mask implant (NMOS G/S/D & interconnect)
P+ mask & implant (PMOS G/S/D & interconnect)
Ni deposition
RTA salicidation and poly re-crystallization (G/S/D regions & interconnect)
Dopant activation anneal
Excess Ni etch
ILD oxide deposition & CMP
C2 mask & etch
W plug formation & CMP
M1 deposition and back end metallization
The TFT process technology consists of creating NMOS & PMOS poly-silicon transistors. In the embodiment in
After gate poly of regular transistors are patterned and etched, the poly is salicided using Nickel & RTA sequences. Then the ILD is deposited, and polished by CMP techniques to a desired thickness. In the shown embodiment, the contact mask is split into two levels. The first C1 mask contains all contacts that connect latch outputs to substrate transistor gates and active nodes. Then the C1 mask is used to open and etch contacts in the ILD film. Ti/TiN glue layer followed by W-Six plugs, W plugs or Si plugs may be used to fill the plugs, then CMP polished to leave the fill material only in the contact holes. The choice of fill material is based on the thermal requirements of the TFT module.
Then, a first P1 poly layer, amorphous or crystalline, is deposited by LPCVD to a desired thickness as shown in
Patterned and implanted P1 may be subjected to dopant activation and crystallization. In one embodiment, RTA cycle is used to activate & crystallize the poly after it is patterned to near single crystal form. In a second embodiment, the gate dielectric is deposited, and buried contact mask is used to etch areas where P1 contacts P2 layer. Then, Ni is deposited and salicided with RTA cycle. All of the P1 in contact with Ni is salicided, while the rest poly is crystallized to near single crystal form. Then the unreacted Ni is etched away. In a third embodiment, amorphous poly is crystallized prior to P1 patterning with an oxide cap, metal seed mask, Ni deposition and MILC (Metal-Induced-Lateral-Crystallization).
Then the TFT gate dielectric layer is deposited followed by P2 layer deposition. The dielectric is deposited by PECVD techniques to a desired thickness in the 30-200A range, desirably 70A thick. The gate may be grown thermally by using RTA. This gate material could be an oxide, nitride, oxynitride, ONO structure, or any other dielectric material combination used as gate dielectric. The dielectric thickness is determined by the voltage level of the process. At this point an optional buried contact mask (BC) may be used to open selected P1 contact regions, etch the dielectric and expose P1 layer. BC could be used on P1 pedestals to form P1/P2 stacks over C1. In the P1 salicided embodiment using Ni, the dielectric deposition and buried contact etch occur before the crystallization. In the preferred embodiment, no BC is used.
Then second poly P2 layer, 200A to 2000A thick, preferably 300A is deposited as amorphous or crystalline poly-silicon by LPCVD as shown in
A spacer oxide is deposited over the LDD implanted P2 using LTO or PECVD techniques. The oxide is etched to form spacers. The spacer etch leaves a residual oxide over P1 in a first embodiment, and completely removes oxide over exposed P1 in a second embodiment. The latter allows for P1 salicidation at a subsequent step. Then NMOS devices & N+ poly interconnects are blanket implanted with N+. The implant energy ensures full or partial dopant penetration into the 80A residual oxide in the S/D regions adjacent to P2 layers. This doping gets to gate, drain & source of all NMOS devices and N+ interconnects. The P+ mask is used to select PMOS devices and P+ interconnect, and implanted with P+ dopant as shown in
After the P+/N+ implants, Nickel is deposited over P2 and salicided to form a low resistive refractory metal on exposed poly by RTA. Un-reacted Ni is etched as shown in
An LTO film is deposited over P2 layer, and polished flat with CMP. A second contact mask C2 is used to open contacts into the TFT P2 and P1 regions in addition to all other contacts to substrate transistors. In the shown embodiment, C1 contacts connecting latch outputs to substrate transistor gates require no C2 contacts. Contact plugs are filled with tungsten, CMP polished, and connected by metal as done in standard contact metallization of IC's as shown in
A TFT process sequence similar to that shown in
C1 mask & etch
W-Silicide plug fill & CMP
˜200A poly P1 (amorphous poly-1) deposition
P1 mask & etch
Blanket or masked Vtn N− implant (Gated-NFET VT)
Vtp mask & P− implant (Gated-PFET VT)
TFT Gox (70A-200A PECVD) deposition 200A P2 (amorphous poly-2) deposition
Blanket or masked P+ implant (Gated-NFET gate & interconnect)
N+ mask & implant (Gated-PFET gate & interconnect)
P2 mask & etch
Blanket or masked LDN Gated-NFET N tip implant
LDP mask and Gated-PFET P tip implant
Spacer LTO deposition
Spacer LTO etch to form spacers & expose P1
Ni deposition
RTA salicidation and poly re-crystallization (exposed P1 and P2)
Fully salicidation of exposed P1 S/D regions
Dopant activation anneal
Excess Ni etch
ILD oxide deposition & CMP
C2 mask & etch
W plug formation & CMP
M1 deposition and back end metallization
As the discussions demonstrate, memory controlled pass transistor logic elements provide a powerful tool to make switches. The ensuing high cost of memory can be drastically reduced by the 3-dimensional integration of configuration elements and the replaceable modularity concept for said memory. When the area is smaller, the wire lengths are shorter and power required to switch wire loads is less. These advances allow much lower power consumption in 3D FPGA designs. In one aspect, a cheaper memory element allows use of more memory for programmability. That enhances the ability to build large logic blocks (i.e. course-grain advantage) while maintaining smaller element logic fitting (i.e. fine-grain advantage). Furthermore larger grains need less connectivity: neighboring cells and far-away cells. That further simplifies the interconnect structure and power to switch interconnects. Additional methods to reduced power such as power up circuits and power up sequences are discussed next.
The semiconductor device of
The programmable logic device of
In a first embodiment of
FPGA to ASIC conversion is described next. In
Such an ASIC derived from the FPGA in
Although an illustrative embodiment of the present invention, and various modifications thereof, have been described in detail herein with reference to the accompanying drawings, it is to be understood that the invention is not limited to this precise embodiment and the described modifications, and that various changes and further modifications may be effected therein by one skilled in the art without departing from the scope or spirit of the invention as defined in the appended claims.