The invention is directed to chemical-mechanical polishing (CMP), and in particular, to CMP of a copper layer on a semiconductor wafer.
Chemical-mechanical polishing (CMP) is widely used in integrated circuit manufacturing to form copper interconnects. Interconnects may be vertical (vias) or horizontal (lines). In the copper damascene process, copper is typically electroplated into holes or trenches that have been formed in a dielectric layer on a semiconductor wafer. The electroplating process results in a layer of copper formed over the surface of the dielectric. The layer is removed by polishing, which also separates copper features defined by the holes or trenches to form the interconnects. When a copper polishing step is used to form vias and lines simultaneously, the process is referred to as “dual damascene.” When one polishing step is used to form the vias, and another is used to form lines, the process is referred to as “single damascene.”
The CMP process typically is performed on a process tool that includes several platens, each having a polishing pad fixed thereto. A wafer is placed face down on the polishing pad, and a polishing slurry is dispensed as the platen rotates. Material is removed from the surface in a manner that reduces the height of surface topography. The polishing step may be terminated when a predetermined thickness of material is removed, or when the surface of the wafer undergoes a change in optical reflectivity that can be detected by an end-point system.
In the copper damascene process, the copper layer is removed on a platen set up for copper removal only. A barrier layer and a portion of the dielectric layer underlying the copper layer are removed on a second platen set up for that purpose. Because the removal of the copper layer requires significantly more time than removal of the barrier layer and dielectric portion, the second platen is idle for some of the time required to remove the copper layer on the first platen. This idle time represents wasted processing time and results in lower tool throughput than would be the case if the second platen were used more continuously. This wasted time results in increased cost per wafer in a fiercely competitive industry.
Accordingly, what is needed in the art is a method of polishing semiconductor wafers that reduces the idle time of polishing platens on the polishing tool.
To address the above-discussed deficiencies of the prior art, the invention provides, in one aspect, a method of manufacturing a semiconductor device. The method includes removing a first thickness of a copper layer located over a semiconductor substrate by chemical-mechanical polishing. The copper layer is located over a barrier layer. The removing is conducted on a first platen using a first polishing slurry. A remaining thickness of the copper layer is removed on a second platen using a second polishing slurry. A portion of the barrier layer on is removed on the second platen using a third polishing slurry. The third polishing slurry has a substantially different composition from the second polishing slurry.
Another embodiment is a method of manufacturing a semiconductor device. A first thickness of a copper layer located on a semiconductor substrate is removed by chemical-mechanical polishing conducted on a first platen using a first polishing slurry that includes colloidal silica particles. The copper layer is located over a barrier layer. The semiconductor substrate is transferred to a second platen after removing the first thickness. After transferring to the second platen, a remaining thickness of the copper layer is removed on the second platen using the first polishing slurry. The barrier layer and a portion of a dielectric layer underlying the barrier layer are also removed on the second platen using a second polishing slurry having a composition substantially different from that of the first polishing slurry. The colloidal silica particles are dispersed in the first slurry and remain substantially dispersed within the second polishing slurry when mixed therewith on the second platen. Further, a residence time of the semiconductor substrate with respect to the first platen is about equal to a residence time of the semiconductor substrate with respect to the second platen.
Another embodiment is a semiconductor device. The device has transistors formed on a substrate and a dielectric layer formed over the transistors. Interconnects are formed within the dielectric layer and configured to connect the transistors to other circuit components located on the substrate. The interconnects are formed by forming trenches in said dielectric layer. A barrier layer is deposited on a sidewall of the trenches and on a surface of the dielectric layer parallel to the substrate. Copper is placed over the barrier layer, where the placing forms a layer of copper parallel to the substrate. A first thickness of the copper layer is removed by chemical-mechanical polishing on a first platen using a first polishing slurry. A remaining thickness of the copper layer is removed on a second platen using a second polishing slurry having a composition substantially different from that of the first polishing slurry. The barrier layer parallel to the substrate is removed on the second platen using a third polishing slurry.
Reference is now made to the following descriptions taken in conjunction with the accompanying drawings, in which:
Previous interconnect levels may include contacts, vias and metal lines formed by any process. Examples include tungsten contacts, aluminum or copper vias, and aluminum or copper lines. The dielectric layer 120 may be homogeneous or heterogeneous, and may include any conventional or future discovered dielectric material.
At this point in the manufacturing process, the substrate 110 is polished to remove a portion of the copper layer 150 and the barrier layer 140.
The incoming semiconductor substrate 110 is initially placed face down in contact with the polishing pad on the first platen 210. In a conventional process, the first platen 210 removes substantially all of a bulk portion of the copper layer 150. The bulk copper removal process uses a first polishing slurry designed to remove copper. Bulk copper is that portion of the copper layer 150 above the barrier layer portion 141 that is sufficiently distant from the barrier layer portion 141 to be free of interfacial effects such as alloying and interdiffusion of the barrier material. “Substantially all” means about 90% or greater of the initial thickness 151.
The wafer is then transferred the second platen 220. In the conventional process, a second polishing slurry, substantially different from that of said first polishing slurry, removes any remaining copper, the barrier layer portion 141, and some of the dielectric layer 120 below. The wafer is then transferred to the third platen 230. A third polishing slurry may optionally be used to buff the wafer to remove potential sources of defects, such as scratches and organic residue.
In a conventional copper CMP process, bulk copper removal and barrier removal are generally not performed on the same platen. In the past, a slurry designed for copper removal was used that included a carrier fluid with an acidic pH (less than about 6). The slurry includes a carrier fluid and may additionally include abrasive particles. The abrasive particles may include particles of a ceramic such as silica, alumina or titania. The precise composition of polishing slurries is generally proprietary, but in general may include pH buffers, surfactants and corrosion inhibitors.
Copper polishing slurries used in prior art processes typically included fused silica particles as the abrasive. In such a slurry, the acidic pH is necessary in part to maintain dispersion of slurry particles suspended in the slurry. When the pH exceeds a value, typically in the neutral range (pH=7), the silica particles tend to form aggregates. Aggregates may cause scratches and particle defects on the semiconductor substrate, reducing product yield and increasing manufacturing costs.
Moreover, the slurry used for barrier removal includes a carrier fluid that is typically basic, with a pH greater than about 8. If the acidic copper removal slurry and the barrier removal slurry are mixed, the silica abrasive in the copper slurry will generally aggregate. Thus, copper and barrier removal are strictly limited to different platens in prior art copper CMP processes.
Recent developments in copper polishing slurry compositions include replacing fused silica particles with colloidal silica for improved uniformity of particle size. In addition, the colloidal silica particles are chemically treated to enable the particles to remain substantially dispersed at a pH higher than about 6 to 7. Particles are substantially dispersed when a significant majority of the particles are not bound to other particles by covalent, ionic or van der Waals forces. A significant majority may in some cases be greater than 95%, and may be greater than 99%.
The invention recognizes for the first time that these recent advances in slurry composition can be exploited to improve throughput of a copper CMP process. Because treated colloidal silica particles remain substantially dispersed as isolated particles at a higher pH than the fused silica particles used in earlier copper polishing slurries, the slurries may both be used sequentially on the same platen to remove a portion of the copper layer 150 and the barrier layer portion 141.
When copper and barrier removal are combined on a single platen, a degree of freedom in the CMP process advantageously results. A portion of the copper removal process load may then be transferred from the first platen 210 to the second platen 220. In this manner, the polishing load on the first platen 210 and the second platen 220 can be made more equal. Thus, idle time of the second platen is reduced, resulting in greater efficiency.
The subprocess 301 removes the first thickness 160 of the copper layer 150. In a step 305, a wafer from a process lot is provided to the first platen 210. The first platen 210 includes a suitable polishing pad, such as a polyurethane pad. Such pads are well known to those skilled in the art, and are available from multiple equipment vendors. In a step 310, a first polishing slurry is used to remove the first thickness 160. In a step 315, the subprocess 301 transfers the wafer to the second platen 220 for removal of the remaining thickness 165. After transfer of the wafer, the subprocess 301 returns to the step 305 to accept another wafer for processing if any remain in the process lot.
The first thickness 160 of the copper layer 150 is substantially less than the initial thickness 151. In one aspect, the first thickness 160 ranges from about 50% to about 90% of the thickness 151. In another aspect, the first thickness 160 is the amount of copper removed by the subprocess 301 in the time that the subprocess 330 removes the remaining thickness 165 and the barrier layer portion 141. In some cases, the first thickness 160 ranges from about 60% to about 75% of the initial thickness 151.
Referring to the subprocess 330, in a step 335, the wafer is accepted from the subprocess 301. In a step 340, the remaining thickness 165 of copper is removed using a second slurry. In some cases, the first slurry and the second slurry have a different composition. Differences may include, but are not limited to pH, concentration of solids and abrasive composition or size distribution. In other cases, the first slurry and the second slurry have a substantially comparable composition.
The compositions of the first and second slurries are substantially comparable when they are provided by a source that provides the same nominal slurry composition. For example, when the first and second slurries are drawn from the same or different reservoirs of slurry represented by the same vendor part number, the slurries are substantially comparable. However, transient or local variations in slurry composition may occur due to mixing effects, settling of solids, etc. without departing from substantial comparability. The first and second slurries are also substantially comparable when, regardless of their sources, the solids concentration, pH, additives and abrasive are each substantially the same in each slurry.
In a step 345, an optional pad rinse may be performed. If done, the pad rinse may act to flush abrasive particles provided by the first slurry from the polishing pad. This rinse may also dilute the carrier fluid provided by the first slurry. In some cases, the wafer may remain in contact with the polishing pad during the rinse. In other cases, the wafer may be removed from contact with the pad during the step 345.
In a step 350, the wafer is placed in contact with the polishing pad if it was removed during the step 345. The barrier layer portion 141 and a portion of the dielectric layer 120 are removed using a third polishing slurry. The third polishing slurry has a composition substantially different from that of the second polishing slurry. After the barrier layer portion 141 is removed in the step 350, the wafer is transferred to the third platen 230 in a step 355.
The third polishing slurry is substantially different when a characteristic of one or more of the components of the third polishing slurry differs significantly from the same characteristic of the one or more components of the second polishing slurry. Nonlimiting examples include: pH of the carrier fluid; atomic makeup of abrasive particles; average size or size distribution of the abrasive particles; and any stabilizers, corrosion inhibitors or surfactants.
In one aspect, the second and the third polishing slurries have a different pH. In some cases, the pH differs by at least 1 pH unit. In other cases, the pH of the second polishing slurry ranges from about 6.8 to about 7.3, and the pH of the third polishing slurry ranges from about 8.5 to about 10.5.
When the third slurry is dispensed onto the polishing pad, residual slurry particles from the second slurry may remain embedded in the pad or suspended in residual carrier fluid from the second polishing slurry. The residual slurry particles or the residual carrier fluid may mix with the third slurry on the pad. Although the second and the third polishing slurries have a substantially different composition, any residual slurry particles from the second polishing slurry remain substantially dispersed in the third slurry.
Without limitation by theory, it is thought that the chemical treatment of the slurry particles enables their suspension in spite of the carrier fluid having a higher pH than previous copper polishing slurries. It is further thought that this surface treatment enables the abrasive particles to remain suspended at the higher pH of the carrier fluid of the third slurry. Without this recent advance in the copper slurry composition, the slurry particles would tend to form aggregates as previously described.
Thus, while experience in the CMP arts would firmly guide one of ordinary skill to prevent mixing of the second and third slurries, the discovery that these particles remain suspended when mixed as described in the process 300 provides the ability to shift some of the copper removal burden from the first platen 210 to the second platen 220. A residence time of the wafer with respect to the first platen 210 and the second platen 220 may thereby be made about equal. Because the copper removal step is often the rate-limiting step of the copper CMP process, the overall throughput of the process is improved when the residence time is balanced between the first and second platens.
Referring to the subprocess 360, the optional buff polish may be performed after barrier removal. However, the subprocess 360 is not necessary to practice the embodiments described herein. When used, a step 365 accepts the wafer from the subprocess 330. In a step 370, the wafer is placed in contact with a polish pad on the third platen 230. The wafer surface remains in contact with the pad while a buff slurry is dispensed onto the pad. The duration of the buff polish is typically less than the residence time of the wafer with respect to the first and the second platens, so the step 370 generally does not limit the throughput of the process 300. In a step 375, the wafer is rinsed and placed in a receiving carrier for later transfer to a following process step.
The concept of residence time is illustrated in
The residence time of a second wafer with respect to the second platen 220 is described by timelines 420, 425, 430. The residence time of the second wafer begins at a time 435 when the surface of the wafer is placed in contact with the polishing pad. The residence time ends when the second wafer is finally removed from contact with the pad at a time 440. Remaining bulk copper (the remaining thickness 165, e.g.) is removed between the time 435 and the time 445. In cases for which the step 340 is terminated by end-point detection, the duration of contact of the wafer with the pad may vary due to variations of the remaining thickness 165. In such cases, there may be some uncertainty regarding the duration between the time 435 and the time 445.
The timeline 425 represents the duration of the optional step 345, during which the polishing pad on the second platen 220 is rinsed. The rinse begins at a time 450 and ends at a time 455. During the duration of the timeline 425, the wafer may or may not be in contact with the polishing pad.
The timeline 430 represents the duration of the step 350 during which the barrier layer portion 141 and a portion of the dielectric layer 120 are removed. In the case that the wafer was removed from the pad in the step 345, the wafer is placed again in contact with the pad at a time 460. In the case in which the wafer remained in contact with the pad during the step 345, the flow of the third polishing slurry begins at the time 460. The step 350 ends at the time 440 when the wafer is removed for the final time from the polishing pad.
When the residence time of the first wafer is about equal to the residence time of the second wafer about equal, loss of throughput due to idle platen time is advantageously minimized. The residence times are about equal when they differ by a maximum difference. In some embodiments, the maximum difference is less that about 25% of the longer residence time. In other embodiments, the maximum difference is less that about 10% of the longer residence time. In still other embodiments, the maximum difference is less that about 5% of the longer residence time.
The interconnects 560 are formed using the process 300 or a similar embodiment. The interconnects 560 may be formed using a single damascene 570 or a dual damascene 580 architecture. The device 500 may include any number of interconnect levels called for by the design of the device 500.
Although the invention has been described in detail, those skilled in the pertinent art should understand that they can make various changes, substitutions and alterations herein without departing from the scope of the invention in its broadest form.