This document claims priority to Japanese Patent Application No. 2021-031850 filed Mar. 1, 2021, the entire contents of which are hereby incorporated by reference.
With a recent trend toward higher integration and higher density in semiconductor devices, circuit interconnects become finer and finer and the number of levels in multilayer interconnect is increasing. In the process of achieving the multilayer interconnect structure with finer interconnects, film coverage of step geometry (or step coverage) is lowered through thin film formation as the number of interconnect levels increases, because surface steps grow while following surface irregularities on a lower layer. Therefore, in order to fabricate the multilayer interconnect structure, it is necessary to improve the step coverage and planarize the surface in an appropriate process. Further, since finer optical lithography entails shallower depth of focus, it is necessary to planarize surfaces of semiconductor device so that irregularity steps formed thereon fall within a depth of focus in optical lithography.
Accordingly, in a manufacturing process of the semiconductor devices, a planarization technique of a surface of the semiconductor device is becoming more important. The most important technique in this planarization technique is chemical mechanical polishing. This chemical mechanical polishing (which will be hereinafter called CMP) is a process of polishing a substrate, such as a wafer, by placing the substrate in sliding contact with a polishing pad while supplying a polishing liquid containing abrasive grains, such as silica (SiO2), onto the polishing pad.
A polishing apparatus for performing CMP includes a polishing table that supports a polishing pad having a polishing surface, and a substrate holder, which is referred to as a polishing head (substrate holding apparatus) for holding a wafer. When the wafer is polished with such a polishing apparatus, the polishing table and the polishing head are moved relative to each other while supplying the polishing liquid (slurry) onto the polishing pad disposed on the polishing table, and the wafer is pressed against the polishing surface of the polishing pad under a predetermined pressure by the polishing head. The wafer is brought into sliding contact with the polishing surface in the presence of the polishing liquid, so that the surface of the wafer is polished to a flat finish by a combination of a chemical action of the slurry and a mechanical action of abrasive grains contained in the slurry.
During polishing of the substrate, the surface of the substrate is brought into contact with the rotating polishing pad, and thus a friction force is exerted on the substrate. Therefore, in order to prevent the substrate from being detached from the polishing head during polishing of the substrate, the polishing head is provided with a retainer ring. This retainer ring is disposed so as to surround the substrate. During polishing of the substrate, the retainer ring rotates and presses the polishing pad at the outside of the substrate.
In recent years, there has been increasing demand for more precise control of thickness profile in a substrate (i.e., improvement of uniformity within the surface, which represents flatness of the substrate surface) in order to respond to various initial film thickness profiles which vary depending on semiconductor devices and CMP processes, as well as to improve yields.
Further. an allowable range with respect to a target film thickness is becoming narrower, and thus it is becoming difficult to maintain the film thickness within the required allowable range in the conventional polishing method, which uses a film thickness sensor to acquire a film thickness index value of the substrate during polishing, and terminates polishing of the substrate based on the thickness index value. For example, an amount of polishing during one rotation of the polishing table may be larger than the allowable range. In this case, if an end point of polishing is determined based on the film thickness index value obtained from the film thickness sensor disposed in the polishing table, the film thickness obtained after the end of polishing may exceed the allowable range with respect to the target film thickness.
Further, the film thickness profile before polishing is different among substrates to be polished, and polishing conditions (for example, conditions of the polishing surface of the polishing pad) are different among polishing apparatuses. Due to these combined factors also, it is becoming increasingly difficult to preciously control the film thickness profile, and bring the film thickness profile within the required allowable range.
Therefore, there is provided a polishing apparatus capable of obtaining a desired film thickness profile. Further, there is provided a polishing method of polishing substrate using such a polishing apparatus.
Embodiments, which will be described below, relate to a polishing apparatus and a polishing method for a substrate, such as a wafer, and particularly relates to a polishing apparatus and a polishing method for polishing a substrate to obtain a desired thickness profile. Further, embodiments, which will be described below, relate to a polishing method for polishing a substrate by use of such polishing apparatus.
In one embodiment, there is provided a polishing apparatus, comprising: at least one polishing unit including a polishing table for supporting a polishing pad, and a substrate holder for pressing a substrate against the polishing pad a film thickness measuring device configured to measure a film thickness profile of the substrate; and a controller configured to control at least operations of the polishing unit and the film thickness measuring device, wherein the substrate holder includes: an elastic membrane film to form a plurality of pressure chambers for pressing the substrate; a head body to which the elastic membrane is attached; and a retainer ring arranged so as to surround the substrate, the controller stores in advance a response model which is created by taking into consideration variation in the amount of polishing between monitored areas of the substrate due to variations in a pressure of a pressurized fluid supplied to each of the pressure chambers, and the controller obtains a film thickness profile of the substrate before polishing by use of a film thickness measuring device; causes the substrate to be polished with an optimized polishing recipe including at least the pressures of the pressurized fluid supplied to each of the plurality of pressure chambers and the polishing time, the optimized polishing recipe being created based on the response model and a target polishing amount, which is a difference between the film thickness profile of the substrate before polishing and the target film thickness of the substrate; and causes a next substrate to be polished with a new optimized polishing recipe which is created based on a target polishing amount of the next substrate and a response model corrected by use of the optimized polishing recipe and film thickness profiles of the substrate before and after polishing.
In one embodiment, the optimal polishing recipe is created by use of an optimization calculation that minimizes an objective function including at least a term for difference between the target polishing amount and a predicted polishing amount calculated by use of the response model.
In one embodiment, the objective function includes a term for differences between pressures of the pressurized fluid of the optimized polishing recipe and preset reference pressures of the pressurized fluid, and/or a term for differences between the pressures of the pressurized fluid of the optimized polishing recipe and pressures of the pressurized fluid used in polishing of the previous substrate.
In one embodiment, the optimization calculation is quadratic programming method.
In one embodiment, the number of the plurality of monitored areas is greater than the number of the plurality of pressure chambers.
In one embodiment, the response model is a response model created by further taking into consideration variation in an amount of polishing between the monitored areas of the substrate due to variation in a pressing force of a retaining ring on the polishing pad, and the optimized polishing recipe further includes the pressing force of the retainer ring.
In one embodiment, the film thickness measuring device is configured to measure a film thickness at a plurality of measurement points set in each of the plurality of monitored areas.
In one embodiment, the response model includes a response coefficient representing an amount of increase in the polishing rate per unit polishing pressure in each of the plurality of monitored areas.
In one embodiment, the polishing apparatus further comprising a plurality of local-load exerting devices for applying a local load to a part of the retainer ring, and the response model is created by further taking into consideration variation in an amount of polishing between the monitored areas due to variation in the local load.
In one embodiment, there is provided a polishing apparatus, comprising: at least one polishing unit including a polishing table for supporting a polishing pad, and a substrate holder for pressing a substrate against the polishing pad; a film thickness measuring device configured to measure a film thickness profile of the substrate; and a controller configured to control at least operations of the polishing unit and the film thickness measuring device, wherein the substrate holder includes: an elastic membrane film to form a plurality of pressure chambers for pressing the substrate; a head body to which the elastic membrane is attached; and a retainer ring arranged so as to surround the substrate, the controller stores in advance film thickness profiles of a plurality of substrates before polishing and response models for polishing each of the plurality of substrates, the response models being created by taking into consideration variation in the amount of polishing between monitored areas of the substrate due to variations in a pressure of each of the pressure chambers, and classifies in advance the film thickness profiles of the plurality of substrates before polishing into a plurality of groups to which the film thickness profiles similar to each other belong, the controller obtains a film thickness profile of the substrate before polishing by use of the film thickness measuring device, determines a group, to which the film thickness profile of the substrate before polishing belongs, from the plurality of groups, causes the substrate to be polished with an optimized polishing recipe including at least the pressures of the pressurized fluid supplied to each of the plurality of pressure chambers and the polishing time, the optimized polishing recipe being created based on a target polishing amount, which is a difference between the film thickness profile of the substrate before polishing and the target film thickness of the substrate, and a response model associated with the determined group, and causes a next substrate to be polished with a new optimized polishing recipe which is created based on a target polishing amount of the next substrate, and a response model corrected by use of the optimized polishing recipe and film thickness profiles of the substrate before and after polishing.
In one embodiment, there is provided a polishing apparatus, comprising: a plurality of polishing units each including a polishing table for supporting a polishing pad, and a substrate holder for pressing a substrate against the polishing pad; and a controller configured to control at least operation of the polishing units, wherein the substrate holder includes: an elastic membrane film to form a plurality of pressure chambers for pressing the substrate; a head body to which the elastic membrane is attached; and a retainer ring arranged so as to surround the substrate, the substrate is a substrate which is polished by a plurality of polishing processes including a first polishing and a second polishing performed in the polishing unit different from the polishing unit in which the first polishing has been performed, the polishing unit in which the first polishing is performed has a film thickness sensor for measuring a thickness profile of the substrate, the controller stores in advance a response model for the second polishing which is created by taking into consideration variation in the amount of polishing between monitored areas of the substrate due to variations in a pressure of a pressurized fluid supplied to each of the pressure chambers, and the controller obtains, after performing the first polishing a film thickness profile of the substrate before the second polishing by use of the film thickness sensor, causes the substrate to be second polished with an optimized polishing recipe for the second polishing including at least the pressures of the pressurized fluid supplied to each of the plurality of pressure chambers and the polishing time, the optimized polishing recipe being created based on a target polishing amount, which is a difference between the film thickness profile of the substrate before the second polishing and the target film thickness of the substrate, and the response model for the second polishing, and causes a next substrate to be second polished with a new optimized polishing recipe for the second polishing which is created based on a target polishing amount of the next substrate, and a response model corrected by use of the optimized polishing recipe for the second polishing and film thickness profiles of the substrate before and after the second polishing.
In one embodiment, there is provided a polishing apparatus, comprising: at least one polishing unit including a polishing table for supporting a polishing pad, and a substrate holder for pressing a substrate against the polishing pad; a film thickness measuring device configured to measure a film thickness profile of the substrate; and a controller configured to control at least operations of the polishing unit and the film thickness measuring device, wherein the substrate holder includes: an elastic membrane film to form a plurality of pressure chambers for pressing the substrate; a head body, to which the elastic membrane is attached; and a retainer ring arranged so as to surround the substrate, the substrate is a substrate which is polished by a plurality of polishing processes including a first polishing and a second polishing performed in the polishing unit different from the polishing unit in which the first polishing has been performed, the polishing unit in which the second polishing is performed has a film thickness sensor for measuring a film thickness profile of the substrate, the controller stores in advance a response model for the first polishing which is created by taking into consideration variation in the amount of polishing between monitored areas of the substrate due to variations in a pressure of a pressurized fluid supplied to each of the pressure chambers, and the controller obtains a film thickness profile of the substrate before the first polishing by use of the film thickness sensor, causes the substrate to be first polished with an optimized polishing recipe for the first polishing including at least the pressures of the pressurized fluid supplied to each of the plurality of pressure chambers and the polishing time, the optimized polishing recipe being created based on a target polishing amount, which is a difference between the film thickness profile of the substrate before the first polishing and the target film thickness of the substrate, and the response model for the first polishing, and causes the substrate to be transferred, after performing the first polishing to the polishing unit having the film thickness sensor, to obtain a film thickness profile of the substrate after the first polishing by use of the film thickness sensor, and causes a next wafer to be first polished with a new optimized polishing recipe for first polishing which is created based on a target polishing amount of the next substrate, and a response model corrected by use of the optimized polishing recipe for the first polishing and film thickness profiles of the substrate before and after the first polishing.
In one embodiment, there is provided a polishing apparatus, comprising: a plurality of polishing units each including a polishing table for supporting a polishing pad, a substrate holder for pressing, a substrate against the polishing pad, and a film thickness sensor for measuring a film thickness profile of the substrate; and a controller configured to control at least operation of the polishing units, wherein the substrate holder includes: an elastic membrane film to form a plurality of pressure chambers for pressing the substrate; a head body to which the elastic membrane is attached; and a retainer ring arranged so as to surround the substrate, the substrate is a substrate which is polished by a plurality of polishing processes including a first polishing and a second polishing performed in the polishing unit different from the polishing unit in which the first polishing has been performed, the controller stores in advance a response model for the first polishing and a response model for the second polishing which is created by taking into consideration variation in the amount of polishing between monitored areas of the substrate due to variations in a pressure of a, pressurized fluid supplied to each of the pressure chambers, and the controller causes the substrate to be transferred to any one of the plurality of polishing units to obtain a film thickness profile of the substrate before the firs polishing, causes the substrate to be first polished with an optimized polishing recipe for the first polishing including at least the pressurises of the pressurized fluid supplied to each of the plurality of pressure chambers and the polishing time, the optimized polishing recipe being created based on a target polishing amount, which is a difference between the film thickness profile of the substrate before the first polishing and the target film thickness of the substrate, and the response model for the first polishing, obtains a film thickness profile of the substrate before the second polishing by use of the film thickness sensor, causes the substrate to be second polished with an optimized polishing recipe for the second polishing including at least the pressures of the pressurized fluid supplied to each of the plurality of pressure chambers and the polishing time, the optimized polishing recipe being created based on a target polishing amount, which is a difference between the film thickness profile of the substrate before the second polishing and the target film thickness of the substrate, and the response model for the second polishing, and obtains a film thickness profile of the substrate after the second polishing by use of film thickness sensor, causes a next substrate to be polished with a new optimized polishing recipe for first polishing which is created based on a target polishing amount of the next substrate, and a response model corrected by use of the optimized polishing recipe for the first polishing and film thickness profiles of the substrate before and after the first polishing causes the next substrate to be second polished with a new optimized polishing recipe for the second polishing which is created based on a target polishing amount of the next substrate, and a response model corrected by use of the optimized polishing recipe for the second polishing and film thickness profiles of the substrate before and after the second polishing.
In one embodiment, there is provided a polishing method in which a substrate held by a substrate holder is pressed against a polishing pad supported by a polishing table to thereby be polished, the substrate holder including an elastic membrane film to form a plurality of pressure chambers for pressing the substrate, a head body to which the elastic membrane is attached, and a retainer ring arranged so as to surround the substrate, comprising: obtaining a film thickness profile of the substrate before polishing by use of a film thickness measuring device; polishing the substrate with an optimized polishing recipe including at least the pressures of the pressurized fluid supplied to each of the plurality of pressure chambers and the polishing time, the optimized polishing recipe being created based on the response model and a target polishing amount, which is a difference between the film thickness profile of the substrate before polishing and the target film thickness of the substrate; and polishing; a next substrate with a new optimized polishing recipe which is created based on a target polishing amount of the next substrate and a response model corrected by use of the optimized polishing recipe and film thickness profiles of the substrate before and after polishing, wherein the response model is created by taking into consideration variation in the amount of polishing between monitored areas of the substrate, due to variations in a pressure of a pressurized fluid supplied to each of the pressure chambers.
In one embodiment, the optimal polishing recipe is created using an optimization calculation that minimizes an objective function including at least a term for difference between the target polishing amount and a predicted polishing amount calculated using the response model.
In one embodiment, the objective function includes a term for differences between pressures of the pressurized fluid of the optimized polishing recipe and preset reference pressures of the pressurized fluid, and/or a term for differences between the pressures of the pressurized fluid of the optimized polishing recipe and pressures of the pressurized fluid used in polishing of the previous substrate.
In one embodiment, the optimization calculation is quadratic programming method.
In one embodiment, the number of the plurality of monitored areas is greater than the number of the plurality of pressure chambers.
In one embodiment, the response model is a response model created by further taking into consideration variation in an amount of polishing between the monitored areas of the substrate due to variation in a pressing force of a retaining ring on the polishing pad, and the optimized polishing recipe further includes the pressing force of the retainer ring.
In one embodiment, the film thickness measuring device is configured to measure a film thickness at a plurality of measurement points set in each of the plurality of monitored areas.
In one embodiment, the response model includes a response coefficient representing an amount of increase in the polishing rate per unit polishing pressure in each of the plurality of monitored areas.
In one embodiment, the response model is created by further taking into consideration variation in an amount of polishing between the monitored areas due to variation in the local load applied to a part of the retainer ring by a plurality of local-load exerting devices.
In one embodiment, there is provided a polishing method in which a substrate held by a substrate holder is pressed against a polishing pad supported by a polishing table to thereby be polished, the substrate holder including an elastic membrane film to form a plurality of pressure chambers for pressing the substrate, a head body to which the elastic membrane is attached, and a retainer ring arranged so as to surround the substrate, comprising: storing in advance film thickness profiles of a plurality of substrates before polishing and response models for polishing each of the plurality of substrates, the response models being created by taking into consideration variation in the amount of polishing between monitored areas of the substrate due to variations in a pressure of each of the pressure chambers, classifying in advance the film thickness profiles of the plurality of substrates before polishing into a plurality of groups to which the film thickness profiles similar to each other belong, obtains a film thickness profile of the substrate before polishing by use of the film thickness measuring device to determine a group, to which the film thickness profile of the substrate before polishing belongs, from the plurality of groups, polishing the substrate with an optimized polishing recipe including at least the pressures of the pressurized fluid supplied to each of the plurality of pressure chambers and the polishing time, the optimized polishing recipe being created based on a target polishing amount, which is a difference between the film thickness profile of the substrate before polishing and the target film thickness of the substrate, and a response model associated with the determined group, and polishing a next substrate with a new optimized polishing recipe which is created based on a target polishing amount of the next substrate, and a response model corrected by use of the optimized polishing recipe and film thickness profiles of the substrate before and after polishing.
In one embodiment, there is provided a polishing method in which a substrate is polished by a plurality of polishing process performed in a plurality of polishing unit each including a polishing table for supporting a polishing pad and a substrate holder for pressing a substrate against the polishing pad, wherein the substrate holder includes: an elastic membrane film to form a plurality of pressure chambers for pressing the substrate; a head body to which the elastic membrane is attached; and a retainer ring arranged so as to surround the substrate, the plurality of polishing processes includes a first polishing and a second polishing performed in the polishing unit different from the polishing unit in which the first polishing has been performed, the polishing unit performing the first polishing has a film thickness sensor for measuring a film thickness profile of the substrate, and the polishing method comprising: preparing in advance a response model for the second polishing which is created by taking into consideration variation in the amount of polishing between monitored areas of the substrate due to variations in a pressure of a pressurized fluid supplied to each of the pressure chambers; obtaining, after performing the first polishing, a film thickness profile of the substrate before the second polishing by use of the film thickness sensor; second polishing the substrate with an optimized polishing recipe for the second polishing including at least the pressures of the pressurized fluid supplied to each of the plurality of pressure chambers and the polishing time, the optimized polishing recipe being created based on a target polishing amount, which is a difference between the film thickness profile of the substrate before the second polishing and the target film thickness of the substrate, and the response model for the second polishing; and second polishing a next substrate with a new optimized polishing recipe for the second polishing which is created based on a target polishing amount of the next substrate, and a response model corrected by use of the optimized polishing recipe for the second polishing and film thickness profiles of the substrate before and after the second polishing.
In one embodiment, there is provided a polishing method in which a substrate is polished by a plurality of polishing process performed in a plurality of polishing unit each including a polishing table for supporting a polishing pad, and a substrate holder for pressing a substrate against the polishing pad, wherein the substrate holder includes: an elastic membrane film to form a plurality of pressure chambers for pressing the substrate; a head body to which the elastic membrane is attached; and a retainer ring arranged so as to surround the substrate, the plurality of polishing processes includes a first polishing and a second polishing performed in the polishing unit different from the polishing unit in which the first polishing has been performed, the polishing unit performing the second polishing has a film thickness sensor for measuring a film thickness profile of the substrate, and the polishing method comprising: preparing in advance a response model for the first polishing which is created by taking into consideration variation in the amount of polishing between monitored areas of the substrate due to variations in a pressure of a pressurized fluid supplied to each of the pressure chambers; obtaining a film thickness profile of the substrate before the first polishing by use of the film thickness sensor; first polishing the substrate with an optimized polishing recipe for the first polishing including at least the pressures of the pressurized fluid supplied to each of the plurality of pressure chambers and the polishing time, the optimized polishing recipe being created based on a target polishing amount, which is a difference between the film thickness profile of the substrate before the first polishing and the target film thickness of the substrate, and the response model for the first polishing, and transferring, after performing the first polishing, the substrate to the polishing unit having the film thickness sensor, obtaining a film thickness profile of the substrate after the first polishing by use of the film thickness sensor, and first polishing a next wafer with a new optimized polishing recipe for first polishing which is created based on a target polishing amount of the next substrate, and a response model corrected by use of the optimized polishing recipe for the first polishing and film thickness profiles of the substrate before and after the first polishing.
In one embodiment, there is provided a polishing method in which a substrate is polished by a plurality of polishing process performed in a plurality of polishing unit each including a polishing table for supporting a polishing pad, and a substrate holder for pressing a substrate against the polishing pad, wherein the substrate holder includes: an elastic membrane film to form a plurality of pressure chambers for pressing the substrate; a head body to which the elastic membrane is attached; and a retainer ring arranged so as to surround the substrate, the plurality of polishing processes includes a first polishing and a second polishing performed in the polishing unit different from the polishing unit in which the first polishing has been performed, each of the polishing unit performing the first polishing and the polishing unit performing the second polishing has a film thickness sensor for measuring a film thickness profile of the substrate, and the polishing method comprising: preparing in advance a response model for the first polishing and a response model for the second polishing which is created by taking into consideration variation in the amount of polishing between monitored areas of the substrate due to variations in a pressure of a pressurized fluid supplied to each of the pressure, chambers, and transferring the substrate to the polishing unit for performing the first polishing to obtain a film thickness profile of the substrate before the firs polishing, first polishing the substrate with an optimized polishing recipe for the first polishing including at least the pressures of the pressurized fluid supplied to each of the plurality of pressure chambers and the polishing time, the optimized polishing recipe being created based on a target polishing amount, which is a difference between the film thickness profile of the substrate before the first polishing and the target film thickness of the substrate, and the response model for the first polishing, obtaining a film thickness profile of the substrate before the second polishing by use of the film thickness sensor, second polishing the substrate with an optimized polishing recipe for the second polishing including at least the pressures of the pressurized fluid supplied to each of the plurality of pressure chambers and the polishing time, the optimized polishing recipe being created based on a target polishing amount, which is a difference between the film thickness profile of the substrate before the second polishing and the target film thickness of the substrate, and the response model for the second polishing, and obtaining a film thickness profile of the substrate after the second polishing by use of film thickness sensor, first polishing a next substrate to be polished with a new optimized polishing recipe for first polishing which is created based on a target polishing amount of the next substrate, and a response model corrected by use of the optimized polishing recipe for the first polishing and film thickness profiles of the substrate before and after the first polishing, and second polishing the next substrate to be second polished with a new optimized polishing recipe for the second polishing which is created based on a target polishing amount of the next substrate, and a response model corrected by use of the optimized polishing recipe for the second polishing and film thickness profiles of the substrate before and after the second polishing.
According to the above-described embodiments, the optimized polishing recipe obtained by use of the response model takes into account variation in the amount of polishing between the areas of the substrate due to the variation the pressures in each of the pressure chambers change. Therefore, the film thickness profile of the substrate can be controlled more precisely.
Embodiments according to the above-described embodiments will be described in detail below with reference to the drawings.
As shown in
In the housing 10, there are disposed a plurality of (four in this embodiment) polishing units 14a to 14d each for polishing the wafer, a first cleaning unit 16 and a second cleaning unit 18 each for cleaning a polished wafer, and a drying unit 20 for drying a cleaned wafer. The polishing units 14a to 14d are arranged along a longitudinal direction of the polishing apparatus, and the cleaning units 16, 18 and the drying unit 20 are also arranged along the longitudinal direction of the polishing apparatus. Further, the polishing apparatus has a controller 30, which is disposed in the housing 10, and is configured to control operations of each of units.
A first substrate transfer robot 22 is disposed in an area surrounded by the loading port 12, the polishing unit 14a, and the drying unit 20. Further, a substrate transport device 24 is disposed parallel to the polishing units 14a to 14d. The first substrate transfer robot 22 receives a wafer, to be polished, from the loading port 12 and transfers the wafer to the substrate transport device 24, and receives a dried wafer from the drying unit 20 and returns the dried wafer to the loading port 12. The substrate transport device 24 transports a wafer received from the first substrate transfer robot 22, and transfers the substrate between the polishing units 14a, 14b, 14c, 14d.
A second substrate transfer robot 26 for transferring a substrate between the cleaning units 16, 18 and the substrate transport device 24 is provided between the first cleaning unit 16 and the second cleaning unit 18. A third substrate transfer robot 28 for transferring a substrate between the second cleaning unit 18 and the drying unit 20 is provided between these units 18, 20. Further, an operation controller 30 for controlling operations of each unit of the substrate processing apparatus is provided in the housing 10. The first substrate transfer robot 22, the substrate transport device 24, the second substrate transfer robot 26, and the third substrate transfer robot 28 constitute a substrate transfer unit for transferring the wafer between the polishing units 14a to 14d, the cleaning units 16, 18, and the drying unit 20.
In this embodiment, the first cleaning unit 16 is a substrate cleaning apparatus configured to clean a wafer by scrubbing both a front surface and a rear surface of the wafer with roll sponges, in the presence of a chemical liquid. The second cleaning unit 18 is a substrate cleaning apparatus in which a pen-type sponge (pen sponge) is used. In one embodiment, the second cleaning unit 18 may be a substrate cleaning apparatus configured to clean a substrate by scrubbing both a front surface and a rear surface of the substrate with roll sponges in the presence of a chemical liquid. Further, the drying unit 20 is a spin drying apparatus configured to hold a substrate, eject IPA vapor from a moving nozzle to dry the substrate, and rotate the substrate at a high velocity to further dry the substrate.
The wafer is polished by at least one of the polishing units 14a to 14d. The polished wafer is cleaned by the first cleaning unit 16 and the second cleaning unit 18, and the cleaned substrate is then dried by the drying unit 20. In one embodiment, the polished wafer may be cleaned by either the first cleaning unit 16 or the second cleaning unit 18.
As shown in
The film thickness measuring device 8 is connected to a controller 30, and the controller 30 is configured to control operation of the film thickness measuring device 8. Further, the film thickness measuring device 8 transmits measured values thereof to the controller 30, and the controller 30 can acquire the film thickness profile of the wafer from the measured values transmitted from the film thickness measuring device 8.
The polishing unit 14a shown in
The polishing table 35 is coupled to a table motor 31 through a table shaft 35a, so that the polishing table 35 is rotated by this table motor 31 in a direction indicated by arrow. The table motor 31 is located below the polishing table 3. The polishing pad 33 is attached to an upper surface of the polishing table 35. The polishing pad 33 has an upper surface, which provides a polishing surface 33a for polishing the wafer. The polishing head 37 is coupled to a lower end of a head shaft 36. The polishing head 37 is configured to be able to hold the wafer on its lower surface by vacuum suction. The head shaft 36 is elevated and lowered by an elevating mechanism (not shown).
The head shaft 36 is rotatably supported to a head arm 42. The head arm 42 is actuated by a head pivot motor 54 to pivot on a head pivot shaft 43. When the head pivot motor 54 is set in motion, the polishing head 37 is moved between a polishing position above polishing pad 33, and a standby position beside the polishing pad 33.
The dressing apparatus 40 includes a dresser 41 which is brought into sliding contact with the polishing pad 33, a dresser shaft 45 to which the dresser 41 is coupled, a pneumatic cylinder 47 mounted to an upper end of the dresser shaft 45, and a dresser arm 27 for rotatably supporting the dresser shaft 23. A lower surface of the dresser 41 serves as a dressing surface 41a, and this dressing surface 41a is formed by abrasive grains (e.g., diamond particles). The pneumatic cylinder 47 is disposed on a support base 50 which is supported by a plurality of columns 51, and these columns 51 are fixedly mounted to the dresser arm 48.
The dresser arm 48 is actuated by a dresser pivot motor 55 to pivot on a dresser pivot shaft 49. The dresser shaft 45 is rotated about its own axis by an actuation of a motor (not shown), thus rotating the dresser 41 about the dresser shaft 45 in a direction indicated by arrow. The pneumatic cylinder 47 serves as an actuator for moving the dresser 41 vertically through the dresser shaft 45 and for pressing the dresser 41 against the polishing surface (surface) 33a of the polishing pact 33 at a predetermined force.
Next, an example of the polishing head 37, which is installed in the polishing unit 14a, will be described with reference to
The elastic membrane 5 has a plurality of (eight in the drawing) annular circumferential walls 5a, 5b, 5c, 5d, 5e, 5f, 5g and 5h which are arranged concentrically. These circumferential walls 5a to 5h form a circular central pressure chamber 7a located at a center of the elastic membrane 5, an annular edge pressure chamber 7h located at the outermost part of the elastic membrane 5, and six annular intermediate pressure chambers (i.e., first to sixth intermediate pressure chambers) 7b, 7c, 7d, 7e, 7f, and 7g located between the central pressure chamber 7a and the edge pressure chamber 7h. These pressure chambers 7a to 7h are located between an upper surface of the elastic membrane 5 and the lower surface of the head body 2. In the present embodiment, the number of pressure chambers formed by the elastic membrane 5 is eight, but the number of pressure chambers is not limited to the present embodiment. The number of pressure chambers may be, increased or decreased according to the configuration of the elastic membrane 5.
The head body 2 has fluid passages 4a, 4b, 4c, 4d, 4e, 4f, 4g, and 4h formed therein, which communicate with the pressure chambers 7a to 7h, respectively. These fluid passages 4a, 4b, 4c, 4d, 4e, 4f, 4g, and 4h are coupled to a pressure regulating device 65 through fluid lines 6a, 6b, 6c, 6d, 6e, 6f, 6g, and 6h, respectively. The pressure regulating device 65 is connected to the controller 30, and thus the controller 30 can control the pressure regulating device 65.
A retainer chamber 34 is formed immediately above the retaining ring 3. This retainer chamber 34 is coupled to the pressure regulating device 65 through a fluid passage 4i formed in the head body 2 and a fluid line 6i.
According to the polishing head 37 configured as shown in
With the retaining ring 3 pressing the polishing pad 33, a shape of the pad 33 changes in response to the pressing force thereof. Therefore, the pressing force of the retainer ring 3 on the polishing pad 33 is also a factor that affects the film thickness profile of the wafer W after polishing.
The retaining ring 3 is made of resin, such as engineering plastic (e.g., PEEK), and the elastic membrane 5 is made of a highly strong and durable rubber material, such as ethylene propylene rubber (EPDM), polyurethane rubber, silicone rubber, or the like.
Next, configuration of the pressure regulating device 65 shown in
Further, pressure release lines 151 to 159 are coupled to the fluid lines 6a to 6i, respectively. Pressure release valves L1 to L9 are attached to the pressure release lines 151 to 159, respectively.
The pressure regulators R1 to R9 have a pressure regulating function for regulating pressures of the pressurized fluid supplied from the fluid supply source 32 to the pressure chambers 7a to 7h and the retainer chamber 34, respectively. The pressure regulators R1 to R9, the opening and closing valves V1 to V9, and the pressure release valves L1 to L9 are connected to the controller 30, and thus operations of these pressure regulators and these, valves are controlled by the controller 30. When the pressure release valves L1 to L9 are operated, each of the chambers 7a to 7h and 34 are open to the atmosphere and become atmospheric pressure.
Although not shown in the drawing, vacuum lines are coupled to the fluid lines 6a to 6i, respectively, and negative pressure is formed in each chambers 7a to 7h and 34 through these vacuum lines. Thus, each chambers 7a, to 7h and 34 is regulated into one of pressurized state, negative pressure state, and atmospheric pressure state by the pressure regulating device 65.
When a vacuum is formed in any of the intermediate pressure chambers 7b to 7g (for example, the intermediate pressure chamber 7d) in a state where the wafer W is placed in contact with the lower surface of the elastic membrane 5, the wafer W is held by the polishing head 37 by vacuum attraction. Further, when the pressurized fluid is supplied to any of the intermediate pressure chambers 7b to 7g (for example, the intermediate pressure chamber 7d) in a state where the wafer W is separated from the polishing pad 33, the wafer W is released from the polishing head 37.
Next, a polishing method for polishing the wafer W using this polishing apparatus will be described.
Next, the controller 30 creates an optimized polishing recipe based on the film thickness profile of the wafer W before polishing and a response model (see STEP 2 in
In the following, a method of creating the optimized polishing recipe according to an embodiment is described.
First, the controller 30 calculates initial film thicknesses for each of monitored areas in the wafer W, corresponding to each of the pressure chambers 7a, to 7h of the elastic. membrane 5, from the film thickness profile of the wafer W before polishing, which has been acquired by the film thickness measuring device 8. Hereinafter, for convenience of explanation, the monitored area in the wafer W which corresponds to the pressure chamber 7a is represented as Da, and the initial film thickness in the monitored area Da is represented as Ta. Similarly, the monitored area in wafer W which corresponds to the pressure chamber 7b is represented as Db, and the initial film thickness in the monitored area Db is represented as Tb. The monitored area, in wafer W which corresponds to the pressure chamber 7c is represented as Dc, and the initial film thickness in the monitored area Dc is represented as Tc. The monitored area in wafer W which corresponds to the pressure chamber 7d is represented as Dd, and the initial film thickness in the monitored area Dd is represented as Td. The monitored area, in wafer W which corresponds to the pressure chamber 7e is represented as De, and the initial film thickness in the monitored area De is represented as Te. The monitored area in wafer W which corresponds to the pressure chamber 7f is represented as Df, and the initial film thickness in the monitored area Df is represented as Tf. The monitored area in wafer W which corresponds to the pressure chamber 7g is represented as Dg, and the initial film thickness in the monitored area Dg is represented as Tg. The monitored area in wafer W which corresponds to the pressure chamber 7h is represented as Dh, and the initial film thickness in the monitored area Dh is represented as Th.
The film thickness measuring device 8 measures the film thicknesses at a plurality of measurement points in each of the monitored areas Da to Dh, and then sends the measured values thereof to the controller 30. The controller 30 determines representative values of the plurality of measured film thickness values in each of the monitored areas Da to Dh as the initial film thicknesses Ta to Th. The representative value is, for example, the average of the plurality of measured film thickness values. Next, the controller 30 calculates differences between each of the initial film thicknesses Ta to Th and a target film thickness Tt, respectively, and calculates target polishing amounts Ra′ to Rh′ for each of the monitored areas Da to Dh.
Next, the controller 30 uses the response model stored in the controller 30 in advance to calculate predicted polishing amounts Ra to Rh in each of the monitored areas Da to Dh. The controller 30 calculates at least pressures of the pressurized fluid supplied to each of the pressure chambers 7a to 7h and polishing time by use of optimization calculations, such that the predicted polishing amount Ra to Rh is close o the calculated target polishing amount Ra′ to Rh′, respectively.
In the optimization calculations, for example,: an objective function shown in a following equation (1) is used. In the optimization calculations, the pressures of the pressurized fluid supplied to each of the pressure chambers 7a to 7h and the retainer chamber 34, and the polishing time are calculated, which minimize the objective function including the differences between the predicted polishing amounts Ra to Rh and the target polishing amounts Ra′ to Rh′.
Objective function=Σ|Predicted polishing amount−Target polishing amount|2 (1)
Although the objective function shown in equation (1) takes into consideration only the polishing amounts, this embodiment is not limited to this example. For example, as shown in a following equation (2), it is also preferred that the objective function include a term for differences between the calculated optimum pressures of the pressurized fluid and the preset reference pressures of the pressurized fluid, and a term for differences between the calculated optimum pressures of the pressurized fluid and the optimum pressures of the pressurized fluid at the previous polishing of the wafer.
Objective function=Σ|Predicted polishing amount−Target polishing amount|2+λ·Σ|Calculated optimum pressure of pressurized fluid−Preset reference pressure of pressurized fluid|2+γ·Σ|Calculated optimum pressure of pressurized fluid−Optimum pressure of pressurized fluid at previous polishing|2 (2)
wherein, λ and γ are weight coefficients that determine weights of each term, and can be set to any real number greater than or equal to 0. The addition of these terms prevents the calculated optimum pressures of the pressurized fluid from changing significantly in each wafer to be processed, and thus enables stable optimum polishing conditions to be obtained.
The pressures of the pressurized fluid supplied to each of the pressure chambers 7a to 7h correspond to the polishing pressures in each of the monitored areas Da to Dh. Further, the pressing force of the retainer ring 3 on the polishing pad 33 corresponds to the pressure of the pressurized fluid supplied to the retainer chamber 34.
In this embodiment, the monitored areas of film thickness in the wafer W correspond to each of the pressure chambers 7a to 7h of the elastic membrane 5, and are divided into a number of areas equal to the total number of pressure chambers of the elastic membrane 5 (see
In still another method of dividing the monitor areas into smaller areas, an entire surface of the wafer or each of the monitored areas Da to Dh may be divided into evenly sized areas (e.g., at 1 mm intervals), and each divided areas may be used as new monitored areas D1 to Dn (see
When determining the monitored areas of the film thickness in the wafer W, it is not necessary to select only one of the above described methods for setting the monitored areas, and a combination of several methods may be used. For example, in an inner area of wafer W (e.g., an area corresponding to the pressure chambers 7a to 7f), the monitored areas of the film thickness in the wafer W may be set to the monitor areas Da to Df corresponding to the pressure chambers 7a to 7f, and in an outer area of wafer W (e.g., an area corresponding to the pressure chambers 7g to 7h), the monitored areas of the film thickness in the wafer W may be set to the monitor areas corresponding to each measurement point MP of film thickness with the film thickness measuring device 8 (see
Next, a method of creating the response model will be described. The response model is created, for example, by experiments. In the experiments, first, wafers whose film thickness profiles have already been acquired by the thickness measuring device 8 are polished using a reference polishing recipe (i.e., a predetermined polishing pressures and a predetermined polishing time), and the film thickness profiles of the wafers after polishing are obtained by the film thickness measuring device 8. Next, a number of wafers, which are different from the wafers W polished with the reference polishing recipe, are polished while changing the pressures of the pressurized fluid supplied to each of the pressure chambers 7a to 7h of the elastic membrane 5 and to the retainer chamber 34 from the pressures of the reference polishing recipe. In these procedures, film thickness profiles of a number of wafers before and after polishing are obtained by the film thickness measuring device 8.
As described above, the film thickness measuring device 8 can measure the film thickness at the plurality of measurement points our the wafer W. Therefore, the controller 30 can calculate polishing rates at each measurement point from the film thickness profile of the wafer polished with the reference polishing recipe, and the film thickness profiles before and after polishing of a large number of wafers.
Next, the controller 30 calculates an amount of increase in a polishing rate per unit polishing pressure (e.g., 1 hPa) from the polishing rates at each measurement point in a large number of wafers. In the present specification, the amount of increase in the polishing rate per unit polishing pressure is referred to as the “response coefficient”. Further, an offset amount D is calculated such that the predicted polishing rate R, which is obtained by substituting the calculated response coefficient into an equation (3) described below, is equal to an actual polishing rate obtained by polishing of the wafer with the aforementioned reference polishing recipe.
Further, from the calculated response coefficients and the offset amounts at each measurement point, response coefficients and offset amounts in each of the aforementioned monitored areas are calculated by an interpolation process, and thus the response coefficients and the offset amounts in each monitored area are determined.
The response coefficients and offset amounts calculated in this manner are included in the response model. The controller 30 uses the response model to create an optimized polishing recipe. Specifically, the controller 30 uses a following equation (3) to calculate the predicted polishing amount R from the pressures of the pressurized fluid supplied to each of the pressure chambers 7a to 7h and the retainer chamber 34, and the polishing time.
R=Tp·(C·X+D) (3)
Further, the calculated predicted polishing amount R is substituted into equation (1) or equation (2) to thereby calculate objective function value. The controller 30 uses optimization calculations to obtain the pressures of the pressurized fluid supplied to each of the pressure chambers 7a to 7h and the polishing time that minimize the objective function value, thereby creating the optimized polishing recipe. Since the aforementioned equation (1) or (2) can be expressed in the form of a quadratic equation in X, the optimized polishing recipe can be uniquely determined by using quadratic programming method in the optimization calculation. As other optimization calculation method, a gradient method, such as the steepest descent method, or the Monte Carlo method can also be used.
In equation (3), R is a matrix consisting of the predicted polishing amount R1 to Rm in each of the monitored areas D1 to Dm of the wafer W, Tp is the polishing time, C is a matrix consisting of the response coefficients in each of the monitored areas of wafer W, X is a matrix consisting of the pressures of the pressurized fluid supplied to each of the pressure chambers 7a to 7h, and D is a matrix consisting of the offsets of each of the monitored areas D1 to Dm in the wafer W. The subscript “m” corresponds to the number of monitored areas in the wafer W.
R, C, D, and X are represented as matrices as shown in
The former of the two subscripts of the response coefficient in the matrix C corresponds to each of the monitored areas D1 to Dm in the wafer W, and the latter corresponds to the pressure of the pressurized fluid to be supplied to each of the pressure chambers 7a to 7h. For example, the response coefficient C2b is a response coefficient corresponding to the pressure of the pressurized fluid to be supplied to the pressure chamber 7b in the monitored area D2 of the wafer W, and the response coefficient C3c is a response coefficient corresponding to the pressure of the pressurized fluid to be supplied to the pressure chamber 7c in the monitored area D3 of the wafer W.
The controller 30 uses the optimization calculations described above to calculate the pressures Pa to Ph of the pressurized fluid to be supplied to each of the pressure chambers 7a to 7h of the elastic membrane 5 and the polishing time Tp, and thus uses these pressures Pa to Ph and the polishing time Tp as the optimized polishing recipe. In the optimized polishing recipe calculated in this manner, the polishing pressures of each of the pressure chambers 7a to 7h is determined utilizing the plurality of measured film thickness values in each of the monitored area D1 to Dm of the wafer W, and as a result, the film thickness profile of wafer W can be precisely controlled.
In one embodiment, an optimized polishing recipe, in which the pressure Pi of the pressurized fluid to be supplied to the retainer chamber 34 is added to the pressures Pa to Ph of the pressurized fluid to be supplied to each of the pressure chambers 7a to 7h in the elastic membrane 5 and the polishing time Tp, may be calculated. In this case, instead of the response matrix C described above, a matrix C as shown in
The response coefficients C1a to Cmh and the response coefficients C1i to Cmi can be determined as follows. First, in the experiments to determine the response coefficients C1a to Cmh, the pressures of the pressurized fluid supplied to each of the pressure chambers 7a to 7h is changed with the pressure of the pressurized fluid supplied to the retainer chamber 34 fixed to a predetermined value to calculate the response coefficients Ca1 to Cmh. Next, with the pressures of the pressurized fluid supplied to each of the pressure chambers 7a to 7h fixed at predetermined values, the pressure of the pressurized fluid supplied to the retainer chamber 34 is changed to obtain the response coefficients C1i to Cmi.
The optimized polishing recipe obtained using the matrix C′ of the response model described above takes into consideration not only variation in the amount of polishing between the monitored areas D1 to Dm due to variations in the pressures of the pressurized fluid supplied to each of the pressure chambers 7a to 7h, but also variation in the amount of polishing between the monitored areas D1 to Dm due to variation in the pressing force of the retaining ring 3 on the polishing pad 33. Therefore, the film thickness profile of wafer W can be controlled more precisely.
In one embodiment, the response model, i.e., the matrix C consisting of the response coefficients C1a to Cmh (or the matrix C′ consisting of C1i to Cmi) and the matrix D (or matrix D′) may be determined by simulations. In this case also, the matrix C (or the matrix C′) and the matrix D (or the matrix D′) obtained by simulations are stored in the controller 30 in advance.
Returning to
Dressing of the polishing pad 33 is performed as follows. The pure water is supplied from the polishing-liquid supplying nozzle 38 onto the polishing pad 33, while the dresser 41 is rotated about the dresser shaft 45. In this state, the dresser 41 is pressed against the polishing pad 33 by the pneumatic cylinder 47 to place the dressing surface 41a in sliding contact with the polishing surface 13a of the polishing pad 33. Further, the dresser arm 48 pivots around the dresser pivot shaft 49 to cause the dresser 41 to oscillate in a radial direction of the polishing pad 33. In this manner, the dresser 41 scrapes slightly the polishing pad 33 to thereby dress (or restore) the surface 33a of the polishing pad 33.
Next, the controller 30 causes the polished wafer W to be transferred to the first cleaning unit 16 and/or the second cleaning unit and to be cleaned, and then the cleaned wafer W to be transferred to the drying unit 20 to be dried. Further, the controller 30 causes the polished wafer W to be transferred to the film thickness measuring device 8 to obtain the film thickness profile of the polished wafer W (see STEP 4 in
Next, the controller 30 corrects the stored response model in order to create an optimized polishing recipe for polishing the next wafer W (see STEP 5 in
The controller 30 calculates an actual polishing amount Rac in each of the monitored areas D1 to Dm of the wafer W from the film thickness profiles before and after polishing of the wafer W which has already been polished. When calculating the actual polishing amount Rac, the controller 30 subtracts film thickness values after polishing T1′ to Tm′, which are representative values of a plurality of measured film thickness values corresponding to each of the monitored areas D1 to Dm of the wafer W after polishing, from the aforementioned initial film thickness values T1 to Tm in each of the monitored areas D1 to Dm, respectively. Each of the polishing film thickness values after polishing T1′ to Tm′ are, for example, an average value of the measured film thickness values in each of the monitored areas D1 to Dm, or each of thickness values at a representative point in each monitored area calculated by interpolation from the measured polishing film thickness values after polishing. The controller 30 subtracts the film thickness values after polishing T1′ to Tm′ from the initial film thicknesses T1 to Tm, respectively, thereby calculating the actual polishing amounts Rac1 to Racm in each of the monitored areas D1 to Dm.
Next, the controller 30 calculates a correction factor K such that the predicted polishing amount R in the above equation (3) and the actual polishing amount Rac satisfy the following equation (3).
Rac=K·R (4)
wherein Rac is a matrix consisting of the actual polishing amounts Rac1 to Racm in each of the monitored area D1 to Dm, and K is a matrix consisting of the correction factors corresponding to each of the monitored area D1 to Dm.
Next, as shown in following equations (5) and (6), the controller 30 calculates and stores a collected response coefficient matrix Cadj and a corrected offset amount matrix Dadj by multiplying the matrices C and D described above by K obtained from equation (4).
Cadj=K·C (5)
Dadj=K·D (6)
Next, the controller 30 causes the next wafer W to be transferred to the film thickness measuring device 8 and to obtain a film thickness profile of the next wafer W before polishing (see STEP 6 in
Next, the controller 30 utilizes a corrected predicted polishing amount Radj, which is calculated from a following equation (7), to calculate the pressures of the pressurized fluid supplied to each of pressure chambers 7a to 7h and the retainer chamber 34, and the polishing time by use of the optimization calculations described above.
Radj=Tp·(Cadj·X+Dadj) (7)
In the equation (7), Radj is a matrix consisting of the predicted polishing amount in each of the monitored areas D1 to Dm, and Tp is the polishing time of the next wafer W.
In this system, the response model corrected based on the film thickness profiles before and after polishing of the previous wafer W is used to create the optimized polishing recipe used for polishing the next wafer W. The film thickness profiles before and after polishing of the previous wafer W are data reflecting the condition (e.g., surface properties of the polishing pad 33) of the polishing unit where the wafer W was actually polished. Therefore, by using the response model corrected based on the film thickness profiles before and after polishing of the previous wafer W to create the optimized polishing recipe for polishing the next wafer W, the film thickness profile of the next wafer W can be controlled more precisely.
Next, the controller 30 causes the next wafer W to be polished with the created optimized polishing recipe (see STEP 8 in
Further, the controller 30 repeats STEPS 5 to 9 shown in
In one embodiment, the controller 30 may be connected to an arithmetic unit 70 (see dotted line in
When polishing a next wafer W, the controller 30 sends the film thickness profile of the next wafer W before polishing to the arithmetic unit 70. The arithmetic unit 70 creates an optimized polishing recipe for polishing the next wafer W based on a target polishing amount, which is the difference between the film thickness profile of the next wafer W before polishing and the target film thickness, and the corrected response model. The arithmetic unit 70 sends the created optical polishing recipe to the controller 30. The controller 30 uses the sent optimized polishing recipe to polish the next wafer W. Further, the controller 30 sends the film thickness profile of the next wafer W after polishing to the arithmetic unit 70, and the arithmetic unit 70 stores the film thickness profiles of the next wafer W before and after polishing, and the optimized polishing recipe used to polish the next wafer W. The arithmetic unit 70 utilizes the film thickness profiles of the next wafer W before and after polishing and the optimized polishing recipe used to polish the next wafer W for correcting the response model to be used to polish a further next wafer W.
The polishing head 37 shown in
The polishing head 37 further includes a rotary ring 71 having a plurality of rollers disposed therein, and a stationary ring 81. The rotary ring 71 is fixed to an upper surface of the retainer ring 3, and is configured to be rotatable together with the retainer ring 3. The stationary ring 81 is located on the rotary ring 71. The rotary ring 71 rotates together with the retainer ring 3, while the stationary ring 81 does not rotate and remains stationary.
The polishing unit 14a includes a plurality of local-load exerting devices for applying a local load to a part of the retainer ring 3. In an example illustrated in the drawing, the polishing unit 14a includes two local-load exerting devices, i.e., a first local-load exerting device 83A and a second local-load exerting device 83B. The local-load exerting devices 83A, 83B are located above the retainer ring 3. The local-load exerting devices 83A, 83B are fixed to the head arm 42 (see
The local-load exerting devices 83A, 83B include pressing members 84A, 84B each for applying a downward local load to the stationary ring 81, bridges 85A, 85B, air cylinders 86A, 86B each for generating a downward force, pressure regulators R10, R11 for regulating pressures of compressed fluids in the air cylinders 86A, 86B, linear guides 87A, 87B, guide roofs 88A, 88B, and unit bases 89A, 89B.
Specifically, the first local-load exerting device 83A includes the first pressing member 84A the first bridge 85A, the first air cylinder 86A, the first pressure regulator R10, the first linear guide 87A, the first guide rod 88A, and the first unit base 89A. The second local-load exerting device 83B includes the second pressing member 84B, the second bridge 85B, the second air cylinder 86B, the second pressure regulator R11, the second linear guide 87B, the second guide rod 88B, and the second unit base 89B.
A piston rod 101a of the first air cylinder 86A is coupled to the first pressing member 84A through the first bridge 85A, and an end portion of the first pressing member 84A is coupled to the stationary ring 81. Therefore, the force generated by the first air cylinder 86A is transmitted to the first pressing member 84A, and the first pressing member 84A applies the local load to a part of the stationary ring 81. Similarly, a piston rod 101b of the second air cylinder 86B is coupled to the second pressing member 84B through the second bridge 85B, and an end portion of the second pressing member 84B is coupled to the stationary ring 81. Therefore, the force generated by the second air cylinder 86B is transmitted to the second pressing member 84B, and the second pressing member 84B applies the local load to a part of the stationary ring 81.
In this embodiment, a combination of the tint air cylinder 86A and the first pressure regulator R10 constitutes a first actuator 90A for regulating the local load applied from the first pressing member 84A to the stationary ring 81, and a combination of the second air cylinder 86B and the second pressure regulator R11 constitutes a second actuator 90B for regulating the local load applied from the second pressing member 84B to the stationary ring 81. In one embodiment, the first actuator 37A and the second actuator 37B may be each composed of a combination of a servomotor, a ball screw mechanism, and a motor driver.
The first pressing member 84A includes two push rods 103a, and the second pressing member 84B includes two push rods 103b. The push rods 103a and the push rods 103b are coupled to the stationary ring 81. The first pressing member 84A is configured to apply the local load to the upstream portion of the stationary ring 81 with respect to the moving direction of the polishing surface 33a of the polishing pad 33, and the second pressing member 84B is configured to apply the local load to the downstream portion of the stationary ring 81 with respect to the moving direction of the polishing surface 33a of the polishing pad 33.
The local-load exerting devices 83A, 83B are fixed to the head arm 42 (see
The local-load exerting devices 83A, 83B have the same construction. The following descriptions relate to the first local-load exerting device 83A, but are applied to the second local-load exerting device 83B as well. The first air cylinder 86A and the first linear guide 87A are mounted to the first unit base 89A. The piston rod 101a of the first air cylinder 86A and the first guide rod 88A are coupled to the first bridge 85A. The first guide rod 88A is vertically movably supported by the first linear guide 87A with low friction. The first linear guide 87A allows the first bridge 85A to move smoothly in the vertical directions without being inclined.
The air cylinders 86A, 86B are coupled to a compressed-gas supply source (see
The pressure regulators R10, R11 can regulate independently the pressures of the compressed gases in the air cylinders 86A, 86B, so that the air cylinders 86A, 86B can generate the forces independently of each other.
The pressure regulators R10, R11 are electrically connected to the controller 30 shown in
The forces generated by the air cylinders 86A, 86B are transmitted to the bridges 85A, 85B, respectively. The bridges 85A, 85B are coupled to the stationary ring 81 through the pressing members 84A, 84B. The pressing members 84A, 84B transmit the forces of the air cylinders 86A, 86, applied to the bridges 85A, 85B, to the stationary ring 81. Specifically, the first pressing member 84A presses a part of the stationary ring 81 with a local load corresponding to the force generated by the fast air cylinder 86A, and the second pressing member 84B presses a part of the stationary ring 81 with a local load corresponding to the fore generated by the second air cylinder 86B.
Each of the local-load exerting devices 83A, 83B is configured to exert the downward local load on a part of the retainer ring 3 through the stationary ring 81 and the rotary ring 71. Specifically, the downward local load is transmitted through the stationary ring 81 and the rotary ring 71 to the retainer ring 3.
The polishing apparatus polishes the wafer W while rotating the rotary ring 71, which is fixed to the retainer ring 3, together with the retainer ring 3 and applying the local load to the stationary ring 81 from the first pressing member 84A or the second pressing member 84B. During polishing of the wafer W, the rotating retainer ring 3 contacts the polishing surface 33a of the polishing pad 33, while pressing the polishing pad 33 outside the wafer W and applying the downward local load on a part of the polishing surface 33a.
Therefore, the wafer W is polished, while the local load is applied from either the first pressing member 84A or the second pressing member 84B to the stationary ring 81, to thereby generate the local repulsive force corresponding to the local load, so that a polishing rate of a portion of the wafer receiving the local repulsive force can be changed. For example, when the local load applied by the first pressing member 84A is to be increased, the controller 30 instructs the pressure regulator R10 to increase the pressure of the compressed gas in the air cylinder 86A, while, when the local load applied by the second pressing member 84B is to be increased, instructing the pressure regulator R11 to increase the pressure of the compressed gas in the air cylinder 86B.
Accordingly, the local loads applied to the retaining ring 3 by the local-load exerting devices 83A, 83B (in this embodiment, corresponding to the pressures of the compressed gas supplied to the air cylinders 86A, 86B) become also factors that affects the film thickness profile of the wafer W after polishing.
Therefore, in this embodiment, the controller 30 (see
The optimized polishing recipe obtained in this manner takes into consideration variation in the amount of polishing between the monitored areas D1 to Dm due to variation in the pressure of the pressurized fluid supplied to each of the pressure chambers 7a to 7h, variation in the amount of polishing between the monitored areas D1 to Dm due to variation in the pressing force of the retainer ring 3 on the polishing pad 33, and variation in the amount of polishing between the monitored areas D1 to Dm due to variation in the local load. Therefore, the film thickness profile of wafer W can be controlled more precisely.
Further, in the corrected response model also, used to calculate the optimized polishing recipe for polishing the next wafer W, the variation in the amount of polishing between the monitored areas D1 to Dm due to the variation in the pressure of the pressurized fluid supplied to each of the pressure chambers 7a to 7h, the variation in the amount of polishing between the monitored areas D1 to Dm due to the variation in the pressing force of the retainer ring 3 on the polishing pad 33, and the variation in the amount of polishing between the monitored areas D1 to Dm due to the variation in the local load are taken into account. Therefore, the film thickness profile of the next wafer W to be polished with the optimized polishing recipe created based on the corrected response model can also be controlled more precisely.
In
The double-dotted graph in
As can be seen from
Further, it has been found that the in-plane uniformities of Cases 2 and 3 are better than the in-plane uniformity of Case 1. Therefore, it can be seen that, by using the response model which is created taking into account at least the changes in the pressures of the pressurized fluid supplied to each of the pressure chambers 7a to 7h and the change in the pressure of the pressurized fluid supplied to the retainer chamber 34, the film thickness profile can be controlled precisely.
As shown in
Each time polishing of the wafer W is performed, the controller 30 sends the film thickness profiles of the wafer W before and after polishing and the optimized polishing recipe used for polishing this wafer W to the arithmetic unit 70. Each time a combination of the film thickness profiles of the wafer W before and after polishing and the optimized polishing recipe are sent from the controller 30, the arithmetic unit 70 corrects the response model using the film thickness profiles of the wafer W before and after polishing and the optimized polishing recipe. Further, the arithmetic unit 70 classifies and accumulates a combination of the film thickness profiles of wafer W before and after polishing and the corrected response model into any one of the plurality of groups.
For classification by use of the film thickness before polishing, an index of conformity with shape, for example, can be used, which is obtained by calculations. Examples of the index of conformity with shape include absolute mean, root mean square, difference in average film thicknesses, correlation coefficient, and GOF (Good of Fitting) value. The index of conformity with shape is an index for determining a shape agreement (similarity) between two film thickness profiles, and can be calculated by any one of the following equations (8) to (11), where the first film thickness profiles are T1 to Tm, the average value of T1 to Tm is Tave, the second film thickness profiles are T′1 to T′m and the average value of T′1 to T′m is T′ave.
Difference of in average film thicknesses=|Tave−T′ave| (10)
GOF is a commonly used index of agreement between two profiles. The smaller the value of absolute mean, root mean square, and difference in average film thicknesses, the higher the shape agreement (shape similarity), while the larger the value of correlation coefficient and GOF, the higher the shape agreement.
The arithmetic unit 70 classifies the film thickness profile by use of at least one of these indices of conformity with shape. Specifically, the arithmetic unit 70 calculates advance the film thickness profiles representing each group, calculates the index of conformity with shape between the film thickness profile before polishing and the film thickness profile representing each group, and classifies the film thickness profile before polishing into a group which has a shape agreement exceeding a preset threshold value, and has the highest shape agreement. The film thickness profiles representing each group can be, for example, average film thickness profiles averaged from the film thickness values at each measurement point of the film thickness profiles classified into each group.
When no group has the shape agreement exceeding the threshold value, i.e., when no group is to be classified based on the index of conformity with shape, the arithmetic unit 70 creates a new group. This process creates a plurality of groups in which film thickness profiles similar to each other are collected.
In one embodiment, a machine learner (not shown) may be provided in the arithmetic unit 70, and the machine learner may be used to classify the film thickness profile of the wafer W before polishing. In this case, the film thickness profile of the wafer W before polishing is input to the machine learner. The machine learner outputs the group to which the input film thickness profile belongs. When the machine learner deter lines that there is no group to which the input film thickness profile belongs, the machine learner outputs an instruction for creating a new group to the arithmetic unit 70.
Next, the controller 30 causes the wafer W to be taken out from the substrate cassette placed on the load port 12 (see
When selecting the group, the arithmetic unit 70 uses the index of conformity with shape described above. Specifically, the arithmetic unit 70 calculates the index of conformity with shape for each group using the received film thickness profiles and the representative film thickness profile of each group, and selects the group which has the shape agreement exceeding the threshold value, and has the highest shape agreement as the group to which the film thickness profile is to belong.
Next, the arithmetic unit 70 creates the optimized polishing recipe using the response model of the group to which the film thickness profile belongs (see STEP 5 in
When, in STEP 4, no group has the shape agreement exceeding the threshold value, the arithmetic unit 70 selects the group which has the threshold value whose index of conformity with shape is highest, and utilizes the response model of this group to create the optimized polishing recipe. Further, the arithmetic unit 70 creates a new group to which this film thickness profile belongs.
Next, the arithmetic unit 70 sends the created optimized polishing recipe to the controller 30, and the controller 30 performs polishing of the wafer W based on the receiving polishing recipe (see STEP 6 in
Next, the controller 30 causes the polished wafer W to be transferred to the first cleaning unit 16 and/or the second cleaning unit 18 and to be cleaned, and then the cleaned wafer W to be transferred to the drying unit 20 and to be dried. Further, the controller 30 causes the polished wafer W to be transferred to the film thickness measuring device 8 to obtain the film thickness profile of the polished wafer W (see STEP 7 in
Processes described in STEPS 9 to 12 in
According to this embodiment, the optimized polishing model for the first wafer is created using the optimized polishing recipe and the response model that belong to the group having a similar film thickness profile to that of the first wafer W. Therefore, the film thickness profile of the first wafer W can be precisely controlled.
Configuration of the polishing unit 14b shown in
During polishing of the wafer W, the film thickness sensor 52 rotates together with the polishing table 35, and obtains the film-thickness signal while sweeping across the surface of the wafer W as shown in symbol A. This film thickness signal comprises an index value that directly or indirectly indicates the film thickness of the wafer W. The film thickness signal varies as the film thickness of the wafer W decreases. The film thickness sensor 52 is coupled to the controller 30, so that the film thickness signal is sent to the controller 30. When the film thickness of the wafer W, indicated by the film-thickness signal, reaches a predetermined target thickness, the operation controller 40 instructs the polishing head 1 and the polishing table 2 to terminate polishing of the wafer W. The controller 30 can obtain the film thickness profile of wafer W from the film thickness signals sent from the film thickness sensor 52.
As semiconductor devices become more highly integrated and densified, wiring composed of a multilayer structure is formed on the wafer W. Therefore, if polishing of the wafer W is executed with one polishing unit until the desired film is exposed, the polishing time becomes longer, resulting in wafer defects due to the rise in polishing temperature and deposition of by-products on the polishing pad, or reducing in a flatness of the surface of the wafer. Depending on a film type of the multilayer structure formed in the wafer W, several polishing processes may be performed over two or more polishing units. For example, polishing a metal film, which is a top layer of wafer W, with the first polishing unit, and then a dielectric film layer formed under the metal film may be polished with the second polishing unit.
If, in a case where several polishing processes are performed sequentially in this manner, the film thickness profile after polishing is obtained with the film thickness measuring device 8 as shown in
As shown in
When the first polishing of the wafer W is terminated, the controller 30 causes a water-polish, which polishes the wafer W while supplying pure water onto the polishing pad 33 on the polishing table 35, to be performed, and obtains, during this water-polishing, the thickness profile of the first polished wafer W by use of the film thickness sensor 52 (see STEP 2 in
Next, the controller 30 causes the first polished wafer W to be transferred to the second polishing unit (e.g., polishing unit 14a), which is other than the first polishing unit, for performing the second polishing (STEP 3 in
Next, the controller 30 causes the second polishing of the wafer W to be performed in accordance with the optimized polishing recipe for the second polishing in the second polishing unit to which the wafer W has been transferred in STEP 3 (see STEP 5 in
Next, the controller 30 causes the next wafer W to be transferred to the first polishing unit, and performs the first polishing of the next wafer W (see STEP 7 in
Next, the controller 30 causes the next wafer W after the first polishing to be transferred to the second polishing unit to performs the second polishing of the next wafer W (see STEP 9 in FIG.16). At this time, the controller 30 corrects the response model for second polishing in order to create the optimized polishing recipe for the second polishing (see STEP 9 in
Specifically, the controller 30 calculates the actual polishing amounts Rac of the second polishing in each of the monitored area D1 to Dm from the thickness profiles before and after the second polishing, of the wafer W that has already been first polished and second polished, and then calculates the correction factor K such that the predicted polishing amount R in the above equation (3) and the actual polishing amount Rac satisfy the above-described equation (4). Next, the controller 30 multiplies the above-described matrices C and D by K obtained from the equation (4) to calculate and store the corrected response coefficient matrix Cadj and the corrected offset amount matrix Dadj through the above-described equations (5) and (6).
Next, the controller 30 calculates, by use of the above-described optimization calculations, the optimized polishing recipe for the second polishing of the next wafer W, including at least the pressures of the pressurized fluid supplied to each of the pressure chambers 7a to 7h and the retainer chamber 34, and the polishing time, from the calculated Cadj and Dadj, and the target polishing amount R′ in the second polishing of the next wafer W.
Next, the controller 30 performs the second polishing of the next wafer W with the optimized polishing recipe for the second polishing, which has been calculated (see STEP 12 in
Further, the controller 30 repeats STEPS 7 to 13 shown in
According to this embodiment, even when several polishing processes are required, the film thickness profile of wafer W can be precisely controlled, while minimizing the reduction in throughput.
In the polishing method shown in the flowchart of
Next, the controller 30 creates the optimized polishing recipe for the first polishing of the wafer W based on the an thickness profile of wafer W before the first polishing and the response model described above (see STEP 2 in
Next, before performing the second polishing, the controller 30 obtains the film thickness profile of the wafer W after the first polishing by use of the film thickness sensor 52, while performing the water-polishing described above (see STEP 5 in
Next, the controller 30 causes the next wafer W to be transferred to the film thickness measurement device 8, and obtains the film thickness profile of the next wafer W before the first polishing (see STEP 7 in
Next, the controller 30 causes the next wafer W to be first-polished with the created optimized polishing recipe (see STEP 10 in
Further, the controller 30 repeats STEPS 7 to 13 shown in
In this embodiment also, the film thickness profile of the wafer W can be precisely controlled while minimizing the reduction in throughput even when several polishing processes are required.
As shown in
Next, the controller 30 creates the optimized polishing recipe for the first polishing of the wafer W, based on the film thickness profile of the wafer W before the first polishing and the response model for the first polishing described above (see STEP 2 in
Next, the controller 30 starts the water-polishing again after terminating of the first polishing, and obtains the film thickness profile of the wafer W after the first polishing during the water-polishing (see STEP 4 in
When the second polishing is terminated, the controller 30 starts the water-polishing, and obtains the film thickness profile of the wafer W after the second polishing during the water-polishing (see STEP 7 in
Next, the controller 30 causes the next wafer W to be transferred to the polishing unit, and obtains the film thickness profile of the next wafer W before the first polishing by use of the film thickness sensor 52 (see STEP 8 in
Next, the controller 30 causes the next wafer W to be first-polished with the optimized polishing recipe for the first polishing, which has been corrected (see STEP 11 in
Next, the controller 30 corrects the response model for the second polishing by use of the optimized polishing recipe for the second polishing of the wafer W and the film thickness profiles before and after the second polishing (see STEP 13 in
According to this embodiment, a plurality of polishing processes can be performed in a single polishing unit, enabling precise control of the film thickness profiles while minimizing the throughput reduction as much as possible. Furthermore, the film thickness measuring device 8 (see
The previous description of embodiments is provided to enable a person skilled in the art to make and use the present invention. Moreover, various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles and specific examples defined herein may be applied to other embodiments. Therefore, the present invention is not intended to be limited to the embodiments described herein but is to be accorded the widest scope as defined by limitation of the claims.
Number | Date | Country | Kind |
---|---|---|---|
2021-031850 | Mar 2021 | JP | national |