1. Field of the Invention
The present invention relates to an apparatus and method for polishing an object material while estimating a removal amount of the object material using a model equation.
2. Description of the Related Art
An interlevel dielectric having a low dielectric constant is an essential technology for a high-density multi-level interconnect structure. This is because a smaller distance between layered metal interconnects results in a larger line-to-line capacitance, which causes a delay in signal transmission through the interconnects. Thus, there has recently been a trend to use a low-k material having a low dielectric constant as the interlevel dielectric. The low-k material has an advantage of having a low dielectric constant, but on the other hand, the low-k material has low mechanical strength and is relatively easily removed from a substrate. Thus, in order to prevent removal of the low-k material, a hard mask film may be formed on the low-k material.
When forming a new multilayer structure on the multilayer structure shown in
There are several techniques for monitoring a film thickness during polishing, such as a method using an optical sensor and a method using an eddy current sensor. However, the hard mask film is generally as thin as 50 nm to 60 nm, and this film is an oxide film. Consequently, it is difficult to accurately monitor a change in thickness of the hard mask film using these polishing end point detection techniques.
The present invention has been made in view of the above drawbacks. It is therefore an object of the present invention to provide a polishing apparatus and polishing method capable of polishing an object material while accurately monitoring a change in thickness of the object material.
One aspect of the present invention provides a polishing apparatus including a polishing table for holding a polishing pad having a polishing surface, a motor configured to drive the polishing table, a holding mechanism configured to hold a substrate having an object material to be polished and to press the substrate against the polishing surface, a dresser configured to dress the polishing surface, and a monitoring unit configured to monitor a removal amount of the object material. The monitoring unit is operable to calculate the removal amount of the object material using a model equation containing a variable representing an integrated value of a torque current of the motor when polishing the object material and a variable representing a cumulative operating time of the dresser.
In this specification, the removal amount means an amount by which a thickness of the object material is reduced.
In a preferred aspect of the present invention, the object material comprises a film that belongs to one of levels of a multi-level interconnect structure, and the model equation contains variables representing a level number to which the film belongs.
In a preferred aspect of the present invention, the level number is a level number of a group composed of plural levels having structures similar to each other.
In a preferred aspect of the present invention, the model equation is a multiple regression equation created from a multiple regression analysis on data including removal amounts of the object material on plural substrates polished, integrated values of the torque current, cumulative operating times of the dresser, and level numbers.
Another aspect of the present invention provides a method for polishing a substrate using a polishing apparatus having a polishing pad with a polishing surface, a polishing table holding the polishing pad, a motor configured to drive the polishing table, a holding mechanism configured to hold a substrate having an object material to be polished and to press the substrate against the polishing surface, and a dresser configured to dress the polishing surface. The method includes creating a model equation for calculating a removal amount of the object material, the model equation containing a variable representing an integrated value of a torque current of the motor and a variable representing a cumulative operating time of the dresser, polishing the object material by bringing the object material into sliding contact with the polishing surface, and calculating the removal amount of the object material by substituting the cumulative operating time of the dresser and the integrated value of the torque current of the motor when polishing the object material into the model equation.
According to the present invention, the removal amount can be estimated accurately using the model equation. Therefore, polishing can be stopped at a desired time point.
Embodiments of the present invention will be described below with reference to the drawings.
The inventors have studied effects of a cumulative operating time of a dresser (or a conditioner), which is to perform dressing (conditioning) of a polishing surface of a polishing pad, on a polishing rate (i.e., a removal rate). As a result, the inventors have discovered that there is a correlation between the cumulative operating time of the dresser and the polishing rate.
In general, a dresser has a longer lifetime than a polishing pad. Therefore, it is normal that plural polishing pads are replaced with new polishing pads before a dresser is replaced with a new dresser.
The polishing table 12 is coupled to the motor 30 via a rotational shaft, and is rotatable about its own axis as indicated by arrow. A polishing liquid supply nozzle (not shown) is disposed above the polishing table 12, so that a polishing liquid is supplied from the polishing liquid supply nozzle onto the polishing surface 10a of the polishing pad 10.
The top ring 14 is coupled to a top ring shaft 18, which is coupled to a motor and an elevating cylinder (not shown). The top ring 14 can thus be moved vertically and rotated about the top ring shaft 18. The substrate is attracted to and held on a lower surface of the top ring 14 by a vacuum attraction or the like.
With the above-described structures, the substrate W, held on the lower surface of the top ring 14, is rotated and pressed by the top ring 14 against the polishing surface 10a of the polishing pad 10 on the rotating polishing table 12. The polishing liquid is supplied from the polishing liquid supply nozzle onto the polishing surface 10a of the polishing pad 10. The object material on the substrate W is thus polished in the presence of the polishing liquid between the substrate W and the polishing surface 10a. In this embodiment, the polishing table 12 and the top ring 14 constitute a mechanism of providing relative motion between the substrate W and the polishing pad 10.
The object material is an interconnect metal film (e.g., a Cu film), a barrier film, and a hard mask film which constitute a multi-level interconnect structure on the surface of the substrate W (see
The monitoring unit 53 is configured to acquire a value of a torque current of the motor 30 and to calculate an integrated value of the torque current.
The approximate removal amount can also be obtained by an integrated value of a temperature of the polishing pad 10, instead of the torque current.
In this embodiment, the interconnect film and the barrier film, each of which is a conductive film, are polished while each thickness (i.e., the removal amount) is monitored by the monitoring unit 53 based on the output signal of the eddy current sensor 50. On the other hand, the hard mask film, which is an oxide film, is polished while an estimated removal amount thereof is monitored by the monitoring unit 53. The estimated removal amount is calculated using a model equation which will be discussed below.
The model equation is a relational expression containing variables that represent the cumulative operating time of the dresser 20, the integrated value of the torque current, and a level number to which the hard mask film (the object of polishing) belongs. Specifically, the model equation is expressed as follow.
This model equation is a multiple regression equation, wherein Y is a response variable (or dependent variable) representing the estimated removal amount of the hard mask film, a0 through an are partial regression coefficients, and X1 through Xn are explanatory variables.
In the above model equation, X1 through Xn-2 are dummy variables which are used to quantify a qualitative variable, i.e., a level number to which the hard mask film belongs. Specifically, X1 through Xn-2 are 0 or 1, so that combinations of 0 and 1 represent the level number. For example, when the hard mask film, which is the object to be polished, belongs to a first level, Xi is 1, and X2 through Xn-2 are 0. Similarly, when the hard mask film belongs to a second level, X2 is 1, and X1, X3 through Xn-2 are 0. When the hard mask film belongs to an n−1th level, X1 through Xn-2 are all 0.
In this manner, the total number of dummy variables introduced in the model equation is smaller by one than the total number of levels constituting the multi-level interconnect structure. In this embodiment, the levels are consecutively numbered such that a first level, a second level, a third level, . . . , an n−1th level are allotted in the order from a lower level to an upper level. In the above-described model equation, the variable Xn-1 is a quantitative variable representing the cumulative operating time of the dresser 20, the variable Xn is a quantitative variable representing the integrated value of the torque current, and the partial regression coefficients a0 through an are coefficients given in advance by multiple regression analysis.
When forming the multi-level interconnect structure, the interconnect metal film, the barrier film, the hard mask film, and the like are formed in each level, and these films are polished to form a flat surface. Generally, when polishing the multi-level interconnect structure, a polishing rate (removal rate) slightly varies depending on the level the film belongs to, even if the same kind of film is polished. For example, in a case of polishing a six-level interconnect structure, a polishing rate of a hard mask film in a first level is different from a polishing rate of a hard mask film in a sixth level. In other words, there is a correlation between the polishing rate and the level. Therefore, by reflecting the level number, to which the hard mask film belongs, in the model equation, more accurate removal amount can be estimated.
As an example, when a multi-level interconnect structure is composed of six levels, the above-described model equation (1) is expressed as follow.
Y=a0+a1·X1+a2·X2+a3·X3+a4·X4+a5·X5+a6·X6+a7·X7 (2)
In this equation (2), the variables X1 through X5 are the dummy variables representing what level the hard mask film belongs to, the variable X6 is the quantitative variable representing the cumulative operating time of the dresser 20, and the variable X7 is the quantitative variable representing the integrated value of the torque current.
In this example, when the hard mask film, which is the object to be polished, belongs to the first level, X1 is 1, and X2 through X5 are 0. When the hard mask film belongs to the second level, X2 is 1, and X1, X3 through X5 are 0. When the hard mask film belongs to the third level, X3 is 1, and X1, X2, X4, X5 are 0. When the hard mask film belongs to the fourth level, X4 is 1, and X1 through X3, X5 are 0. When the hard mask film belongs to the fifth level, X5 is 1, and X1 through X4 are 0. When the hard mask film belongs to the sixth level, X1 through X5 are 0. In this manner, the level number, which is the qualitative variable, is quantified.
The partial regression coefficients a0 through an are given by the multiple regression analysis as follows. First, data of the above-described response variables and explanatory variables obtained by polishing multi-level interconnect structures on plural substrates are prepared. More specifically, data including removal amounts (actual removal amounts) of the hard mask films, the level numbers to which these hard mask films belong, the cumulative operating times of the dresser 20, and the integrated values of the torque current used in polishing of the hard mask films are prepared. These data are inputted to the monitoring unit 53. Then, the monitoring unit 53 calculates the partial regression coefficients a0 through an from the data using formulas of the multiple regression analysis. The calculation of the partial regression coefficients may be conducted by another device and the resultant partial regression coefficients may be inputted to the monitoring unit 53. The formulas of the multiple regression analysis are known in the art, as disclosed in “Introduction of Multivariate Analysis” (by Yasushi Nagata, etc., published by SAIENSU-SHA Co. Ltd., Japan).
Next, processing flow for obtaining the removal amount of the hard mask film using the above-described model equation will be described. First, the level number to which the hard mask film (i.e., the object to be polished) belongs is inputted into the monitoring unit 53 from the control unit 54, so that the value (0 or 1) of each of the variables X1 through Xn-2 is determined. Further, the cumulative operating time of the dresser 20 is inputted into the monitoring unit 53 from the controller 54, so that the value of the variable Xn-1 is determined.
During polishing of the hard mask film, the monitoring unit 53 calculates the integrated value of the torque current at certain time intervals, and substitutes the resultant value for the variable Xn of the model equation. Thus, the estimated removal amount, i.e., the response variable of the model equation, increases according to an increase in the value of the variable Xn. When the estimated removal amount reaches a preset target value, the monitoring unit 53 sends a polishing end point signal to the control unit 54. Upon receiving this polishing end point signal, the control unit 54 stops the polishing operation.
After polishing, an actual removal amount is measured using a film-thickness measuring device (not shown in the drawing) installed in the polishing apparatus. The actual removal amount measured is stored as data together with the estimated removal amount calculated, the level number, the cumulative operating time of the dresser 20, and the integrated value of the torque current, in the monitoring unit 53. The monitoring unit 53 calculates a difference between the estimated removal amount and the actual removal amount. If the difference is larger than a first threshold, the monitoring unit 53 recalculates the partial regression coefficients a0 through an from the newly obtained data so as to update (or renew) the model equation. If the difference is larger than a second threshold (>the first threshold), the monitoring unit 53 judges that a polishing failure has occurred, and produces an alarm.
The larger total number of partial regression coefficients requires the larger number of data to be prepared for calculating the partial regression coefficients. In other words, if the total number of partial regression coefficients can be reduced, the data to be prepared can also be reduced. Therefore, plural levels having similar structures may be grouped into a single level in the multi-level interconnect structure. For example, in the six-level interconnect structure, the first level and the second level, which have structures similar to each other, may be grouped into a first level, the third level and the fourth level, which have structures similar to each other, may be grouped into a third level, and the fifth level and the sixth level, which have structures similar to each other, may be grouped into a fifth level. In this case, the above-described equation (2) is expressed as follow.
Y=a0+a1·X1+a2·X2+a3·X3+a4·X4 (3)
In this equation, the dummy variables are X1 and X2. When the hard mask film, which is the object to be polished, belongs to the first level or second level, X1 is 1, and X2 is 0. When the hard mask film belongs to the third level or fourth level, X2 is 1, and X1 is 0. When the hard mask film belongs to the fifth level or sixth level, X1 and X2 are 0. The variable X3 represents the cumulative operating time of the dresser, and the variable X4 represents the integrated value of the torque current.
As described above, according to this embodiment, an accurate removal amount can be estimated. Hence, polishing can be stopped when a desired removal amount is reached.
The previous description of embodiments is provided to enable a person skilled in the art to make and use the present invention. Moreover, various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles and specific examples defined herein may be applied to other embodiments. Therefore, the present invention is not intended to be limited to the embodiments described herein but is to be accorded the widest scope as defined by limitation of the claims and equivalents.
Number | Date | Country | Kind |
---|---|---|---|
2007-283039 | Oct 2007 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
6231425 | Inaba et al. | May 2001 | B1 |
6416617 | Yoshida et al. | Jul 2002 | B2 |
6517412 | Lee et al. | Feb 2003 | B2 |
7108580 | Ushio et al. | Sep 2006 | B2 |
7475368 | Angyal et al. | Jan 2009 | B2 |
Number | Date | Country |
---|---|---|
2004-106123 | Apr 2004 | JP |
2004-158506 | Jun 2004 | JP |
2005-342841 | Dec 2005 | JP |
2004-035265 | Apr 2004 | WO |
Number | Date | Country | |
---|---|---|---|
20090111358 A1 | Apr 2009 | US |