1. Field of Invention
The present invention relates to an advanced process control (APC) system and an APC method. More particularly, the present invention relates to an APC system and an APC method utilizing virtual metrology (VM) with a reliance index (RI).
2. Description of Related Art
Run-to-run (R2R) advanced process control (APC) is widely applied to semiconductor and TFT-LCD factories for improving process capability. As defined in SEMI E133 specification, a R2R control is the technique of modifying recipe parameters or the selection of control parameters between runs to improve processing performance. A (process) run can be a batch, a lot, or an individual workpiece, wherein the R2R APC becomes a lot-to-lot (L2L) APC when a run is a lot, and the R2R APC becomes a workpiece-to-workpiece (W2W) APC when a run is a workpiece. A workpiece may represent a wafer for the semiconductor industry or a glass for the TFT-LCD industry. The L2L APC is now widely implemented for dealing with advanced technologies. When a L2L control is applied, only one single workpiece in the lot is required to be measured for feedback and feedforward control purposes. However, as the device dimension shrinks further, tighter process control is needed. In this case, the L2L control may not be accurate enough and therefore a W2W control becomes essential for critical stages. As a result, each workpiece in the lot should be measured. To fulfill this requirement, large amounts of metrology tools will be required and production cycle time will also be increased significantly. Furthermore, metrology delays, which are inevitable as real measurements are performed, will not only cause complicated control problems but also degrade the APC performance.
To resolve the problem mentioned above, virtual metrology (VM) was proposed. Virtual metrology is a technology using a conjecture model to predict metrology variables using information about the state of the process for every workpiece. If the VM conjecture model is fresh and accurate enough, it can generate a VM value within seconds after collecting the complete tool process data of a workpiece. Therefore, this VM value can be applied for real-time W2W control.
Referring to
yk=β0+β1uk+ηk (1)
where yk is the plant output; uk the control action taken for process run k; β0 the initial bias of process; β1 the process gain; and η5 the disturbance model input.
Given a process predictive model Auk, where A is a gain parameter (e.g., removal rate for chemical mechanical polishing (CMP)) estimated for the system, and its initial values can be obtained from the actual tool/recipe performance.
Using an EWMA (Exponentially Weighted Moving Average) filter, the model offset or disturbance of the (k+1)th process run is estimated to be
{tilde over (η)}k+1=α(yk−Auk)+(1−α){tilde over (η)}k (2)
where α is an EWMA coefficient ranged between 0 and 1.
Control action of (k+1)th process run is
where Tgt represents the target value.
Referring to
When yk is measured by the actual metrology tool 20, it becomes yz, an EWMA coefficient α, is used in
{tilde over (η)}k+1=α1(yz−Auk)+(1−α1){tilde over (η)}k (4)
When yk is conjectured or predicted by a VM module 30, it becomes ŷk, i.e. a VM value ŷk and an EWMA coefficient α2 is used in
{tilde over (η)}k+1=α2(ŷk−Auk)+(1−α2){tilde over (η)}k (5)
Khan et al. pointed out that α1>α2 (usually, depending on the relative quality of virtual metrology data). Now, the controller-gain problem of applying VM is focused on how to set α2, wherein the rule of thumb is that α2 should depend on the quality or reliability of VM and α2<α1. Khan et al. proposed two VM quality metrics to consider incorporating VM quality into the controller gain of a R2R controller 40:
where the correlation coefficient
and σy and σŷ are standard deviations of y and ŷ, respectively.
Nevertheless, both metrics proposed above have the following disadvantages:
As a result, it may not be easy to combine the data quality metrics as in equations (6) and (7) into the R2R model. Hence, there is a need to develop an APC system and an APC method utilizing VM with a reliance index (RI) and a global similarity index (GSI) for effectively considering the data quality of VM into the R2R controller.
An object of the present invention is to provide an APC system and an APC method for effectively considering the data quality of VM into a R2R controller, thereby overcoming the problems of inability to consider the reliance level in the VM feedback loop of R2R control and metrology delays as well as upgrading the APC performance.
According to an aspect of the present invention, an APC system includes a process tool, a metrology tool, a virtual metrology (VM) module, a reliance index (RI) module and a run-to-run (R2R) controller. The process tool is operated for processing a plurality of historical workpieces in accordance with a plurality of sets of historical process data, and performing a plurality of process runs on a plurality of workpieces in accordance with a plurality of sets of process data. The metrology tool is operated for measuring the historical workpieces and a plurality of sampling workpieces selected from the workpieces, thereby providing a plurality of historical measurement data of the historical workpieces and a plurality of actual measurement values of the sampling workpieces which have been processed in the process runs. The virtual metrology module is used for providing a plurality of virtual metrology values of the process runs by inputting the sets of process data into a conjecture model, wherein the conjecture model is built in accordance with a conjecture algorithm by using the sets of historical process data and the historical measurement values, wherein the historical measurement values are the measurement values of the historical workpieces which are manufactured in accordance with the sets of historical process data, respectively. The RI module is used for generating respective reliance indexes (RI) of the process runs, wherein each of the reliance indexes (RI) corresponding to the process run is generated by calculating the overlap area between the statistical distribution of the virtual metrology value of the workpiece and the statistical distribution of a reference prediction value of the workpiece, wherein the reference prediction value of the process run is generated by inputting the set of process data into a reference model, wherein the reference model is built in accordance with a reference algorithm by using the sets of historical process data and their corresponding historical measurement values, and the conjecture algorithm is different from the reference algorithm, and the reliance index is higher when the overlap area is larger, representing that the reliance level of the virtual metrology value corresponding to the reliance index is higher. The R2R controller is operated for controlling the process tool to perform the process runs in accordance with the following relationships:
uz+1=g(G1,1,G1,2, . . . ,G1,i,yz)
uk+1=g(G2,1,G2,2, . . . ,G2,i,ŷk)
G2,i=f(RIk)×G1,i
where G2,i=0 or ŷk−1 but not ŷk is adopted for tuning the R2R controller, if RIk<RIT;
f(RIk)=RIk, if RIk≧RIT and k≦C;
f(RIk)=1−RIk, if RIk≧RIT and k>C;
wherein yz represents the actual measurement value of the sampling workpiece which has been processed in the zth process run; uz+1 represents the control action of the (z+1)th process run when yz is adopted; G1,i, represents the controller gain used in the R2R controller when yz is adopted, wherein i represents the number of the controller gains used in the R2R controller; ŷk represents the virtual metrology value of the workpiece which has been processed in the kth process run; uk+1 represents the control action of the (k+1)th process run when ŷk is adopted; G2,i, represents the controller gain used in the R2R controller when ŷk is adopted; RIk represents the reliance index (RI) of the kth process run; RIT represents the RI threshold value based on a maximal tolerable error limit defined by the errors of the virtual metrology values obtained from the conjecture model; and C stands for a predetermined number of process runs.
In one embodiment, the APC system further includes a global similarity index (GSI) module for generating respective global similarity indexes (GSI) of the process runs by inputting the sets of process data into a statistical distance model, wherein the statistical distance model is built in accordance with a statistical distance algorithm by using the sets of historical process data, wherein G2,i=0 or ŷk−1 but not ŷk is adopted for tuning the R2R controller, if GSIk>GSIT, where GSIk represents the global similarity index (GSI) of the kth process run; GSIT represents a GSI threshold value defined by two to three times of the maximal global similarity indexes of the sets of historical process data.
According to another aspect of the present invention, in an APC method, a step is performed for obtaining a plurality of sets of historical process data used by a process tool for processing a plurality of historical workpieces. Another step is performed for obtaining a plurality of historical measurement data of the historical workpieces measured by a metrology tool. Another step is performed for establishing a conjecture model in accordance with a conjecture algorithm by using the sets of historical process data and the historical measurement values, wherein the historical measurement values are the measurement values of the historical workpieces which are manufactured in accordance with the sets of historical process data, respectively; and establishing a reference model in accordance with a reference algorithm by using the sets of historical process data and their corresponding historical measurement values, wherein the conjecture algorithm is different from the reference algorithm. Another step is performed for enabling a run-to-run (R2R) controller to control the process tool to perform the process runs in accordance with the aforementioned relationships.
In one embodiment, the APC method further includes establishing a statistical distance model in accordance with a statistical distance algorithm by using the sets of historical process data; and enabling the R2R controller to control the process tool to perform the process runs in accordance with the relationship: G2,i=0 or ŷk−1 but not ŷk is adopted for tuning the R2R controller, if GSIk>GSIT, where GSIk represents the global similarity index (GSI) of the kth process run; GSIT represents a GSI threshold value defined by two to three times of the maximal global similarity indexes of the sets of historical process data.
According to another aspect of the present invention, a computer program product is provided and performs the aforementioned APC method when executed.
Hence, with the application of the embodiments of the present invention, the data quality of VM can be effectively considered into the R2R model, thereby overcoming the problems of inability to consider the reliance level in the VM feedback loop of R2R control and metrology delays as well as upgrading the APC performance.
These and other features, aspects, and advantages of the present invention will become better understood with regard to the following description, appended claims, and accompanying drawings where:
Reference will now be made in detail to the embodiments of the present invention, examples of which are illustrated in the accompanying drawings. Wherever possible, the same reference numbers are used in the drawings and the description to refer to the same or like parts.
Referring to
For the VM module 120, the RI module 122 and the GSI module 124, a conjecture model, a reference model and a statistical distance model are required to be built. The conjecture model is built in accordance with a conjecture algorithm by using the sets of historical process data and the historical measurement values, wherein the historical measurement values are the measurement values of the historical workpieces which are manufactured in accordance with the sets of historical process data, respectively; the reference model is built in accordance with a reference algorithm by using the sets of historical process data and their corresponding historical measurement values; and the statistical distance model is built in accordance with a statistical distance algorithm by using the sets of historical process data. The conjecture algorithm and the reference algorithm can be a multi-regression (MR) algorithm, a support-vector-regression (SVR) algorithm, a neural-networks (NN) algorithm, a partial-least-squares regression (PLSR) algorithm, or a Gaussian-process-regression (GPR) algorithm. The statistical distance model can be a Mahalanobis-distance algorithm or an Euclidean-distance algorithm. The aforementioned algorithms are merely stated as examples, and certainly other algorithms may be applicable to the present invention. The RI and GSI used in the embodiment of the present invention can be referred to U.S. Pat. No. 7,593,912 entitled “Method for evaluating reliance level of a virtual metrology system in product manufacturing”, which is incorporated herein by reference. The RI, GSI and VM models used in the embodiment of the present invention can be referred to U.S. Pat. No. 7,603,328 entitled “Dual-phase virtual metrology method”; and US Patent Publication No. 20090292386 entitled “System and Method for Automatic Virtual Metrology”, which are incorporated herein by reference. It is noted that U.S. Pat. Nos. 7,593,912, 7,603,328 and US Patent Publication No. 20090292386 all have the same assignee as this application.
The VM module 120 is used for providing a plurality of virtual metrology (VM) values of the process runs by inputting the sets of process data into the conjecture model. The RI module 122 is used for generating respective reliance indexes (RI) of the process runs, wherein each of the reliance indexes (RI) corresponding to the process run is generated by calculating the overlap area between the statistical distribution of the virtual metrology value of the workpiece and the statistical distribution of a reference prediction value of the workpiece, wherein the reference prediction value of the process run is generated by inputting the set of process data into the reference model. The RI module 122 mainly uses another algorithm (reference algorithm) to gauge the reliance level of the conjecture algorithm, and thus the conjecture algorithm and the reference algorithm can be any algorithms as long as the conjecture algorithm is different from the reference algorithm. The reliance index is higher when the aforementioned overlap area is larger, representing that the reliance level of the virtual metrology value corresponding to the reliance index is higher. In this embodiment, a RI threshold value (RIT) is based on a maximal tolerable error limit defined by the errors of the virtual metrology values obtained from the conjecture model. The GSI module 124 is used for generating respective global similarity indexes (GSI) of the process runs by inputting the sets of process data into the statistical distance model. The GSI assesses the degree of similarity between any set of process data and the model set of process data (for example, the historical process data). In this embodiment, a GSI threshold value (GSIT) is defined by two to three times of the maximal global similarity indexes of the sets of historical process data.
Hereinafter, the R2R controller 130 is exemplified as an EWMA controller for explanation, but the R2R controller 130 also can be a moving-average (MA) controller, double-EWMA controller (d-EWMA) or a proportional-integral-derivative (PID) controller.
Referring to
α2=RI×α1 (9)
wherein the EWMA coefficient α1 is the same as the α of equation (2).
Equation (9) will be applied when the R2R controller 130 needs relatively high gain. The situations that need high controller gain are: yk is apart from the target value or the production process is relatively unstable. On the contrary, if yk is near the target or the production process is relatively stable, then the controller gain should be small. For generating a small controller gain, the EWMA coefficient α2 also can be set as follows:
α2=(1−RI)×α1 (10)
Equations (9) and (10) are valid only when RI is good enough; in other words, RI should be greater than RIT. If RI<RIT, this VM value cannot be adopted for tuning the R2R controller gain. Further, due to the fact that the GSI is designed to help the RI gauge the reliance level of VM, when GSI>GSIT, its corresponding VM value cannot be adopted, either. In conclusion, if RI<RIT or GSI>GSIT, then α2 is set to be zero (0).
The issue of the R2R controller-gain management in real-production environment whenever a modification is performed on the process tool 100 is considered as follows. In general, the production process of the first lot (just after a modification is performed) is relatively unstable; therefore, the controller gain should be relatively high. After finishing the production of the first lot, the production process will become comparatively stable. In other words, the rest of the lots should have small controller gains.
In summary, α2 can be set as:
C stands for a predetermined number of process runs. For a W2W control, C can be 25 for semiconductor industries.
Since the R2R controller 130 also can be a MA controller, a d-EWMA controller or a PID controller, a generic form of governing equations is provided as follows:
uz+1=g(G1,1,G1,2, . . . ,G1,i,yz) (13)
uk+1=g(G2,1,G2,2, . . . ,G2,i,ŷk) (14)
G2,i=f(RIk,GSIk)×G1,i (15)
where G2,i=0 or ŷk−1 but not ŷk is adopted for tuning the R2R controller, if RIk<RIT or GSIk>GSIT;
f(RIk,GSIk)=RIk, if RIk≧RIT and GSIk≦GSIT and k≦C;
f(RIk,GSIk)=1−RIk, if RIk≧RIT and GSIk≦GSIT and k>C;
wherein yz represents the actual measurement value of the sampling workpiece which has been processed in the zth process run; uz+1 represents the control action of the (z+1)th process run when yz is adopted; G1,i, represents the controller gain used in the R2R controller when yz is adopted, wherein i represents the number of the controller gains used in the R2R controller; ŷk represents the virtual metrology value of the workpiece which has been processed in the kth process run; uk+1 represents the control action of the (k+1)th process run when ŷk is adopted; G2,i, represents the controller gain used in the R2R controller when ŷk is adopted; RIk represents the reliance index (RI) of the kth process run; RIT represents a RI threshold value based on a maximal tolerable error limit defined by the errors of the virtual metrology values obtained from the conjecture model; GSIk represents the global similarity index (GSI) of the kth process run; GSIT represents a GSI threshold value defined by two to three times of the maximal global similarity indexes of the sets of historical process data; and C stands for a predetermined number of process runs.
The MA controller and the EWMA controller which are single-gain controllers; and the d-EWMA controller, and the PID controller which are multiple-gain controllers are described below.
MA Controller
The (z−1)th run control action, uz+1, of an n-terms MA controller is derived by
where A is a gain parameter (e.g., removal rate for chemical mechanical to polishing (CMP)) estimated for the system; Tgtz+1 is the target value of (z+1)th run; and {tilde over (η)}z+1 is the model offset or disturbance of the (z+1)th run. {tilde over (η)}z+1 of the n-terms MA controller is expressed as:
where yz represents the actual measurement value of the zth run control output; q represents the delay operator, i.e. q−1yz=yz−1; M1=1/n is the controller gain; and
hMA(q)=(1+q−1+ . . . +q−(n−1)) (18)
Then, from equation (16),
In conclusion, the (z+1)th run control action, uz+1, of an n-terms MA controller can be expressed as a function of the actual measurement value of the zth control output, yz, and the controller gain, M1.
EWMA Controller
The (z+1)th run control action, uz+1, of an EWMA controller can also be expressed as equation (16).
For the EWMA controller, {tilde over (η)}Z+1 is derived below.
In conclusion, the (z+1)th run control action, uz+1, of an EWMA controller can be expressed as a function of the actual measurement value of the zth run control output, yz, and the controller gain, α1.
d-EWMA Controller
The (z+1)th run control action, uz+1, a d-EWMA controller is expressed as:
Referring to equations (20), (21), and (22), {tilde over (η)}Z+1 can be expressed as:
Similarly, {tilde over (ρ)}Z+, is derived as:
Finally, uz+1 can be expressed as:
In conclusion, the (z+1)th run control action, uz+1, of a d-EWMA controller can be expressed as a function of the actual measurement value of the zth run control output, yz, and the controller gains, α1.1 and α1.2.
PID Controller
The (z+1)th run control action, uz+1, of a PID controller is expressed as:
In conclusion, the (z+1)th run control action, uz+1, of a PID controller can be expressed as a function of the actual measurement value of the zth run control output, yz, and the controller gains, K1,P, K1,I, and K1,D.
Observing equations (19), (23), (27), and (28), a generic form of the (z+1)th run control action, uz+1, of the MA, EWMA, d-EWMA, and PID R2R controller can be generated as a function of the actual measurement value of the zth run control output, yz, and the controller gains, G1,1, G1,2, . . . , and G1,i, where i represented the number of gains existed in the controller.
uz+1=g(G1,1,G1,2, . . . ,G1,i,yz) (29)
For the MA case, i=1 and G1,1=M1; for EWMA, i=1 and G1,1=α1; for d-EWMA, i=2, G1,1=α1,1 and G1,2=α1,2; for PID, i=3, G1,1=K1,P, G1,2=K1,I, and G1,3=K1,D. In fact, equation (29) has been mentioned in equation (13).
When VM is utilized, y2 will be replaced by ŷk and the controller gains will be changed to G2,1, G2,2, . . . , and G2,i, where i represented the number of gains existed in the controller. Therefore, by utilizing VM, the generic form of the (k+1)th run control action, uk+1, is
uk+1=g(G2,1,G2,2, . . . ,G2,i,ŷk) (30)
For the MA case, i=1 and G2,1=M2; for EWMA, i=1 and G2,1=α2; for d-EWMA, i=2, G2,1=α2,1 and G2,2=α2,2 for PID, i=3, G2,1=K2,P, G2,2=K2,I, and G2,3=K2,D. In fact, equation (30) has been mentioned in equation (14).
When VM is adopted as the feedback of the R2R controller, VM's accompanying RI/GSI can be used to tune the controller gains as shown below:
G2,i=f(RI,GSI)×G1,i (31)
In fact, equation (31) has been mentioned in equation (15).
Specifically, for the MA case:
M2=fMA(RI,GSI)×M1 (32)
For the EWMA case:
α2=fEWMA(RI, GSI)×α1 (33)
For the d-EWMA case:
α2,1=fα1(RI,GSI)×α1,1
α2,2=fα2(RI,GSI)×α1.2 (34)
For the PID case:
K2,P=fP(RI,GSI)×K1.P
K2,I=fI(RI,GSI)×K1,I
K2,D=fD(RI,GSI)×K1,D (35)
In conclusion, all of the G1,I controller gains may be assigned as constants or tuned by an adaptive scheme or function. When the actual measurement values (yz) are adopted, G1,i will be designed and assigned accordingly. After G1,i are assigned and if the VM values (ŷk) are adopted to replace yz, then the corresponding G2,1 gains can be designed and assigned as shown in equations (31)-(35).
Equations (31)-(35) are valid only when RI and GSI are good enough; in other words, RI should be greater than RIT and GSI should be smaller than GSIT. If RI<RIT or GSI>GSIT, this VM value cannot be adopted for tuning the R2R controller gain. In conclusion, if RI<RIT or GSI>GSIT, then
for the MA case, set {tilde over (η)}k+1={tilde over (η)}k, i.e. ŷk−1 but not ŷk is adopted for tuning the R2R controller;
for the EWMA case, set {tilde over (η)}k+1={tilde over (η)}k or α2=0 (i.e. G2,i=0);
for the d-EWMA case, set {tilde over (η)}k+1={tilde over (η)}k and
for the PID case, set uk+1=uk, i.e. ŷk−1 but not ŷk is adopted for tuning the R2R controller.
The following presents the algorithms related to the RI and explains their operating procedures.
Reliance Index (RI)
Referring to Table 1, n sets of historical data are assumed to be collected, including process data (Xi,i=1, 2, . . . ,n) and the corresponding actual measurement values (yi,i=1, 2, . . . , n), where each set of process data contains p individual parameters (from parameter 1 to parameter p), namely Xi=[xi,1, xi,2, . . . , xi,p]T. Additionally, (m-n) sets of process data in actual production were also collected, but no actual measurement values are available besides yn+1. That is, only the first among (m-n) pieces of the products is selected and actually measured. In the current manufacturing practice, the actual measurement value yn+1 obtained is used to infer and evaluate the quality of the (m-n−1) pieces of the products.
As shown in Table 1, y1, y2, . . . , yn are historical measurement values, and yn+1 is the actual measurement value of the first piece of the products being manufactured. Generally, a set of actual measurement values (yi,i=1, 2, . . . ,n) is a normal distribution with mean μ and standard deviation σ, namely yi˜N(μ, σ2).
All the actual measurement values can be standardized in terms of the mean and standard deviation of the sample set (yi,i=1, 2, . . . , n). Their standardized values (also called z scores) Zy
wherein yi is the i-th actual measurement value,
The explanation herein adopts a neural—networks (NN) algorithm as the conjecture algorithm for establishing the conjecture model performing virtual measurement, and uses such as a multi-regression (MR) algorithm to be the reference algorithm for establishing the reference model that serves as a comparison base for the conjecture model. However, the present invention can also apply other algorithms to be the conjecture algorithm or the reference algorithm, provided the reference algorithm differs from the conjecture algorithm, such as a support-vector-regression (SVR) algorithm, a partial-least-squares regression (PLSR) algorithm, a Gaussian-process-regression (GPR) algorithm or other related algorithms, and thus the present invention is not limited thereto.
When the NN and MR algorithms are utilized, if their convergence conditions both are that SSE (Sum of Square Error) is minimized with n→∞, their standardized predictive measurement values (defined as
respectively) should be the same as the standardized actual measurement value Zy
all represent the standardized actual measurement value, but they have different names due to having different purposes and different estimating models. Hence,
indicate that Zy
with respect to the NN conjecture model differ from the standardized mean-estimating equation ({circumflex over (μ)}Z
with respect to the MR reference model.
The RI is designed to gauge the reliance level of the virtual metrology value. The RI thus should consider the degree of similarity between the statistical distribution Zŷ
Referring to
of the reference prediction value from the reference model (built by such as the MR algorithm). As such, the RI equation is listed below:
and σ is set to be 1.
The RI increases with increasing overlap area A. This phenomenon indicates that the result obtained using the conjecture model is closer to that obtained from the reference model, and thus the corresponding virtual metrology value is more reliable. Otherwise, the reliability of the corresponding measurement value reduces with decreasing RI. When the distribution Zŷ
Hereinafter, the method for calculating the statistical distribution of the virtual metrology values (Zŷ
In the NN conjecture model, if the convergence condition is to minimize SSE, then it can be assumed that “for given Zx
Before the NN conjecture model is constructed, the process data must be standardized. The equations for standardizing the process data are presented below:
wherein xi,j is the j-th process parameter in the i-th set of process data,
The n sets of standardized process data (Zx
Accordingly, the estimated value of μZ
wherein
Hereinafter, the method for calculating the reference predication values (Zŷ
The basic assumption of the MR is that “for given Zy
is {circumflex over (μ)}Z
To obtain the MR relationship between the n sets of standardized process data (Zx
The least square method can obtain the estimating equation of βr, {circumflex over (β)}r=[{circumflex over (β)}r0,{circumflex over (β)}r1,{circumflex over (β)}r2, . . . ,{circumflex over (β)}rp]T as
{circumflex over (β)}r=(ZxTZx)−1ZxTZy (49)
Therefore, the MR reference model can be obtained as
Zŷr
i=1,2, . . . ,n,n+1, . . . ,m (50)
Hence, during the conjecture phase, after inputting a set of process data, its MR estimating value Zŷ
After obtaining the NN estimating equations (Zŷ
After obtaining the RI, the RI threshold value (RIT) must be defined. If RI>RIT, then the reliance level of the virtual metrology value is acceptable. A systematic approach for determining the RIT is described below.
Before determining the RIT, it is necessary to define a maximal tolerable error limit (EL). The error of the virtual metrology value is an absolute percentage of the difference between the actual measurement value yi and ŷNi obtained from the NN conjecture model divided by the mean of all the actual measurement values,
The EL can then be specified based on the error defined in equation (53) and the accuracy specification of virtual metrology (VM). Consequently, RIT is defined as the RI value corresponding to the EL, as shown in
with μ and σ defined in equation (39) and
ZCenter=Zŷ
where σy is specified in equation (38).
The following presents the algorithms related to the GSI and explains their operating procedures.
Global Similarity Indexes (GSI)
When virtual metrology is applied, no actual measurement value is available to verify the accuracy of the virtual metrology value. Therefore, instead of the standardized actual measurement value Zy
The GSI assesses the degree of similarity between any set of process data and the model set of process data. This model set is derived from all of the sets of historical process data used for building the conjecture model.
The present invention may utilize a statistical distance measure, such as Mahalanobis distance, to quantify the degree of similarity. Mahalanobis distance is a distance measure introduced by P.C. Mahalanobis in 1936. This measure is based on correlation between variables to identify and analyze different patterns of sample sets. Mahalanobis distance is a useful way of determining similarity of an unknown sample set to a known one. This method considers the correlation of the data set and is scale-invariant, namely it is not dependent on the scale of measurements. If the data set has high similarity, the calculated Mahalanobis distance calculated will be relatively small.
The present invention uses the calculated GSI (applying Mahalanobis distance) size to determine whether the newly input set of process data is similar to the model set of process data. If the calculated GSI is small, the newly input set is relatively similar to the model set. Thus the virtual metrology value of the newly input (high-similarity) set is relatively accurate. On the contrary, if the calculated GSI is too large, the newly input set is somewhat different from the model set. Consequently, the virtual metrology value estimated in accordance with the newly input (low-similarity) set has low reliance level in terms of accuracy.
The equations to calculate the standardized process data Zx
Assuming that the correlation coefficient between the s-th parameter and the t-th parameter is rst and that there are k sets of data, then
After calculating the correlation coefficients between the standardized model parameters, the matrix of correlation coefficients can be obtained as
Assuming that the inverse matrix (R−1) of R is defined as A, then
Hence, the equation for calculating the Mahalanobis distance (Dλ2) between the standardized λ-th set process data (Zλ) and the standardized model set process data (ZM) is as follows.
Finally, we have
The GSI of the standardized λ-th set process data is, then, equal to Dλ2/p.
After obtaining the GSI, the GSI threshold (GSIT) should be defined. Generally, the default GSIT is assigned to be two to three times the maximal GSI, (the subscript “a” stands for each historical set during the training phase).
Referring to
The aforementioned embodiments can be provided as a computer program product, which may include a machine-readable medium on which instructions are stored for programming a computer (or other electronic devices) to perform a process based on the embodiments of the present invention. The machine-readable medium can be, but is not limited to, a floppy diskette, an optical disk, a compact disk-read-only memory (CD-ROM), a magneto-optical disk, a read-only memory (ROM), a random access memory (RAM), an erasable programmable read-only memory (EPROM), an electrically erasable programmable read-only memory (EEPROM), a magnetic or optical card, a flash memory, or another type of media/machine-readable medium suitable for storing electronic instructions. Moreover, the embodiments of the present invention also can be downloaded as a computer program product, which may be transferred from a remote computer to a requesting computer by using data signals via a communication link (such as a network connection or the like).
Hereinafter, illustrative examples are provided and compared for explaining that the embodiment of the present invention is useful and advantageous.
The W2W control of a CMP tool with a periodic maintenance (PM) cycle being 600 pieces (pcs) of wafers is selected as the illustrative example for evaluation and comparisons. The simulation conditions and scenarios are listed as follows:
1. yk is the actual removal amount measured from the metrology tool and PostYk is the actual post CMP thickness of run k. The specification of PostYk is 2800±150 Angstrom (Å) with 2800 being the target value denoted by TgtPostY. Therefore, we have
PostYk=PreYk−yk (61)
with
Yk=ARRk*uk (62)
where ARRk is the actual removal rate of run k and uk represents the polish time in this example.
The well-known Preston equation, empirically found from the experiment of the glass polishing in 1927, has been proposed to predict the material removal rate of CMP. According to the Preston equation, the material removal rate is affected by the contact pressure (also denoted as tool stress) distribution at contact point, magnitude of the relative velocity (also denoted as tool rotation speed) at contact point between wafer and polishing pad, and constant representing the effect of the other remaining parameters including the slurry fluid speed, pad property, and so on. Therefore, ARRk is simulated by:
The meanings of Stress1, Stress2, Rotspd1, Rotspd2, Sfuspd1, Sfuspd2, PM1, PM2, and Error are tabulated in Table 2. The Ak in equation (63) is the nominal removal rate, which is empirically simulated by a polynomial curve fitting of parts usage count between PMs (denoted by PU varying from 1 to 600):
Ak=(4×10−6)×(PU−1)3−(3.4×10−3)×(PU−1)2+(6.9×10−3)×(PU−1)+(1.202×103) (64)
2. PostŶk represents the predictive value of PostYk, and then, from equations (61) and (62) we have
ŷk=A{circumflex over (R)}Rk*uk (65)
PostŶk=PreYk−ŷk=PreYk−A{circumflex over (R)}Rk*uk (66)
where
A{circumflex over (R)}Rk=f(Stress, Roupd, Sfuspd, PU,PU2,PU3) (67)
A{circumflex over (R)}Rk is the VM value of ARRk with Stress (=Stress1+Stress2), Rotspd (=Rotspd1+Rotspd2), Sfuspd (=Sfuspd1+Sfuspd2), PU, PU2, PU3 as the process parameters. The reason of adopting Stress, Rotspd, Sfuspd, PU, PU2, and PU3 as the process parameters is based on the Preston equation, equations (63) and (64). The setting values of the simulated process parameters are tabulated in Table 2.
3. The k+1 run control action is derived by
4. When PostYk is measured by an actual metrology tool, then
{tilde over (η)}k+1=α1(yz−Akuk)+(1−α1) {tilde over (η)}k (70)
When PostYk is conjectured or predicted by a VM system, then
For this example, C=25.
5. 1 Lot=25 workpieces in which the 2nd workpiece being the sampling wafer.
6.
8. Extra random disturbances caused by Sfuspd2 with mean=0 and variance=0.36 are also added at Samples 50, 111, 179, 251, 349, and 503. In other words, the combined variances of Sfuspd2 at Samples 50, 111, 179, 251, 349, and 503 are 1.2+0.36=1.56. With these extra random disturbances, the to RI and/or GSI values may exceed their thresholds.
Five rounds with different random seeds are performed to evaluate and compare the performance. For each round, the simulation results of PreYk, Tgtk, Ak, and ARRk for k=1˜600 should be generated firstly based on the setting values shown in Table 2, equations (68), (64) and (63), respectively. Then, let α1=0.35 and {tilde over (η)}1=0 to calculate u1 as well as apply equations (62), (70), (69) and (61) to calculate yk, ηk+1, uk+1 and PostYk, respectively for k=1 and 2 for all of the five cases. As for k=3˜600, control schemes for those five cases are different and are described below:
Case 1: R2R with in-situ metrology
Let α1=0.35. Apply equations (62), (70), (69) and (61) to calculate yk, {tilde over (η)}k+1, uk+1, and PostYk, respectively for k=3˜600.
Case 2: R2R+VM without RI
Let α2=α1=0.35. Apply equations (65), (71), (69), (66) and (61) to calculate ŷk, {tilde over (η)}k+1, uk+1, PostŶk, and PostYk, respectively for k=3˜600.
Case 3: R2R+VM with RI
Let α1=0.35. If RI<RIT or GSI>GSIT, then let α2=0; otherwise, let α2,k=RIk×α1; as well as apply equations (65), (71), (69), (66) and (61) to calculate ŷk, {tilde over (η)}k+1, uk+1, PostŶk, and PostYk, respectively for k=3˜600.
Case 4: R2R+VM with (1−RI)
Let α1=0.35. If RI<RIT or GSI>GSIT, then let α2=0; otherwise, let α2,k=(1−RIk)×α1; as well as apply equations (65), (71), (69), (66) and (61) to calculate ŷk, {tilde over (η)}k+1, uk+1, PostŶk, and PostYk, respectively for k=3˜600.
Case 5: R2R+VM with RII(1−RI)
Let α1=0.35. Apply the RII(1−RI) switching scheme as shown in equations (72) and (73) to set α2; as well as apply equations (65), (71), (69), (66), and (61) to calculate ŷk, {tilde over (η)}k+1, uk+1, PostŶk, and PostYk, respectively for k=3˜600.
Both Cpk (Process Capability Index) and MAPEProcess (Mean Absolute Percentage Error; as expressed in equations (74) and (75), respectively) are applied to evaluate and compare the performance of those 5 cases. The Cpk and MAPEProcess values of those 5 cases are tabulated in Tables 3 and 4, respectively.
Observing Tables 3 and 4 and treating Case 1 as the baseline, it is obvious that the performance of Case 2, which does not consider RI/GSI, is the worst. Case 3, which filters out the bad-quality PostŶk (VM) values and lets α2=RI×α1, is the most natural approach and has acceptable performance. The performance of Case 4, which filters out the bad-quality PostŶk (VM) values and lets α2=(1−RI)×α1, is better than that of Case 3 on average except for Round 1. Case 5, which filters out the bad-quality PostŶk (VM) values and applies the RII(1−RI) switching scheme shown in equation (73), fixes the problem of Case 4 in Round 1; and Case 5's performance is compatible with that of Case 1 (in-situ metrology).
Simulation Results of Round 1 for those 5 cases are shown in
The RIT and GSIT are set at 0.7 and 9, respectively in this example. The cases that RI<RIT and GSI>GSIT at Sample 50 of Round 1 as well as GSI>GSIT at Sample 349 of Round 1 are enlarged and depicted in
As shown in
Observing
As mentioned above, α2=RI×α1 when PostYk is apart from the target value or production process is relatively unstable. On the contrary, if PostYk is near the target or production process is relatively stable, then α2=(1−RI)×α1.
It will be apparent to those skilled in the art that various modifications and variations can be made to the structure of the present invention without departing from the scope or spirit of the invention. In view of the foregoing, it is intended that the present invention cover modifications and variations of this invention provided they fall within the scope of the following claims and their equivalents.
The present application is based on, and claims priority from, U.S. provisional Application Ser. No. 61/369,761, filed Aug. 2, 2010, the disclosure of which is hereby incorporated by reference herein in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
7359759 | Cheng et al. | Apr 2008 | B2 |
7363099 | Smith et al. | Apr 2008 | B2 |
7593912 | Cheng et al. | Sep 2009 | B2 |
7603328 | Cheng et al. | Oct 2009 | B2 |
7974723 | Moyne et al. | Jul 2011 | B2 |
8014991 | Mitrovic et al. | Sep 2011 | B2 |
8036869 | Strang et al. | Oct 2011 | B2 |
8108060 | Tsen et al. | Jan 2012 | B2 |
8392009 | Fei et al. | Mar 2013 | B2 |
8433434 | Wang et al. | Apr 2013 | B2 |
8437870 | Tsai et al. | May 2013 | B2 |
20060129257 | Chen et al. | Jun 2006 | A1 |
20090292386 | Cheng et al. | Nov 2009 | A1 |
20110202160 | Moyne | Aug 2011 | A1 |
Number | Date | Country |
---|---|---|
2007510287 | Apr 2007 | JP |
2009282960 | Dec 2009 | JP |
Entry |
---|
An Approach for Factory-Wide Control Utilizing Virtual Metrology. |
On the Quality of Virtual Metrology Data for Use in the Feedback Process Control. |
Performance Analysis of EWMA Controllers Subject to Metrology Delay. |
Virtual metrology and feedback control for semiconductor manufacturing processes using recursive partial least squares. |
Number | Date | Country | |
---|---|---|---|
20120029662 A1 | Feb 2012 | US |
Number | Date | Country | |
---|---|---|---|
61369761 | Aug 2010 | US |