The present disclosure relates generally to semiconductor processing and, more particularly, to a system and method for exception handling during semiconductor processing.
Typically, exceptions in semiconductor processing are regular occurrences. Exceptions may be events that interrupt the normal control flows of the process control system. Examples of these exceptions may be new tape out (NTO), long-time-no-run (LTNR), periodic maintenance (PM), new chamber release, and the like. The impacts of exceptions may result in mismatched parameters of the process controllers from the actual state of the tools or processes. These impacts are generally unaccounted for in the modeling or are typically very difficult to model in the normal control algorithms.
In an attempt to remedy any mismatched parameters, engineers would typically enter control parameters after an exception occurred. The parameters that an engineer would enter were generally determined based on the engineer's experience and judgment. Because of this factor, results from processing after an exception could vary greatly. Also, there is usually some added variation that would occur between different engineers, particularly between an engineer with a high level of experience and a relatively junior engineer.
Further, the handling of the exception generally requires a relatively large amount of time when performed manually by engineers. A determination by an engineer of proper control parameters may take hours. This, in turn, may result in a large maintenance load and reduced productivity. Accordingly, there is a need in the art to overcome these deficiencies.
In accordance with an embodiment, a method for exception handling comprises accessing an exception type for an exception, filtering historical data based on at least one defined criterion to provide a data train comprising data sets, assigning a weight to each data set, and providing a current control parameter. The data sets each comprise a historical condition and a historical control parameter, and the weight assigned to each data set is based on each historical condition. The current control parameter is provided using the weight and the historical control parameter for each data set.
Another embodiment is a computer program product for providing a processing control parameter. The computer program product has a medium with a computer program embodied thereon. The computer program comprises computer program code for filtering historical data when an exception occurs to provide a data train comprising data sets, the data sets each comprising a historical condition and a historical parameter; computer program code for assigning a weight to each data set based on a statistical analysis of the data train; and computer program code for determining a suggested control parameter based on the weights.
A further embodiment is a method for processing control exception handling. The method comprises determining an exception type when an exception occurs; initiating one filter of a plurality of filters and one weighting calculator of a plurality of weighting calculators based on the exception type; filtering historical data by the one filter to provide a data train, the data train comprising data sets, each of the data sets including a historical condition and a historical parameter; and providing a suggested parameter based on the historical parameter for each respective data set and on a weight of each data set determined by the one weighting calculator.
For a more complete understanding of the present embodiments, and the advantages thereof, reference is now made to the following descriptions taken in conjunction with the accompanying drawings, in which:
The making and using of the present embodiments are discussed in detail below. It should be appreciated, however, that the present disclosure provides many applicable inventive concepts that can be embodied in a wide variety of specific contexts. The specific embodiments discussed are merely illustrative of specific ways to make and use the disclosed subject matter, and do not limit the scope of the different embodiments.
Embodiments will be described with respect to a specific context, namely an intelligent manufacturing process control system for exception handling that interfaces with an advance process control (APC) system. Other embodiments may also be applied, however, to other situations beyond exception handling and to other process control systems.
Referring now to
The bus 20 may be one or more of any type of several bus architectures including a memory bus or memory controller, a peripheral bus, or video bus. The CPU 14 may comprise any type of electronic data processor, and the memory 16 may comprise any type of system memory, such as static random access memory (SRAM), dynamic random access memory (DRAM), or read-only memory (ROM).
The mass storage device 18 may comprise any type of storage device configured to store data, programs, and other information and to make the data, programs, and other information accessible via the bus 20. The mass storage device 18 may comprise, for example, one or more of a hard disk drive, a magnetic disk drive, or an optical disk drive.
The video adapter 22 and the I/O interface 22 provide interfaces to couple external input and output devices to the processing unit 12. As illustrated in
It should be noted that the processor based system 10 may include other components. For example, the processor based system 10 may include power supplies, cables, a motherboard, removable storage media, cases, and the like. These other components, although not shown, are considered part of the processor based system 10.
Embodiments may be implemented on the processor based system 10, such as by program code executed by the CPU 14. Some possible implementations of embodiments are discussed herein, but a person having ordinary skill in the art will readily understand the application of the below described processes in the processor based system 12.
The exception type is determined by the scenario function 52 and then provided to kernel 62 from the scenario function 52. The scenario function 52 may be implemented in a number of ways. For example, the scenario function 52 may be implemented on a processor based system, such as processor based system 10 in
The scenario function 52 may also provide to the kernel 62 current conditions and configurations that correspond to the determined exception type, such as the criteria of collecting raw data set, setting filters, and initializing weight calculators. For example, historical data may be collected that are marked as NTO and happened within 30 days, a pattern density filter may be set with a limited range of 0.1, and only a similarity weight calculator may be activated for use (the others may be disabled). The current conditions and configurations may similarly be saved in memory 16 or mass storage 18 for later access.
The kernel 62 comprises a filter 64 and a weighting calculator 66. Similar to the scenario function 52, the kernel 62 may be implemented on a processor based system, such as processor based system 10 in
The kernel 62 may comprise multiple filters and weighting calculators that are triggered by different exception types. For example, one filter and one weighting calculator may be desirable for a NTO exception, but not for a PM exception. Accordingly, another filter and weighting calculator may be used for the PM exception. Embodiments have no limitation on the number of filters and weighting calculators.
The filter 64 accesses historical data 60 and filters the historical data 60 based on a defined criterion or set of criteria to provide a data train to the weighting calculator 66. The historical data 60 may be any set of data compiled from previous exception events. The historical data 60 may include the type of exception, the time of the event, any processing conditions, the result of the processing, control parameters, and the like. The historical data 60 may be saved as an array in a remote database that is accessible through the LAN/WAN 32 and the network interface 26, in the memory 16 or mass storage 18, or any other suitable device. Note that there is no limit to the type or amount of data saved as the historical data 60. The filter 64 may be computer program code executed by the CPU 14 and may remove irrelevant or undesirable data from what is provided to the weighting calculator 66. The filter 64 may be case, module, and/or experienced based and may be a design choice by a programmer. Examples of possible criteria for a filter are a specified target range of the processing, a recipe, a particular product, technology node, time period, and the like. Any set of historical data 60 that fails to meet the criterion or criteria defined by the filter is not passed to the weighting calculator 66. Note that a single criterion or any combination of criteria may be used in the filter. The data train that passes through the filter 64 to be provided to the weighting calculator 66 may be saved in the memory 16 or mass storage 18 in the processor based system 10 for later access by the CPU 14 when the weighting calculator 66 is executed.
The weighting calculator 66 assigns a weight to each data set within the data train to output a suggestion 68. Note that “data set” as used herein refers to all of the historical data relating to a single event, and that “data train” may comprise multiple data sets after the historical data is filtered. The weighting calculator 66 may also be computer program code executed on the CPU 14 and may access data train at any point during the execution.
The weighting calculator 66 may be a statistical analysis of the data train to determine the suggestion 68. The weighting calculator 66 may provide a weight for each data set. Then, the weights for all of the data sets in the data train may be normalized to provide normalized weights. The parameter of each data set corresponding to the parameter for the suggestion 68 may then be multiplied by the corresponding data set's normalized weight. The products of all of the data sets' parameter and normalized weight may then be summed together to provide the suggestion 68. The suggestion 68 is output to the APC parameter 70. How the suggestion 68 is output may depend on how the kernel 62 and the APC system are implemented. The kernel 62 may be implemented in the same processor based system 10 as the APC system, and thus, the suggestion 68 may be saved in memory 16 or mass storage 18 for later access by the APC system. Further, the APC system may be a separate processor based system, and the suggestion 68 may be communicated through the LAN/WAN 32 and network interface 26. How the suggestion 68 is provided to the APC system is not limited to these two examples, and a person having ordinary skill in the art will readily understand other means by which this is accomplished.
The weight assigned to each data set may be accomplished in any number of ways. Some examples of how the weights may be assigned are by using similarity weighting, performance weighting, time weighting, and any combination of these ways. A similarity weighting scheme may be based on inputs and may include a normal distribution based weighting function. A performance weighting scheme may be based on outputs and may include a linear function of the APC performance. A time weighting scheme may be based on the time of the events and may include an exponential distribution based weighting function.
The following examples may clarify these illustrative schemes. First, assume the data train (Data Set No. 1-3) and the Current Data Set shown in Table 1 is provided to the weighting calculator 66 from the filter 64 and from the scenario function 52, respectively.
Similarity weighting may include using a multivariate normal distribution to calculate the weight to be assigned to each data set. A random vector X=[X1 . . . Xk]T may be defined where each condition is a random variable, and each random variable comprises n number of instances. An expected value vector μ=[μ1 . . . μk]T may be defined where the variable ,u, is the expected value of random variable Xi, i.e. μi=E[Xi]. In this example, k=2; X1 is defined by condition 1; X2 is defined by condition 2, and each random variable has n=3. The number of random variables Xk may depend on the number of conditions used in the analysis, and may be one or more without limit. Then, the variance matrix ΣX is calculated. Calculation of the variance matrix ΣX is known in the art, but the equation is identified herein as Equation (1) for ease of reference.
In this example, the variance matrix ΣX is as follows:
With these definitions and information, a similarity weight may be assigned to each of n instances of random vector X , i.e. each data set in the data train. A probability density function (PDF) may be used to assign each weight in a similarity vector ωS, where ωS=[ωS,1 . . . ωS,n]T, particularly a Gaussian normal distribution, such as shown in Equation (2) below.
Using the information provided above in this example, the similarity vector ωS may be solved to provide
The similarity vector ωS may be normalized to obtain a normalized similarity vector
Once the normalized similarity vector
Time weighting may include using an exponential distribution. A time vector T may comprise the time period from the current time for each of n number of instances, i.e. each data set in data train. The time vector T may be described as T=[t1 . . . tn]T. An exponential distribution may be used to assign each weight in a time weight vector ωT, where ωT=[ωT,1 . . . ωT,n]T, such as shown in Equation (4) below.
The decay factor λ may be set at any value and is a design choice. Using the information provided above in this example with λ=1, the time weight vector ωT may be solved to provide
The time weight vector ωT may be normalized to obtain a normalized time vector
Once the normalized time vector
Performance weighting may include using a linear function. A difference vector D may comprise magnitudes of the difference between the target and the post measurement for each of n number of instances, i.e. each data set in data train. It should be noted that the difference may not be a magnitude but the actual signed difference in some instances. One example may be when the concerned process or chamber is for a chemical mechanical polish (CMP), where a negative value indicates that the polishing has removed less than the target, as opposed to removing more than the target depth. The difference vector D may be described as D=[d1 . . . dn]T. A linear function may be used to assign each weight in a performance weight vector ωP, where ωP=[ωP,1 . . . ωP,n]T, such as shown in Equation (6) below.
The coefficients a and b are case dependent upon various considerations, such as process sensitivity and control safety. In this example, a=2 and b=0. Further, q and r are lower and upper weighting boundaries as defined by the control specification. In this example, any performance difference di not within the range q to r will be assigned a weight of zero. Further note that these boundaries may change depending on the processes to which these weights are being applied. In some instances, for example a CMP, it may be desirable to have a third definition for a range where a performance difference di is within a particular negative range. Using the information provided above in this example, the performance weight vector ωP may be solved to provide
The performance weight vector ωP may be normalized to obtain a normalized performance vector
Once the normalized performance vector
Even further, any different weighting schemes that are used may be combined in any manner to determine the suggested APC parameter PSug. For example, the similarity weighting, the time weighting, and/or the performance weighting may be combined in any manner. This may be achieved by multiplying corresponding entries of the normalized similarity vector
Also, any two normalized weight vectors may be multiplied to determine the weight vector ωC.
The combined weight vector ωC may be normalized to obtain a normalized combined vector
Once the normalized combined vector
Using the normalized weighting vectors in the above examples, the following Table 2 may summarize the normalized weighting vectors, the combined weighting vector ωC, and the normalized combined vector
S
T
P
C
Taking the dot product of the APC parameter vector P and the normalized combined vector
Embodiments may achieve advantages over known processes. Embodiments may achieve reduced maintenance loading and reduced calculation time when determining control parameters, as compared to known processes. For example, embodiments may reduce the time required to calculate a parameter from hours to seconds, such as from five hours to ten seconds. Also, embodiments may achieve improved deviations between products. For example, the inventors have experimented with embodiments and have found that for a polysilicon deposition process in an APC system, the coverage rate improved on average approximately 30 percent. The turn rate improved approximately 14 to 32 percent, and the Q-time gain was approximately 24 to 46 percent. Also, the operation rate improved from approximately 35 to 1.5 percent, a 96 percent improvement.
Although the present embodiments and their advantages have been described in detail, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the disclosure as defined by the appended claims. Moreover, the scope of the present application is not intended to be limited to the particular embodiments of the process, machine, manufacture, composition of matter, means, methods and steps described in the specification. As one of ordinary skill in the art will readily appreciate from the disclosure, processes, machines, manufacture, compositions of matter, means, methods, or steps, presently existing or later to be developed, that perform substantially the same function or achieve substantially the same result as the corresponding embodiments described herein may be utilized according to the present disclosure. Accordingly, the appended claims are intended to include within their scope such processes, machines, manufacture, compositions of matter, means, methods, or steps.