Embodiments of the present disclosure generally relates to semiconductor manufacturing, and, more specifically, embodiments of the present disclosure relate to anomaly detection in sensor data retrieved during semiconductor processing.
Manufacturing of electronic devices typically involves performing a sequence of procedures with respect to a substrate such as a silicon substrate, a glass plate, etc. (Such substrates may also be referred to as wafers, whether patterned or unpatterned.) These steps may include polishing, deposition, etching, photolithography, heat treatment, and so forth. Usually a number of different processing steps may be performed in a single processing system or “tool” which includes a plurality of processing chambers. During processing, each chamber in which a procedure is carried out may include a plurality of sensors, with each sensor configured to monitor a predefined metric relating to substrate processing.
Anomaly detection in sensor trace data aids in assessing the overall health of the chamber in which a substrate processing procedure is carried forth. Some technique for anomaly detection in sensor trace data is recipe dependent, i.e., a different method for detecting anomalies in sensor trace data is needed based on the tool used and the process run. As a result, different techniques are needed, which can be costly and time consuming to develop. Therefore, there is a continual need for an improved method of detecting anomalies in sensor data retrieved during semiconductor processing.
Embodiments described herein generally relate to a method, system, and computer readable medium for detecting an anomaly is sensor data generated in a substrate processing apparatus is disclosed herein. A plurality of data sets is received. Each data set corresponds to a sensor in a plurality of sensors disposed in a substrate processing apparatus. Each sensor is configured to monitor a respective substrate processing metric. A first data set is selected. The first data set corresponds to a first substrate processing metric monitored by the first sensor disposed in the substrate processing apparatus. A second data set is selected. The second data set corresponding to a second substrate processing metric monitored by the second sensor disposed in the substrate processing apparatus. The first sensor and the second sensor are defined as a sensor pair. A reference correlation is generated by selecting a subset of values in each data set for each of the first and second data sets. Difference of the remaining data outside of the subset of values compared to the reference in each data set is z-normalized to obtain a z-normalized data set The z-normalized data set is filtered to remove one or more outliers from the z-normalized data set to generate a filtered data set. One or more anomalies are identified in the filtered data set. Process parameters of the substrate processing apparatus are adjusted, based on the one or more identified anomalies from the filtered data set.
So that the manner in which the above recited features of the present invention can be understood in detail, a more particular description of the invention, briefly summarized above, may be had by reference to embodiments, some of which are illustrated in the appended drawings. It is to be noted, however, that the appended drawings illustrate only typical embodiments of this invention and are therefore not to be considered limiting of its scope, for the invention may admit to other equally effective embodiments.
To facilitate understanding, identical reference numerals have been used, where possible, to designate identical elements that are common to the figures. It is contemplated that elements and features of one embodiment may be beneficially incorporated in other embodiments without further recitation.
It is to be noted, however, that the appended drawings illustrate only exemplary embodiments of this invention and are therefore not to be considered limiting of its scope, for the invention may admit to other equally effective embodiments.
Client device 108 communicates with the controller 102 over the network 105. For example, controller 102 may transmit the sensor data collected from each sensor 106 to the client device 108 for further analysis. In some embodiments, client device 108 is configured to identify anomalies in the received sensor data such that subsequent substrate processing processes may be adjusted. Detection of anomalies in the sensor data aids in assessing the overall health of the processing chamber 104, i.e., how well the processing chamber 104 is performing. Specifically, anomaly detection in the sensor data aids in diagnosing symptoms of the processing chamber 104. In some techniques, anomaly detection suffers from being recipe dependent, resulting in many false positives. Additionally, these detection methods are typically slow to scale to larger data sets, and are expensive to maintain. Client device 108 is configured to provide an improved anomaly technique that result in efficient identification of anomalies in sensor data.
The memory 204 includes program code 216. The program code 216 is configured to carry out the instructions of the end user for substrate processing. For example, an end user may provide the controller 102 with instructions regarding the process parameters of a certain substrate processing process. In another example, the program code 216 may further include instructions for generating each data set for each sensor 106.
The storage 206 may be disk drive storage. Although shown as a single unit, the storage 206 may be a combination of fixed and/or removable storage devices, such as fixed disk drives, removable memory cards, optical storage, network attached storage (NAS), or storage-area-network (SAN). The network interface 208 may be any type of network communications allowing the controller to communicate with other computers via the network 105, such as, for example, client device 108.
Client device 108 includes a processor 252, a memory 254, storage 256, and network interface 258. The processor 252 retrieves and executes programming instructions, such as program code 262, stored in memory 254. For example, the program code 262 is configured to carry out the method discussed below, in conjunction with
The storage 256 may be disk drive storage. Although shown as a single unit, the storage 256 may be a combination of fixed and/or removable storage devices, such as fixed disk drives, removable memory cards, optical storage, network attached storage (NAS), or storage-area-network (SAN). The network interface 258 may be any type of network communications allowing the controller to communicate with other computers via the network 105, such as, for example, controller 102.
At block 304, one or more metrics are measured with the one or more sensors during processing. Continuing with the above example, a first sensor measures a first metric (e.g., rate of gas flow entering the processing chamber) and the second sensor measures a second metric (e.g., rate of gas flow exiting the processing chamber). The first sensor detects a first plurality of measurements related to the first metric. The second sensor detects a second plurality of measurements related to the second metric.
At block 306, controller 102 retrieves the sensor data from each of the one or more sensors. For example, data gathering agent 214 gathers the data related to the first metric stored in the first database from the first sensor, and data gathering agent 214 gathers the data related to the second metric stored in the second database from the second sensor.
At block 308, controller 102 transmits the retrieved sensor data to the client device 108. For example, controller 102 transmits the generated first data set corresponding to the first sensor and the generated second data set corresponding to the second sensor to the client device 108 for further analysis. At block 310, client device 108 receives the sensor data from the controller 102.
At block 310, client device 108 generates a reference correlation using a subset of each retrieved data set. When client device 108 receives the sensor data from the controller 102, continuing with the above example, the client device 108 receives a first set of sensor data and a second set of sensor data. Mathematically, this may be presented as:
S
1
={x
i
},S
2
={y
i},
where xi is a received measurement for each substrate i from i=1 . . . n, from the first sensor (i.e., S1), and where yi is a received measurement for each substrate i from i=1 . . . n, from the second sensor (i.e., S2), and n is the total number of substrates processed in the batch of substrates.
At block 404, within the subset of t substrates, the client device 108 defines a rolling window, w, of substrates. For example, within subset of substrates {σa . . . σt}, client device selects a smaller subset of substrates {σa . . . σb}, where a,b∈{σa . . . σt} and b=a+w. For each smaller subset of substrates, client device 108 calculates a reference correlation coefficient. For example, assume that the subset of t substrates includes 100 substrates, from {σ1 . . . σ100}. Within that subset of t substrates, client device 108 selects a range of 30 substrates. For example, client device 108 starts by selecting substrates σ1-σ30. At block 406, for this range of substrates, {σ1 . . . σ30}, client device 108 calculates an r value that is directed to the reference correlation coefficient. The reference correlation coefficient, generally, is represented as:
Continuing with the above example, to generate r1, the equation would look as follows:
The client device 108 continues to generate ri in a rolling window, by sliding the range of w substrates back and forth within the subset of s substrates. For example, to generate r2, the equation may look as follows:
At block 408, when each ri is calculated, client device 108 generates a reference correlation coefficient mean and reference coefficient range to be leveraged in subsequent blocks of
To normalize the data associated with the substrates outside of the subset of t substrates, client device 108 selects a subset z of remaining substrates and calculates the reference correlation efficient for the subset z. In one embodiment, the cardinality of the subset of z substrates is equal to the cardinality of the subset of w substrates. Continuing with the above example, the cardinality of subset w was chosen to be 30; therefore, the cardinality of the subset of z substrates is also 30. Thus, the first subset of 30 substrates may be substrates σ101-σ130. For each subset z of substrates, the r value would be calculated similarly to block 310 above. For example, for the first set of 30 substrates, the r value may be calculated as follows:
After the rz value is determined, client device 108 calculates the difference compared to the reference correlation efficient determined in block 310. For example:
if ri<min(rref)
then diff=ri−min(rref)
if ri>max(rref)
then diff=ri−max(rref)
else diff=0
Referring back to
The z-normalized correlation difference is de-noised/smoothened by defining a low frequency band path filter in the frequency space. In this way, the isolated correlation difference outlier can be filtered out. The process works as follows: first transfer the z normalized correlation difference of each substrate (Zdiff(r
Then the inverse Fourier transformation is performed on FFT(Zdiff(r
At block 316, the client device 108 identifies one or more anomalies in the z-normalized and filtered data set. For example, an outlier may be defined as that value which exceeds a cutoff amount. The cutoff amount may be a predefined value or may be defined as:
C=mean(SM(Zdiff(r
Accordingly, based on the above definition of a cutoff amount, an anomaly is defined by that value, for which:
SM(Zdiff(r
where rj represents some element, j, in the filtered data set.
In another example, we can use the SM(Zdiff(r
Optionally, at block 318, the client device 108 may analyze the determined outliers. For example, as discussed above, the client device 108 receives a plurality of data sets from a plurality of sensors positioned in a substrate processing apparatus. Client device 108 then selects two or more data sets for the above anomaly detection, and repeats the process on each permutation of two or more data sets. In some embodiments, to determine which sensors appear most problematic, client device 108 may count the sensors that appear in each anomaly detected. Accordingly, the client device 108 alerts a user of which sensors appear most frequently in the anomaly pairwise relations. In another embodiment, to determine which substrates were involved in an anomaly run, the client device 108 calculates an absolute sum of all the anomalies found for a sensor pair. Those absolute sums that exceed the cutoff amount signal to the user that the substrate during that run may be defective.
At block 320, the client device 108 may communicate with the controller 102, indicating how to modify the process parameters of the processing chamber. At block 322, controller 102 changes the process parameters accordingly.
It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise as apparent from the following discussion, it is appreciated that throughout the description, discussions utilizing terms such as “processing” or “computing” or “calculating” or “determining” or “displaying” or the like, refer to the action and processes of a computer system, or similar electronic computing device, that manipulates and transforms data represented as physical (electronic) quantities within the computer system's registers and memories into other data similarly represented as physical quantities within the computer system memories or registers or other such information storage, transmission, or display devices.
The present example also relates to an apparatus for performing the operations herein. This apparatus may be specially constructed for the illustrated purposes, or it may comprise a general purpose computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a computer readable storage medium, such as, but is not limited to, read-only memories (ROMs), random access memories (RAMs), EPROMs, EEPROMs, flash memory, magnetic or optical cards, any type of disk including floppy disks, optical disks, CD-ROMs, and magnetic-optical disks, or any type of media suitable for storing electronic instructions, and each coupled to a computer system interconnect.
The structure for a variety of these systems will appear from the description above. In addition, the present examples are not described with reference to any particular programming language, and various examples may thus be implemented using a variety of programming languages.
As described in greater detail herein, embodiments of the disclosure provide a software application through which a user may access a first file storage location, such as a personal web page hosted by a server, on a remote device by a link accessed via a second graphical user interface. The user may upload and/or transfer media files from a second file storage location accessed via the second graphical user interface to the first file storage location. The link may allow a user to send a link via a data network that will allow the user to quickly log in and upload media files from the remote device, which accesses the link. As such, the media files accessible by the remote device may be transferred from the second storage location of the remote device to the first storage location.
While the foregoing is directed to embodiments described herein, other and further embodiments may be devised without departing from the basic scope thereof. For example, aspects of the present disclosure may be implemented in hardware or software or in a combination of hardware and software. One embodiment described herein may be implemented as a program product for use with a computer system. The program(s) of the program product define functions of the embodiments (including the methods described herein) and can be contained on a variety of computer-readable storage media. Illustrative computer-readable storage media include, but are not limited to: (i) non-writable storage media (for example, read-only memory devices within a computer such as CD-ROM disks readable by a CD-ROM drive, flash memory, ROM chips or any type of solid-state non-volatile semiconductor memory) on which information is permanently stored; and (ii) writable storage media (for example, floppy disks within a diskette drive or hard-disk drive or any type of solid-state random-access semiconductor memory) on which alterable information is stored. Such computer-readable storage media, when carrying computer-readable instructions that direct the functions of the disclosed embodiments, are embodiments of the present disclosure.
It will be appreciated to those skilled in the art that the preceding examples are exemplary and not limiting. It is intended that all permutations, enhancements, equivalents, and improvements thereto that are apparent to those skilled in the art upon a reading of the specification and a study of the drawings are included within the true spirit and scope of the present disclosure. It is therefore intended that the following appended claims include all such modifications, permutations, and equivalents as fall within the true spirit and scope of these teachings.
While the foregoing is directed to embodiments of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof, and the scope thereof is determined by the claims that follow.