During fabrication of semiconductor devices, semiconductor wafers are processed by a variety of mechanical apparatuses. As an example, during a chemical-mechanical planarization (CMP) process, a CMP apparatus may be utilized to process a wafer. The CMP apparatus may include a plurality of moving or movable components (e.g., a rotatable platen, a polishing head, a pad conditioner, and slurry sprinkler) which operate in coordination with one another to process the wafer.
Many semiconductor processes require very precise movements and positioning of the mechanical components. Even very small deviations from the correct positioning and movements of the components can result in defects in the semiconductor wafer that is undergoing processing.
Aspects of the present disclosure are best understood from the following detailed description when read with the accompanying figures. It is noted that, in accordance with the standard practice in the industry, various features are not drawn to scale. In fact, the dimensions of the various features may be arbitrarily increased or reduced for clarity of discussion.
The following disclosure provides many different embodiments, or examples, for implementing different features of the provided subject matter. Specific examples of components and arrangements are described below to simplify the present disclosure. These are, of course, merely examples and are not intended to be limiting. For example, the formation of a first feature over or on a second feature in the description that follows may include embodiments in which the first and second features are formed in direct contact, and may also include embodiments in which additional features may be formed between the first and second features, such that the first and second features may not be in direct contact. In addition, the present disclosure may repeat reference numerals and/or letters in the various examples. This repetition is for the purpose of simplicity and clarity and does not in itself dictate a relationship between the various embodiments and/or configurations discussed.
Further, spatially relative terms, such as “beneath,” “below,” “lower,” “above,” “upper” and the like, may be used herein for ease of description to describe one element or feature's relationship to another element(s) or feature(s) as illustrated in the figures. The spatially relative terms are intended to encompass different orientations of the device in use or operation in addition to the orientation depicted in the figures. The apparatus may be otherwise oriented (rotated 90 degrees or at other orientations) and the spatially relative descriptors used herein may likewise be interpreted accordingly.
In various embodiments, the present disclosure provides systems, apparatuses, and methods in which an irregular mechanical motion of a component (such as a component of a CMP apparatus) may be recognized or determined during operation.
Embodiments provided herein include mechanical motion irregularity prediction systems and methods for predicting an irregular motion of one or more mechanical components in a semiconductor processing apparatus based on sensed signals associated with one or more motion-related parameters associated with the one or more mechanical components. In some embodiments, spectral images are generated based on the sensed signals, and the spectral images include frequency and time information associated with the sensed signals. Machine learning techniques may be utilized to analyze the spectral images, which analysis may be based at least in part on historical spectral images stored in a spectral image database.
In various embodiments, the irregular motions of one or more components of a semiconductor processing apparatus may be predicted during operation of the apparatus, for example, while processing a semiconductor wafer. The one or more components of the semiconductor processing apparatus may be automatically stopped based on the predicted irregular motions, thereby preventing or reducing any damage from occurring to the semiconductor wafer being processed.
The polishing pad 120 may be attached to the platen 110, for example, the polishing pad 120 may be secured to an upper surface of the platen 110. The polishing pad 120 may be formed of any material that is hard enough to allow the abrasive particles in the slurry 142 to mechanically polish the wafer 160, which is operably positioned at a polishing location between the polishing head 130 and the polishing pad 120. On the other hand, polishing pad 120 is soft enough so that it does not substantially scratch the wafer 160 during the polishing process. The polishing pad 120 may be made of polyurethane or any other suitable materials.
During the CMP process, the platen 110 rotates along a direction of rotation D1 at any of various suitable speeds. For example, the platen 110 may be rotated along the direction of rotation D1 by any mechanism, such as a motor, or the like, which in turn rotates the polishing pad 120 in the direction of rotation D1. The polishing head 130 may apply a force along a direction D2 which pushes the wafer 160 in the direction D2 downward toward and against the polishing pad 120, such that a surface of the wafer 160 in contact with the polishing pad 120 may be polished by the slurry 142.
The polishing head 130 may include a wafer carrier 132 that positions the wafer 160 on the polishing pad 120 at the polishing location. For example, the wafer 160 may be disposed underneath the wafer carrier 132 and may be brought into contact with the polishing pad 120.
For further planarization of the wafer 160, the polishing head 130 may rotate (e.g., in the direction D1, as shown or the reverse direction), causing the wafer 160 to rotate, and move on the polishing pad 120 at the same time, but various embodiments of the present disclosure are not limited in this regard. The wafer carrier 132 may be securely attached to the polishing head 130 and the wafer carrier 132 may rotate along with the polishing head 130. In some embodiments, as shown in
While the CMP apparatus 100 is in operation, slurry 142 flows between the wafer 160 and the polishing pad 120. The slurry dispenser 140, which has an outlet over the polishing pad 120, is used to dispense the slurry 142 onto the polishing pad 120. The slurry 142 includes reactive chemicals that react with the surface layer of the wafer 160 and abrasive particles for mechanically polishing the surface of the wafer 160. Through the chemical reaction between the reactive chemicals in the slurry and the surface layer of the wafer 160, and the mechanical polishing, at least some of the surface layer of the wafer 160 is removed.
As the polishing pad 120 is used, the polishing surface tends to glaze, which can reduce the removal rate and overall efficiency of the CMP apparatus 100. The pad conditioner 150 is arranged over polishing pad 120, and is used to condition the polishing pad 120 and to remove undesirable by-products generated during the CMP process.
The pad conditioner 150 may include a pad conditioner base 151, a pad conditioner arm 152, and a pad conditioner head 153. The pad conditioner base 151 may be any base structure, or may be secured to any base structure, and may generally be fixed in its position. The pad conditioner arm 152 may be attached to the pad conditioner base 151, and the pad conditioner head 153 may be attached to an end of the pad conditioner arm 152 that is opposite the pad conditioner base 151. The pad conditioner arm 152 may be rotatable, for example, about a pivot or joint at which the pad conditioner arm 152 is connected to the pad conditioner base 151. For example, a mechanism such as a motor, actuator, or the like may be operatively coupled to the pad conditioner base 151 or the pad conditioner arm 152 and may be move the pad conditioner arm 152 and the attached pad conditioner head 153 such that the pad conditioner head 153 is movable along a third direction D3. The third direction D3 may be, for example, an arc or segment of an arc that may be defined by rotating the pad conditioner arm 152 and pad conditioner head 153 about a pivot point at which the pad conditioner arm 152 is attached to or otherwise rotatable about the pad conditioner base 151. The third direction D3 may represent travel of the pad conditioner head 153 along the arc in any direction, such as toward the left or toward the right as shown in
A conditioning disk 154 is mechanically coupled to the pad conditioner head 153. For example, the conditioning disk 154 may be attached to the pad conditioner head 153. The conditioning disk 154 may extend outwardly from (e.g., in a downward direction) the pad conditioner head 153, such that the conditioning disk 154 may be brought into contact with the top surface of the polishing pad 120 when the polishing pad 120 is to be conditioned, for example, during use of the CMP apparatus 100. The conditioning disk 154 generally includes protrusions or cutting edges that can be used to polish and re-texturize the surface of the polishing pad 120. In some embodiments, the exposed surface (e.g., the lower surface) of the conditioning disk 154 is formed of or includes a diamond grit material which is used to condition the polishing pad 120. Such a conditioning disk may sometimes be referred to as a “diamond disk.” In some embodiments, the conditioning disk 154 may be formed of other suitable materials such as scouring materials, bristles, or the like.
During the conditioning process, the polishing pad 120 and conditioning disk 154 are rotated, so that the protrusions, cutting edges, grit material, scouring material, or the like of the exposed lower surface of the conditioning disk 154 move relative to the surface of polishing pad 120, to polish the surface of the polishing pad 120. The conditioning disk 154 may be rotated along the first direction D1 of rotation, or in an opposite direction. For example, the conditioning disk 154 may be rotated in a clockwise direction or in a counterclockwise direction.
Any additional features or components may be included in the CMP apparatus 100, which may include, for example, any additional features or components of a CMP apparatus that may be known by those skilled in the field of semiconductor processing tools or CMP apparatuses. In some embodiments, one or more additional pad conditioners 150 may be included in the CMP apparatus 100, such that multiple conditioner disks may be utilized concurrently or alternately to polish the surface of the polishing pad 120. In some embodiments, the CMP apparatus 100 includes a pump (not shown), such as a pump for creating a vacuum or negative pressure between the wafer carrier 132 and the wafer 160 for securing the wafer 160 to the wafer carrier 132 during operation of the CMP apparatus 100. In some embodiments, the CMP apparatus 100 includes one or more motors (not shown), such as motors for moving any of the various components of the CMP apparatus 100 during use.
The CMP apparatus 100 includes one or more sensors 170, which may be positioned at various locations on or within various components of the CMP apparatus 100. For example, as shown in
In some embodiments, the one or more sensors 170 are operable to sense motion-related parameters associated with the one or more components of the CMP apparatus. In some embodiments, the one or more sensors 170 may include any one or more of: a torque sensor, an acceleration sensor, a gyroscope, a vibration sensor, a pressure sensor, a temperature sensor, or a humidity sensor.
As will be discussed in further detail later herein, the various parameters associated with the components of the CMP apparatus 100 which are sensed by the one or more sensors 170 may be analyzed to detect irregularities in motion of the various components of the CMP apparatus 100. Irregular or abnormal motion of components of the CMP apparatus 100 can lead to undesirable effects of the processing of the wafer 160, such as various defects which may be result from over polishing or under polishing the wafer 160 due to the irregular motion of components of the CMP apparatus 100.
As shown in
A first layer 274 may be formed on the substrate 272, and the first layer 274 may be a layer of any material utilized in the manufacture of semiconductor devices. For example, in some embodiments, the first layer 274 may be a first dielectric layer; however, embodiments provided herein are not limited thereto. In various embodiments, the first layer 274 may be a conductive layer, a semiconductor layer, or any other layer of material.
A second layer 276 may be formed on the first layer 274, and the second layer 276 may be a layer of any material utilized in the manufacture of semiconductor devices. For example, in some embodiments, the second layer 276 may be a second dielectric layer; however, embodiments provided herein are not limited thereto. In various embodiments, the second layer 276 may be a conductive layer, a semiconductor layer, or any other layer of material.
One or more first electrical features 282 may be formed in the wafer 260, and the first electrical features 282 may be any electrical features formed in the manufacture of semiconductor devices. In the example shown in
The wafer 260 may further include one or more second electrical features 284, which may be any electrical features formed in the manufacture of semiconductor devices. In the example shown in
Before polishing a surface (e.g., the upper surface) of the wafer 260, the wafer 260 has a certain thickness, which is later reduced due to the polishing. For example, as shown in
As shown in
As shown in
In contrast, referring now to
Referring again to
As shown in
The sensors 170 may be located on or within the first and second mechanical components 12, 14 and configured to sense one or more motion-related parameters associated with the first and second mechanical components 12, 14. In various embodiments, the sensors 170 may be any of the sensors 170a-170g illustrated in
The semiconductor processing apparatus 10 is shown in
The motion-related sensors 170 and the additional sensors 180 may be high-sensitivity sensors which are operable to sense high-sensitivity signals with high-resolution data, which may be analog or digital data. In some embodiments, one or more of the motion-related sensors 170 may be a vibration sensor having an accuracy equal to or less than about 10 μg. That is, the vibration sensor may be capable of sensing motions (e.g., vibrational accelerations) equal to or less than about 10 μg. In some embodiments, the motion-related sensors 170 or the additional sensors 180 may be high-resolution sensors having data that is output or converted to digital data at a resolution equal to or greater than 24 bits. In some embodiments, the additional sensors 180 include a temperature sensor having an accuracy equal to or less than 0.1° C.
As shown in
The motion-related sensors 170 and additional sensors 180 are communicatively coupled to the signal processing circuitry 410 so that the signal processing circuitry 410 receives signals output by the motion-related sensors 170 and additional sensors 180 that are indicative of the sensed parameters of the various components of the apparatus 10, such as sensed parameters associated with the first and second mechanical components 12, 14. The motion-related sensors 170 and additional sensors 180 may be communicatively coupled to the signal processing circuitry 410 by any suitable communications network. The communications network may utilize one or more protocols to communicate via one or more physical networks, including local area networks, wireless networks, dedicated lines, intranets, the Internet, and the like.
In some embodiments, the communications network includes one or more electrical wires which communicatively couple the motion-related sensors 170 or the additional sensors 180 to the signal processing circuitry 410. For example, as shown in
The signal processing circuitry 410 may be or include any electrical circuitry configured to perform the signal processing techniques described herein. In some embodiments, the signal processing circuitry 410 may include or be executed by a computer processor, a microprocessor, a microcontroller, or the like, configured to perform the various functions and operations described herein with respect to the signal processing circuitry. For example, the signal processing circuitry 410 may be executed by a computer processor selectively activated or reconfigured by a stored computer program, or may be a specially constructed computing platform for carrying out the features and operations described herein. In some embodiments, the signal processing circuitry 410 may be configured to execute software instructions stored in any computer-readable storage medium, including, for example, read-only memory (ROM), random access memory (RAM), flash memory, hard disk drive, optical storage device, magnetic storage device, electrically erasable programmable read-only memory (EEPROM), organic storage media, or the like.
The signal processing circuitry 410 receives and processes signals output by the motion-related sensors 170 and the additional sensors 180. In some embodiments, the signal processing circuitry 410 includes an analog-to-digital (ADC) converter 412, which converts analog signals (e.g., as may be received from the motion-related sensors 170 and the additional sensors 180) into digital signals. The digital signals, for example, as output by the ADC 412, may be processed by fast Fourier transform (FFT) circuitry 414 which transforms the sensing signals (e.g., in digital form) from the time domain into the frequency domain, applying any suitable FFT algorithm or technique. FFT algorithms for performing transformation of a signal from its original domain (e.g., the time domain) to a representation in the frequency domain are well known within the field of signal processing, and any such known FFT algorithm may be utilized by the FFT circuitry 414. Transforming signals received from any of the motion-related sensors 170 or the additional sensors 180 into the frequency domain may yield certain spikes of activity (e.g., detected motions, vibrations, etc.) at certain frequencies or within certain frequency bands. This may be the result, for example, of motions caused by various different components (e.g., a pump, a fan, a motor, a wobbling or vibration of the platen, the pad conditioner, the polishing head, or any other component), and the different motions may have different frequencies which may be separately detected and identified in the frequency domain.
The signal processing circuitry 410 may calculate or generate a frequency spectrum for each received sensing signal, for example, using the FFT circuitry 414. The frequency spectrum for each received sensing signal may be generated based on samples having a particular sampling period (e.g., period of time) in the time domain. That is, each of the signals may be analyzed as clips having some period of time, for example, 1 second, 500 ms, 10 ms, 1 ms, or less than 1 ms. Each of these clips of data sensed by the motion-related sensors 170 or additional sensors 180 may then be processed by the FFT circuitry 414 to obtain frequency spectrums for the clips.
The signal processing circuitry 410 may generate spectral images for signals received from each of the motion-related sensors 170 or additional sensors 180, and the spectral images may be generated based on the frequency spectrums output by the FFT circuitry 414 and the time domain information associated with each of the frequency spectrums (e.g., the time period for each of the clips over which the signal data is transformed to the frequency domain).
The signal processing circuitry 410 may further include window circuitry 416, which may process the outputs of the FFT circuitry 414 (e.g., frequency spectrum data associated with certain time-domain sampling clips of the sensor outputs). The window circuitry 416 may apply any window function to the frequency spectrums. As is known within the field of signal processing, a window function may be utilized in spectrum analysis, for example, to provide better resolution and distinguishability among a plurality of frequency components (e.g., vibrations or motions having different frequencies which may be apparent in the frequency spectrum generated based on the sensing signals sensed by a particular sensor).
In some embodiments, the window circuitry 416 is configured to apply a Hamming window to the frequency spectrum output by the FFT circuitry 414. The Hamming window is a known window function that is commonly used in narrow band applications. By applying a Hamming window using the window circuitry 416, particular frequency components of interest are retained in the spectral images, and the resolution and distinguishability of the frequency components of interest may be improved.
The spectral images 500 thus visually represent the frequency spectrum of the sensed data in a temporal manner. That is, the frequency spectrum obtained at a first time (e.g., at the left side of the x-axis) may be different from the frequency spectrum obtained at a later second time (e.g., moving to the right side of the x-axis). The amplitude of the frequency components in the frequency spectrum may be represented in the spectral images 500 by any suitable indicia. For example, in the spectral image 500 illustrated in
Referring again to
In some embodiments, the defect prediction circuitry 420 includes memory which stores instructions for performing one or more of the features or operations described herein, and the defect prediction circuitry 420 may be operable to execute instructions stored, for example, in the memory to perform the functions of the defect prediction circuitry 420 described herein. The memory may be or include any computer-readable storage medium, including, for example, read-only memory (ROM), random access memory (RAM), flash memory, hard disk drive, optical storage device, magnetic storage device, electrically erasable programmable read-only memory (EEPROM), organic storage media, or the like.
The defect prediction circuitry 420 may receive spectral images 500 from the signal processing circuitry 410. The defect prediction circuitry 420 analyzes the spectral images 500 to predict or determine irregularities in motion of the various components of the semiconductor processing apparatus 10, for example, based on a comparison of the received spectral images 500 with past data or analysis of the received spectral images 500 by a machine learning model that is trained with past data (e.g., past spectral images 500) indicative of irregular motions of one or more mechanical components of the semiconductor processing apparatus 10. In some embodiments, the defect prediction circuitry 420 may further predict or determine a status or a remaining operational lifetime of one or more mechanical components of the semiconductor processing apparatus 10 based on the analysis of the spectral images 500.
In some embodiments, the defect prediction circuitry 420 may predict or determine irregular motions, status, or remaining operational lifetime of the mechanical components by employing one or more artificial intelligence or machine learning techniques, which in some embodiments may be implemented at least in part by machine learning circuitry 430. Some or all of the determinations described herein that are made by the defect prediction circuitry 420 may be performed automatically by the defect prediction circuitry 420, for example, in response to receiving spectral images 500 from the signal processing circuitry 410. The machine learning circuitry 430 may be included as part of the defect prediction circuitry 420 (as shown), or may be remotely located and communicatively coupled with the defect prediction circuitry 420. The machine learning circuitry 430 may predict or determine the irregular motions, status, or remaining operational lifetime of the mechanical components of the semiconductor processing apparatus 10 by using past data (e.g., the machine learning circuitry 430 may be trained based on past data) which indicates motions of the mechanical components that are known to be irregular (e.g., past spectral images for a mechanical component that is known to indicate irregular motions), a known status of the mechanical components and its associated irregular motions (e.g., past spectral images for a mechanical component that is known to be failing or defective), or a known remaining operational lifetime of the mechanical components and its associated motions (e.g., spectral images for a mechanical component that is known to have failed within some period of time, such as 1 month later), and the machine learning circuitry 430 may compare the received spectral images 520 with the past data to predict or determine the irregular motions, status, or remaining operational lifetime of the mechanical components based on similarities or deviations from the past data or from a trained model contained within, managed by, or otherwise accessible to the machine learning circuitry 430.
“Artificial intelligence” is used herein to broadly describe any computationally intelligent systems and methods that can learn knowledge (e.g., based on training data), and use such learned knowledge to adapt its approaches for solving one or more problems, for example, by making inferences based on a received input, such as the received spectral images. Machine learning generally refers to a sub-field or category of artificial intelligence, and is used herein to broadly describe any algorithms, mathematical models, statistical models, or the like that are implemented in one or more computer systems or circuitry, such as processing circuitry, and which build one or more models based on sample data (or training data) in order to make predictions or decisions.
The defect prediction circuitry 420 or the machine learning circuitry 430 may employ, for example, neural network, deep learning, convolutional neural network, Bayesian program learning, support vector machines, and pattern recognition techniques to solve problems such as predicting or determining irregular motions, status, or remaining operational lifetime of mechanical components of a semiconductor processing apparatus. Further, the defect prediction circuitry 420 or the machine learning circuitry 430 may implement any one or combination of the following computational algorithms or techniques: classification, regression, supervised learning, unsupervised learning, feature learning, clustering, decision trees, or the like.
As one example, an artificial neural network may be utilized by the defect prediction circuitry 420 or the machine learning circuitry 430 to develop, train, or update one or more machine learning models which may be utilized to predict or determine the irregular motions, status, or remaining operational lifetime of mechanical components. An example artificial neural network may include a plurality of interconnected “neurons” which exchange information between each other. The connections have numeric weights that can be tuned based on experience, and thus neural networks are adaptive to inputs and are capable of learning. The “neurons” may be included in a plurality of separate layers which are connected to one another, such as an input layer, a hidden layer, and an output layer. The neural network may be trained by providing training data (e.g., past data or past spectral images which are indicative of irregular motions, status, or remaining operational lifetime of the mechanical components) to the input layer. Through training, the neural network may generate and/or modify the hidden layer, which represents weighted connections mapping the training data provided at the input layer to known output information at the output layer (e.g., classification of received sensing data as representative of irregular motions, a status, or a remaining operational lifetime of the mechanical components). Relationships between neurons of the input layer, hidden layer, and output layer, formed through the training process and which may include weight connection relationships, may be stored, for example, as one or more machine learning models within or otherwise accessible to the machine learning circuitry 430.
Once the neural network has been sufficiently trained, the neural network may be provided with non-training data (e.g., new spectral images 500 received during operation of the semiconductor processing apparatus 10) at the input layer. Utilizing irregular motion knowledge (e.g., as stored in the machine learning model, and which may include, for example, weighted connection information between neurons of the neural network), the neural network may make determinations about the received spectral images 500 at the output layer. For example, the neural network may predict or determine the irregular motions, status, or remaining operational lifetime of the mechanical components.
Employing one or more computationally intelligent and/or machine learning techniques, the defect prediction circuitry 420 may learn (e.g., by developing and/or updating a machine learning algorithm or model based on training data) to predict or determine the irregular motions, status, or remaining operational lifetime of the mechanical components, and in some embodiments, the defect prediction circuitry 420 may make some predictions or determinations based at least in part on knowledge, inferences or the like developed or otherwise learned through training of the machine learning circuitry 430.
The machine learning circuitry 430 may be implemented in one or more processors having access to instructions, which may be stored in any computer-readable storage medium, which may be executed by the machine learning circuitry 430 to perform any of the operations or functions described herein.
In some embodiments, the machine learning circuitry 430 is communicatively coupled to a spectral image database 442, which may be stored, for example, in any computer-readable storage medium. The spectral image database 442 may include information that associates sensed parameters (e.g., as sensed by a motion-related sensor 170 or an additional sensor 180) with irregular motions, status, or remaining operational lifetime of the mechanical components. In some embodiments, the spectral image database 442 stores a plurality of historical (e.g., past) spectral images having known outcomes or otherwise representing a known irregular motion, status, or remaining operational lifetime of one or more mechanical components of the semiconductor processing apparatus 10.
In some embodiments, the machine learning circuitry 430 may be trained based on the historical spectral images stored in the spectral image database 442. That is, the historical spectral images may be provided as training data for training the machine learning circuitry 430, and the algorithm or machine learning model contained within or accessible to the machine learning circuitry 430 may be updated or modified based on the historical spectral images stored in the spectral image database 442, so that the trained machine learning circuitry 430 may predict or determine irregular motions, status, or remaining operational lifetime of the mechanical components.
In some embodiments, the training data (e.g., historical spectral images stored in the spectral image database 442) may be or include labeled training data from which the machine learning circuitry 430 or the defect prediction circuitry 420 may learn to predict or determine irregular motions, status, or remaining operational lifetime of the mechanical components. The labeled training data may include labels indicating that one or more of the spectral images stored in the spectral image database represents, for example, irregular motions, status, or remaining operational lifetime of the mechanical components.
During use of the semiconductor processing apparatus 10, the motion-related parameters sensed by the motion-related sensors 170 or the additional sensors 180 are processed by the signal processing circuitry to generate spectral images 500. The spectral images 500 may then be analyzed by the defect prediction circuitry 420 or the machine learning circuitry 430 to predict or determine irregular motions, status, or remaining operational lifetime of any of the mechanical components of the semiconductor processing apparatus 10. The defect prediction circuitry 420 or the machine learning circuitry 430 may analyze the received spectral images 500, for example, by comparing the received spectral images 500 with historical spectral images stored in the spectral image database 442 which are known to be associated with irregular motions or the like. In some embodiments, the defect prediction circuitry 420 or the machine learning circuitry 430 may analyze the received spectral images 500 by utilizing a trained machine learning model, such as a neural network or the like.
In some embodiments, the defect prediction circuitry 420 or the machine learning circuitry 430 may include or access a plurality of machine learning models, with each such machine learning models being trained based on sensor data of a particular type (e.g., a torque sensor, an acceleration sensor, a gyroscope, a vibration sensor, a pressure sensor, a temperature sensor, or a humidity sensor) and provided from a particular location (e.g., on or within the polishing head 130, the platen 110, the slurry dispenser 140, the pad conditioner base 151, the pad conditioner arm 152, the pad conditioner head 153, the conditioning disk 154, a motor, a pump, or any other component within the CMP apparatus 100, or any other mechanical component of any semiconductor processing apparatus).
In some embodiments, the defect prediction circuitry 420 or the machine learning circuitry 430 may analyze sensor data received from a plurality of different sensors of the semiconductor processing apparatus 10 in a combined manner. For example, spectral images 500 may be generated for sensor data received from each of a plurality of different sensors 170, 180 of the semiconductor processing apparatus 10. Each of the different spectral images 500 may be according a particular weight or coefficient value, for example, by the machine learning circuitry 430 (which may be a neural network, in some embodiments). The weighted spectral images 500 may then be combined into a single spectral image which concurrently represents the sensor data from all of the separate sensors 170, 180, and the combined spectral image may be compared with a machine-learning model to predict or determine irregular motions, status, or remaining operational lifetime of any of the mechanical components of the semiconductor processing apparatus 10.
In some embodiments, the irregular mechanical motion detection system 400 may include hold circuitry 480 which is communicatively coupled to the defect prediction circuitry 420 and the semiconductor processing apparatus 10 and is configured to automatically hold or stop one or more mechanical components (such as the first or second mechanical components 12, 14) of the semiconductor processing apparatus 10, for example, upon receiving an indication from the defect prediction circuitry 420 that the motion of the one or more mechanical components is irregular and should therefore be stopped. The hold circuitry 480 may be, for example, a controller or control circuitry which may be included within the semiconductor processing apparatus 10 or remotely located from the semiconductor processing apparatus 10, and which is configured to control operations of the semiconductor processing apparatus 10. The hold circuitry 480 may further provide a defect indication (e.g., a visual or audible indication) which may be utilized to alert maintenance personnel to inspect the predicted defective component or a wafer which is being processed by the predicted defective component.
At 602, the method includes receiving sensing signals indicative of motion-related parameters of one or more components of a semiconductor processing apparatus. The sensing signals may be provided, for example, from any motion-related sensor 170 which may be located on or within any mechanical component of a semiconductor processing apparatus. For example, the sensors 170 may be sensors included in the CMP apparatus 100 illustrated in
At 604, the received sensing signals are transformed to frequency spectrum data. For example, the FFT circuitry 414, which may be included as part of the signal processing circuitry 410, may apply a FFT algorithm to transform the received sensing signals to frequency spectrum data as previously described herein. In some embodiments, the sensing signals are first converted to digital sensing signals, for example, by the analog-to-digital converter 412, and then the digital sensing signals are transformed to frequency spectrum data. In some embodiments, the signal processing circuitry 410 may apply a window function (e.g., by the window circuitry 416) as part of the transforming the sensing signals to frequency spectrum data at 604.
At 606, spectral images 500 are generated based on the received sensing signals and the frequency spectrum data. For example, the spectral images 500 may include the frequency spectrums generated by the FFT circuitry 414 and may further include time domain information associated with each of the frequency spectrums (e.g., the time period for each of the clips over which the signal data is transformed to the frequency domain). The spectral images 500 may thus provide a visual representation of the frequency spectrum data for the sensing signals in a temporal manner.
At 608, the defect prediction circuitry 420 or the machine learning circuitry 430 predicts or determines irregular motions of the one or more components of the semiconductor processing apparatus. Analyzing the spectral images to predict irregular motions at 608 may include comparing the spectral images 500 generated at 606 with one or more historical spectral images stored, for example, in the spectral image database 442. In some embodiments, machine learning models or algorithms are utilized to receive the generated spectral images 500 (e.g., as input to a neural network) and to predict irregular motions of the one or more components of the semiconductor processing apparatus (e.g., as an output of the neural network).
At 610, a status or remaining operational lifetime of the one or more components of the semiconductor processing apparatus is predicted. This may be performed, for example, by the defect prediction circuitry 420 or the machine learning circuitry 430 based on analysis of the spectral images 500, as previously described herein.
At 612, a wafer defect is predicted, for example, by the defect prediction circuitry 420 or the machine learning circuitry 430 based on analysis of the spectral images 500. The wafer may be a wafer that is currently undergoing processing by the semiconductor processing apparatus, such as a wafer undergoing CMP processing by the CMP apparatus 100. The prediction of a wafer defect at 612 may be based on the prediction of irregular motions at 708. For example, if the defect prediction circuitry 420 or machine learning circuitry 430 predicts or determines that a motion of a component of the semiconductor processing apparatus is irregular, this may indicate a defective operation of that component. The defective operation of the component thus makes it likely that the processed wafer will also have a defect as a result of the defective operation of the component. As an example, the defect prediction circuitry 420 or machine learning circuitry 430 may determine, based on the signals received from a sensor 170f positioned on the pad conditioner head 153, that the motion of the conditioning disk 154 is irregular or abnormal (e.g., defective operation). The irregular motion of the conditioning disk 154 may cause an edge profile of the semiconductor wafer to be thinner than it should be due to an overpolish condition. Accordingly, the defect prediction circuitry 420 or machine learning circuitry 430 may predict or determine the presence of a defect in the semiconductor wafer based on the predicted or determined defect of the component of the semiconductor processing apparatus.
If a wafer defect is predicted at 612, then in some embodiments, the method may include automatically holding or stopping one or more components of the semiconductor processing apparatus at 614. For example, hold circuitry 480 may receive an indication of a defective condition or of the prediction of a wafer defect from the defect prediction circuitry 420, and the hold circuitry 480 may control one or more components of the semiconductor processing apparatus, thereby holding or stopping the one or more components.
At 616, feedback is provided to the machine learning circuitry 430, such as a machine learning model which may be included as part of or otherwise is accessible to the machine learning circuitry 430. The feedback may be used, for example, as training data to further train the machine learning model. The feedback may indicate, for example, that a particular generated spectral image indicates irregular motions (e.g., based on the prediction at 608), a particular status (e.g., a status of normal, abnormal, based on the prediction at 610), or a remaining useful lifetime (e.g., will likely fail within one month, one week, one day, etc., based on the prediction at 610) of the one or more components of the semiconductor processing apparatus. The spectral image, as well as a result of the predictions at 608 or 610 may be provided together as training data, and may be stored in the spectral image database 442, for further training the machine learning circuitry 430 or machine learning model.
Embodiments of the present disclosure provide several advantages, and provide technical solutions to technical problems that are present, for example, within the field of semiconductor processing apparatuses, systems, and methods. For example, embodiments of the disclosure are operable to predict or determine irregular motions of one or more mechanical components of a semiconductor processing apparatus. This provides a significant advantage over conventional systems in which such irregular motions cannot be predicted, which results in failures and can lead to scrapping of the semiconductor wafer. This results in increased costs and reduced profits. Moreover, some defects which may be formed in semiconductor devices formed from a wafer that has undergone processing by the apparatus may not be detected in some cases until various additional processes have been performed. This results in further losses in terms of costs and time expended performing the additional processes on a defective wafer. However, embodiments of the present disclosure can avoid or reduce such losses by predicting the irregular motions of one or more components of the semiconductor processing apparatus, and the operation of the apparatus can be stopped to avoid damage to the wafer.
Embodiments of the present disclosure further facilitate significant improvements over conventional semiconductor processing systems, apparatuses, and methods as some embodiments of the present disclosure are capable of predicting a status (e.g., beginning to breakdown, but not yet outside of a particular tolerance range) or a remaining operational lifetime (e.g., will likely fail within one month, one week, one day, etc.) of the components of a semiconductor processing apparatus. This allows defects to be avoided, for example, by enabling maintenance personnel or the like to monitor the status of the components, and to repair the components before reaching a state at which the irregular motions of the components will damage the wafer.
According to one embodiment, a mechanical motion irregularity prediction system includes one or more motion sensors that are configured to sense motion-related parameters associated with at least one mechanical component of a semiconductor processing apparatus. The one or more motion sensors output sensing signals based on the sensed motion-related parameters. The mechanical motion irregularity prediction system further includes defect prediction circuitry that is configured to predict an irregular motion of the at least one mechanical component based on the sensing signals
According to another embodiment, a method is provided that includes sensing, by at least one motion sensor, motion-related parameters associated with at least one mechanical component of a semiconductor processing apparatus. Spectral information is generated by signal processing circuitry, and the spectral information is generated based on the sensing signals. Defect prediction circuitry predicts an irregular motion of the at least one mechanical component based on the spectral information.
According to yet another embodiment, a chemical-mechanical polishing (CMP) apparatus is provided that includes a rotatable platen, a polishing pad on the rotatable platen, a polishing head, a pad conditioner, a first motion sensor, and defect prediction circuitry. The polishing head is configured to carry a semiconductor wafer and to selectively cause the semiconductor wafer to contact the polishing pad. The pad conditioner includes a pad conditioner head and a conditioning disk coupled to the pad conditioner head, and the conditioning disk is configured to selectively contact the polishing pad. The first motion sensor is configured to sense a first motion-related parameter associated with at least one of the rotatable platen, the polishing pad, the polishing head, or the pad conditioner. The defect prediction circuitry is configured to predict an irregular motion of the at least one of the rotatable platen, the polishing pad, the polishing head, or the pad conditioner based on the sensed first motion-related parameter.
The foregoing outlines features of several embodiments so that those skilled in the art may better understand the aspects of the present disclosure. Those skilled in the art should appreciate that they may readily use the present disclosure as a basis for designing or modifying other processes and structures for carrying out the same purposes and/or achieving the same advantages of the embodiments introduced herein. Those skilled in the art should also realize that such equivalent constructions do not depart from the spirit and scope of the present disclosure, and that they may make various changes, substitutions, and alterations herein without departing from the spirit and scope of the present disclosure.
The various embodiments described above can be combined to provide further embodiments. These and other changes can be made to the embodiments in light of the above-detailed description. In general, in the following claims, the terms used should not be construed to limit the claims to the specific embodiments disclosed in the specification and the claims, but should be construed to include all possible embodiments along with the full scope of equivalents to which such claims are entitled. Accordingly, the claims are not limited by the disclosure.
Number | Name | Date | Kind |
---|---|---|---|
6424137 | Sampson | Jul 2002 | B1 |
6431953 | Carter | Aug 2002 | B1 |
20030045009 | Tanaka et al. | Mar 2003 | A1 |
20060063383 | Pattengale, Jr. | Mar 2006 | A1 |
20100035525 | Deshpande et al. | Feb 2010 | A1 |
20100159804 | Sampurno | Jun 2010 | A1 |
20120021671 | McReynolds | Jan 2012 | A1 |
20120088439 | Duescher | Apr 2012 | A1 |
20120142254 | Jiang | Jun 2012 | A1 |
20130260646 | Lischka et al. | Oct 2013 | A1 |
20140134924 | Shinozaki | May 2014 | A1 |
20150224620 | Takahashi | Aug 2015 | A1 |
20170190020 | Kobayashi et al. | Jul 2017 | A1 |
20180071889 | Huang et al. | Mar 2018 | A1 |
20180093360 | Shinozaki | Apr 2018 | A1 |
20180093363 | Shinozaki | Apr 2018 | A1 |
Number | Date | Country |
---|---|---|
1345084 | Apr 2002 | CN |
1314096 | May 2007 | CN |
100380103 | Apr 2008 | CN |
102956521 | Mar 2013 | CN |
105571638 | May 2016 | CN |
105881194 | Aug 2016 | CN |
106463381 | Feb 2017 | CN |
106652999 | May 2017 | CN |
200915407 | Apr 2009 | TW |
WO-9738292 | Oct 1997 | WO |
WO 2010017000 | Feb 2010 | WO |
WO 2018074091 | Apr 2018 | WO |
Number | Date | Country | |
---|---|---|---|
20230330803 A1 | Oct 2023 | US |
Number | Date | Country | |
---|---|---|---|
62752598 | Oct 2018 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 16431957 | Jun 2019 | US |
Child | 18339099 | US |