This disclosure pertains to, inter alia, control systems having particular utility in governing the motions and positions achieved by positioning devices such as, but not limited to, respective stages for holding and moving reticles and substrates in microlithography systems.
Many precise industrial processes require machinery in which workpieces, process tools, measurement tools, and the like are accurately positioned and moved, usually while embodying a high degree of automation. An example category of such machinery includes various microlithography systems widely used in the semiconductor-device and micro-electronics industries for transferring images from a pattern-defining reticle onto a semiconductor wafer or other suitable substrate during semiconductor processing. In modern microlithography tools, the need to achieve extraordinarily accurate positioning and movements is critical, requiring these tools to achieve position and motion accuracies of their stages in the nanometer range.
A typical microlithographic exposure apparatus includes an illumination source, a reticle-stage assembly that retains a reticle (or pattern master), an optical assembly, a wafer-stage assembly that retains the substrate, a measurement system, and a control system. The wafer-stage assembly includes a wafer stage that retains the lithographic substrate (such as a semiconductor wafer), and a respective mover assembly that precisely positions the wafer stage and the wafer. Somewhat similarly, the reticle-stage assembly includes a reticle stage that retains the reticle, and a respective mover assembly that positions the reticle stage and the reticle. The control system independently directs current to these mover assemblies to generate forces causing motion of the wafer stage and reticle stage along respective “trajectories.”
The sizes of the images and features within the images transferred onto the substrate (termed generally a “wafer” herein) from the reticle are extremely small. Accordingly, precise positioning of the wafer and the reticle relative to the optical assembly is critical to the manufacture of high-density semiconductor devices. Typically, multiple identical microcircuits are formed on each wafer. Hence, during manufacture of the microcircuits, the wafer stage and/or the reticle stage are cyclically and repetitiously moved to follow an intended trajectory.
During movements of the stages, respective “following-errors” of the wafer stage and/or the reticle stage can occur. A following-error is the difference between the intended trajectory of the stage and an actual trajectory of the stage at a specified time. A following-error can arise due to lack of complete rigidity in the components of the microlithography tool, which is manifest as a slight time delay between the instant in which current is directed to the mover assembly and the instant in which the stage exhibits the corresponding motion.
Alignment errors can occur even if the stages are properly positioned relative to each other. For example, periodic vibration disturbances of various mechanical structures of the microlithography tool can occur. Examples include oscillations and/or resonances of the optical assembly or supporting structures. These oscillations and/or resonances can significantly degrade relative alignment between the stages and the optical assembly. As a result of following-errors and/or the vibration disturbances, the achievable precision with which micro-devices can be manufactured is compromised.
Conventional approaches to reducing following-errors include feedback control of stage motion. In a stage system under feedback control, during movement of one of the stages a measurement system periodically provides data on the current position of the stage. This data is utilized by a controller to adjust the level of current to the mover assembly of the stage in an attempt to achieve the intended trajectory of the stage. Unfortunately, feedback control is not entirely satisfactory, and the control system does not always precisely move the stage along its intended trajectory.
Also, the movable portion of a stage inherently has mass, usually substantial mass. Regardless of applicable tolerances, controlling positions and motions of a stage involves dealing not only with disturbances originating outside the stage but also with disturbances originating in motions (including accelerations and decelerations) of the stage mass itself. No control system is perfect; each has limitations such as following-error and/or synchrony of relative stage motions. The goal of control systems used with these stages is to achieve a level of stage position and motion control sufficient to meet extremely demanding specifications. As specifications progressively tighten, the need for more accurate and precise control follows apace.
In light of the above, there is a need for control systems and methods that improve the accuracy and precision of stage positioning and movement.
The need articulated above is satisfied by apparatus and methods as disclosed herein. The apparatus and methods are particularly useful for and/or during automatic calibrations or automatic tunings of a high-precision system, particularly of one or more stages in a high-precision system. A “stage” is an assembly used for holding and moving a thing (e.g., a pattern-defining reticle, a mask, a tool, a workpiece, etc.) with high accuracy and precision. The stage is capable of moving the thing in at least one degree of freedom of motion, usually more than one degree of freedom. The calibration and tuning can be directed to a single stage or of multiple stages, including synchronization of the stages' movements relative to each other. The calibration and tuning are advantageous not only for immediate use of the high-precision system but also for correcting changes in system performance over time. Thus, auto-calibration and/or auto-tuning is one way in which to ensure the system works at optimum performance over time. The calibration and/or tuning can be performed during initial set-up of the system and periodically thereafter.
The subject methods and apparatus achieve control of stages and the like faster than conventionally. More specifically, the subject methods, being iterative, achieve faster control convergence than conventional methods.
According to a first aspect of the subject apparatus and methods, a stage assembly is provided. An embodiment of such a stage assembly comprises a first stage and first and second controllers (or first and second portions of a single controller). The first controller is coupled to control the first stage in a feedback-control manner, according to at least one respective parameter vector. The second controller is coupled to control the first stage in a feed-forward-control manner, according to at least one respective parameter vector. The first and second controllers are programmed to perform iterative feedback tuning (IFT), including minimization of a cost-function that is a function of at least one respective parameter vector from each of the first and second controllers. The second controller can be coupled to receive data including trajectory of the first stage, and the first controller can be coupled to receive data including following-error of the first stage. A particularly suitable application of this stage assembly is a reticle stage or a wafer stage, as used in a microlithography system. Alternatively, the stage assembly can be used wherever a finely controlled stage is needed or useful, such as (but not limited to) various high-precision measuring, positioning, and processing tools.
IFT is an iterative method that obtains control measurement data from “experiments” rather than relying upon lengthy numerical calculations. IFT is also gradient-based, wherein gradients of control parameters are iteratively measured with the aim of minimizing a cost function. Hence, IFT achieves faster and more accurate convergence, and hence more accurate control, than conventional control methods. For example, one gradient experiment can apply to substantially all parameters being controlled by the particular controller.
The stage assembly can further comprise a second stage and a third controller (or third portion of a controller). If the first stage is a reticle stage, the second stage can be a substrate stage, for example, of a microlithography system. Alternatively, the first stage can hold a tool and the second stage can hold a workpiece. The third controller is coupled to receive data regarding at least respective position errors of the first and second stages and is programmed to synchronize movement of the first and second stages, according to at least one respective parameter vector. The third controller can be further programmed to perform IFT, cooperatively with the first and second controllers, including minimization of a cost-function that is also a function of the at least one respective parameter vector of the third controller. The cost-function can include a synchronization error pertaining to the motion of at least one of the first and second stages relative to the other and a control output of the first controller. This control output desirably is a function of the at least one respective parameter vector of each of the first, second, and third controllers. In this three-controller embodiment, the synchronization error can be a function of the at least one respective parameter vector from each of the first, second, and third controllers. The synchronization error and the control output desirably each have respective weighting functions in the cost-function.
Three-controller embodiments can have one or more of several control connections. In a first example, the output of the third controller can be input to the first controller. In a second example, the output of the second controller is summed with an output of the first controller. In a third example, the output of the third controller is connected such that a difference of a trajectory of the first stage and the output of the third controller is input to the first controller. In the third example the input to the first controller is a following-error that is a function of the at least one respective parameter vector of each of the first, second, and third controllers.
Each iteration of IFT performed by one or more of the controllers includes a first experiment directed to an evaluation of a cost-function. The IFT iteration also includes respective experiments directed to measurements of gradients of the at least one respective parameter vector of each of the first, second, and third controllers.
An IFT cost-function desirably includes the following-error of the first stage and a control output of the first controller. The control output is a function of the at least one respective parameter vector of each of the first and second controllers. The following-error and the control output can have respective weighting functions in the cost-function.
According to another aspect, methods are provided for controlling movement of at least a first stage. One embodiment of the methods comprises coupling a first controller (or first portion of a controller) to the first stage in a feedback-control manner to provide a control command to the first stage and to receive data regarding a following-error of the first stage. Outputs (called “first outputs”) are produced by the first controller; a first output is a function of at least one respective parameter vector of the first controller. A second controller is coupled to the first controller to control the first stage in a feed-forward manner in cooperation with the first controller. Outputs (called “second outputs”) are produced by the second controller; a second output is a function of at least one respective parameter vector of the second controller. Using the first and second controllers, multiple iterations of IFT are performed to minimize a cost-function that is a function of the at least one respective parameter vectors for each of the first and second controllers, thereby IFT-tuning the first and second outputs.
The method can further comprise inputting the second controller with data regarding the trajectory of the first stage. The method can further comprise combining the IFT-tuned first and second outputs to produce a force command, and routing the force-command to the first stage. The results of the multiple IFT iterations can be applied to tune the feedback and feed-forward control of the first stage, as reflected in the force-command routed to the first stage from the first and second controllers. Alternatively, the results of iterations of the IFT can be applied to tune the feedback and feed-forward control of the first stage.
Each iteration of IFT in this embodiment desirably includes evaluating the cost-function and measuring a gradient of the at least one parameter vector of the first controller and a gradient of the at least one parameter vector of the second controller. Based on the evaluation of the cost-function and the measurements of the gradients, the cost-function is minimized. The outputs of the first and second controllers can be tuned based on the minimized cost-function.
The method can further comprise coupling a third controller to the first controller and the first stage, and inputting the third controller with data concerning position errors of the first stage and position errors of a second stage to which motions of the first stage are to be synchronized. The third controller is used, in cooperation with the first and second controllers and according to at least one respective parameter vector of the third controller, for controlling synchronization of the first and second stages. IFT can also be performed with the third controller, in cooperation with the first and second controllers, including minimization of a cost-function that is also a function of the at least one parameter vector of the third controller. With IFT being also performed by the third controller, each iteration of IFT can further include measuring a gradient of the at least one parameter vector of the third controller. Based on the evaluation of the cost-function and the measurements of the gradients, control is achieved by minimizing the cost-function, and tuning the outputs of the first, second, and third controllers based on the minimized cost-function.
According to another aspect of the disclosure, precision systems are disclosed. An embodiment of such a system comprises a process assembly, a first stage, and a first controller (or first portion of a controller). The first stage is movable relative to the process assembly, and the first controller is coupled to control the first stage in a feedback-control manner, according to a respective transfer-function and at least one respective parameter vector. The system also includes a second controller (or second portion of a controller) coupled to control the first stage in a feed-forward-control manner, according to a respective transfer-function and at least one respective parameter vector. The first and second controllers are programmed to perform IFT including minimizing a cost-function that is a function of at least one respective parameter vector from each of the first and second controllers.
Embodiments of the system can further comprise a second stage that is movable relative to the process assembly. A third controller (or third portion of a controller) is coupled to receive data at least regarding respective position errors of the first and second stages and to control synchronous movement of the first and second stages, according to a respective transfer-function of at least one respective parameter vector of the third controller. The third controller can be programmed to perform IFT, in cooperation with the first and second controller, including minimization of a cost-function that is also a function of the at least one respective parameter vector of the third controller.
According to yet another aspect, methods are provided for controlling movement of at least a first stage. In an embodiment, a trajectory for the first stage is established. According to the trajectory, a control command is directed to the first stage to cause corresponding motion of the first stage. As the first stage moves according to the control command, position of the first stage is measured. In a feedback-control manner, the measured stage position is compared with the trajectory, and a respective position-error of the first stage is determined. From the position-error of the first stage, a first processed output is produced that is a function of at least a first parameter vector. A second processed output is produced that is a function of at least a second parameter vector different from the first parameter vector. The second processed output is fed-forward and combined with the first processed output. The first and second processed outputs are tuned by performing at least one iteration of IFT to minimize a cost-function that is a function of at least the first and second parameter vectors. The first stage is moved according to the tuned first and second processed outputs. The first position error of the first stage can include a following-error, and the second processed output cam be calculated from data concerning at least the trajectory of the first stage.
The method can further comprise determining a position error of the first stage relative to an expected position that otherwise would result from the trajectory. A position error of a second stage is determined, wherein motion of the second stage is to be synchronized with motion of the first stage. A third processed output is produced that is a function of at least a third parameter vector different from the first and second parameter vectors. The first stage is caused to move according to the first, second, and third processed outputs. The first, second, and third processed outputs can be tuned by performing at least one IFT iteration to minimize a cost-function that is a function of at least the first, second, and third parameter vectors. The first stage is moved according to the tuned first, second, and third processed outputs. The first stage can be a reticle stage, mask stage, or tool stage. The second stage is a substrate stage (or workpiece stage) placed relative to the first stage. The first and second stages can move substantially synchronously relative to each other.
The foregoing and additional features and advantages of the invention will be more readily apparent from the following detailed description, which proceeds with reference to the drawings.
This disclosure is set forth in the context of representative embodiments that are not intended to be limiting in any way.
Further alternatively, for example, the stage assembly 220 can be used as the reticle-stage assembly 18 in the exposure apparatus 10 of
The stage assembly 220 includes a stage base 202, a coarse-stage mover assembly 204, a coarse stage 206, a fine stage 208, and a fine-stage mover assembly 210. The configuration of the components of the stage assembly 220 can be varied as required. For example, in
The stage base 202 is generally rectangularly shaped. Alternatively, the stage base 202 can be another shape. The stage base 202 supports some of the components of the stage assembly 220 above the mounting base 30 illustrated in
The configuration of the coarse-stage mover assembly 204 can be varied to suit the movement requirements of the stage assembly 220. In one embodiment, the coarse-stage mover assembly 204 includes one or more movers, such as rotary motors, voice-coil motors, linear motors utilizing a Lorentz force to generate a driving force, electromagnetic actuators, planar motors, or other force actuators.
The coarse-stage mover assembly 204 moves the coarse stage 206 relative to the stage base 202 along the X-axis, along the Y-axis, and about the Z-axis (collectively “the planar degrees of freedom” x, y, and θz, respectively). Additionally, the coarse-stage mover assembly 204 can be configured to move and position the coarse stage 206 along the Z-axis, about the X-axis and/or about the Y-axis relative to the stage base 202 (z, θx, and θy, respectively). Alternatively, for example, the coarse-stage mover assembly 204 can be configured to move the coarse stage 206 with fewer than three degrees of freedom.
In
In
The first mover component 212 can be maintained above the second mover component 214 using vacuum pre-load type air bearings (not shown). With this configuration, the coarse stage 206 is movable relative to the stage base 202 with three degrees of freedom (x, y, and θz). Alternatively, the first mover component 212 could be supported above the second mover component 214 by other ways, such as guides, a rolling-type bearing, or by the magnetic levitation forces. Further alternatively or in addition, the coarse-stage mover assembly 204 can be configured to be movable with up to six degrees of freedom (x, y, z, θx, θy, θz). Further alternatively, the coarse-stage mover assembly 204 can be configured to include one or more electromagnetic actuators.
The control system 224 directs electrical current to one or more of the conductors in the conductor array. The electrical current through the conductors causes the conductors to interact with the magnetic field of the magnet array. This generates a force between the magnet array and the conductor array that can be used to control, move, and position the first mover component 212 and the coarse stage 206 relative to the second mover component 214 and the stage base 202. The control system 224 adjusts and controls the current level for each conductor to achieve the desired resultant forces. In other words, the control system 224 directs current to the conductor array to position the coarse stage 206 relative to the stage base 202.
The fine stage 208 includes an object holder (not shown) that retains the object 200. The object holder can include a vacuum chuck, an electrostatic chuck, or clamp.
The fine-stage mover assembly 210 moves and adjusts the position of the fine stage 208 relative to the coarse stage 206. For example, the fine-stage mover assembly 210 can adjust the position of the fine stage 208 with six degrees of freedom (x, y, z, θx, θy, θz). Alternatively, for example, the fine-stage mover assembly 210 can be configured to move the fine stage 208 with only three degrees of freedom. The fine-stage mover assembly 210 can include one or more rotary motors, voice-coil motors, linear motors, electromagnetic actuators, or other type of actuators. Further alternatively, the fine stage 208 can be fixed to the coarse stage 206.
In
One of the actuator pairs 226 (one of the horizontal movers 216) is mounted so that the attractive forces produced thereby are substantially parallel with the X-axis. Two of the actuator pairs 226 (two of the horizontal movers 216) are mounted so that the attractive forces produced thereby are substantially parallel with the Y-axis. Three actuator pairs 226 (the vertical horizontal movers 216) are mounted so that the attractive forces produced thereby are substantially parallel with the Z-axis. With this arrangement: (a) the horizontal movers 216 can make fine adjustments to the position of the fine stage 208 along the X-axis, along the Y-axis, and about the Z-axis (x, y, and θz, respectively), and (b) the vertical movers 218 can make fine adjustments to the position of the fine stage 208 along the Z-axis, about the X-axis, and about the Y-axis (z, θx, θy, respectively).
Alternatively, for example, two actuator pairs 226 can be mounted parallel to the X-direction, and one actuator pair 226 can be mounted parallel to the Y-direction. Further alternatively, other arrangements of the actuator pairs 226 can be utilized.
In one embodiment, the measurement system 22 (
The stage assembly 220D or the stage assembly 220 (
For illustrative purposes,
During tuning, a desired trajectory is made and the respective data on position and velocity, for example, of the stage are saved. These data can be applied to the control of subsequent trajectories. It will be understood that the above merely describes an example, and the “similarity” between the actual trajectory of an iteration and the actual trajectory of the previous iteration may be more general. After tuning, for instance, the velocity and shot-size of the stage may be changed.
An embodiment of a stage-control system is shown in
Depending upon the particular application, either the synchronization controller H(z) or the feed-forward controller GFF(z) is optional. Alternatively, both can be present, as shown. These controllers H(z), GFF(z), as present, as well as the feedback controller C(z) are programmed to perform iterative feedback tuning (IFT) described in detail below.
The following discussion is presented in the context of the feedback controller C(z) controlling position and movement of a stage. But, this discussion is generally applicable to other controllers as well.
A control system in general operates to control the positioning of the plant (in this case a stage). An intended trajectory r(t) of the stage P is established based on the desired path of the stage. The intended trajectory r(t) is relative to at least one axis, such as along the X-axis, along the Y-axis, and/or about the Z-axis (θz), for example. The intended trajectory r(t) may also include components about the X-axis (θx), about the Y-axis (θw), and/or along the Z-axis, or any combination thereof.
The intended trajectory r(t) is compared with the actual trajectory of the stage P to determine whether the stage is properly positioned. The actual trajectory is determined by a measurement system (e.g., item 22 in
After the control law determines the current, the current is distributed as a “force command” u(t) to the one or more mover assemblies of the stage P, as appropriate. The mover assemblies then move the stage P, causing it to follow more accurately the intended trajectory r(t). Data on the position of the stage, or object thereon, is then compared with a corresponding position based on the trajectory r(t) to increase positioning accuracy.
Referring again to
Iterative Feedback Tuning (IFT) is an adaptive control technique that originally was developed for tuning of feedback-control filters. Hjalmarsson, “Iterative Feedback Tuning—An Overview,” Int. J. Adaptive Control and Signal Processing 16:373-395 (2002). IFT is an iterative technique in which the parameters of a controller are successively updated using data from closed-loop “experiments” and proceeding to an updated “law” for the control parameters. Controller parameters are updated by minimizing a specified cost-function using gradient-search techniques, and the gradients are estimated from data obtained from iterative experiments. Controller parameters are changed iteratively with the goal of improved performance in subsequent iterations. In other words, IFT is a gradient-based optimization technique in which controller parameters are updated according to the gradient of the cost-function and according to Hessian Matrices (approximate second-order gradients). No plant model or other explicit model of the system to be controlled is required, and the tuning algorithm can be executed without interrupting closed-loop control. In contrast to other gradient-based methods, in IFT the gradient of the cost-function in each iteration is directly measured from only one or a few additional experiments.
Feedback control involving IFT is performed using a feedback controller controlling at least one control parameter, which can be designated ρ. The cost-function for the IFT is a function of the parameter vector
Shown in
The cost function J(
wherein e1(t,
In IFT, the cost function is minimized with respect to the parameter vector
The solution can be obtained using a stochastic approximation algorithm, and requires, for each iteration, obtaining the quantities e(t,
and unbiased estimates of
These are evaluated in the second experiment.
To estimate the direction for the next optimization step, a gradient-measurement experiment is performed, using the error in the evaluation of the cost-function J(
The parameter gradients of control and output,
respectively, can be evaluated (either on-line or off-line) from the control signal u2(t) and output signal y2(t), respectively, obtained via a “gradient filter”
is a vector. The vector describes the parameter gradients of the control filter that appear in the gradient filter:
Upon completing the gradient-measurement experiment, the control parameter
where the step size γi>0, and Ri is an appropriate positive definite matrix such as a Hessian matrix. The gradient of the cost-function may be estimated as follows with the measured parameter gradients
The Hessian Matrix (second-order gradient of the cost-function) is approximated as described below with the measured parameter gradients,
After updating the control parameters, the first and second experiments are repeated in one or more subsequent iterations until a termination condition for the optimization is satisfied.
The following analytical gradient operations for output and control, respectively, are useful for understanding IFT as utilized herein. The plant output position in the first experiment 1 is driven by the feedback controller as follows:
which is abbreviated
The equations above indicate that the gradients are directly measurable with a redefined reference from the same closed-loop system.
A list of control gradient filters,
is provided in Table 1, below.
In Table 1 the subscripts represent polynomial orders.
Based on the above, application of IFT to feed-forward control and synchronization control is described below.
In this embodiment IFT is used not only in feedback control but also in feed-forward control of stage positioning and movement. The movement can be step-wise, continuous (e.g., scanning), or a combination of these.
The general closed-loop system is shown in
The cost function to be minimized is described as follows, including both weighted error and control command:
in which e1(t,
A Gauss-Newton gradient method is used to minimize the cost-function. Parameter updating from iteration i to iteration i+1 depends upon step size, the gradients of the cost-function, and the Hessian matrix:
in which the step size 1≧γi>0. The gradient of the cost-function may be estimated as set forth below, using
from experiment 2:
The Hessian matrix (second-order gradient of the cost-function) is approximated below with the measured
from experiment 2:
The stage output position in experiment 1 is driven by the feedback and feed-forward controllers:
The associated parameter-gradient vector is derived as follows:
If r2(t) is defined as:
The feedback control in experiment 1 is as follows:
u
fb
1(t,
and its associated parameter vector is:
which can be simplified as follows:
To estimate the direction for the next optimization step, a gradient-measurement experiment (experiment 2) is performed, which can be configured based on Equations (14), (15), (16), and (17). As a reference, the error in experiment 1 is used, as illustrated in
The parameter gradients of control and output (
respectively) can be evaluated (either on-line or off-line) from the control and output signals obtained in this experiment through the “gradient filter”
is a vector:
After updating the control parameters with Equations (b), (c), and (d), experiments 1 and 2 can be repeated until the termination condition for the optimization is satisfied.
A list of control gradient filters is as provided in Table 1, above.
In this example of the first embodiment, the feedback filter was tuned to improve settling performance of the stage undergoing a scanning motion. A reticle stage was used, having a mass of 10 kg, 2000-Hz bandwidth amplifiers, and four-samples delay (1 Ts=96×10−6 sec. For cost-function evaluations, the stage is caused to execute two 33-mm shots at 2.8 m/s scanning velocity and an average acceleration of 10×g, with a 2 msec settling time. The position trajectory was delayed by four samples to synchronize the system delay.
An evaluation was performed of the following cost-function, expressed as a weighted mean-square sum of the following-error and the feedback force:
wherein λ=1×10−16. The magnitude ratio between the following-error and the feedback force is approximately 10−8, so the value of λ stated above can be used as the weighting between the square sums of the feedback force and the following-error in the cost-function. With selected weighting, and over multiple iterations, the following-error converged and the closed-loop system remained stable.
In this embodiment IFT is used for both feedback control and feed-forward control of a first stage (e.g., reticle stage) as well as synchronization control of the first stage with a second stage (e.g., wafer stage). In synchronization control, motion of the reticle stage is coordinated with motion of the wafer stage, and vice versa.
The first IFT experiment involves evaluation of the cost-function J(
Here, the wafer-stage following-error ews1(t) through the target filter H(z,
In a lithography system in which the magnification factor of the imaging-optical system is ¼, the synchronization error esynch1(t,
e
synch
1(t,
Since the reticle-stage trajectory is four times the wafer trajectory, r1(t)=4rws1(t), the synchronization error on the reticle may be further expressed as below:
For synchronization control, an exemplary cost-function to be minimized over multiple iterations is:
in which t is time; esynch1(t,
As illustrated in
A Gauss-Newton gradient-evaluation method can be used to minimize the cost-function J(
where the step size 1≧yi>0. The gradients of the cost-function may be calculated as follows:
The Hessian Matrices used for determining the next-step direction in Equations (24A), (24B), and (24C) are approximated as follows:
The parameter gradient vectors of output position and feedback force command,
in the foregoing gradients and Hessian matrices are measured later, via the gradient filters
in additional experiments.
Based on the reticle-stage output position in the first experiment,
the output gradient with respect to the feedback-control parameters is as follows:
In the gradient-measurement experiment for the feedback controller, the reference and output for the controller can be directly measured as follows:
According to Equations (28), (29), and (30), the output gradient with respect to feedback-control parameters may be directly measured:
A derivation of the feedback-control gradient with respect to the feedback parameter starts with the following:
u
fb
1(t,
After some simple manipulations, the feedback-force gradient with respect to feedback-control parameters is derived as follows:
Based on Equations (29), (30), (31), and (32), the gradient-measurement experiment with respect to the feedback parameters is described below.
In the second experiment gradient measurements are obtained with respect to the feedback-control parameters
Outputs from the gradient filters are the respective gradients
To avoid an excessive excitation force in the associated gradient experiment, the feed-forward control has been separated into two portions. The two portions are a default portion ĜFF (z) with fixed parameters and a delta portion ΔGFF(z,
G
FF(z,
Based on the reticle-stage output position in the first experiment,
the output gradient with respect to the feed-forward control parameters is:
Similarly, based on the feedback-control force in the first experiment:
u
fb
1(t,
the associated feedback-control gradient with respect to the feed-forward control parameters is:
With Equation (36), the output gradient (Equation (38)) may be further expressed as follows:
The second experiment provided a gradient measurement of the feedback parameters
Based on the reticle-stage output position in the first experiment:
the output gradient with respect to the target filter parameters is:
Similarly, based on the feedback-control force in the first experiment:
u
fb
1(t,
the associated feedback-control gradient with respect to target filter parameters is:
With Equation (41), the feedback-control gradient (Equation (43)), may be further expressed as below:
In the fourth experiment, gradient measurements are obtained with respect to the synchronization filter parameters φ. Referring to
provide respective outputs
The gradient filter used in each of experiments 2-4 is actually a vector of filters:
A list of control filters is provided in Table 1, above. In each iteration of IFT the experiments 1-4 are executed to evaluate the cost-function and the associated gradients, to update the controller parameters. The iterations are continued until the parameters converge or the terminating condition is met. Updating of the parameters
For simplicity in this example, it is assumed that the reticle-stage feed-forward control is well-tuned beforehand. Here, a reticle-stage AFC filter and a target notch filter attenuate the wafer-stage following-error vibrations at approximately 80 Hz and 300 Hz, respectively. The simulation is performed in the discrete time domain with a sample period of 96×10−6 second. Broad-band noise (±1 nm) is added to both the reticle-stage and wafer-stage error data.
A two-shot trajectory was used to evaluate the cost-function, the averaged mean square of the synchronization error of two exposures:
Experiment 2 of the IFT was a gradient evaluation for the reticle-stage AFC filter parameters. The reticle-stage AFC filter used the same frequency in the numerator and denominator terms, with a fixed ratio ra (=1×10−6 here) between their corresponding damping ratios. Hence, only two terms, wa and da, were tuned. The transfer functions for the AFC filter and the associated gradient filters are listed below.
The reticle-stage error recorded in experiment 1 is used as a reticle-stage reference while both the feed-forward control and filtered wafer-stage input are off.
Experiment 3 was a gradient evaluation for the target notch-filter parameters. For simplicity, the target notch-filter used the same frequency in the numerator and denominator terms, with a fixed ratio rn (=1×10−6) between their corresponding damping ratios. Hence, only two terms, wn and dn were tuned. The transfer functions for the target notch-filter and associated gradient filters are provided below:
To estimate the gradient associated with notch-filter parameters, the wafer-stage error obtained in experiment 1 was used as the input for the target notch-filter, while the reticle stage was regulated.
With the evaluated gradients and Hessian matrices, the control parameters were updated with step size λ=0.5. The AFC and notch filters used here were of positive parameters (in continuous time domain) for stability. A lower bound was set for each parameter. If the parameter value for the next step was out of its lower bound, it was updated as the average of the lower bound and the current step value.
Plots of cost function, AFC frequency, AFC damping history, notch-frequency, and notch-damping with increased iteration are shown in
In this embodiment, IFT is used for feed-forward control, along with feedback control, of a stage.
The first experiment is directed to cost-function evaluation. To avoid a high-force excitation during gradient-evaluation experiments (so that the following-error can remain small), the default feed-forward gain, ĜFF (z), which provides the major force, can be fixed. This leaves the delta feed-forward gain ΔGFF (z,
The goal is to minimize this cost-function. The Gauss Newton gradient method (i.e., Gauss Newton approximation of the Hessian of J(
of the cost-function, and the Hessian matrix R:
Desirably, the step size in each iteration is 1≧γi>0. The gradients of the cost function are calculated as follows:
and the Hessian matrices (approximate second-order gradients) for the next iteration step in Equation (47) may be approximated as follows:
A key goal is to define the feed-forward parameter gradient vector of the output position,
This term can be derived as follows. As shown in
The associated parameter-gradient vector is derived as follows:
If we define:
Based on Equations (52) and (53), the gradient-measurement experiment (experiment 2) is designed as follows, with respect to feed-forward parameters.
According to Equation (52), the stage position is regulated at an arbitrary set-point r2 (i.e., r2=0), in which the “2” superscript denotes experiment 2. The delta feed-forward control is applied with the same trajectory r1(t) as in experiment 1. See control diagram of
The parameter gradient vector
is measured as a filtered output position with gradient filters
where:
For a simple trajectory-based delta feed-forward control,
ΔGFF(s)=ksnaps4+kjerks3+kaccs2+kvels+kpos (55)
the gradient filters
are listed in Table 2, below.
For discrete time implementation, Tustin and zero-order-hold conversion may be applied to the above gradient filters for trajectories with trapezoidal and Euler integrators respectively.
This example is a simulation model in which a reticle stage system was used. The simulation was conduced with a discrete-time model. To verify the tuning effectiveness explicitly, the system modeling and timing were carefully treated to provide an analytical optimization solution for feed-forward control. For instance, the plant was a 2nd order rigid-body with 2nd amplifier dynamics, converted to a discrete-time model using the Tustin-bilinear method:
The trajectory utilized trapezoidal (bilinear) integrators. The trajectory output position was delayed by the same four samples as the system delay. Therefore, the overall acceleration feed-forward gain was expected to be the inverse dynamics of the plant.
The feed-forward control gains were separated into two portions, default feed-forward gains and delta feed-forward gains. The same default feed-forward gain was used all the time while tuning the delta feed-forward gains.
The feed-forward control was roughly tuned to achieve a 270-Hz closed-loop bandwidth (see
For evaluation of the cost function, the stage was run two “shots” (
In each iteration, after completing the cost-function evaluation (experiment 1), a gradient-evaluation experiment was executed with the stage position being regulated (with a fixed position setpoint) while applying the delta feed-forward control. With the measured gradient values, the delta feed forward parameters were updated using Equations (47), (48), and (49), which concluded the tuning iteration. The same procedure (experiments 1 and 2) was repeated over multiple iterations with updated delta feed-forward parameters until the tunings converged.
To see the step-size effect, several sets of tunings were executed with various respective step sizes.
Results obtained with a tuning process having an ideal step size (γi=1) for all iterations (i=0, 1, 2, . . . , 9) are shown in
With step sizes deviating from γi=1, convergence slowed, but rapid (exponential) convergence was still seen.
It will be understood that the “optical assembly” 16 can include optical and mechanical components. But the assembly 16 in other precision system embodiments may not have any optical components. The assembly 16 can be any of various “process assemblies” or process tools relative to which at least one of the stages 18, 20 positions an object being carried by the stage.
The control system 24 utilizes a position-compensation system that improves the accuracy in the control and relative positioning of at least one of the stage assemblies 18, 20. The control system 24 can include multiple controllers, including stage-motion controllers programmed to perform IFT as they iteratively control motion of one or more of the stage assemblies.
The exposure apparatus 10 is useful as a lithography tool that transfers a pattern (not shown) of an integrated circuit or other micro-device from a reticle 26 onto a substrate (“wafer”) 28. The exposure apparatus 10 rests on a mounting base 30, e.g., the ground, a base, a floor, or other supporting structure.
There are a number of different types of lithography tools. For example, the exposure apparatus 10 can be used as scanning-type photolithography system that exposes the pattern from the reticle 26 onto the wafer 28 with the reticle 26 and the wafer 28 moving synchronously. In a scanning-type lithography tool, during exposures the reticle 26 is moved perpendicularly to an optical axis of the optical assembly 16 by the reticle-stage assembly 18, and the wafer 28 is moved perpendicularly to the optical axis of the optical assembly 16 by the wafer-stage assembly 20. Meanwhile, scanning of the reticle 26 and the wafer 28 occurs. Synchronous motions of the reticle and wafer are achieved while their respective stage assemblies are being controlled as described above.
Alternatively, the exposure apparatus 10 can be a step-and-repeat type of lithography tool that exposes the wafer 28 while the reticle 26 and the wafer 28 are momentarily stationary. In step-and-repeat exposure, the wafer 28 is in a constant position relative to both the reticle 26 and the optical assembly 16 during exposure of an individual field on the wafer. Between consecutive exposure steps, the wafer 28 is moved using the wafer-stage assembly 20 perpendicularly to the optical axis of the optical assembly 16 to bring the next field of the wafer 28 into position relative to the optical assembly 16 and the reticle 26 for exposure. By repeating this sequence, images of the pattern defined by the reticle 26 are sequentially exposed onto the fields of the wafer 28.
Use of the exposure apparatus 10 provided herein is not limited to a lithography tool for integrated-circuit manufacturing. The exposure apparatus 10, for example, can be used as an LCD photolithography system that exposes a pattern of a liquid-crystal display device onto a rectangular glass plate, for example, or a photolithography system for manufacturing a thin-film magnetic head. Alternatively, the exposure apparatus 10 can be a proximity photolithography system that exposes a pattern from a mask to a substrate with the mask being located close to the substrate without the use of the optical assembly 16.
The apparatus frame 12 is rigid and supports the components of the exposure apparatus 10. The apparatus frame 12 illustrated in
The illumination system 14 includes an illumination source 34 and an illumination-optical assembly 36. The illumination source 34 emits a beam of light energy. The illumination-optical assembly 36 guides the beam of light energy from the illumination source 34 to the optical assembly 16. The beam illuminates selectively different portions of the reticle 26 and exposes the wafer 28. In
The illumination source 34 can be a high-pressure mercury lamp (producing, for example, g-line or i-line ultraviolet light), a KrF excimer laser, an ArF excimer laser, or a F2 excimer laser, or an x-ray source. Alternatively, the illumination source 34 can produce a charged-particle beam such as an electron beam. An electron beam can be produced by, for example, a thermionic-emission type lanthanum hexaboride (LaB6) source or a tantalum (Ta) cathode. Furthermore, in the case in which an electron beam is used, either a mask can be used or a pattern can be directly formed on the substrate without using a mask or reticle.
The assembly 16 typically is an optical assembly that, for example, projects and/or focuses the light energy passing through the reticle 26 to the wafer 28. Depending upon the design of the exposure apparatus 10, the image formed by the assembly 16 on the wafer can be magnified or reduced relative to the corresponding pattern on the reticle. Hence, the assembly 16 is not limited to a reduction system. It can alternatively be a 1× or a magnification system.
Whenever far-UV light such as light from an excimer laser is used for exposure, glass materials such as quartz and fluorite that transmit far-UV light can be used in the assembly 16. Whenever exposure using light from an F2 excimer laser, extreme UV, or X-ray source is used, the assembly 16 can be catadioptric or reflective (the reticle desirably is a reflective type). Whenever an electron beam is used, the assembly 16 includes electron optics such as electron lenses and deflectors. The optical path for an extreme UV beam or electron beam should be in a vacuum.
Examples of catadioptric (reflective-refractive) optical systems are discussed in U.S. Pat. Nos. 5,668,672 and 5,835,275. In these cases, the reflecting optical device can be a catadioptric optical system incorporating a beam-splitter and a concave mirror. U.S. Pat. No. 5,689,377 also discusses a catadioptric optical system incorporating a concave mirror, etc., but without a beam-splitter. As far as is permitted by law, the disclosures in these U.S. patents are incorporated herein by reference.
The reticle-stage assembly 18 holds and positions the reticle 26 relative to the assembly 16 and the wafer 28. Somewhat similarly, the wafer stage assembly 20 holds and positions the wafer 28 with respect to the projected image of the illuminated portions of the reticle 26. The stage assemblies 18, 20 are controlled in a manner as discussed above and are configured as described in more detail below.
In photolithography systems, when linear motors (see U.S. Pat. Nos. 5,623,853 and 5,528,118) are used in a reticle-stage assembly 18 and/or in a wafer-stage assembly 20, the linear motors can be either an air-levitation type employing air bearings or a magnetic-levitation type using Lorentz force or reactance force. Additionally, the stage can move along a guide, or it can be a guideless type of stage. As far as is permitted by law, the disclosures in these U.S. patents are incorporated herein by reference.
Alternatively, the reticle stage and/or wafer stage can be driven by a planar motor. A planar motor drives the stage by an electromagnetic force generated by a magnet unit having two-dimensionally arranged magnets and an armature-coil unit having two-dimensionally arranged coils in facing positions. With this type of driving system, either the magnet unit or the armature-coil unit is connected to the stage and the other unit is mounted on the moving-plane side of the stage.
Movement of the stages as described above generates reaction forces that can affect performance of the exposure system. Reaction forces generated by motion of the wafer stage can be mechanically transferred to the floor (ground) by using a frame member as discussed in U.S. Pat. No. 5,528,100. Additionally, reaction forces generated by motion of the reticle stage can be mechanically transferred to the floor (ground) using a frame member as discussed in U.S. Pat. No. 5,874,820. As far as is permitted by law, the disclosures in these U.S. patents are incorporated herein by reference.
Typically, multiple integrated circuits or other micro-devices are produced on a single wafer 28. The process may involve a substantial number of repetitive, identical, or substantially similar movements of portions of the reticle-stage assembly 18 and/or the wafer-stage assembly 20. Each such repetitive movement is also referred to herein as an iteration, iterative movement, or cycle, as defined in greater detail below.
The measurement system 22 monitors movement of the reticle 26 and the wafer 28 relative to the assembly 16 or some other reference. With this information, the control system 24 controls the reticle-stage assembly 18 to precisely position the reticle 26 and the wafer-stage assembly 20 to precisely position the wafer 28 relative to the assembly 16. For example, the measurement system 22 can utilize multiple laser interferometers, encoders, and/or other measuring devices.
One or more sensors 23 can monitor and/or receive information regarding one or more components of the exposure apparatus 10. For example, the exposure apparatus 10 can include one or more sensors 23 positioned on or near the assembly 16, the frame 12, or other suitable components. Information from the sensor(s) 23 can be provided to the control system 24 for processing. In the embodiment illustrated in
The control system 24 receives information from the measurement system 22 and other systems and controls the stage assemblies 18, 20 to precisely and synchronously position the reticle 26 and the wafer 28 relative to the assembly 16 or other reference. The control system 24 includes one or more processors, filters, and other circuits for performing its functions, as discussed above.
An exposure apparatus according to the embodiments described herein can be built by assembling various subsystems in such a manner that prescribed mechanical accuracy, electrical accuracy, and optical accuracy are maintained. To maintain the various accuracies, prior to and following assembly, every optical system is adjusted to achieve its specified optical accuracy. Similarly, every mechanical system and every electrical system are adjusted to achieve their respective specified mechanical and electrical accuracies. The process of assembling each subsystem into an exposure system includes mechanical interfaces, electrical-circuit wiring connections, and air-pressure plumbing connections between each subsystem, as required. Also, each subsystem is typically assembled prior to assembling an exposure apparatus from the various subsystems. After assembly of an exposure apparatus from its various subsystems, a total adjustment is performed to make sure that accuracy and precision are maintained in the exposure apparatus. It is desirable to manufacture an exposure apparatus in a clean room in which temperature and cleanliness are controlled.
Microelectronic devices (such as, but not limited to, semiconductor devices) may be fabricated using the apparatus described above. An exemplary fabrication process is shown in
Upon completion of pre-processing steps, post-processing steps may be implemented. In step 1315 a layer of photoresist is applied to the wafer. Then, in step 1316, an exposure apparatus is used to transfer the circuit pattern defined on the reticle to the wafer. Transferring the circuit pattern of the reticle to the wafer generally includes executing a scanning motion of a reticle-scanning stage. In one embodiment, scanning the reticle-scanning stage includes accelerating a fine stage with a coarse stage, then accelerating the fine stage substantially independently from the coarse stage.
After transfer of the circuit pattern on the reticle to the wafer, the exposed wafer is developed in step 1317. After development of the wafer, parts thereof other than residual photoresist, e.g., the exposed material surface, may be removed by etching. Finally, in step 1319, unnecessary photoresist remaining after etching is removed. Multiple circuit patterns may be formed on the wafer by repeating the pre-processing and post-processing steps.
While the invention has been described above in connection with representative embodiments and examples, it will be understood that the invention is not limited to those embodiments and/or examples. On the contrary, it is intended to encompass all modifications, alternatives, and equivalents as may be included within the spirit and scope of the invention as defined by the appended claims.
This application claims priority to and the benefit of U.S. Provisional Patent Application No. 61/147,719, filed on Jan. 27, 2009, which is incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
61147719 | Jan 2009 | US |