The present invention relates generally to photolithography, and specifically to simulation of aerial images produced by projecting a mask onto a target surface.
Photolithography is an essential tool in reproduction of fine patterns on a substrate, and is very widely used in production of microelectronic devices. As the design rules used in such devices become ever finer, mask designers must increasingly resort to reticle enhancement technologies, such as the use of serifs, assist lines and phase shift masks, in order to project the desired pattern onto the device substrate. The aerial image that is actually formed on the substrate is a complex function of the characteristics of the illumination source and optics that are used in the lithographic process and of diffraction and interference effects caused by the structures on the mask itself. Mask designers need simulation systems that model these effects in order to predict the pattern that will be formed on the substrate by particular arrangements of mask features.
Simulation of the aerial image is complicated by the fact that practical lithography systems use partially-coherent illumination. For optical systems that are nearly paraxial, the intensity of the aerial image at the image plane with partially-coherent illumination of the mask is given by the well-known Hopkins formula:
Here g is the transmission function of the mask; h is the mutual coherence function of the illumination source (typically the Fourier transform of the condenser aperture function); and K is the point spread function (PSF) of the projection system. The function h can be expressed in terms of a hermitian matrix. Formula (1) is a quadruple integral, taken over the two-dimensional space Ξ in the mask plane. Accurate, direct computation of this integral is computationally heavy and requires precise knowledge of the optical system properties, as expressed by K and h. In some formulations, the properties of the optical system are expressed more compactly in the form of a kernel function W=K·h.
In order to reduce the complexity of aerial image simulation, a number of authors have suggested using optimal coherent decomposition (OCD) to approximate the optical properties of the partially-coherent imaging system as an incoherent sum of a finite number of coherent imaging systems. The aerial images that would be formed by each of the coherent imaging systems are computed and summed together to give the total, simulated aerial image. Pati et al. provide a useful overview of OCD methods in “Exploiting Structure in Fast Aerial Image Computation for Integrated Circuit Patterns,” IEEE Transactions on Semiconductor Manufacturing 10:1 (February, 1997), pages 62-73, which is incorporated herein by reference. This article describes the use of “basis” (or building block) images, which correspond to certain types of integrated circuit patterns, in order to compute aerial images by OCD.
Von Bunau et al. describe a related method of OCD in “Optimal Coherent Decompositions for Radially Symmetric Optical Systems,” Journal of Vacuum Science and Technology B15:6 (November/December, 1997), pages 2412-2416, which is incorporated herein by reference. The authors show that for optical systems that are radially symmetrical, the point spread functions and pupil functions corresponding to each term in the OCD expansion are separable in polar coordinates. They derive analytical expressions for the angular dependence of these terms and an integral equation for the radial dependence.
A number of methods for aerial image simulation have been described in the patent literature. For example, U.S. Pat. No. 6,223,139, whose disclosure is incorporated herein by reference, describes a method for kernel-based fast aerial image computation for a large-scale design of integrated circuit patterns. The method is based on determining an appropriate sampling range and sampling interval for use in generating simulated aerial images of a mask pattern, so as to enhance computation speed without sacrificing accuracy. U.S. Patent Application Publication 2002/0062206, whose disclosure is also incorporated herein by reference, describes a method for fast aerial image simulation, using a kernel that is calculated based on an orthogonal pupil projection of the parameters of the optical projection system onto a basis set. A vector is calculated based on an orthogonal mask projection of the parameters of the mask onto the basis set, and the field intensity distribution in the image plane is then calculated using the kernel and the vector.
The references cited in the Background of the Invention describe methods for simulating an aerial image assuming that the properties of the optical system used to project the image are known. Even when the optical system design is precisely known, however, the actual projection system may vary from the design specifications due to factors such as manufacturing tolerances and imprecise alignment of the optical components. Therefore, even if the Hopkins equation (or its eigenfunctions) were computed with high precision, the resulting simulated aerial image for a given mask might still deviate significantly from the actual aerial image created by projecting the mask using the optical system in question.
In response to this deficiency, embodiments of the present invention provide methods and systems for calibrating an aerial image simulator so as to model the actual properties of a given optical projection system. In some embodiments, the optical system is used to project a reference aerial image of a predetermined reference mask, and this image is captured electronically, typically using an electronic imaging camera. (In the context of the present patent application and in the claims, the term “mask” should be understood to comprise substantially any sort of object carrying a pattern that can be projected onto a target plane, and the term “camera” should be understood to refer to any suitable type of imaging device.) The reference aerial image is then processed in order to extract a set of expansion functions, which approximate the eigenfunctions of the kernel function W that represents the optical properties of the actual optical system in question.
The expansion functions that are determined in this manner account for the actual aberrations and misalignment of the optical system, as well as any aberrations or other defects of the camera. These expansion functions may then be used in accurate computation of simulated aerial images of other masks projected by this optical system. Thus, the expansion functions implicitly represent a calibration of the optical system for purposes of aerial image simulation. This implicit calibration obviates the need for direct measurement of the actual aberrations and misalignment.
In some embodiments, the optical system in question is part of a mask inspection system, which captures and processes aerial images of a mask under inspection in order to detect microscopic defects in the mask. The actual aerial image of the mask under inspection is compared to a simulated image that is computed based on the mask design and on the expansion functions that were determined based on the reference mask, as described above. Significant differences between the actual and simulated images can be attributed to defects in the mask under inspection. Because the expansion functions accurately represent the performance of the actual optical system (including the camera), the threshold for identifying a difference as “significant” can generally be set lower than in mask inspection systems known in the art.
There is therefore provided, in accordance with an embodiment of the present invention, a method for generating a simulated aerial image, including:
forming a reference aerial image of a first mask using an optical system;
capturing and processing the reference aerial image so as to generate a set of expansion functions representative of the optical system; and
computing the simulated aerial image of a second mask by applying the expansion functions to a design of the second mask.
In a disclosed embodiment, the first mask includes a pseudo-noise pattern. The pseudo-noise pattern may be characterized by a transmission function g that approximates a condition:
∫∫g({right arrow over (x)}−{right arrow over (z)}1)·g({right arrow over (x)}−{right arrow over (z)}2)·g({right arrow over (x)}−{right arrow over (z)}3)·g({right arrow over (x)}−{right arrow over (z)}4)d{right arrow over (x)}=Gδ({right arrow over (z)}1−{right arrow over (z)}3, {right arrow over (z)}2−{right arrow over (z)}4),
wherein G is a constant.
Typically, processing the reference aerial image includes finding the set of expansion functions so as to minimize a difference between the reference aerial image formed using the system and a simulation of the reference aerial image computed using the expansion functions. In disclosed embodiments, finding the expansion functions includes determining a kernel function based on properties of the optical system, and estimating a series of eigenfunctions of the kernel function. In some of these embodiments, finding the expansion functions includes expressing each of the expansion functions as a linear combination of the estimated eigenfunctions multiplied by respective expansion coefficients, and determining the expansion coefficients so as to minimize the difference between the reference aerial image formed using the system and the simulation of the reference aerial image.
In one embodiment, determining the expansion coefficients includes computing a gradient of the difference with respect to the expansion coefficients, and optimizing the expansion coefficients using the gradient. In another embodiment, determining the expansion coefficients includes defining a hermitian matrix based on the expansion coefficients, and applying a linear regression analysis to the difference between the reference aerial image formed using the system and the simulation of the reference aerial image in order to compute elements of the hermitian matrix.
In a disclosed embodiment, capturing the reference aerial image includes capturing the image using a camera having optical properties, and processing the reference aerial image includes determining the expansion functions so as to take into account the optical properties of the camera.
In some embodiments, the method includes forming and capturing an actual aerial image of the second mask using the optical system, and comparing the actual aerial image to the simulated aerial image so as to evaluate the second mask. Comparing the actual aerial image to the simulated aerial image may include detecting a difference between the actual aerial image to the simulated aerial image, and identifying a defect in the second mask if the difference is greater than a predetermined threshold.
There is also provided, in accordance with an embodiment of the present invention, apparatus for modeling an optical system, including:
a camera, which is adapted to capture a reference aerial image of a first mask, which image is formed using the optical system; and
an image processor, which is adapted to process the reference aerial image so as to generate a set of expansion functions representative of the optical system, and to compute a simulated aerial image of a second mask by applying the expansion functions to a design of the second mask.
There is additionally provided, in accordance with an embodiment of the present invention, apparatus for mask inspection, including:
an optical system, which is adapted to form aerial images of masks, the aerial images including a reference aerial image of a first mask and an actual aerial image of a second mask;
a camera, which is adapted to capture the aerial images formed by the optical system; and
an image processor, which is adapted to process the reference aerial image so as to generate a set of expansion functions representative of the optical system, and to compute a simulated aerial image of a second mask by applying the expansion functions to a design of the second mask, and to compare the actual aerial image to the simulated aerial image so as to evaluate the second mask.
There is further provided, in accordance with an embodiment of the present invention, a computer software product for modeling an optical system, the product including a computer-readable medium, in which program instructions are stored, which instructions, when read by a computer, cause the computer to process a reference aerial image of a first mask, which image is formed using the optical system, so as to generate a set of expansion functions representative of the optical system, and to compute a simulated aerial image of a second mask by applying the expansion functions to a design of the second mask.
The present invention will be more fully understood from the following detailed description of the embodiments thereof, taken together with the drawings in which:
System 20 comprises an illumination source 26, which emits radiation, typically comprising visible, ultraviolet or infrared radiation. A condenser lens 28, followed (in this example) by an aperture 30, focuses the light from source 26 through mask 22. A projection lens 32, having an aperture 34, focuses an aerial image of mask 22 onto plane 24. Typically, lenses 28 and 32 include complex, multi-element lenses.
In a disclosed embodiment, system 20 is used for inspection of mask 22, for purposes of detecting mask defects. For this purpose, an electronic imaging camera 36, such as a video camera or other two-dimensional array camera, or a line scan camera, captures the actual aerial image formed at plane 24 with high resolution. An image processor 37 also generates a simulated aerial image, based on the known design of mask 22, which is stored in a design database 38. The mask design determines the transmission function g({right arrow over (x)}). The simulated aerial image is computed, based on g({right arrow over (x)}) and on a set of expansion functions representing the optical characteristics of system 20. These expansion functions are computed using novel calibration techniques that are described hereinbelow. Image processor 37 compares the actual aerial image to the simulated aerial image in order to detect discrepancies between the two, which may be indicative of defects in mask 22.
Image processor 37 typically comprises a general-purpose computer, which performs the functions described in the present patent application under the control of suitable software. The software may be downloaded to the computer in electronic form, over a network, for example, or it may alternatively be provided on tangible media, such as optical or magnetic media or non-volatile memory. Alternatively or additionally, at least some of the functions of the image processor may be performed by dedicated or programmable hardware components, such as a digital signal processor.
wherein Φi are the denormalized eigenfunctions of the kernel. Equation (2) assumes that the PSF (K) of the optical system is position-independent over the area of interest in target plane 24. It also assumes that the influence of imperfections in camera 36 on images captured by the camera can be expressed as a linear filtering operation in the target plane:
C({right arrow over (z)})=Ch*I({right arrow over (z)}) (3)
wherein Ch is an impulse reaction of the filter, and the operator “*” represents convolution. Based on these assumptions, K can be modified to include the influence of the camera together with the projection optics, while the kernel W remains hermitian. Equation (1) can then be rewritten:
wherein I now represents the actual electronic image captured by camera 36. The derivation of equation (4) is given in greater detail in Appendix A below.
As noted earlier, it is practically very difficult to directly compute the exact eigenfunctions Φi. Therefore, the method of
Based on the estimated eigenfunctions, an initial simulated aerial image Ĩ is computed, at an image estimation step 42, using equation (4):
The transmission function g in this case refers to a reference mask, having a known pattern without defects. An actual, reference aerial image I of this reference mask is projected and captured by system 20, at an image capture step 44. The set of expansion functions {tilde over ({circumflex over (Φ)})}i is then found, at an optimization step 46, so as to minimize the difference between the actual aerial image captured at step 44 and a simulated aerial image based on the expansion functions:
Here the norm ∥·∥ ideally represents the Chebyshev norm (L28 ). Alternatively, for greater ease of analytical development, the norm
may be used. Note that L2n approaches the Chebyshev norm as n approaches infinity.
The formulation of the expansion functions in equation (6) does not guarantee that these expansion functions will be the eigenfunctions of W. Nevertheless, if the reference mask transmission function g is sufficiently representative of the characteristics of other masks that are to be projected by system 20, the expansion functions given by equation (6) can serve as a good approximation of the eigenfunctions. A series of 20-30 expansion functions of this sort (corresponding approximately to the first 20-30 eigenfunctions of W) is typically sufficient to give simulated aerial images that match the corresponding actual images to within one gray level unit.
∫∫g({right arrow over (x)}−{right arrow over (z)}1)·g({right arrow over (x)}−{right arrow over (z)}2)·g({right arrow over (x)}−{right arrow over (z)}3)·g({right arrow over (x)}−{right arrow over (z)}4)d{right arrow over (x)}=Gδ({right arrow over (z)}1−{right arrow over (z)}3, {right arrow over (z)}2−{right arrow over (z)}4) [equation (7)]
Here G is a constant, and {right arrow over (z)}j (j=1, 2, 3, 4) are arbitrary vectors. Mask 48 is shown here only by way of example. Other sorts of reference masks may also be used and are considered to be within the scope of the present invention.
Returning now to step 46 and equation (6), various methods may be used to determine the expansion functions. One possibility is to find the expansion functions iteratively, in decreasing order of the corresponding eigenvalues:
The computation of equations (9) and (10) is repeated iteratively until the entire set of expansion functions is found. At each step, the next expansion function is found so as to minimize the remainder of the image. This method is conceptually straightforward, but computationally complex because of the large dimensions of the eigenfunctions. Alternatively or additionally, the equations may be solved by manipulating discretized values of the functions in the equations, or by manipulating parameters in a parameterized expansion of the functions.
For example, the expansion functions may be expressed as linear combinations of the initial estimates of the eigenfunctions, {tilde over (Φ)}0k, which were found at step 40:
Here the expansion coefficients aik are scalar products given by aik=<{tilde over (Φ)}0k, {tilde over (Φ)}i>. This approach makes use of the orthogonality of the eigenfunctions and assumes that the subspace spanned by the initial estimates of the eigenfunctions is close to that spanned by the true eigenfunctions Φk. As the initial estimates of the eigenfunctions {tilde over (φ)}0k approach the true eigenfunctions Φk, the matrix of coefficients a approaches the identity matrix. Equation (5) may be re-expressed in terms of the aik coefficients as:
wherein “∘” designates element-wise multiplication (so that each element in the resulting matrix is the product of the corresponding elements in the multiplicand matrices), and Ψk≡g*{tilde over (Φ)}0k. In other words, Ψk represent precomputed “images” of the reference mask, which are multiplied and summed to give the complete simulated image.
Equation (6) may now be restated and solved, at step 46, in terms of the aik coefficients. Various methods may be used in order to find the optimal set of coefficients. One method is to express equation (6) as a parametrized minimization problem in L2n space:
wherein Va≡∥Da∥2n, and Da≡I({right arrow over (z)})−{right arrow over (I)}a({right arrow over (z)}). The gradient d of the minimized form Va can be expressed as a complex nxn matrix with elements given by:
The gradient elements may then be expressed in terms of the aik coefficients as follows:
The derivation of equation (15) is given in greater detail in Appendix B.
Equation (13) may now be solved by optimization methods known in the art, such as the method of steepest descent, conjugate gradient method or other gradient-based method, using the gradient given by equation (15). For example, using the steepest descent method, the aik coefficients may be found iteratively according to the following procedure:
â(0)=1 (identity matrix)
â(i+1)=â(i)−μ·d (16)
wherein μ is a convergence factor. Another alternative is to find a hessian matrix of Va using equation (15), and then to minimize Va on this basis.
Alternatively, the aik coefficients may be found indirectly using a linear regression model. For this purpose, we define a matrix b=a·ah, wherein ah is the hermitian transpose of a, i.e., the matrix elements of b are given by
We can then restate equation (12) as follows:
Because the matrix b is hermitian, equation (17) can be rewritten as
This simplified expression uses the following substitutions:
p=k·n+1, 1= . . . k, P=n*(n+1)/2
up=2·Re(Ψk∘Ψ1*) for k≠1
up=Ψk∘Ψk* for k=1
vp=−2·Im(Ψk∘Ψ1*) for k≠1
vp=0 for k=1
Note that the functions up and vp can be precomputed based on the design properties of the reference mask and optical system, before the calibration procedure of
Using the formulation of equation (18), equation (6) can be expressed as parameterized minimization problem in the space of the Chebyshev norm (L∞):
Standard methods of linear regression may be used to calculate Re(b) and Im(b). The matrix a can then be obtained from b, at step 46, using the definition given above, i.e., b=a·ah. For example, since b is hermitian, a can be found as a matrix made up of the denormalized eigenvectors of b.
Image processor 37 compares the actual image to the simulated image, pixel by pixel, at an image comparison step 54. Where differences occur in the pixel gray levels, the image processor examines the difference between the actual and simulated images (at the level of individual pixels or groups of pixels), in order to determine whether the difference exceeds a predetermined threshold criterion, at a thresholding step 56. If all the differences between the actual and simulated images are below threshold, image processor 37 reports that the mask under inspection is free of defects, at an approval step 58. Otherwise, if any differences in excess of the threshold are found, the image processor 37 reports the existence of a possible defect at the location on the mask corresponding to the image pixel at which the difference was found, at a defect reporting step 60. Additional automated and/or operator-initiated inspection and image processing steps may be performed at these possible defect locations in order to determine the cause of the deviation in the actual image.
Although the embodiment of
It will thus be appreciated that the embodiments described above are cited by way of example, and that the present invention is not limited to what has been particularly shown and described hereinabove. Rather, the scope of the present invention includes both combinations and subcombinations of the various features described hereinabove, as well as variations and modifications thereof which would occur to persons skilled in the art upon reading the foregoing description and which are not disclosed in the prior art.
The PSF (K) in general is not spatially shift-invariant in the target (image) plane. Nevertheless, it is reasonable to assume that the PSF can be decomposed into the product of a slow position-dependant component Ks and a fast position-independent component Kf. Then equation (1) can be rewritten as follows:
wherein K2({right arrow over (x)}1, {right arrow over (x)}2)≡Kf({right arrow over (x)}1)·Kf({right arrow over (x)}2). We assume that the working area is small enough so that Ks is constant. This constant coefficient and the index f will be omitted in the subsequent development. Equation (20) can then be rewritten in the following way:
As noted above, we assume that the influence of the camera on the PSF can be expressed as the operation of a linear filter applied in the target plane, as given by equation (3). Applying this definition to equation (20), omitting the leading coefficient, and changing the order of integration, gives:
Now substituting {right arrow over (z)}−{right arrow over (y)}={right arrow over (v)}, we obtain:
If we substitute this equation into equation (22), we obtain a result identical in form to equation (20), with the substitution of K2C for K2. Note that both K2c and K2 are hermitian functions. Thus, the influence of the camera does not change the form of equation (21), either.
Equation (21) can now be rewritten as follows:
wherein W({right arrow over (x)}1, {right arrow over (x)}2)≡K2({right arrow over (x)}1,{right arrow over (x)}2)·h({right arrow over (x)}1−{right arrow over (x)}2). As noted above, both K2 and h are hermitian, and thus W({right arrow over (x)}1,{right arrow over (x)}2) is hermitian, as well. Therefore, W can be expanded in the manner given above in equation (3). Substituting this expansion into equation (24) gives the expression of equation (4).
Based on the definitions of Va and Da in equation (13), equation (14) may be restated as follows:
From equation (12):
The partial derivatives in this equation may be simplified by using the identities:
Substituting these identities into equation (26), and performing some elementary transformations, gives the following result:
Substituting this result into equation (25) gives the expression for dnm in equation (15).
This application is related to another U.S. patent application, filed on even date, entitled “Simulation of Aerial Images,” which is assigned to the assignee of the present patent application and whose disclosure is incorporated herein by reference.