 
                 Patent Application
 Patent Application
                     20070143733
 20070143733
                    This invention relates to the field of fabrication of integrated circuits and, more particularly, to a method of compensating photomask data for the effects of etch and lithograph processes.
 In the fabrication of integrated circuits, layers of semiconductor devices are patterned using lithography and etch processes. Both of these processes distort the pattern. The pattern on the photomask can be changed in order to compensate for the combined distortion of lithography and etch processes. The photomask data set describes a pattern PMASK DATA which comprises a union of polygons. The photomask data set is transferred on to the wafer, to form the pattern PWAFER, by a composition of pattern transformations: 
PWAFER=TETCH(TLITHO(TMASK(PMASK DATA)))   (1) 
 TMASK is a transformation that maps the photomask data set to a pattern that is etched into a layer on the photomask. TMASK includes effects of software calculated dose adjustments, electron or laser beam interaction with a e-beam resist or a photoresist on the photomask, resist blur, development of the resist, and etching of the photomask. 
TLITHO is a transformation that maps the pattern etched on the photomask to a pattern that is formed in a photoresist layer deposited on a wafer. TLITHO includes the effects of optical image formation, photo-reactions and catalytic reactions in the photoresist, dissolution of the photoresist in a developer solution.
TETCH is a transformation that maps the pattern that is formed in the photoresist on a wafer to the pattern that is etched in a layer underlying the photoresist. TETCH includes all steps of the etch process, such as resist-trim and hard-mask-open processes of gate-etch. Gate poly-silicon etch is usually preceded by etching of a silicon dioxide or silicon oxi-nitride hard mask.
 Beale et al. (SPIE Vol. 5040, p. 1202-1209, 2003), which is incorporated herein by reference, proposed compensating the photomask data in multiple stages according to: 
PMASKDATA=TMASK−1(TLITHO−1(TETCH−1(PTARGET)))   (2) 
 In equation (2), PTARGET is the pattern desired on the wafer, TETCH−1 is the inverse of the transformation TETCH, and TLITHO−1 is the inverse of the transformation TLITHO. The method of Equation (2) is also called multi-step process proximity correction (PPC) (see: Choi et al., Proc SPIE, Vol 5040, p. 1176, 2003, which is incorporated herein by reference). The process is called multi-step or tandem correction because it performs a sequence of inverse transformations. The method first starts with the final target pattern PTARGET and obtains the target for the lithography inversion, which is TETCH−1 (PTARGET). It then proceeds to find the target for the mask writing, which is TLITHO−1(TETCH−1(PTARGET)). 
According to one approach, both PTARGET and TETCH−1 (PTARGET) are represented by a collection of polygons. However, representing TETCH−1 (PTARGET) by polygons is counter productive because etch correction results in segmentation and movement of edges of the polygons in PTARGET which results in a lithography target that is not lithography friendly. That is, TETCH−1 (PTARGET) obtained using such an approach is not in the range-space of the transformation TLITHO. Such an approach runs the risk of creating edge segments that are either too small or not optimal for lithography. It is among the objects of the present invention to address these and other limitations of prior art approaches, and to improve photomask fabrication.
In accordance with an embodiment of the invention, a method is provided for synthesizing a photomask data set from a given target layout, including the following steps: (a) providing a set of target polygons for the target layout; (b) fitting a smooth curve to a target polygon of said set of target polygons, the curve having a set of etch-target points; (c) moving the etch target points according to a model of an etch process to produce a set of lithography-target points; and (d) synthesizing a photomask data set based on a model of a lithography process and the set of lithography-target points. The steps (b) and (c) are performed for other polygons of said set of polygons. The target region may be applicable to a region of the chip, and the method can be repeated for one or more further regions of the chip. In a preferred embodiment, the smooth curve of step (b) comprises an etch target curve on which said etch target points lie, and the etch target curve substantially matches the target polygon, except at its vertices.
In an embodiment of the invention, the model of an etch process of step (c) is determined by a technique that includes calculating how much of a region above a wafer is visible from a point on the sidewall of an etched feature in the wafer. This step can utilize an algorithm selected from the group consisting of a Overmars-Welzl Algorithm, a Ghosh-Mount Algorithm, and a Lee Algorithm, for constructing a visibility graph in a plane. In a form of this embodiment, the model of an etch process of step (c) is determined by a technique that includes calculating a convolution of a point-spread-function with a photoresist pattern density.
In an embodiment of the invention, the model of an etch process of step (c) has adjustable parameters, and the method further includes the following steps: printing a test pattern in photoresist on a wafer thereby forming a photoresist pattern; measuring critical dimensions of the photoresist pattern; etching a pattern into a layer on a wafer; measuring critical dimensions of the etched pattern; and selecting the adjustable parameters of the model of the etch process according to the measured critical dimensions of the photoresist pattern and the measured critical dimensions of the etched pattern. In this embodiment, the selecting step further comprises substantially minimizing a weighted sum of squares of differences between predictions of the model of the etch process and measured critical dimensions.
In an embodiment of the invention, the step (d) of synthesizing a photomask data set includes: calculating an image intensity at a plurality of lithography target points according to the model of the lithography process; calculating a figure-of-demerit comprising a sum of differences between the calculated image intensities and predetermined values, wherein the sum is evaluated over said plurality of lithography target points; and adjusting the photomask data in a way that decreases the said figure-of-demerit. In a form of this embodiment, the step of calculating a figure-of-demerit further comprises calculating how much a resist edge moves in response to a change in a lithography exposure dose. The calculation of a figure-of-demerit can further comprise calculating how much a resist edge moves in response to a change in a lithography defocus parameter. The step of calculating a figure-of-demerit can further comprise calculating an image slope.
In an embodiment of the invention, the step (d) of synthesizing a photomask data set further comprises calculating edge placement errors at a plurality of lithography target points. In a form of this embodiment, the step of synthesizing a photomask further comprises calculating how much a resist edge is displaced in response to a change in an exposure dose or defocus.
Another embodiment of the invention is a photomask made using the methods of the invention as described herein.
Further features and advantages of the invention will become more readily apparent from the following detailed description when taken in conjunction with the accompanying drawings.
  
  
  
  .a-3.f show graphs of measurements, and best fitting model, of the difference between line widths after-etch and before-etch, for several different line and space widths. 
  
 According to certain embodiments, TETCH−1 (PTARGET) is obtained without segmentation or movement of polygons. According to certain embodiments, compensation for the effects of etch and lithography processes comprises the following high-level steps, which are explained in reference to 
These steps are explained in more detail below.
 A computational model of the etch process will next be described. Upon etching, the edge of a pattern is displaced in the plane of the wafer, in the direction that is normal (perpendicular) to the edge, by a lateral etch bias ΔEdge, which can be positive or negative. The mathematical model for the lateral etch bias is:  
 The term “lateral etch bias,” or “etch bias” for short, refers to changes in the dimensions in the plane of the wafer. It is not to be confused with etch depth or a bias in the etch depth. In equation (3a), ΩSKY, h, l is:  
 ΩSKY, h, l is specific to a pattern, and to an observation point (x0, y0,-h) on a sidewall of a pattern that is being etched. According to this model, the pattern that is being etched, like a well-trimmed maze, has a planar top and vertical side-walls, as illustrated in 
Similarly, reaction products that are formed at the bottom (floor) of trenches that are being etched can sputter on to the sidewalls and form a polymer that protects the sidewall from further etching. The rate of such sputtering is governed by a similar “solid angle of the floor.” The term αN+1ΩSKY is intended to approximately account for the combined effect of the plasma particles incident from the sky and the polymer deposition by particles sputtered from the floor. In an embodiment hereof, Equation (3a) is used. In a preferred embodiment, the depth parameter h is adjusted by fitting the predictions of the model to measured etch biases. Alternatively, the depth h can be set to be a fixed fraction of the etch depth, such as half of the etch depth.
The function (resist pattern) (x,y) in Equation (3a) takes the value 1 where there is resist on the wafer, and 0 where the resist has been removed. The symbol * indicates two-dimensional convolution. The two-dimensional convolution is evaluated at the point (x0,y0). The summation on the right-hand-side of Equation (3) is a point-spread-function that accounts for pattern density effects in etching the wafer. Changes in the pattern density affect the rate at which reactants are consumed and reaction products are generated. Local variations in the densities of reactants and reaction products are partially equalized by diffusion processes which are not instantaneous. The point-spread-function is a linear superposition of Gaussians with length parameters σ1, σ2, . . . , σN, which describes a combination of diffusion processes with various diffusion rates. According to certain embodiments, σ1, σ2, . . . , σN are assigned logarithmically spaced values between the minimum and maximum diffusion lengths that can be probed by the test patterns that are used to calibrate the etch model. In an alternative embodiment, the parameters σ1, σ2, . . . , σN are determined by minimizing the difference between predictions of the model and measured etch biases of test patterns. The amplitudes, or coefficients, α0, α1, α2l, . . . , αN+1 are adjustable, and a priori unknown, parameters of the etch model. The coefficient αN+1 is the weight of the solid angle term, which accounts for deposition or ablation of material on the sidewall. Each of the coefficients α0, α1, α2, . . . , αN+1 can be positive or negative. The coefficient α0 represents a constant etch bias.
In an embodiment of the invention, Equation (3a) indicates the rate of lateral etch at an instance. The shape of the etched pattern evolves with time. The parameters in Equation (3a), such as the depth parameter, h, are time dependent. As the pattern is etched, the right-hand-side of Equation (3a) is updated and the evolution of the etch pattern is calculated by time-stepping.
In a preferred embodiment, the lateral etch is evaluated in one step, wherein the parameters on the right-hand-side of Equation (3a) represent the geometry of the resist pattern. This embodiment facilitates fast calculation of the etched pattern given the resist pattern.
In another preferred embodiment, the lateral etch is evaluated in one step, where the parameters on the right-hand-side of Equation (3a) represent the geometry of the final etched pattern. This embodiment facilitates fast calculation of the resist pattern before etching, given the final etch pattern.
Calibration of the etch model will next be described. A multitude of test patterns are printed on a wafer to calibrate the etch model. According to certain embodiments, the test patterns include line gratings. Dimensions of the photoresist pattern on the wafer are measured by electron microscopy (CD-SEM), or atomic force microscopy, or optical scatterometry. Then, the wafer is etched following the same etch process that will be used in the manufacturing of a semiconductor device. The dimensions of the etched patterns are measured. CD-SEM measurements on photoresist patterns shrink the photoresist. Therefore, the before-etch and after-etch CD-SEM measurements are not taken at the same wafer-coordinates. Identical copies of each test pattern are provided in close proximity to each other, such as less than 1 mm distance. A test pattern is used for the before-etch measurement. A twin of the same test pattern is used for the corresponding after-etch measurement.
 The etch bias ΔEdge is calculated from the difference of the resist and post-etch measurements. For example, for line gratings: 
ΔEdge=(CDETCH−CDLITHO)/2   (4) 
 CDLITHO and CDETCH are dimensions of the same feature in a test pattern before and after etching, respectively. They are also know as the develop inspection (DI) and final inspection (FI) critical dimensions, respectively. The parameters α0, α1, α2, . . . , αN+1 of the model are obtained from ΔEdge for a plurality of test targets with various line and space widths. The parameters α0, α1, α2, . . . , αN+1 are obtained by linear regression, i.e., by solving a system of linear equations in the least-squares sense. For line-space targets, the linear system of equations is as follows:  
 In Equation (5), the index m labels a multitude of line grating test targets. Each line grating has different combination of line and space widths. The parameters of the line gratings are as follows: 
Km,1+Km,2+1=number of lines in the mth line grating target 
Km,1+1=line number in the mth grating on which the measurement is taken 
Lm=line width in the mth line grating target 
Sm=space width in the mth line grating target 
Pm=Lm+Sm=pitch of the mth line grating target   (6) 
 The first N coefficients in Equation (5) are convolutions of Gaussian kernels with the line grating:  
 The last term in Equation (5) is the solid angle of the sky seen from a point on the side wall of the Km,1+1st line in the mth line grating target, a distance h from the top of the line (for l=0):  
  
Once equation (5) is solved, the model is calibrated and it can be used to predict etch bias of arbitrary 2-dimensional patterns. According to certain embodiments, the model is verified on one-dimensional and two-dimensional test patterns that were not used to calibrate the model. Verification is done by recording CD-SEM images or CD measurements of the verification pattern before and after etching, and comparing the etch bias to the predictions of the model.
In another embodiment of the invention, the model parameters are calibrated using measurements taken on before-etch and after-etch measurements on test targets that comprise any combination of: lines-and-spaces, dots (posts) of various diameters, holes of various diameters, end of a line, end of a slot (trench), an array of line-ends, an array of slot-ends, bending lines, bending trenches.
The target layout is generated by circuit layout, routing, timing, and possibly, manufacturability considerations. The target layout is described by the union of a set of polygons. The polygons are typically specified by the coordinates of their vertices and stored in a file in GDSII or OASIS format. The target layout is input to the method of the invention as described herein.
 The generation of etch target points will next be described. Target polygons most commonly have 90° angles, in some cases 135° angles, at their vertices. Other angles, especially acute angles, are less common but possible. Such sharp corners are impossible to produce reliably with a combination of photolithography and etching. According to certain embodiments hereof the corners of the target polygons are rounded to render them realizable. A practical advantage of rounding corners is: the direction that is perpendicular to the edge of a polygon is not well defined at the vertices of a polygon, but the normal direction is always unambiguous for a smooth closed curve that doesn't intersect itself. There are many ways of rounding the corners of a polygon: one approach is to replace the corners of the polygon with segments of circles or ellipses that are tangent to the edges of the polygon. A preferred embodiment of generating etch target points comprises the following steps which are explained with reference to 
 This method replaces a target polygon 110 with a set of etch-target points 130 that lie on a smooth curve 120. The smooth curve 120 is not explicitly defined but it is represented by points p1(K), p2(K), . . . (130 in 
 What is meant by curvature of the smooth curve at a point p on the curve is that the magnitude of the vector d2p(s)/(ds)2 where p(s) is the parametric representation of a point on the curve, wherein the parameter s is the path length along the curve measured from an arbitrary point on the curve. In practice, since we only have a set of points p1, p2, . . . , pN on the smooth curve, we numerically calculate the curvature at pn according to:  
 In this description, pn refers to one of many points on a curve that are sequentially labeled. The points pn−1 and pn+1 are adjacent to pn. It is understood that pn+1 stands for p1 if n=N is the last point. Similarly, pn−1 is understood to stand for pN if n=1 is the first point. 
 The obtaining of litho target points from etch target points will next be described. Etch target points 130 are selected on etch target curves 120. The outward unit normal {right arrow over (n)} of each curve is calculated at each target point 130. The outward normal vector (180 in 
 The unit vector {circumflex over (z)} is normal to the plane of the wafer. The normal vector {right arrow over (n)}m at a point pm on a smooth, closed curve that passes through the points p1, p2, . . . , pN is numerically calculated as follows:  
 The litho target point 140 corresponding to an etch target point 130 is found by moving from the etch target by −{right arrow over (n)}ΔEdge, wherein ΔEdge is the etch bias calculated by the etch model. The solid angle and the convolutions in (3a) are calculated for the pattern defined by the etch target points. 
The calculation of ΩSKY,h,l, which is one of the terms that make up ΔEdge, requires identifying the integration domain “SKY” in the integration (3.b). The integration domain, which is specific to a target point, is the set of points (x,y,0) such that, the ray starting from the target point (x0,y0,-h) and going through a point (x, y,0) in the integration domain reaches the sky (plasma) without intersecting the 3-dimensional pattern that is being etched. Identifying the integration domain, which is a subset of the z=0 plane, is closely related to the problem of determining the visibility map of a set of non-intersecting line segments in the plane. The top-view of the etched pattern, i.e. the two dimensional etch-target curve in the plane of the wafer, is a set of polygons in the plane. An etch target point, or the projection of (x0,y0,-h) on to the z=0 plane, is a point on the etch-target curve. The visibility map of the etch-target curves (polygons) is determined by the Overmars-Welzi algorithm (see: Overmars, M. H. and E. Welzl, Construction of Sparse Visibility Graphs, Technical Report RUU-CS-87-9, Department of Computer Science, University of Utrecht, 1987) in a preferred embodiment. In other embodiments, the visibility map is determined by either one of: Ghosh-Mount algorithm (Ghosh, S. K. and D. M. Mount, “An Output Sensitive Algorithm For Computing Visibility Graphs,” SIAM J. Comput. Vol. 20, No. 5, pp. 888-910, October 1991), Lee Algorithm (Lee, D. T., “Proximity And Reachability In The Plane,” Ph.D. Thesis and Tech Report ACT-12, Coordinated Science Laboratory, Univerity of Illinois at Urbana-Champaign, Urbana, Ill., 1978), or the so-called Naive-Algorithm (Kitzinger, J., “The Visibility Graph Among Obstacles: A Comparison Of Algorithms,” M. S. Thesis, Department of Computer Science, University of Mexico, 2003). For each target point, the integration domain is divided up into triangles where one vertex of each triangle is the target point. The integral ΩSKY,h,l is then analytically calculated over each triangle.
 The corresponding litho target point 140 is displaced by −{right arrow over (n)}ΔEdge from 130 where ΔEdge is the etch bias calculated by the etch model. The reasoning behind this correction is that: 
TETCH−1=(I+E)−1≅I−E   (12) 
 If etch transformation amounts to a small perturbation E on the identity map I, then its inverse can be approximated by I−E. A more accurate approximation to TETCH−1 can be obtained by iterating the etch correction, i.e., using a series expansion in (12). Alternatively, TETCH−1 can be calculated by solving a minimization problem:  
 where PLITHO is the pattern defined by litho target points 140, and PETCH is the pattern defined by etch target points 130. The norm of difference between two patterns can be the square root of the sum of squares of distances between points 130, and the corresponding points predicted by the etch model TETCH(PLITHO). 
The photomask data synthesis will next be described. The litho target points 140 are used to express the optimization goals of for the optimization process that implements TLITHO−1. For details of this calculation, see, for example, the following copending U.S. patent applications, which are all assigned to the same assignee as the present application: U.S. patent application Ser. No. 11/203,498, filed 13 Aug. 2005; U.S. patent application Ser. No. 11/203,505, filed 13 Aug. 2005; U.S. patent application Ser. No. 11/203,330, filed 13 Aug. 2005; and U.S. patent application Ser. No. 11/203,329, filed 13 Aug. 2005, all incorporated herein by reference.
 According to certain embodiments, the mask data comprises a set of polygons whose edges are parallel to the x, y, x+y, or x−y axes. This is because mask writers, and the software that handle the data for mask writers, work most efficiently under these conditions. The polygons in the mask data set are optimized by segmenting their edges and moving (displacing) the segments is a way that preserves their angular orientation. According to certain embodiments, there is no pairing of edge segments of a polygon and the target points. The edge segments in the mask are adjustable, or independent, variables of optimization. The target points are used to define the optimization goal. For example, the optimization in an embodiment minimizes a figure of demerit F1m of the photomask data m:  
 In Equation (14), I(xj, yj; zq) denotes a lithographic latent image intensity at the location (xj,yj) in the wafer-plane when the defocus is zq. The defocus refers to the axial displacement of the wafer from a position of best focus. The symbol t denotes a threshold, which is equal to the ratio: (dose-to-clear/dose) where “dose” refers to the light exposure dose (energy per unit area) applied to a photoresist, and “dose-to-clear” indicates the minimum dose necessary to clear a positive photoresist or not clear a negative photoresist. A positive photoresist dissolves if I(xj,yj;zq)>t, and it does not dissolve if I(xj,yj;zq)<t, at the location (xj,yj) in the wafer-plane when the defocus is zq. The inequalities are reversed for a negative photoresist. The point (xj,yj) is the ith litho target point, one of the points 140 in 
 Alternatively, the photomask data can be designed by minimizing a figure-of-demerit F2m of the photomask data set m:  
 Minimizing F2m forces the resist edge to go through the litho target points (xj,yj); j=1, . . . , M for focus values z1, . . . , zQ. The image-slope ∂I/∂n(xj,yj; zq) is the directional derivative of the image in the direction that is perpendicular to the target edge (150 in 
 The edge placement error is normalized with the edge placement tolerance tolj for the jth target point:  
 Edge placement error depends on the exposure dose. If the exposure dose is increased by Δdose, such that Δdose/dose<<1, then the edge placement error changes as follows:  
 If the sum of └EPEjq2(+Δdose)+EPEjq2(−Δdose)┘/tolj2 over all target points is minimized, then the figure-of-demerit F2m in Equation (15) is derived as follows:  
 This determines the value of the parameter a as:  
 As an alternative approach, the figure-of-demerit F2m in Equation 15 is derived by introducing edge-displacement (ED) induced by dose-variation:  
 If the sum of └EPEjq2+EDjq2(Δdose)┘/tolj2 over all target points is minimized, then again the figure-of-demerit F2m in Equation (15) is derived, with the value of the parameter α given by Equation (17). 
In order to minimize the figure-of-demerit (14), the litho target points 140 are needed. To minimize the figure-of-demerit (15), the litho target points and unit normal vectors (unit vectors in the plane of the wafer, and that are perpendicular to a smooth curve 150 that goes through the litho target points) at the litho target points are needed. Most notably, a polygonal litho target is not needed; and there is no need to segment a polygonal litho target.
According to certain embodiments, there is a photomask data set that comprises polygons. The method starts with an initial guess of the photomask data set. The polygons in the photomask data set are segmented and the positions of the segments are optimized to minimize a figure-of-demerit such as (14) or (15). The figure-of-demerit is expressed in terms of some quality of the lithographic image at the litho target points. There is no pairing, or one-to-one correspondence, of the litho target points and the segments of the polygons comprising the photomask data set.
In the foregoing specification, embodiments of the invention have been described with reference to numerous specific details that may vary from implementation to implementation. The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense.
Priority is claimed from U.S. Provisional Patent Application 60/723,563, filed Oct. 3, 2005, and said Provisional patent Application is incorporated herein by reference.
| Number | Date | Country | |
|---|---|---|---|
| 60723563 | Oct 2005 | US |