Semiconductor processing for forming integrated circuits requires a series of processing steps. These processing steps include the deposition and patterning of material layers such as insulating layers, polysilicon layers, and metal layers. The material layers are typically patterned using a photoresist layer that is patterned over the material layer using a photomask or reticle. Typically, the photomask has alignment targets or keys that are aligned to fiduciary marks formed in the previous layer on the substrate. However, as the size of integrated circuit features continues to decrease, it becomes increasingly difficult to measure the overlay accuracy of one masking level with respect to the previous level. This overlay metrology problem becomes particularly difficult at submicrometer feature sizes where overlay alignment tolerances are reduced to provide reliable semiconductor devices. One type of overlay measurement is known as diffraction based overlay (DBO) metrology, which may be empirically based or model based.
A fundamental issue with process-control equipment is move-acquire-measure (MAM) time. The empirically based DBO process typically requires the acquisition of spectra from a minimum of six pads on a sample in order to determine the overlay error along the X and Y axes. If each of these spectra were to be acquired sequentially by the metrology tool, a minimum of six stage moves per measurement and a minimum of six integrations for the camera would be required. Given currently available technologies, such a measurement sequence would require a MAM-time of approximately 3 seconds or more.
By acquiring the spectra simultaneously from all of the pads, the MAM-time could be greatly reduced, e.g., approximately 1 second, thereby reducing the cost of ownership of the metrology tool. Thus, parallel acquisition of the spectra from all of the DBO target pads is desired.
Spectra for diffraction based overlay (DBO) in orthogonal directions, i.e., along the X-axis and Y-axis, are acquired in parallel. A broadband light source produces unpolarized broadband light that is simultaneously incident on X-axis and Y-axis DBO targets. A polarization separator, such as a Wollaston prism or planar birefringent element, receives diffracted light from the X-axis and Y-axis DBO targets and separates the TE and TM polarization states of the diffracted light. A detector simultaneously detects the TE and TM polarization states of the diffracted light for both the X-axis DBO target and the Y-axis DBO target as a function of wavelength.
Diffraction based overlay (DBO) metrology is based on the measurement of the diffraction of light from a number of alignment pads.
An error in the alignment of the top diffraction grating with respect to the bottom diffraction grating of a DBO target 10X produces change in the resulting diffracted light with respect to perfectly aligned top and bottom diffraction gratings. Using a number of alignment pads and comparing the resulting diffraction signal from each alignment pad, the overlay error can be determined, which is sometimes referred to as empirical DBO (eDBO) measurement. In eDBO measurements, the DBO target 10X includes a pre-programmed shift between two or more of the alignment pads, illustrated as x1, x2, x3, and x4 in pads A, B, C, and D in
The gratings 12 and 16 used as pads for the DBO process have rulings that are transverse to the direction of the overlay error that they are intended to measure, i.e., target 10X measures overlay error in the X direction and target 10Y measures overlay error in the Y direction, as illustrated in
Parallel X and Y acquisition requires that half of the pads are measured with one linear polarization while the other half is measured with the orthogonal polarization simultaneously for optimal sensitivity. Parallel X and Y acquisition may be achieved by supplying unpolarized light to the sample and separating the two polarizations states of interest between the last beam-splitter surface in the optical system and the detector. For example, a Wollaston prism or a plane piece of birefringent material may be used as a polarization separator.
Parallel acquisition of spectra from all pads greatly reduces the MAM-time, compared to sequential acquisition. Moreover, parallel acquisition of spectra is advantageous as errors caused by light source instabilities are minimized. In comparison, sequential acquisition results in light source instabilities adding noise to the measurement and decreasing precision as a consequence.
The image of the pads 102 is separated into its two orthogonal polarization components O and E using a polarization separator 104, such as a Wollaston prism or a plane piece of birefringent material. The two differently polarized images 106O and 106E are separated in the direction that the pads are separated, i.e., along the direction of the row pads, to provide a relatively simple optical system. However, if desired, any direction of separation that does not cause the spectra of one polarization to overlap the spectra from the other polarization may be used, e.g., the separation may be at 45 degrees to the direction that the pads are separated provided that the distance of separation was adequate. The composite image of the two differently polarized images 106O and 106E is passed through a spectrometer 108 to an array detector 110.
The array detector 110 is illustrated as illuminated by light that is polarized in the horizontal plane on the left of the array detector 110 and light that is polarized vertically on the right side of the array detector 110. Thus, the TE light is associated with the pads at region 110XTE and at region 110YTE of the array detector 110, and TM light is associated with pads at region 110YTM and at region 110XTM of the array detector 110. Thus, the TE and TM polarization states from the X DBO targets and the Y DOB targets is detected simultaneously as a function of wavelength. The measurements from regions 110XTE and at region 110YTE of the array detector 110, labeled as measurements K and N, are used for the 0° acquisition and measurements from regions 110YTM and at region 110XTM of the array detector 110, labeled as measurements L and M, are used for the 180° acquisition. Thus, the TE and TM polarization states from the X DBO targets and the Y DOB targets is detected simultaneously as a function of wavelength.
The above notion suggests that there should be no scattering of light from outside the region where the detector is back-projected into the detector by non-specular processes. In practice, it is expected that there will be some level of contamination of the light reaching the detector by light that had not interacted with the desired part of the wafer. If the non-specular processes are strongly localized, then such a component to the signal could disrupt the measurement. However, if the processes merely add a slowly varying signal to the required signal, then it is not likely that the measurement will be affected. Under these circumstances, a light source 202 such as a Kohler illumination system is desirable. Alternatively, “critical illumination” may be used instead of Kohler illumination. As is well known, with critical illumination an extended source is projected so that an image of the source is conjugate with the sample. A featureless source (or as featureless as possible), e.g., an opal diffuser, is used so as to disrupt the signal as little as possible. Disruption due to variation of intensity of the different parts of the source that are projected onto different pads of the target may be calibrated using a blank wafer or blank target mounted on the stage that holds the wafer. The use of an extended source illuminating a substantial part of the wafer is desirable for pattern-recognition purposes. The selection of the metrology light source for pattern-recognition is important in reducing MAM-time as there is no need to switch sources within the measurement cycle.
The metrology device includes a beam splitter 210 that receives light from the light source 202 after passing through appropriate optical system, illustrated as lens 208. The light is illustrated as reflected by the beam splitter 210 towards an objective 220, such as a Schwarzschild objective with an NA of, e.g., approximately 0.3. The light is focused on the target area 232 of the sample 230, which includes the multiple diffraction pads for both the X and Y axes in a row. The diffracted light is received by the objective 220 and is illustrated as being transmitted through beam splitter 210 and through a second beam splitter 212, which may be used to direct a portion of the light to another optical system, e.g., for pattern recognition and/or focusing. The light is transmitted through a polarization separator 240, such as a Wollaston prism or a planar birefringent element, to produce two differently, e.g., orthogonally, polarized images of the target area 232 of the sample 230. The light is received by a spectrometer 250 by passing through a rectangular aperture 251, after which it is reflected by a mirror 252 to a wavelengths separator, such grating 254 or a prism, and is received by a detector 256, which is coupled to a computer 300. Of course, other geometries of spectrometer 250 are possible, e.g., the mirror 252 is merely illustrative and is not necessary component of a spectrometer, moreover, additional components may be included if desired.
The detector 256 may be, e.g., a back-thinned camera of 256×256 pixels with 24 μm pixels. The appropriate size of the detector 256 is based on the size and number of pads in the overlay target, as well as the characteristics of the optical system, including the object 220 and polarization separator 240. For example, with the use of four pads per target, with pads of 25 μm square, the two targets (X and Y) will have a total length of 2×4×25 μm. Margins the size of approximately one or two pads are used, and thus, the target size is approximately 25 μm×10 would be a good estimate, i.e., 250 μm long×25 μm wide. Using an objective 220 with a magnification of 10×, then the image of the two targets at the input of the spectrometer 250 would be e.g., 2.5 mm×0.25 mm. After splitting by the polarization separator 240, the displacement of the E-ray at the entrance aperture of the spectrometer would be about 3 mm. Thus a feature of 3 mm+2.5 mm would be projected onto the input slit of the spectrometer. A back-thinned camera of 256×256 pixels with 24 μm pixels, has a dimension of approximately 6.14 mm×6.14 mm. In order to collect wavelength information from 250 nm to 1000 nm (750 nm range) at a resolution of 5 nm, a total of 750/5=150 data points are required. Thus, a 256×256 pixel camera is a suitable detector. It may not be necessary to have a back-thinned camera, as any camera with appropriate spectral sensitivity may be used.
[The computer 300 may include a processor 302 with memory 304, as well as a user interface including e.g., a display 308 and input devices 310. A computer-usable medium 312 having computer-readable program code embodied may be used by the computer 300 for causing the processor to control the metrology device 200 and to perform the functions including the analysis described herein. The data structures and software code for automatically implementing one or more acts described in this detailed description can be implemented by one of ordinary skill in the art in light of the present disclosure and stored, e.g., on a computer usable medium 312, which may be any device or medium that can store code and/or data for use by a computer system such as processor 302. The computer-usable medium 312 may be, but is not limited to, magnetic and optical storage devices such as disk drives, magnetic tape, compact discs, and DVDs (digital versatile discs or digital video discs). A communication port 314 may also be used to receive instructions that are used to program the computer 300 to perform any one or more of the functions described herein and may represent any type of communication connection, such as to the internet or any other computer network. Additionally, the functions described herein may be embodied in whole or in part within the circuitry of an application specific integrated circuit (ASIC) or a programmable logic device (PLD), and the functions may be embodied in a computer understandable descriptor language which may be used to create an ASIC or PLD that operates as herein described.
The sensitivity to overlay at any given wavelength for either polarization is expected to be a strong function of wavelength. The sensitivity is part of the measurement and is provided by the third (and fourth if there is one) pad with the programmed offset. The reported measurement may include an average of the measurements at all wavelengths and polarizations weighted by the sensitivities at those wavelengths and polarizations.
The measured signal from the detector may be in the form of a wave. The signal 324 must be sensitive to the overlay error in the pads A, B, C, and D, however, the difference in signal from one pad to another is not expected to be great. For example, the difference in the signal between a pad and the surrounding material is likely to exceed the difference in the signals between different pads. Accordingly, the gap between the pads may be reduced to zero.
The measurement of the overlay error is a comparison of the relative signal levels produced by pads A, B, C and D, for each wavelength. The absolute signal level is not the metric of interest. Thus, a small DC component added to the signal will have little or no effect on the measurement. It is noted, however, that a large DC component will add to the Poisson noise and will also reduce the total number of “signal” photons that can be collected, and is therefore discouraged.
In many applications, such as scatterometry and metrology of materials, thin films, etc., it is important to know the polarization state of the incident beam. In the present embodiment, sensitivity is expected to be predominantly with radiation with a TE polarization state. Sensitivity in the TM radiation is not expected to correlate to the sensitivity in the TE radiation, i.e., the signal from TM radiation could have a positive or negative coefficient when the TE radiation has a positive coefficient. Thus, it is desirable to ensure that there is good polarization separation. Nevertheless, good results may be achieved even with the TE radiation slightly polluted with TM radiation, provided that the TM signal was less than, e.g., 10% of the TE signal.
Any optic with a finite NA will modify the polarization state of skew rays, which is a fundamental geometrical effect, not just a matter of optical design issues. Skew rays are produced by any optical component with a finite numerical aperture. Thus, it is not possible with ordinary polarizing components to project an incoming beam of light having a polarization state parallel to gratings on a pad if the optical component has a finite numerical aperture. Accordingly, in any practical system, it is expected that a few percent of the signal at the detector will not be pure TE or TM polarization states. However, as noted above, the addition of a small amount of signal that displays no sensitivity to overlay is not likely to be detrimental provided that the magnitude of the minority signal remains small.
Although the present invention is illustrated in connection with specific embodiments for instructional purposes, the present invention is not limited thereto. Various adaptations and modifications may be made without departing from the scope of the invention. Therefore, the spirit and scope of the appended claims should not be limited to the foregoing description.
This application claims priority under 35 USC 119 to U.S. Provisional Application No. 61/449,041, filed Mar. 3, 2011, entitled “Diffraction Based Overlay”, which is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
61449041 | Mar 2011 | US |