This invention is related to the field of white light interferometry or vertical scanning phase-shift interferometry. White light interferometry (WLI) is a technique that uses an interferometer to profile surfaces. A set of intensity frames is acquired from a test surface sequentially by changing the optical path difference (OPD) between the test surface and a reference surface. Either the test surface or the reference surface may be moved with respect to one another to effect this change. With a spectrally broad band or white light illumination, the temporal intensity distribution recorded at each pixel in the intensity frame in WLI has a localized interferogram whose contrast is a maximum at OPD=0, decreases rapidly as OPD increases, and diminishes at OPD greater than the coherence length of illumination.
Although most commercial interferometers using WLI are capable of measuring surfaces with nanometer precision, the technique has been used to profile opaque surfaces or surfaces without a transparent thin film on them. The reason for this is that multiple reflections produced from a test surface with transparent thin films create multiple superimposed interferograms. As a result, known methods in WLI lose the ability to profile surfaces with such superimposed interferograms.
This invention is directed to a method for measuring the top surface of an object with single or multiple transparent thin films on all or parts of the measuring area using WLI.
FIGS, 2, 3, 4 and 5 are schematic diagrams of different interferometers used in practicing embodiments of the invention;
Reference now should be made to the drawings, in which the same reference numbers are used throughout the different figures to designate the same or similar components.
The reflected light beams from both the topmost surface 34 and underneath surfaces of the test piece 32 (where a transparent thin film or multiple transparent thin, films of the type shown in
The step-by-step positioning for each frame of analysis is effected by the computer 20 in synchronization with the operation of the camera 40 by means of a suitable pusher or drive mechanism 50. The pusher 50 is illustrated in each of
It should be noted that instead of moving the test piece or object 32 with respect to the reference flat 30, the pusher 50 could be mechanically coupled (by a coupling not shown) to the reference flat 30 to move that surface relative to the surfaces of the test piece 32. Either the object/test piece 32 or the reference surface 30 may be moved in parallel planes with respect to one another to produce the repeated measurements or vertical scanning for each of the positions over which the complete scan is made. As is well known, the use of WLI allows the entire image field to be captured in one instant without the need for scanning apertures, resulting in profiling with high accuracy over a large range.
The method of this invention may be practiced with a variety of different interferometric systems.
In the Mirau interferometer of
All of the four different interferometer systems-which are described above in conjunction with
In white light interferometry, a temporal intensity distribution I(z) is defined as a series of intensities recorded at a single pixel in a series of intensity frames. The pixel is representative of a specific location on a measuring surface. The series of intensity frames comprises set of temporal samples of white light phase shifting interferometry data that is acquired from a measuring surface at selected positions of the measuring surface relative to a reference surface. I(z) can be expressed as:
where z is the distance of the reflecting point from the focus, h, is the distance of the reference mirror from the focus, k is the wavenumber, θ0 is related to the numerical aperture of the objective or N.A.=sin θ0, φ (k) is the reflectance phase of the measuring surface 32, and F(k)=S(k)R(k) where S(k) is the spectrum of white light illumination and R(k) is the reflectance of the measuring surface.
For a measuring surface without a transparent thin film on the top, it may be rewritten as:
I(n)=α+m(n−τ)cos(φ+ωsn). for n=0,1,2, . . . ,N−1 (2)
Where a is the background intensity, N is the total number of temporal intensity samples, m is the function representing the fringe modulation or envelope for the subject pixel of the sample, τ is the position effected by means of the pusher 50 producing the temporal sample intensity frame having the peak of the intensity envelope, n is the position effected by means of the pusher 50 to produce the selected temporal sample intensity frame, also known as the frame number,
A typical temporal intensity distribution of Equation (2) is depicted in
Where si and ci are the weights for the sampling intensity at the i-th frame. When R(n)∝ the fringe modulation m, R(n) may be expressed as:
M is the number frames used in the algorithm and C is a constant that depends on the phase shifting algorithm used.
Any one of several different known phase shifting algorithms can be used in conjunction with the method of this invention. By way of example, if a Hariharan 5 frame phase-shifting algorithm is used, si and ci are:
si=0, s2=2, s3=0, s4=−2, s5=0;
ci=−1, c2=0, c3=2, c4=0, c5=−1;
For this algorithm, the constant C is close to 4 and Equation (3) and Equation (4) become:
Once Φ(n) is found, the desired phase φ can be obtained in Equation (3) as long as R(n)≠0. In other words, it is not necessary to achieve the surface height information φ by selecting Φ(n) where R(n) is the maximum.
Other phase shifting algorithms also exist with different expressions to calculate Φ(n) and R(n), such as least squares, Fourier transforms, and others. They also can be used in this invention.
For a measuring surface with multilayer transparent thin films on the top, Equation (1) may be rewritten as:
Where a is the background intensity, N is the total number of temporal samples, K is the total number of reflective-surfaces, K is one of the reflective surfaces, mk(n) is the function representing the envelope for the temporal fringe related to k-th reflective surface only, τk is the position of the envelope peak of the K-th reflective surface, n is the sampling position or frame number, ωs is the phase shift, and φk carries the height information related to k-th reflective surfaces only. A typical temporal intensity distribution I(n), and its associated overall modulation envelope R(n) for this type of measuring surface is depicted in
As can be seen by comparison of
Substituting Equation (7) into Equation (3) or Equation (5) gives:
Substituting Equation (7) into Equation (4) or Equation (6) gives:
The top surface phase, say Φ1(n), may be, found if there exists a temporal position nc such that:
Thus, the phase Φ(nc) calculated by Equation (8) is equal to Φ1(nc). Consequently, the top surface m1 height information related to Φ1 is achieved. With a similar approach, the surface height or phase of any other underneath surfaces m2, m3, m4 . . . may be found, for example, Φm(nc), if there exists a temporal position nc such that:
ΣRk(nc)sin Φk(nc)=ΣRk(nc)cos Φk(nc)=0 for all k except k=m.
Where m is a reflective surface in which the expression of Equation 11 is non-zero, such a desired temporal top surface position nc exists for each pixel in various applications. It should be noted that the desired position nc for measuring the surface with transparent thin films may be neither at the maximum position of R(n), nor at the first local maximum of R(n). This is illustrated in
When all or part of the measuring surface has one or more transparent layers of thin films, and where the top layer or thin film is thick, the amplitude peaks of the fringe envelope related to different reflective surfaces are well separated. An example of this is shown in
In the example of a measuring surface without a thin transparent film as shown in
For a measuring surface with one layer of relatively thicker transparent film with two reflective surfaces, m1, m2, the calculating position nc is found at the maximum amplitude peak of R(n). If, as shown in
For purposes of illustration, the threshold value in the diagram of
When all or part of the measuring surface has one or more transparent layers of thin films, and where the amplitude peaks of the fringe envelope related to different reflective surfaces are not well separated, such as the surfaces represented by
The offset from R(n) may be determined using an algorithm which is a function of the known characteristics of the transparent thin films of the sample being measured, properties of the top surface of the sample, envelope fringes, or the phase under the fringe envelope.
One way to determine n, as an offset from an amplitude peak R(n) is to determine the phase Φ(n) in accordance with a phase shifting algorithm such as shown in Equation 3, and select nc at a minimum or maximum of the phase Φ(n) within the fringe envelope R(n).
As can be seen in both
Other ways of determining the offset are to develop a theoretical model R0(n), given knowledge of the composition of the layers and approximate thickness of each layer. When R(n) is developed from 1(n), and a position P of the amplitude peak of the known R(n) is selected, the model R0(n) and the known R(n) may be used to develop and offset O, which may then be subtracted from the position P.
To determine the top surface calculating position n, as an offset of R(n) with known thicknesses and n&k of the transparent films, one can generate two temporal intensity distributions at a pixel by Equation (1), one with a full stack of the transparent films (see
In the case of
Once PB is selected, an offset OB is derived by the above-described described modeling method, and the calculating position nG is found at the position defined by the value equal to PB minus OB. In other words, nG Is the location before PB that is offset from PB by the amount of the offset OB. As with
In the case of
It should be noted that, although the plots for some or all of the fringe envelopes for boundaries m1, m2, m3 are shown in
Once the calculating position is found, the phase Φ(nG) is extracted by:
Where SG and Ci are the weights of the sampling intensity of the I-th frame, M is the total number of frames used in the algorithm, and K is the total number of reflective surfaces such that for nG:
In a situation where the measuring surface has a flat top surface and has part of the measuring area where the top surface can be reliably measured, such as a surface having a portion without thin film, the reliable calculating positions over the field of view are established at the portion without the thin film. Typically, these are areas over a metal portion, such as the metallization 12 shown in
Another situation exists where all or part of the measuring surface has a single transparent layer of thin film, such as the wafer surface represented in
It is apparent that the above techniques all are substantially the same for calculating nc, but they may be employed with different types of objects. For example, in measuring the top surface of a CMP pattern wafer, the technique of finding a reliable or unambiguous measuring surface, as described above, is employed. In a CMP pattern wafer, not all of the top surface is covered with a transparent thin film. Parts of the measuring surface are exposed metals, such as copper. The top surface position can be determined unambiguously at these area because of the substantially greater intensity of the intensity distribution wherever such an uncovered metal surface exists.
CMP wafers typically are covered with a very thin transparent film of a known thickness. Thus, where an exposed top metal surface position is unambiguously determined, the interferograms for the different surfaces of the thin film can be related to the interferogram for such a top surface, and unambiguously determined.
It should be noted that the phase Φ1(nc) found by above techniques, see Equation (8), includes β1 (the phase change on reflection of the first reflective surface) and Φ1(nc-τ1) (the phase change of the top fringe envelope). These error sources from the top layer can be removed if the properties of thin films and surface materials are known. These properties include the complex index of refraction of the material N, where N=n+ki, and the real part n is known as the refraction index, and the imaginary part k is known as the extinction index.
In summary, the method for producing correct height information in a white light interferometer involves the steps of first acquiring N frames of data while the white light fringe pattern is scanning through the field of view with the frames having a plurality of pixels. Then, R(n) is calculated by equation (4) for every pixel. In addition, the desired top calculating positions nc are determined for every pixel. The phase Φ(n) is extracted by equation (3) or equation (5) at the desired temporal positions nc. The error resulting from the envelope phase is removed; and the phase map is converted to a height map. In addition, the phase map may be unwrapped using any conventional method; and it is possible to remove the phase on reflection at each pixel if its, related material property is known. This latter step is optional and may be used if desired.
The foregoing description is to be considered illustrative and not as limiting. Various changes and modifications will occur to those skilled in the art for performing substantially the same function, in substantially the same way, to achieve substantially the same result without departing from the true a scope of the invention as defined in the appended claims.
Number | Name | Date | Kind |
---|---|---|---|
5355221 | Cohen et al. | Oct 1994 | A |
5398113 | de Groot | Mar 1995 | A |
5633715 | Ai et al. | May 1997 | A |
7102761 | De Lega | Sep 2006 | B2 |
7106454 | De Groot | Sep 2006 | B2 |
7139081 | De Groot | Nov 2006 | B2 |
20020135775 | De Groot et al. | Sep 2002 | A1 |
20040189999 | De Groot | Sep 2004 | A1 |
20050057757 | De Lega | Mar 2005 | A1 |
20050068540 | De Groot | Mar 2005 | A1 |
20050073692 | De Groot | Apr 2005 | A1 |
20050078318 | De Groot | Apr 2005 | A1 |
20050078319 | De Groot | Apr 2005 | A1 |
20050088663 | De Groot | Apr 2005 | A1 |
20050280829 | Unruh et al. | Dec 2005 | A1 |
20060262321 | De Groot | Nov 2006 | A1 |
20070046953 | De Groot | Mar 2007 | A1 |
20070081167 | De Groot | Apr 2007 | A1 |
20070097380 | De Groot et al. | May 2007 | A1 |
Number | Date | Country | |
---|---|---|---|
20070008551 A1 | Jan 2007 | US |