Method of controlling plasma distribution uniformity by superposition of different constant solenoid fields

Information

  • Patent Application
  • 20090250335
  • Publication Number
    20090250335
  • Date Filed
    April 07, 2008
    16 years ago
  • Date Published
    October 08, 2009
    15 years ago
Abstract
A method for processing a workpiece in a plasma reactor having a set of n coils includes constructing, for each one of the n coils, a set of plasma distributions for discrete values of coil current in a predetermined current range. The distributions are grouped, each group having one distribution for each of the n coils, and being a unique set of n distributions. A combined plasma distribution is computed from each group of distributions. The variance of each combined distribution is computed. The method further includes finding an optimum one of the combined distributions having an at least nearly minimum variance, and identifying the n coil currents associated with the optimum distribution. During plasma processing of the workpiece, currents through the coils are maintained at levels corresponding to the n coil currents associated with the one combined distribution.
Description
BACKGROUND

Plasma processing of workpieces such as semiconductor wafers to form nanometer-sized thin film features requires precise control over plasma uniformity. Improving device performance requires decreasing feature sizes, which increases requirements for plasma ion density distribution uniformity across the surface of a workpiece or wafer. Using two axially displaced solenoidal coils over a plasma reactor chamber, plasma distribution can be changed by changing the D.C. currents applied to the coils.


Plasma ion density distribution non-uniformity has been reduced to as low as 5% (the measured variance or standard deviation) by choosing the D.C. currents in the overhead solenoidal coils. The problem is that nonuniformity must be reduced even further, and it has not seemed possible to reduce the uniformity below 5%.


SUMMARY

A method is provided for processing a workpiece in a chamber of a plasma reactor having a set of n solenoidal electromagnet coils. The method includes constructing, for each one of the n coils, a set of plasma distributions for discrete values of coil current in a predetermined current range. The method further includes defining different groups of the distributions, each group having one distribution for each of the n coils, each group being a unique set of n distributions. A combined plasma distribution is computed from each group of distributions. The variance of each combined distribution is computed. The method further includes finding an optimum one of the combined distributions having an at least nearly minimum variance, and identifying the n coil currents associated with the optimum distribution. During plasma processing of the workpiece, currents through the coils are maintained at levels corresponding to the n coil currents associated with the one combined distribution.


In one embodiment, constructing the set of plasma distributions for discrete values of coil current is carried out by measuring, for each of the n coils, a plasma distribution at each one of a small set of widely spaced values of coil current spanning the range, determining the change in plasma distribution for a predetermined incremental change ΔI in coil current, and then synthesizing plasma distributions at finely spaced values of coil current lying between the widely spaced values by interpolating between the measured distributions at intervals of ΔI. In one implementation, the plural predetermined plasma density distributions are two-dimensional.





BRIEF DESCRIPTION OF THE DRAWINGS

So that the manner in which the exemplary embodiments of the present invention are attained and can be understood in detail, a more particular description of the invention, briefly summarized above, may be had by reference to the embodiments thereof which are illustrated in the appended drawings. It is to be appreciated that certain well known processes are not discussed herein in order to not obscure the invention.



FIG. 1A is a simplified block diagram of a plasma reactor system in accordance with one embodiment.



FIG. 1B depicts a simplified implementation of a process controller of the reactor of FIG. 1A.



FIG. 2 is a graph depicting the behavior of plasma distribution non-uniformity as a function of overhead coil current.



FIG. 3 is a graph depicting the radial components of center-high and center-low plasma distributions and a composite distribution obtained by their superposition.



FIG. 4 is a graph depicting the azimuthal components of different plasma distributions and a composite distribution obtained by their superposition.



FIG. 5A is a graph representing a two-dimensional plasma distribution obtained by a first set of D.C. currents applied to the overhead coils.



FIG. 5B is a graph representing a two-dimensional plasma distribution obtained by a second set of D.C. currents applied to the overhead coils.



FIG. 5C is a graph representing a net plasma distribution corresponding to a measured etch rate distribution obtained by switching the coil currents between the two sets of currents corresponding to the distributions of FIGS. 5A and 5B for a predetermined duty cycle.



FIG. 5D is a graph depicting the separate radial components of the plasma distributions of FIGS. 5A, 5B and 5C.



FIG. 6 is a block flow diagram of a simplified process implemented by the process controller of the reactor of FIG. 1A.



FIG. 7 is a block flow diagram of a comprehensive process including an optimization search method which the process controller of FIG. 1A may be programmed to execute.



FIGS. 8A and 8B constitute a block diagram depicting a method in accordance with another embodiment.



FIGS. 9A, 9B, 9C and 9D are graphs depicting interpolations employed in carrying out certain portions of the method of FIGS. 8A and 8B.





To facilitate understanding, identical reference numerals have been used, where possible, to designate identical elements that are common to the figures. It is contemplated that elements and features of one embodiment may be beneficially incorporated in other embodiments without further recitation. It is to be noted, however, that the appended drawings illustrate only exemplary embodiments of this invention and are therefore not to be considered limiting of its scope, for the invention may admit to other equally effective embodiments.


DETAILED DESCRIPTION

Referring to FIG. 1A, a plasma reactor includes a chamber 100 defined by side walls 102, a ceiling 104 and a workpiece support 106 within the chamber for supporting a workpiece or wafer 108 to face the ceiling 104. A plasma source power applicator, which may be adapted to couple power such as RF plasma source power into the chamber is provided. The plasma source power applicator may be any suitable form, such as a coil antenna (not shown) overlying the ceiling 104, an electrode formed by the ceiling 104 as shown in FIG. 1A, a toroidal plasma source or other sources such as a microwave source, a Helicon source, etc. In FIG. 1A, the ceiling 104 is formed of metal to provide an electrode as the RF plasma source power applicator, and an insulating ring 110 separates the ceiling electrode 104 from the side wall 102. An RF source power generator 112 provides RF plasma source power through an impedance match element 114 to the ceiling electrode 104. An RF bias power generator 116 provides RF plasma bias power through another impedance match element 118 to an electrode 120 within the workpiece support 106. A pair of inner and outer solenoidal electromagnet coils 122, 124 overlie the reactor chamber 100, the coils 122, 124 being of different diameters and at different axial locations, as shown in FIG. 1A. In the embodiment of FIG. 1A, the inner coil 122 is disposed at a higher axial location than the outer coil 124, although an opposite arrangement may be employed. Also, the number of solenoidal coils may exceed two. Furthermore, while the solenoidal coils 122, 124 are depicted as being mutually coaxial and coaxial with the axis of symmetry of the reactor chamber 100, other arrangements not involving such symmetries may be employed.



FIG. 2 depicts the behavior of the non-uniformity or variance, a, in plasma ion distribution (vertical axis) as a function of D.C. currents Iinner, Iouter (x and y horizontal axes) in the two coils 122, 124. At a low current level in each coil, the plasma ion distribution tends to be highly non-uniform, the non-uniformity corresponding to a center high distribution, such as the center-high radial distribution 300 in the graph of FIG. 3. At a high current, the plasma ion distribution tends to be high non-uniform, the non-uniformity corresponding to a center-low distribution, such as the center-low distribution in 305 in the graph of FIG. 3. At some intermediate current in each coil, the non-uniformity is minimum. The location of the trough or ideal operating point of minimum non-uniformity in FIG. 2 is typically difficult or impractical to locate. Therefore, in one embodiment, the ideal behavior at the minimum uniformity or trough in the graph of FIG. 2 is obtained by switching the coil currents between two sets of values corresponding to two distributions A1 and A2 fairly near but on opposite sides of the trough. The net effect over time corresponds to a superposition of the center-high and center-low distributions 300, 305 of FIG. 3, resulting in an intermediate distribution 310 that is neither center-high nor center-low and therefore more uniform.


The switching of the coil currents Iinner, Iouter to switch the plasma between the distributions A1 and A2 is performed by a programmed process controller 130 of the reactor of FIG. 1A. In one embodiment, the process controller 130 includes a microprocessor programmed to perform the methods described below in this specification. A simplified representation of the controller 130 is depicted in FIG. 1B, in which the controller 130 governs current flow from two sources 132, 134 of respective current pairs that produce the plasma ion distributions A1 and A2. The distributions A1 and A2 are depicted in FIG. 3 as radial distributions (functions of radius r). The controller 130 further has a switching element 136 that switches the respective coils 122, 124 between the corresponding output pairs of the two sources. The switching element 136 may be programmable to spend a duty cycle, a1, connected to the A1 current source 132 and a duty cycle, a2, connected to the A2 current source 134, where






a
1
+a
2=1





and






a
2=|1−a1|.


The controller 130 generates a time-weighted superposition of the two plasma distributions A1 and A2 (the distributions 300, 305 of FIG. 3) to produce the intermediate distribution 310, which may be defined as the time-weighted superposition or combination






A
comb
=a
1
A
1
+a
2
A
2


The time-weights or coefficients a1 and a2 can be chosen to minimize the non-uniformity or variance in Acomb.


The distributions A1 and A2 may be two-dimensional, so that what is depicted in FIG. 3 are their radial components. For the two-dimensional case, the azimuthal components of the distributions A1 and A2 are depicted in FIG. 4 as functions of angle θ.


More than two solenoidal coils may be employed. More than two different plasma distributions may be included in the time-weighted superposition or combination Acomb.



FIGS. 5A-5D depict a working example. In FIG. 5A, the D.C. coil currents Iinner, Iouter are set to produce a center-low two-dimensional plasma ion density distribution A1 depicted in FIG. 5A. Specifically, Iinner=−8 amps and Iouter=+10 amps. In FIG. 5B, the D.C. coil currents Iinner, Iouter are set to produce a center-high two-dimensional plasma ion density distribution A2 depicted in FIG. 5B. Specifically, Iinner=0 amps and Iouter=0 amps. The distribution A1 had a deviation between maximum and minimum density values of 7.7% and a variance σ=4.7%. The distribution A2 had a deviation between maximum and minimum density values of 5.9% and a variance σ=2.5%. The time duration or weighting coefficient a1 was 38.8% while the time duration or weighting coefficient a2 was 61.2%. The resulting effective combined distribution Acomb depicted in FIG. 5C had a maximum-to-minimum deviation of only 3.9% and a very low variance σ=1.8%. FIG. 5D compares the radial components of the two-dimensional distributions A1 (solid line), A2 (dashed line) and Acomb (thick line). The plasma distributions A1, A2, and Acomb were obtained by measuring etch rate distributions across the surfaces of test wafers.


A method for carrying out an embodiment is depicted in FIG. 6. The programmable process controller 130 of FIG. 1A may be programmed to carry out the method of FIG. 6. In this case, the controller 130 may include machine-readable media storing instructions for carrying out the steps of FIG. 6. In the method of FIG. 6, the reactor is provided with plural solenoidal coils capable of generating different plasma ion density distributions by changing the D.C. currents through the coils to different values (block 600 of FIG. 6). Two different plasmas distributions (A1 and A2) are chosen, tending to have different shapes that may compensate for non-uniformities inherent in each other (block 605). Unknown time weighting coefficients a1, a1 are defined and a combined time weighted plasma distribution Acomb=a1 A1+a2 A2 is defined (block 610). A search is made to find the set of time weighting coefficients a1, a2 that minimizes plasma distribution variance or maximizes uniformity (block 615). How this search process is performed is discussed below in this specification. During plasma processing, the processor 130 changes the coil currents between the coil current pairs that generate the different distributions or states A1 and A2 so that each state lasts for a time period corresponding to the respective coefficient a1, a2 (block 620).



FIG. 7 depicts a method employing any number (two or more) of different plasma distributions that are two dimensional. The programmable process controller 130 of FIG. 1A may be programmed to carry out the method of FIG. 7. In this case, the controller 130 may include machine-readable media storing instructions for carrying out the steps of FIG. 7. The method of FIG. 7 includes a method for optimizing the time weighting coefficients a1 and a2. First, two (or more) different two-dimensional (2-D) plasma distributions are chosen (block 700 of FIG. 7). Each of these distributions may be designated as Aj(r,θ) in cylindrical coordinates relative to the surface of the workpiece or wafer. The index or subscript “j” identifies a particular one of the chosen distributions. Preferably, the different distributions have mutually complementary behaviors. Each distribution Aj(r,θ) is produced by a different pair of known coil currents Iinnerj, Iouterj which are stored in a memory of the controller 130. Unknown time weighting coefficients aj are defined and a combined time weighted plasma distribution Acomb=a1 A1+a2 A2+ . . . is defined (block 710). An average plasma density value Aave is defined as a function of all the chosen Aj (block 715), which in one embodiment may be in accordance with the following equation:








A
ave

=




j
=
1

n









0
R





0

2

π





a
j

·

A
j

·


r

·


θ














where aj is the time duration of plasma distribution Aj and R is the radius of the wafer to be plasma processed.


A variance function is defined as the standard deviation of Acomb from Aave which is a function of the chosen distributions Aj's, their unknown time weighting coefficients aj's and Aave (block 720). This variance function in one embodiment may be defined in accordance with the following equation:






σ
=


[


1

A
ave






0
R





0

2

π





1
R




(





j
=
1

n








a
j



A
j



-

A
ave


)

2





r

·






θ






]


1
/
2






This formula is used by the controller 130 to search for an optimum set of time weighting coefficients aj that minimizes the variance function a (block 725 of FIG. 7). Such as search may be constructed by the skilled worker in view of the foregoing teachings using standard mathematical programming practices. A number of mathematical programs are readily available that the skilled worker can employ to find the optimum values of the time weighting coefficients, the aj's.


After the optimum time weighting coefficients have been found, the solenoidal coil currents are switched between the sets of currents corresponding to the chosen distributions Aj such that the time spent in a particular plasma distribution Aj is proportional to its time weighting coefficient aj (block 730). This switching operation may be performed in any one of the following modes.


In a first mode, the coil currents are switched between sets of currents defining successive chosen distributions Aj (block 732). That is, the currents are switched between states in mutually exclusive duty cycles.


In a second mode, one of the coil currents is maintained at a constant level another coil current is switched between different values (block 734).


In a third mode, the plasma is switched to between two chosen distributions by reversing the polarities of the coil currents (block 736).



FIGS. 8A and 8B constitute a flow diagram illustrating a method in which the coil currents Iinner, Iouter are held constant rather than being switched, and a search is made for the optimum pair of constant coil currents Iinner′, Iouter,′ that produces an ideal plasma distribution A′ having the least variance or non-uniformity. The programmable process controller 130 of FIG. 1A may be programmed to carry out the method of FIGS. 8A and 8B. In this case, the controller 130 may include machine-readable media storing instructions for carrying out the steps of FIGS. 8A and 8B.


Referring to FIGS. 8A and 8B, in block 800, a set of plasma distributions A1 is constructed for all discrete values of Iinner in a predetermined range. (The subscript “1” refers to the inner coil.) This is carried out as follows. First, in block 802, a reduced number of plasma distributions A1 are measured at a small set of widely spaced values of Iinner spanning the chosen range. One example of this step is depicted in FIG. 9A, in which the chosen range is −24 to +24 amps, and the values of Iinner occur in steps of ΔIinner=4 amps, so that only twelve measurements are taken. Each of the twelve measurements is carried out by etching a test wafer while holding the current on the inner coil at one of the twelve values of Iinner and then deducing the two-dimensional plasma distribution Al from the etch depth distribution on the test wafer, and storing the corresponding two-dimensional distribution A1. The result is a set of twelve measured inner coil plasma distributions A1. Then, in block 804, a measurement is made to determine the change ΔA1in plasma distribution A1 for a predetermined incremental change ΔIinner=4 amps in the coil current Iinner. This determination may be made while Iinner=0. In one embodiment, it is assumed that the distribution change ΔA1 is the same regardless of location within the range −24 amps to +24 amps. The distribution change ΔA1 may be found by subtracting any two measured plasma distributions A1 generated by inner coil currents that differ by 4 amps. For example, ΔA1=A8 amps−A16 amps. In the example depicted in FIG. 9A, ΔA1=A0 amps−A2 amps. This measurement requires the etching of two test wafers at constant inner coil currents of 0 amps and 2 amps respectively.


The twelve measured distributions at the twelve inner coil current values of FIG. 9A and the plasma distribution change ΔA1 are used to construct all the remaining A1's at the eighteen remaining current values depicted in FIG. 9B (block 806 of FIGS. 8A-8B). This construction is performed by interpolating between the twelve measured A1's of FIG. 9A at intervals of ΔIinner by adding (or subtracting) the appropriate multiple of ΔA1 from each distribution A1.


Next, in block 820 of FIGS. 8A-8B, a set of plasma distributions A2 are measured for all discrete values of Iouter in a predetermined range (e.g., −24 amps to +24 amps). (The subscript “2” refers to the outer coil current). This is carried out as follows. First, in block 822, a reduced number of outer coil current plasma distributions A2 are measured at a small number (e.g., twelve) of widely spaced values of Iouter spanning the chosen range. One example of this step is depicted in FIG. 9C, in which the chosen range is −24 amps to +24 amps, and the values of Iouter occur in steps of ΔIouter=4 amps, so that only twelve measurements are taken. Each of the twelve measurements is carried out by etching a test wafer while holding the current on the inner coil at one of the six values of Iouter and then deducing the two-dimensional plasma distribution A2 from the etch depth distribution on the test wafer, and storing the corresponding two-dimensional distribution A2. Then, in block 824, a measurement is made to determine the change ΔA2 in A2 for a predetermined incremental change ΔIouter=4 amps in the coil current Iouter. In one embodiment, it is assumed that the change ΔA2 is the same regardless of location within the range −24 amps to +24 amps. The change ΔA2 may be found by subtracting any two measured distributions A2 generated by coil currents that differ by 4 amps. For example, ΔA2=B8 amps−B16 amps. In the example depicted in FIG. 9C, ΔA2=B0 amps−B2 amps. This measurement requires the etching of two test wafers at constant outer coil currents of 0 amps and 2 amps respectively.


The twelve measured distributions A2 at the twelve outer coil current values of FIG. 9C and the distribution change ΔA2 are used to construct all the remaining A2's at the eighteen remaining current values depicted in FIG. 9D (block 826 of FIGS. 8A-8B). This construction is performed by interpolating between the measured twelve A2's of FIG. 9C at intervals of ΔIouter by adding (or subtracting) ΔA2 from each distribution.


In block 830 of FIGS. 8A-8B, a set of combined plasma distributions C are constructed as sums of all possible pairings of A1's with A2's, where each C is defined as C=A1+A2. For each C, an average distribution Aave is computed as the average plasma density of C (block 835). This computation may be carried out in one embodiment in accordance with the following equation:







A
ave

=




j
=
1

n









0
R





0

2

π




·

A
j

·


r

·


θ









where dr is an incremental radius, dθ is an incremental angle in cylindrical coordinates and R is the radius of the workpiece, and j runs from 1 (inner coil) to 2 (outer coil).


In block 840, a variance function is defined as the standard deviation of C from Aave. The variance function may be defined in one embodiment in accordance with the following equation:






σ
=


[


1

A
ave






0
R





0

2

π





1
R




(





j
=
1

n







A
j


-

A
ave


)

2





r

·






θ






]


1
/
2






The foregoing equations use the more general notation in which j is the index of each coil running from 1 to n. In the foregoing example, there are only two coils, an inner coil and an outer coil, so that n=2. However, in the more general case, the number of coils, n, may be any integer greater than 1.


In block 845, the processor 130 computes the variances a for all possible combinations of n plasma distributions Aj and stores the results in memory, and then searches the memory for the particular “optimum” combination of n Aj's for which the variance function σ is minimum. In block 850, the processor 130 looks up in memory for the n coil currents corresponding to the optimum combination of n Aj's, and chooses those currents as the optimum coil currents. In the present example employing only and inner coil and outer coil, n=2, and each combination of distribution is a sum of a pair of distributions A1+A2 produced by corresponding coil currents Iinner, Iouter. The processor 130 searches the results of the foregoing search for the coil current pair Iinner′, Iouter′ corresponding to the particular combination distribution A1+A2 having the minimum variance σ. In block 855, a wafer or workpiece is processed in the plasma reactor by constantly maintaining the coil currents at the designated optimum values Iinner′, Iouter′.


While the foregoing is directed to embodiments of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof, and the scope thereof is determined by the claims that follow.

Claims
  • 1. A method of processing a workpiece in a chamber of a plasma reactor having a set of n electromagnet coils, comprising: for each one of said n coils, constructing a set of plasma distributions for discrete values of coil current in a predetermined current range;defining different groups of said distributions, each said group comprising one distribution for each of said n coils, each group comprising a unique set of n distributions;computing a combined plasma distribution from each said group of distributions;computing the variance of each combined distribution and finding an optimum one of said combined distributions having an at least nearly minimum variance;identifying the n coil currents associated with said optimum distribution;flowing a process gas into the chamber and generating a plasma in the chamber; andmaintaining currents through said coils corresponding to said n coil current associated with said one combined distribution.
  • 2. The method of claim 1 wherein said constructing a set of plasma distributions for discrete values of coil current comprises, for each of said n coils: measuring a plasma distribution at each one of a small set of widely spaced values of coil current spanning the range;determining the change in plasma distribution for a predetermined incremental change ΔI in coil current; andsynthesizing plasma distributions at finely spaced values of coil current lying between said widely spaced values by interpolating between the measured distributions at intervals of ΔI.
  • 3. The method of claim 1 wherein said plural predetermined plasma density distributions are two-dimensional.
  • 4. The method of claim 1 wherein n=2 and said coils comprise an inner coil and an outer coil.
  • 5. Electronically-readable storage media storing instructions for carrying out the method of any one of claims 1 or 2.
  • 6. A method of processing a workpiece in a chamber of a plasma reactor having a set of n electromagnet coils, wherein n is an integer, said method comprising: for each one of said n coils, constructing a set of plasma distributions Aj for discrete values of coil current Ij spanning a predetermined current range, where j refers to the one coil and ranges from 1 to n;defining different groups of said distributions Aj, each one of said groups comprising one distribution for each of said n coils, each group comprising a unique set of n distributions;computing a combined plasma distribution C from each one of said groups of n distributions Aj;defining a variance function as the standard deviation of C relative to an average value;searching for the optimum distribution C′ for which the variance function is minimized;looking up the components Aj′ of the optimum distribution C′ and looking up the coil currents Ij′ associated with those components;introducing a workpiece into the chamber, flowing a process gas into the chamber and generating a plasma in the chamber; andfor each of said coils, maintaining the D.C. coil current Ij′ in the jth coil.
  • 7. The method of claim 6 wherein said constructing a set of plasma distributions for discrete values of coil current comprises, for each of said n coils: measuring a plasma distribution at each one of a small set of widely spaced values of coil current spanning the range;determining the change in plasma distribution for a predetermined incremental change ΔI in coil current; andsynthesizing plasma distributions at finely spaced values of coil current lying between said widely spaced values by interpolating between the measured distributions at intervals of ΔI.
  • 8. The method of claim 6 wherein said plural predetermined plasma density distributions are two-dimensional.
  • 9. The method of claim 1 wherein n=2 and said coils comprise an inner coil and an outer coil.
  • 10. The method of claim 2 wherein said defining a variance function is preceded by computing said average value.
  • 11. The method of claim 10 wherein said average value is defined as:
  • 12. The method of claim 11 wherein said variance function is defined as: