The disclosure relates to wafer inspection methods and systems.
In general, semiconductor structures are built with minimum structure sizes or critical dimensions of down to about 5 nanometers, and devices with smaller critical dimensions are being developed. The fabrication of such semiconductor structures may involve about 1000 fabrication steps, starting with a blank wafer, to form an array of semiconductor dies, each semiconductor die including the semiconductor structures. The fabrication steps can include, for example, about 100 lithography steps. In modern manufacturing lines, up to 200 wafers may pass each lithography step per hour.
In order to obtain a high yield of semiconductor structures of close to 100%, it is typically desirable to closely monitor variations in any fabrication step that may indicate process variations leading to defects. Therefore, high-speed in-line metrology is used between different fabrication steps or is integrated into the fabrication steps. This metrology is sometimes also referred to as wafer inspection. Metrology tools are used to detect indications of process variations or defect candidates within a structure after specified fabrication step. Typical silicon wafers used in manufacturing of semiconductor structures have diameters of up to 12 inches (300 mm). With the small structure sizes, defect candidates of the order of the critical dimensions are desirably identified in a very large area in a short time.
For wafer inspection, usually some kind of image of the wafer surface is acquired, using techniques like scanning electron microscopy (SEM) or x-ray diffraction. Given the large area of the wafer compared to the feature size, this results in huge amounts of data to be analyzed.
Some approaches to analyze these huge amounts of data include the use of machine learning techniques, but these present various challenges. In particular, machine learning techniques typically involve samples and time for training. Also, while object detection algorithms have been used for various applications, for example for self driving cars or facial recognition, applying such algorithms to wafer inspections presents a challenge.
The disclosure seeks to provide wafer inspection methods and systems which help to inspect a plurality of wafers in short time even with small feature sizes on large wafers.
In a general aspect, a method for wafer inspection includes: acquiring an image of a processed wafer; converting the image to a polygonal chain representation; converting the polygonal chain representation to a feature vector list; comparing the feature vector list to a further feature vector list obtained based on reference data for the wafer; and determining defects in the wafer based on the comparison.
A processed wafer as used herein, may refer to a partially processed or fully processed wafer, or, in other words, a wafer in any stage during or after the front-end processing of the wafer. By converting the image into a polygonal chain representation and then converting the same to a feature vector list, a fast comparison with feature vectors based on design data is possible to detect defects. Moreover, as the design is used as reference data, new defect types of patterns can be easily detected.
Acquiring the image may include acquiring the image by multiple beam-scanning electron microscopy. Multi-beam scanning electron microscopy allows to obtain high resolution images in short time.
The polygonal chain representation may include closed polygonal chains, for example only closed polygonal chains.
The feature vector list may include one or more of convex corners, concave corners, edges and line ends.
Converting to a polygonal chain representation may include comparing image elements of the acquired image to a threshold.
Converting the acquired image to a polygonal chain representation may also include performing a contour extraction, performing a corner detection or performing a line end extension.
Using such image processing techniques allows for an efficient processing.
The method may further include converting the reference data to the further feature vector list.
Converting the reference data to the further feature vector list may include aligning the reference data to the acquired image or registering the reference data with the acquired image. In this way, based on the design data a reference fitting the acquired image may be obtained. Aligning the reference data to the acquired image or registering the reference data with the acquired image may be performed by modifying (e.g. performing a transformation like a coordinate transformation, a rotation, a translation or a size transformation) the reference data, the acquired image or both.
The reference data may be selected form the group consisting of design data, a reference wafer and a reference chip.
Converting one or more of converting the image to a polygonal chain representation, converting the polygonal chain representation to a feature vector list, and the reference data to the further feature list may be based on machine learning.
Each of the one or more of converting the image to a polygonal chain representation, converting the polygonal chain representation to a feature vector list, and the reference data to the further feature list may alternatively or additionally be based on an image analysis.
In a general aspect, a wafer inspection system includes an image acquisition device configured to acquire an image of a processed wafer, and an evaluation device. The evaluation device is configured to: convert the image to a polygonal chain representation; convert the polygonal chain representation to a feature vector list; compare the feature vector list to a further feature vector list obtained based on reference data for the wafer; and determine defects in the wafer based on the comparison.
The image acquisition system may include a multi-beam scanning electron microscope.
The device, in particular the evaluation device, may be configured to perform any of the methods discussed above.
In a general aspect, a method includes: converting an image of a processed wafer to a polygonal chain representation; converting the polygonal chain representation to a first feature vector list; comparing the first feature vector list to a second feature vector list obtained based on reference data for the wafer; and determining defects in the wafer based on the comparison.
In a general aspect, a method includes: converting a polygonal chain representation of an image of a processed wafer to a first feature vector list; comparing the first feature vector list to a second feature vector list obtained based on reference data for the wafer; and determining defects in the wafer based on the comparison.
In a general aspect, a system includes a first device configured to acquire an image of a processed wafer. The system also includes a second device configured to: convert the acquired image to a polygonal chain representation; convert the polygonal chain representation to a first feature vector list; compare the first feature vector list to a second feature vector list obtained based on reference data for the wafer; and determine defects in the wafer based on the comparison
In the following, various embodiments will be described in detail referring to the attached drawings. These embodiments are to be understood as examples only and are not to be construed as limiting in any way.
Features from different embodiments may be combined to form further embodiments. Variations or modifications described with respect to one of the embodiments may also be applicable to other embodiments and will therefore not be described repeatedly.
Embodiments as discussed herein may be employed for in-line metrology wafer inspection during manufacturing of semiconductor devices. An example for such a manufacturing of semiconductor devices as an application environment for various embodiments is illustrated in
In
After certain fabrication steps, the wafers are subjected to in-line wafer inspection at 14. In the in-line wafer inspection, methods and devices as explained further below with reference to
If the defect map of the wafer indicates defects, i.e. when defects are detected, the wafers where the defects are detected may be provided to an at-line wafer defect review and classification at 17. “At-line” indicates that the wafers in this case are taken out of the usual production process for further inspection. In particular, in the review at 17, the locations identified in wafer defect map may be reviewed in order to verify and classify the indications of process variations or defects. The determination of the presence or absence of defects at 17 may be carried out by comparing the image data to data previously gathered for a similar section of another object (die-to-die), or it may be carried out by comparison to a corresponding portion of a reference database (die-to-database) or a design data (die-to-CAD). All data may be handled and controlled in databases, including defect databases forming a collection of representative defects, CAD databases collecting information about ideal or representative structures, and process recipes. As a result, at 15 feedback instructions to the fabrication may be given, for example to modify fabrication parameters to counter process variations, or also instructions for example to do maintenance due to possible malfunctioning components in a corresponding fabrication system.
These steps are repeated until at 18 all layers of the front-end processing are completed. Following this, at 19 wafer probe testing may be performed, where for example structures on the wafer are contacted electrically by probes to perform test measurements. This concludes the front-end processing.
After the front-end processing at 11, back-end processing 12 follows where the wafers are diced into separate chips, and the chips are packaged. More testing of the semiconductor devices manufactured may occur during the back-end processing 12.
As already discussed in the introductory portion, for large semiconductor wafers and small structure sizes in the in-line wafer inspection at 14, huge amount of data have to be gathered and analyzed. Embodiments discussed in the following referring to
At 20, the method includes performing an image acquisition of a wafer to be inspected. To this end, the system of
A recent development in the field of charged particle microscopes CPM is the MSEM, a multi-beam scanning electron microscope. In an MSEM, the wafer is irradiated by an array of electron beams, including for example 80 up to 10000 electron beams, as primary radiation. Each electron beam is typically separated by a distance of between 1 and 200 micrometers from its next neighboring electron beam. For example, the MSEM can have 100 separated electron beams, arranged on a hexagonal array, with the electron beams separated by a distance of 10 μm. These electron beams are scanned in parallel over an object, forming an image patch of for example 110 μm diameter. After acquisition of the image patch, a substrate or wafer stage is moved to a next patch position and the image of the next patch is obtained by again scanning of the electron beam array. Thereby, a high resolution images with below 5 nm resolution can be formed by stitching multiple image patches together. It is also possible to acquire high resolution images for specific locations on a wafer, for example for the above mentioned PCMs or critical areas only. With an MSEM, a fast scanning of a wafer surface is possible, and therefore, it is well suitable for wafer metrology with a high throughput and with high resolution of down to few nm, for example 5 nm. The throughput may depend on resolution and the number of beamlets. For 100 beamlets, typical examples of throughput are 3.5 sq mm/min (square-millimeter per minute), or up to 10 sq mm/min. With increasing number of beamlets, e.g. with 100×100 beamlets, the throughput can go up to more than 300 sq mm/min, or even more than 500 sq mm/min, or even exceed 1000 sq mm/min.
However, uses of techniques as described herein are not limited to an MSEM as image acquisition device 30, but other image acquisition devices of high throughput, for example as described above, may also be used, as long as the resulting image has a sufficiently high resolution, i.e. a sufficiently high pixel density, to capture the smallest relevant detail, for example the smallest occurring defects or deviations from an intended structure of the wafer which should be detected by the method of
An example for an acquired image is shown in
This image is then processed further using the method of
At 21, the image acquired at 20 is processed to be converted to a polygonal chain representation. A polygonal chain is a connected series of line segments. It should be noted that in the context of the present application, such a polygonal chain may also include only a single line segment. In some case, the polygonal chain is closed, such that the last line segment of the series is connected to the first line segment. In this case, a polygon is formed. In some embodiments, only closed polygonal chains, i.e. polygons are used. For example, when designing semiconductor chip layouts, in many tools used for design only polygons are used. In such cases, what seem to be lines in a design may for example be formed by thin elongate rectangles. To match such designs, also at 21 and subsequent steps polygons may be used. However, in other embodiments also open polygonal chains may be used. For conversion to polygonal chains, the image acquired at 20 may be subjected to a series of image processing steps. In other words, shapes present in the image are processed to resemble actual polygonal chains, as they are present in a design as will be explained later.
As a first step, the image in some embodiments is subjected to a thresholding for black and white conversion to separate the foreground which is then converted to polygonal chains from the background. For example, in image 50 of
For example, the image may be provided as a greyscale image. In thresholding, the gray scale image is converted to a binary image by pixel thresholding and thereby a data reduction is achieved. After selection of a threshold value, all pixels having a gray level value which is below the selected threshold value is set to 0 (black e.g. background) and all the pixels having a gray level value which is equal to or greater than the threshold value are classified as 1 (white, e.g. foreground). Further details of thresholding may be found at https://www.geeksforgeeks.org/matlab-converting-a-grayscale-image-to-binary-image-using-thresholding/
Then, image processing techniques like contour extraction, corner detection and extension (for example to form closed polygonal chains) may be performed. Generally, various feature extraction techniques may be used, in particular polygon feature extraction techniques. An overview over feature extraction techniques may be found on the English Wikipedia page “feature extraction”, retrieved 15:18, May 16, 2019, from https://en.wikipedia.org/w/index.php?title=Feature extraction&oldid=877129337.
Examples for contour extraction may be found in Image Contour Extraction Method based on Computer Technology from Li Huanliang, 4th National Conference on Electrical, Electronics and Computer Engineering (NCEECE 2015), 1185-1189 (2016).
In this way, Manhattan polygonal chains become visible. The term “Manhattan polygonal chains” refers to polygonal chains which have only right angles. They are also referred to as rectilinear polygonal chains.
Several examples for the conversion into a polygonal chain representation, in particular a Manhattan polygonal chain representation, are shown in
In embodiments, for the conversion, contours extracted from the image are converted to the closest Manhattan polygon structure. In some embodiments, this may be achieved using image processing algorithms like contour approximation or convex hull extraction. Contour extraction approximates a contour shape to another shape with less number of vertices depending upon a specified precision. In convex hull extraction, the polygon chains are checked for convexity defects and convex forms are removed. As a result, reduced polygon chains of convex shape are extracted.
For the example image 50, the result of the conversion to a polygonal chain representation is shown at 51 in
It should be noted that in embodiments this is still an image, which, however, has been processed to exhibit substantially only polygonal chain features.
Turning to
This conversion to a feature list is illustrated in
As can be easily seen, by conversion to this list the amount of data may be reduced significantly compared to full image data.
Furthermore, at 24 in
An example reference wafer or chip is designated with reference numeral 56 in
At 26 in
In embodiments where the feature lists are provided using a (co-ordinate{x,y}, feature vector) implementation as discussed above, the feature vector lists generated from the design and the wafer image are aligned. Thus the comparison now becomes even easier and also may give the accurate location of the defect to nm precision in some embodiments.
For example, by the comparison symbolized by a minus at 511 in
At 27 in
Obtaining feature vectors at 25 from the design data provided at 24 may be performed in various manners. Examples are shown in
Implementations of the subject matter and the functional operations described in this specification can be implemented in digital electronic circuitry, in tangibly-embodied computer software or firmware, in computer hardware, including the structures disclosed in this specification and their structural equivalents, or in combinations of one or more of them. Implementations of the subject matter described in this specification can be implemented as one or more computer programs, i.e., one or more modules of computer program instructions encoded on a tangible program carrier for execution by, or to control the operation of, a processing device. Alternatively or in addition, the program instructions can be encoded on a propagated signal that is an artificially generated signal, e.g., a machine-generated electrical, optical, or electromagnetic signal that is generated to encode information for transmission to suitable receiver apparatus for execution by a processing device. A machine-readable medium can be a machine-readable storage device, a machine-readable storage substrate, a random or serial access memory device, or a combination of one or more of them.
The term “processing device” encompasses all kinds of apparatus, devices, and machines for processing information, including by way of example a programmable processor, a computer, or multiple processors or computers. The apparatus can include special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application specific integrated circuit) or RISC (reduced instruction set circuit). The apparatus can also include, in addition to hardware, code that creates an execution environment for the computer program in question, e.g., code that constitutes processor firmware, a protocol stack, an information base management system, an operating system, or a combination of one or more of them.
A computer program (which may also be referred to as a program, software, a software application, a script, or code) can be written in any form of programming language, including compiled or interpreted languages, or declarative or procedural languages, and it can be deployed in any form, including as a stand alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program may, but need not, correspond to a file in a file system. A program can be stored in a portion of a file that holds other programs or information (e.g., one or more scripts stored in a markup language document), in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub programs, or portions of code). A computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.
The processes and logic flows described in this specification can be performed by one or more programmable computers executing one or more computer programs to perform functions by operating on input information and generating output. The processes and logic flows can also be performed by, and apparatus can also be implemented as, special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application specific integrated circuit) or RISC.
Computers suitable for the execution of a computer program include, by way of example, general or special purpose microprocessors or both, or any other kind of central processing unit. Generally, a central processing unit will receive instructions and information from a read only memory or a random access memory or both. The essential elements of a computer are a central processing unit for performing or executing instructions and one or more memory devices for storing instructions and information. Generally, a computer will also include, or be operatively coupled to receive information from or transfer information to, or both, one or more mass storage devices for storing information, e.g., magnetic, magneto optical disks, or optical disks. However, a computer need not have such devices. Moreover, a computer can be embedded in another device, e.g., a mobile telephone, a smartphone or a tablet, a touchscreen device or surface, a personal digital assistant (PDA), a mobile audio or video player, a game console, a Global Positioning System (GPS) receiver, or a portable storage device (e.g., a universal serial bus (USB) flash drive), to name just a few.
Computer readable media (e.g., one or more machine readable hardware storage devices) suitable for storing computer program instructions and information include all forms of non volatile memory, media and memory devices, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto optical disks; and CD ROM and (Blue Ray) DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.
To provide for interaction with a user, implementations of the subject matter described in this specification can be implemented on a computer having a display device, e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor, for displaying information to the user and a keyboard and a pointing device, e.g., a mouse or a trackball, by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback, e.g., visual feedback, auditory feedback, or tactile feedback; and input from the user can be received in any form, including acoustic, speech, or tactile input. In addition, a computer can interact with a user by sending documents to and receiving documents from a device that is used by the user; for example, by sending web pages to a web browser on a user's client device in response to requests received from the web browser.
Implementations of the subject matter described in this specification can be implemented in a computing system that includes a back end component, e.g., as an information server, or that includes a middleware component, e.g., an application server, or that includes a front end component, e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of the subject matter described in this specification, or any combination of one or more such back end, middleware, or front end components. The components of the system can be interconnected by any form or medium of digital information communication, e.g., a communication network. Examples of communication networks include a local area network (“LAN”) and a wide area network (“WAN”), e.g., the Internet.
The computing system can include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. In another example, the server can be in the cloud via cloud computing services.
As can be seen from the above explanations, numerous variations and modifications are possible, and it is evident that the scope of the disclosure is not limited by the specific embodiments.