This invention relates in general to face recognition and in particular to light invariant face recognition.
Face recognition systems are used for the identification and verification of individuals for many different applications such as gaining entry to facilities, recognition of people to personalize services such as in a home network environment, and locating wanted individuals in public facilities. The ultimate goal in the design of any face recognition system is to achieve the best possible classification (predictive) performance. Depending on the use of the face recognition system it may be more or less important to make sure that the comparison has a high degree of accuracy. In a high security application such as identifying wanted individuals, it is very important that identification is achieved regardless of minor changes in the captured image vs. the stored image.
The process of face recognition typically requires the capture of an image, or multiple images, of a person, processing the image or images and then comparing the image with stored images. If there is a positive match between the stored image and the captured image the identity of the individual can either be found or verified. U.S. Pat. No. 6,292,575 describes such a system and is hereby incorporated by reference.
The processing of the images includes normalization. Normalization insures that the size of the face in the captured image is relatively the same size as in the stored images. This normalization is an attempt at controlling differences that might occur in a captured image which are not found in a stored image (and visa-versa). These differences cause false negative identification results when the two faces are compared but when in actuality the same person appears in both images.
Many systems also control the lighting of the captured image to insure that the lighting will be similar to the lighting of the stored images. Once the individual is positioned properly the camera takes a single or multiple pictures of the person and a comparison is made to stored pictures. A problem with these systems is that for many applications, such as for security applications, sometimes the lighting of the captured image is different than the lighting used for the stored image. When the lighting is different in the captured image than the lighting in the stored image there is a chance a false negative identification result. Since it is not always possible to have the same lighting, a system is needed which will take into account the different lighting.
Accordingly, it is an object of the invention to provide a system that performs face identification/verification which is less susceptible to errors caused by light variation.
This object is achieved by taking random samples of pixels within the face and performing comparisons with the same random samples in the stored images. The comparison is performed using almost any form of face recognition. The RANSAC algorithm is used to perform robust matching. The RANSAC algorithm helps eliminate the possibility of having shadowed regions of the face causing false negative results.
In one embodiment of the invention instead of a purely random sampling, a weighted sampling is used which is weighted in the areas of the face which are least likely to be affected by lighting. A determination is made as to which areas of the face are more likely to be affected by lighting. The samplings are then weighted in the areas where there is a lower probability of light changes. These weighted samples are then compared to the like pixels in the stored images using face recognition to determine similarity.
Other objects and advantages will be obvious in light of the specification and claims.
For a better understanding of the invention reference is made to the following drawings:
a shows the selection of a line through a set of points in the presence of outliers.
b shows the selection of a line through a set of points in the presence of outliers using the RANSAC algorithm.
a shows images of random pixel samples of size 1000, for an image of size 64×72.
b shows random pixel samples for real images of size 133×100, and the random sample being 40% of the image.
In typical face recognition programs pixels of a captured image are compared to pixels of stored images using face recognition methods. If a certain percentage of pixels match the same pixels in a stored image, the images are deemed to match and a verification or identification has occurred. There are many known face recognition methods which determine similarity between images. The term “match” as used herein can mean a probability of a match, an exact match, or enough pixels match above a predetermined threshold that identification is probable, or the similarity index found after performing face recognition on these samples is above a certain threshold. The problem is that if a portion of the face in the captured image has a shadow from a light source, and the image of this person in the stored database is shadow-free, when the pixels of the two images are compared it is likely that those pixels within the shadow will not match the corresponding pixels in the stored image. This is because there will be a large percentage of non-matching pixels due to uneven lighting rather than due to the faces in the images being from different people. If a certain percentage of pixels must match to be deemed a proper identification, the large percentage of non-matching pixels will cause a false negative match.
It has been found that face recognition can be performed on portions of a face rather than an entire face. The present invention chooses random samples of pixels within the face but these random samples are chosen as part of an algorithm which ignores “outliers”, i.e. data points that lie outside of a majority of the data points. The RANSAC algorithm is a robust estimation algorithm that avoids “outliers” from the chosen samples from causing incorrect matching results and it is used in a preferred embodiment of the instant invention. Other such algorithms can also be used such as the least median of squares (LmedS). The RANSAC algorithm, as explained below in more detail, chooses pixels randomly and detects “outliers” which in the present invention are the shadowed regions.
In a preferred embodiment of the invention the captured image is compared to the stored images using a known face recognition algorithm and if no match is found, then the captured image is randomly sampled and the random sample of pixels Tj is compared to the same random sample in the stored images using a face recognition algorithm. The number of random samples selected is determined by the computing power and the degree of accuracy that is needed for a particular application. In a preferred embodiment, anywhere from 100 to 1000 or more sets of samples are chosen. Face recognition is performed on all samples. The sample that provides the best result (i.e. the sample that most closely matches the same sample in a stored image) is chosen and the result of the sample comparison is compared to a threshold. If the sample of the captured image matches the same sample in a stored image then the stored image is deemed to show the same person as in the captured image. The stored image that provides this match has a higher probability of being the same face as in the captured image since it is likely that one of the samples will be relatively free of outliers. This results in fewer false negatives since the random sample is less likely to include the portions of the face that are affected by lighting.
To perform general random sampling of pixels in an image, we represent an image as a vector, i.e., each pixel is assigned a unique number as shown in Table 1.
For the example given in Table 1, to create a sample of 100 random pixels we create a sample of random numbers between 1 and 4616, and then just choose pixels corresponding tot he 100 random numbers.
In a preferred embodiment of the present invention, random sampling is used in the form of RANSAC which is applied to perform face recognition as follows:
Assume that the facial image in question is formed of 1000 pixels and that 50 of them are affected by light changes. Further assume that only 100 facial pixels 10 can be selected to perform face recognition. If all 100 pixels are from the 950 pixels that are not affected by light changes, then there should be good recognition results. The probability that none of the selected 100 pixels are affected by light changes is given by
If it is necessary to guarantee that at least one sample is outlier free with the probability of 0.99, then the number of samples X that needs to be chosen can be obtained from
As explained above, the sets of regions are chosen randomly; for a large enough X it is likely that at least one set of regions will have consistent lighting that will match well with the corresponding set of regions in the database. If there is a high similarity, then there is a match between the captured image and the stored image with high similarity.
Referring back to
Weighted Sampling for Light Invariant Recognition
While in uniform sampling all the pixels have the same probability of being chosen, in weighted sampling, some pixels are more likely to be chosen than the others. Weighted sampling has also been studied extensively in literature, and we will explain the difference between it and uniform sampling through the following example.
Let us assume that we need to choose one number between 1 and 5 (array X), but that sampling is not uniform but weighted, as shown in Table 2.
One way to perform weighted sampling using the weights set forth in Table 2 would be to form an auxiliary array Y={1,2,2,3,3,3,4,4,5,5}
This array has 10 elements and, if uniform sampling is performed on the array Y it is equivalent to a weighted sampling of X. In this example, 2 is two times as likely to occur as 1; and 3 is most likely to occur with the probability of 0.3.
Weighted sampling for light-invariant face recognition is simply an extension of the use of the RANSAC algorithm in face recognition. In the previous case, sampling is performed uniformly, i.e. each pixel of the set Tj is chosen with equal probability 1/(# of pixels in the face). In weighted sampling some pixels are more likely to be chosen than the others based on the following criteria.
Assume there is a face that has been photographed for 100 different illuminations.
Compare each pixel pj in the original facial image (i.e. with uniform illumination) with the same pixel in the images obtained under different illuminations and count how many times the difference between them is below some given threshold. Let us denote this number nj, and let
The pixels with higher nj are more useful as they are less likely to be affected by illumination changes, so these pixels result in higher accuracy if they are part of the comparisons. In other words, using these pixels one is more likely to find a comparison that is illumination invariant. Accordingly in the weighted-sampling embodiment of this invention, these “light invariant pixels” are chosen more frequently as part of Tj. The probability of a pixel being selected in the weighted sample is not uniform but is given by the formula
The algorithm now proceeds the same way as with the RANSAC illumination invariant recognition described before.
Alternatively, to compute weights for sampling, computer graphics can be used. A 3D model of a head is created with light sources placed in different directions. Many computer graphics packages can then turn the 3d image back into 2 dimensions with the different light sources creating different effects on the 2 dimensional images. One can then determine which areas of the face have a higher likelihood of changing with different light sources. The portions of the face that have a low likelihood of changing can then receive a higher weight or a higher probability of being chosen than the pixels with a high likelihood of changing. Examples of programs that perform this type of ray tracing are POV-Ray, or OpenGL.
While there has been shown and described what is considered to be preferred embodiments of the invention, it will, of course, be understood that various modification and changes in form or detail could readily be made without departing from the spirit of the invention. It is therefore intended that the invention be not limited to the exact forms described and illustrated, but should be constructed to cover all modifications that may fall within the scope of the appended claims.
| Filing Document | Filing Date | Country | Kind | 371c Date |
|---|---|---|---|---|
| PCT/IB03/06108 | 12/17/2003 | WO | 6/16/2005 |
| Number | Date | Country | |
|---|---|---|---|
| 60435247 | Dec 2002 | US |