The present invention relates generally to the field of image recognition. More specifically, it relates to a method and an apparatus for facial image acquisition and recognition, wherein an active near infrared (NIR) light within invisible light spectrum is applied to illuminate a target face.
Face recognition is a biometric technology in which the technology related to computers, image processing, and pattern recognition is also involved to perform person identification based on facial images. Recently, especially after 9.11 terror attacks, many countries in the world have attached a great importance to their public security. Accordingly, face recognition technology has been greatly noticed much more than ever before.
Biometric authentication refers to a class of high tech recognition technologies that use human biometric traits to carry out person verification and identification. Biometric traits of a person, such as fingerprint, palm print, iris, deoxyribonucleic acid (DNA), are unique and stable for the individual; they cannot be duplicated, stolen and forgotten. Because each person's characteristics are distinct from others, it is possible to accurately identify a person by using his/her unique biometrics. Existing biometric recognition methods generally include face recognition, fingerprint recognition, sound recognition, palm print recognition, signature recognition, eye iris, retina recognition and so on.
As compared to other recognition technologies, face recognition technique is of many advantages such that it is natural, simple and convenient, easy to operate, user friendly, contactless, and non-intrusive, etc. It can complete the recognition task without incurring much disturbance. With this technology, people no longer need to worry about touching his fingerprint on the fingerprint device, or talking to the microphone, or looking into an iris scanner required by conventional recognition in the prior art. A face can be recognized when a person show his face to the camera. Therefore, the face recognition technology can be widely applied to access control, machine readable traveling documents (MRTD), e-passport, anti-terrorism, ATM, computer logon, safe cabinet, time attendance, and so on.
Typical face recognition applications include the following modes:
Identification (1:N match) to determine a person's ID: A system (1) acquires the face image data, (2) extracts facial features or record from the image, (3) compares it with all or part of the records of enrolled persons in database to calculate the similarity scores, and (4) produce a sorted list based on the similarity score. Finally, the system outputs the persons ID corresponding to the top most similarity if the top most similarity is above an acceptance threshold; otherwise concludes that the person is not identified.
Verification (1:1 match) to verify whether the claimant. In this case, the system needs just to compare the facial record extracted from the image with that of the claimed person to give the similarity score. The system either accepts the claimant if the similarity score is above an acceptance threshold, or reject if otherwise.
Surveillance: Using the techniques of face image acquisition and face recognition to track a person in the surveillance area and determines his location.
Monitoring: To discover the faces in the surveillance area, far or near, regardless of their locations, track them and separate them from the background, compare the facial features with those in the database. The entire process is automatic, continuous, and real-time.
The above application modes can be widely applied in the following domains:
Personnel identification and indexing: These can be used in computer/network security, bank services, smart card, access control, frontier control, etc.
ID card: This can be used in voter registration, ID card, passport, driver's license, work identification and so on.
Computer information safeguarding system: This uses the facial features to recognition user, safeguards the computer information.
Crime suspect recognition system: This system stores face pictures and recognizes faces in analyzing incidents.
Long-distance person identification: This is applied in surveillance, monitoring, TV, traffic control, enemy-friend recognition and so on.
A face recognition process is illustrated in
Image acquisition module 10: It captures face image or video images through image acquisition equipment (for example video camera, digital camera and so on), then, then sends these images or video to a computer.
Feature extraction module20: Residing in a computer processor, this module examines the input image, detects the face, locate facial features such as eyes and mouth, normalize the face in pose and illumination, and extracts face features (face code).
Feature matching module30: Also residing in the computer, it compares the face features extracted from the input image information (face code) with those stored in the database40, and find the best matched one.
Obviously, the face feature database should be set up before the face recognition process. Therefore, as shown in
Both enrollment and recognition (Parts A and B) include the image acquisition and feature extraction modules. Of these, the face recognition part has an additional feature matching (comparing) module, while the face enrollment part has a data saving module.
Face feature extraction process 20 is composed of several steps: face detection or tracking 201, facial feature localization and face normalization 202, face feature extraction (face code generation) 203. The face detection finds the face in the input image or video image sequence, so that the face is separated from the background; the face tracking tracks detected the faces in video image sequence, face normalization or alignment uses localized facial landmarks (eyes and/or mouth) to normalize the geometry of the face to a standard pose and normalize the lighting to a standard illumination condition, face feature extraction calculates the face code from normalized face image.
Face matching 30 compares the face code from the input with those of the enrolled persons in the database 40, one by one in turn, computes the similarity matching scores, and gives a decision for verification or identification after referring to a similarity threshold.
To achieve reliable and accurate accuracy, face recognition should be performed based on intrinsic factors of the face only, mainly of 3D shape and reflectance of the facial surface. Variations brought about by extrinsic factors, including hairstyle, eyeglasses, expression, posture, and environmental lighting, should be reduced or eliminated in order to achieve high performance.
Most of existing face recognition technologies are based on visible light images. Such technologies have difficulties in adapting to changes in environmental lighting: Changes in lighting cause changes in facial features; therefore, their accuracy deteriorates when the lighting of the face recognition environment differs from that of the face enrollment environment, for example, US Patent US2001/003102A1.
The research shows that the difference of facial image for same person by light change is much bigger than that of different persons. (Sees also Yael Adnin, Yael Moses and Shimon Ullman, “Face recognition: The problem of compensating for changes in illumination direction”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 19, No. 7, 1,997, pp. 712-732). Existing face recognition technology depends on “passive” light source, that is, environmental light sources. Unfortunately, in real application, the environmental lights vary, and are not controlled. A change in environment light changes the captured facial image dramatically. This in turn significantly changes extracted face features, and causes significant drop in recognition accuracy.
Suppose for each point Pi, there is a vector ni=(nx,ny,nz)T, nTi is a unit vector, that is ∥n∥=1; Assume that the light source is point source, the direction is s=(sx,sy,sz), then we have the Lambertian imaging equation model, the gray scale Ii of Pi can be written as:
I
i=ρi(x, y)ni(x, y)T·s (1)
where i=1, 2, . . . , k, k is the number of pixels of a face image
It can be seen from the above equation that the facial image formation is related to the reflection and 3-D shape of the face surface, and the illumination. These are the three essential factors in the facial image formation process. The first two terms are related with the intrinsic characteristic of the face itself, and also the important information for face recognition; the last term, illumination, is the extrinsic factor, and also the primary factor which affects face recognition performance.
Although the light intensity ∥s∥ also affects the gray scale of facial images, this kind of influence can be adjusted using a simple linear transform. The top-most factor that affects the face recognition performance is the incidence angle of the light relative to the face surface vector. Assume that θi is the angle between the incident light ray and the face surface vector at Pi (θi ε [0, π]), the light intensity ∥s∥=1, then Equation (1) maybe expressed as follows:
I
i=ρi(x, y)cos θi (2)
where, i=1,2, . . . ,k; k is the number of pixels of a face image.
From equation (2), we can see that when is changes as a result of a change in the illumination direction, the facial image changes accordingly. It can also be illustrated by a correlation analysis: Given two facial images lighted from the left side and from the right side, respectively, the correlation coefficient of resulting images is generally a negative number; this means that the two images are completely different by the pixel values, even though of the same person.
In real applications, the environment lightings generally differ from place to place, and a face recognition system has to adapt to different environmental lightings. However, current face recognition technology mixes both intrinsic and extrinsic factors in the imaging and hence cannot adapt well to the environment This is why the best face recognition system can only achieve 50% accuracy (see also NIST 2002 Human Face Recognition Vendor Tests Evaluation Report (P. J. Phillips, P. Grother, R. J Micheals, D. M. Blackburn, E Tabassi, and J. M. Bone. March 2003).
Although there are many methods for compensation and normalization of illumination for face recognition, they are not very effective (see: P. N. Belhumeur, David J. Kriegman, “What is the set of Images of an Object Under All possible Lighting Conditions?”, IEEE conf. On Computer Vision and Pattern Recognition”, 1,996; Athinodoros S. Georghiades and Peter N. Belhumeur, “Illumination cone models for recognition under variable lighting: Faces”, CVPR, 1,998; Athinodoros S. Georghiades and Peter N. Belhumeur, ” From Few to many: Illumination cone models for face recognition under variable lighting and pose”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 23, No. 6, pp 643-660, 2,001; Amnon Shashua, And Tammy Riklin-Raviv, “The quotient image: Class-based re-rendering and recognition with varying illuminations”, Transactions on Pattern Analysis and Machine Intelligence, Vol. 23, No. 2, Pp 129-139, 2,001; T. Riklin-Raviv and A. Shashua. “The Quotient image: Class based recognition and synthesis under varying illumination” In Proceedings of the 1,999 Conference on Computer Vision and Pattern Recognition, Pages 566-571, Fort Collins, Colo., 1,999; Ravi Ramamoorthi, Pat Hanrahan, “On the relationship between radiance and irradiance: Determining the illumination from images of a convex Lambertian object”, J. Opt. Soc. Am., Vol. 18, No. 10, 2,001; Ravi Ramamoorthi, “Analytic PCA Construction for Theoretical Analysis of Lighting Variability in Images of a Lambertian Object”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 24, No. 10, 2002-10-21; Ravi Ramamoorthi and Pat Hanrahan, “An Efficient Representation for Irradiance Environment Maps”, SIGGRAPH 01, Pages 497-500, 2,001; Ronen Basri, David Jacobs, “Lambertian Reflectance and Linear Subspaces”, NEC Research Institute Technical Report 2000-172R; Ronen Basri and David Jacobs, Lambertian Reflectance and Linear Subspaces, IEEE Transactions on Pattern Analysis and Machine Intelligence, Forthcoming; Terence Sim, Takeo Kanade, “Illuminating the Face”, CMU-RI-TR-01-31, Sep. 28, 2001, etc) Among these methods, some requires 3-D modeling of faces, while some assumes known facial shapes. These limitations reduce the applicability. Moreover, the computational cost is very high.
There have been several face recognition patents, most of them using visible lights and for applications. One is Chinese patent ZL99117360.X. There, it is about how to implement the face recognition for access control and time attendance, without much attention paid to face image acquisition, and influence of skin complexion and light changes. The recognition accurate rate of this method under the lighting changes is still low. These limit its applications.
US Patent (US2001/0031072A1) disclosed a device using VISIBLE light sources to actively illuminate the face for face recognition. The device uses visible light as active light sources and hence inherits problems existing in current visible light image based face recognition; further, the visible light are intrusive to human eyes especially; this is especially true when the active lights should be strong enough to override environmental lightings, as is the case in US2001/0031072A1. That patent did not publicize how to use INVISIBLE infrared lights as active light sources to illuminate the face for facial image acquisition and recognition, nor is there any information there about how to setup infrared light sources and infrared filters for better face image acquisition and recognition.
There have also been iris recognition techniques for accurate biometric identification, such as used in Iridian Corporation's products. Disadvantages of such technology include complexity of iris image acquisition devices, and inconvenience of use. These limit the applications. Chinese patent ZL99110825.6 has also disclosed portable iris equipment. This equipment is limited by the similar disadvantages.
The object of the present invention is to provide a method and an apparatus for facial image acquisition and/or facial image recognition that can overcome one or more problems existing in the prior art, such as the accuracy of face recognition is deteriorated due to changes of environmental lightings. The present invention aims to solve the problems of prior art by using a non-intrusive and user-friendly means, and to achieve accurate and fast face recognition.
A further object of the present invention is to provide a method and an apparatus for face image acquisition, wherein an active near infrared (NIR) light is used to illuminate the face during the acquisition of face images. The method and apparatus can significantly reduce unfavorable influence caused by variable environmental lights.
A further object of the invention is to provide a method and an apparatus for face recognition in which eyes and face in NIR facial images acquired with illuminating of active NIR light are localized by detecting specular highlight reflections in eyes under illuminating of active lightings. The present method can lead to accurate and fast face recognition.
The present invention provides a face recognition method, comprising the following steps:
providing an active infrared light to illuminate a target face when a user approaches an image capturing unit, wherein said active infrared light mounted around lens of an image capturing unit is near infrared (NIR) radiation light sources in invisible light spectrum,
capturing a plurality of facial images from a target face illuminated by said active NIR light sources, and sending a NIR facial image to a data processing unit;
localizing said face and/or eyes of said face, and cropping a portion of said facial image from said NIR facial image by said data processing unit;
extracting facial feature from said portion of said facial image;
comparing facial feature with that of previously extracted and stored in a facial image database;
outputting a recognition result obtained from said comparing step.
Said face recognition method is provided, wherein a NIR filter is disposed on said image capturing unit for cutting off visible light radiation while allowing the NIR light radiation to pass through, so as to improve NIR face image acquisition.
Said face recognition method is provided, further comprising the steps of:
detecting specular highlight reflections in eyes in said NIR face image to localize eye positions and thereby localize said face.
Said face recognition method is provided, further comprising the steps of:
judging whether eyes and/or face is successfully localized after sending at least one facial image to a data processing unit; if yes, going forward to the next step of cropping a portion of said facial image, otherwise repeating the localizing step until eyes and/or face is successfully localized.
The present invention further provides a facial image acquisition method, comprising the steps of:
providing a plurality of active infrared lights to illuminate a target face, wherein said active infrared light mounted around lens of an image capturing unit is a near infrared (NIR) light in invisible spectrum;
providing an image capturing unit for capturing NIR images of said target face, and sending/storing said NIR face images to a data processing unit used for localizing and recognizing said target face;
wherein the total energy of said active NIR light plus said environmental lightings on entire area of said target face is greater than that of environmental lightings on entire area of said target face by at least twice times.
Said facial image acquisition method is provided, wherein a NIR filter is disposed on said image capturing unit for cutting off a visible light radiation while allowing a NIR light radiation to pass through, so as to improve NIR facial image acquisition.
The present invention further provides a facial image acquisition apparatus used for realizing a facial image acquisition method, comprising an active NIR light and an image capturing unit;
Said active NIR light is mounted around lens of said image capturing unit to illuminate a target face;
Said image capturing unit captures NIR images of said target face illuminated by said active NIR light, and sends said NIR images to a subsequent data processing unit.
Said facial image acquisition apparatus is provided, wherein a NIR filter is disposed on said image capturing unit for cutting off visible light radiation while allowing the NIR light radiation to pass through, so as to improve NIR face image acquisition.
Said facial image acquisition apparatus is provided, wherein the spectrum range of said active NIR light is between 740 nm-1700 nm; said NIR optical filter is an NIR optical coating or an NIR optical glass disposed on the surface or inside of said lens.
Said facial image acquisition apparatus is provided, wherein said active NIR light comprises a plurality of constant NIR lights, or a plurality of flash NIR lights, or the combination thereof.
Said facial image acquisition apparatus is provided, wherein the direction of said active NIR light is approximately parallel to axis of said lens.
Said facial image acquisition apparatus is provided, wherein the total energy of said active NIR light plus said environmental lightings on entire area of said target face is greater than that of environmental lightings on entire area of said target face by at least twice times.
Said facial image acquisition apparatus is provided, wherein said image capturing unit includes an NIR optical filter of band-wavelength-pass or long-wavelength-pass type.
The present invention further provides an facial image recognition apparatus used for realizing the above facial image recognition method, comprising an active infrared lighting, an image capturing unit and a data processing unit;
wherein said image capturing unit includes a lens; and said active infrared light comprises a plurality of active NIR lights used for illuminating a target face and mounted around said lens;
said image capturing unit is used for capturing facial images and sending at least one facial image to said data processing unit;
said data processing unit comprises a PC or an embedded processor in which image processing software is installed, used for receiving images from said image capturing unit and localizing eyes and face in said facial images, and extracting facial features in said localized facial area, and comparing the extracted features with that of previously stored in a facial image database.
Said facial image recognition apparatus is provided, wherein the spectrum range of said active NIR light is between 740 nm-1700 nm; said active NIR light comprises a plurality of constant NIR lights, or a plurality of flash NIR lights, or the combination thereof.
Said facial image recognition apparatus is provided, wherein the direction of said active NIR light is approximately parallel to axis of said lens.
Said facial image recognition apparatus is provided, wherein said image capturing unit includes an NIR optical filter of band-wavelength-pass or long-wavelength-pass type, and it is used to suppress visible lights while allowing NIR lights to pass through so as to achieve better NIR imaging effect.
Said facial image recognition apparatus is provided, wherein said data processing unit includes a means for detecting specular highlight reflection in each eyes in said NIR face image, it is used for localizing said eyes and face through localizing the positions of a highlight spots.
Said facial image recognition apparatus is provided, wherein there is a displaying device for displaying facial images, used for adjusting the position of the target face in vertical and horizontal directions; said displaying device is a mirror or an LCD (liquid crystal displace), mounted in such a way that its surface normal is co-axis to said lens.
Said facial image recognition apparatus is provided, wherein said active NIR light can be controlled by a power switch, a proximity sensor switch or an RFID controlled switch.
The present invention can effectively overcome a main problem existing in current visible light image based face recognition methods and systems that their accuracy drops because of the unfavorable impact of uncontrolled environmental lighting on facial images, and therefore can increase the recognition accuracy under uncontrolled environmental lighting.
The above advantages are realized by the invented NIR face image acquisition method and device wherein active NIR lights, strong enough to override environmental lighting, are used to illuminate the face during image capturing and at the same time visible lights in the uncontrolled environment are suppressed using an NIR optical filter. Therefore, the invention leads to stable imaging properties and hence high recognition accuracy under different lighting environments.
Moreover, the invented face image acquisition method and apparatus are user-friendly because the active NIR lights are in the invisible spectrum and cause no disturbance to human eyes.
The advantages are further realized by the method and apparatus for the NIR facial image acquisition and recognition, wherein highlight specularities in the eyes are located quickly and accurately. The facial feature template extracted based on accurate eye localization can represent the face accurately and hence lead to high recognition accuracy.
a is a procedure for an embodiment of a face recognition method in
b is a diagram of an image acquisition and data processing modules for a system in
a is a schematic diagram of a face image acquisition in the face recognition based MRTD system in
b is a schematic diagram of a face recognition in a face recognition based MRTD system in
Detailed embodiments of the present invention are disclosed herein, with an illustrative drawings and an exemplar embodiment:
First, the active lights 421illuminate on the face area 410, the camera 422 (which can be a web camera, a CCTV camera, or specialized infrared camera) captures an image of the face 410; the acquired image is transmitted to the data processing unit where face image recognition takes place.
a reveals an embodiment of a face recognition apparatus given in the present invention, including the following steps:
Step 100, start a face image acquisition system 420;
Step 110, when human body approaches the system 420, an infrared sensor is triggered on, and the active lights 421 illuminate the face area;
Step 120, the camera 422 captures images of the face area illuminated by the active lights 421;
Step 130, the camera 422 sends at least one face image to the data processing unit (such as a PC or an embedded data processor) 430;
Step 140, the data processing unit 430 finds the face from the image and locates the positions of the eyes and/or face;
Step 150, if the eye/face localization is successful, execute step 160; Otherwise, execute step 130;
Step 160, crop the face area from the image;
Step 170, extract facial feature template;
Step 180, compare the extracted facial feature template with those stored in the face template database;
Step 190, output recognition result.
In the above steps, the total energy of the active lighting 421 and the environmental lighting 427 on the face area is greater than twice that of environmental lighting. For example, if the strength of the environment lighting is 30 LUX, and that of the active lighting is 120 LUX then the strength of the active lighting is 4 times that of the environmental lighting.
In
However, because NIR lights are in the invisible spectrum, human eyes are insensitive to them, and the active infrared lights cause minimum disturbance to the human; meanwhile, an NIR optimal filter 412 can be added into the cameras, to cut off visible lights in the environmental lighting, so as to further reduce the influence of environmental lighting; therefore, NIR lights are the most suitable type of active lights.
In any embodiment of the present invention, whatever type of active lights are used to illuminate the face, the relative position between the active lights and the camera should be relatively fixed, and the angle between the direction of the active lighting and the axis of the camera lens should be in a sharp angle.
Refer to
When infrared lighting is used, an infrared optical filter can be mounted on the camera lens, so as to cut off the shorter wavelength visible lights, and to further reduce the influence of environmental lights. For the present invention, the preferred infrared lights are of near infrared in the wavelength range of 740 nm-1700 nm.
When an infrared optical filter is used, the filter can be either band-pass or long-pass type. For example, when the infrared lights are 850 nm LEDs, a band-pass filter could be chosen, such that it has the central wavelength of 850 nm to allow infrared ray of around 850 nm to pass while cutting of ray of wavelengths shorter than 800 nm and longer than 900 nm; or a long-pass filter could be chosen, such that it allows infrared ray of wavelength longer than 800 nm to pass, while cutting off ray of wavelengths shorter than 800 nm.
In
In
In
Refer to
I
i=ρi(x, y)ni(x, y)T·(s1+s2) (3)
where i=1,2, . . . ,k;
If the strength of the active lighting S1 is much greater than that of the environmental lighting S2, i.e. ∥S1∥>>∥S2∥, then equation (3) can be approximated by:
I
i≈ρi(x, y)ni(x, y)T·S1 (4)
where i=1,2, . . . ,k;
If in the process of face recognition, a further constraint is imposed, namely, the relative position between the face and the camera is un-changed and so is the angle between the facial surface normal and active light direction, then according to equation (4), the acquired image is determined by the intrinsic properties of the face (ie, facial surface albedo and facial surface normal), nearly regardless of environmental lighting. Facial images acquired in such as way is most stable and best for face recognition.
Refer to
In
Step 300, start an image enrollment system;
Step 310, the passenger hands in the travel document 502 when the body approaches to within about 50 cm from the counter 500. The infrared sensor switch turns on the active lights (near infrared LEDs) to illuminate the face area;
Step 320, the passenger moves his head so that he can see his face in the middle of the mirror, so that the active light camera with an optical filter can take pictures of the face;
Step 330, the camera captures at least one image and send it to the data processing unit (or a PC);
Step 340, the data processing unit locates the two highlight spots from the image;
Step 350, if two highlights are detected, execute S360, otherwise, execute S330;
Step 360, crop the face area from the image, based on the two detected highlight spots;
Step 370, extract facial feature template(s);
Step 380, store the extracted facial template(s).
b discloses further details of face image acquisition and processing, including the following steps:
Step 200, start a face recognition apparatus;
Step S210, the passenger hands in the travel document 502 when the body approaches to within about 50 cm from the counter 500. The infrared sensor switch turns on the active lights (near infrared LEDs) to illuminate the face area;
Step 220, the passenger moves his head so that he can see his face in the middle of the mirror, so that the active light camera with an optical filter can take pictures of the face;
Step 230, the camera captures at least one image and send it to the data processing unit (or a PC);
Step 240, the data processing unit locates the two highlight spots from the image;
Step 250, if two highlights are detected, execute S360, otherwise, execute S230;
Step 260, crop the face area from the image, based on the two detected highlight spots;
Step 270, extract facial feature template; Step S280, compare the extracted facial template with those stored in the database;
Step 290, output recognition result.
In real applications, the face enrollment system and the face recognition system can be built into one combined system. The difference is that the latter does not include the enrollment phase. The custom inspector checks the documents against the enrolled passenger, associate the personal information with the enrolled facial image, and test whether the person can be verified his identity successfully by the system.
In the embodiment shown in
Further, the imaging system of the present invention can be on a motion platform, to be an elevator-pan-tilt-zoom camera unit. Such a device can track the people, control the active lights, and capture face images. It also caters for people of different heights.
The present invention can enable face recognition in the complete darkness without environmental lighting.
The present has further advantages such as being highly accurate and stable, compact low in cost, autonomous, convenient to use in various applications and for installation and maintenance.
New characteristics and advantages of the invention covered by this document have been set forth in the foregoing description. It will be understood, however, that this disclosure is, in many respects, only illustrative. Changes may be made in details, particularly in matters of shape, size, and arrangement of parts, without exceeding the scope of the invention. The scope of the invention is, of course, defined in the language in which the appended claims are expressed.
Number | Date | Country | Kind |
---|---|---|---|
200310121340.1 | Dec 2003 | CN | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/CN04/00482 | 5/14/2004 | WO | 00 | 6/9/2006 |