The present invention relates generally to the field of face detection and tracking. More specifically, the present invention relates to face detection and tracking in a wide field of view.
Surveillance systems are being used with increasing frequency to detect and track individuals within an environment. In security applications, for example, such systems are often employed to detect and track individuals entering or leaving a building facility or security gate, or to monitor individuals within a store, hospital, museum or other such location where the health and/or safety of the occupants may be of concern. More recent trends in the art have focused on the use of facial detection and tracking methods to determine the identity of individuals located within a field of view. In the aviation industry, for example, such systems have been installed in airports to acquire a facial scan of individuals as they pass through various security checkpoints, which are then compared against images contained in a facial image database to determine whether an individual poses a security threat.
Current facial detection and tracking systems typically rely on the use of one or more pan-tilt-zoom (PTZ) cameras to track individuals located within a wide field of view. Such devices can include an optical system operatively coupled to a number of drive motors that permit the operator to zoom-in on the details of an individual, or to monitor a larger area from multiple camera angles. In certain designs, each of the cameras within the system can be connected to a computer equipped with image processing software and/or hardware that can be used to process images received from the cameras in order to detect the identity of the individual.
Due to the high resolution often necessary to accurately detect facial features, many prior-art facial detection and tracking systems lack the ability to both detect and track individuals within a wide field of view while simultaneously acquiring information sufficient to perform facial recognition. In systems employing PTZ cameras, for example, the ability of the camera to effectively track motion within a wide field of view is often limited by the speed and accuracy of the positioning mechanism employed. If, for example, the individual is located within a moving vehicle or is otherwise moving quickly through the image field, such cameras may not be able to adequately cover the entire image field while still providing sufficient resolution to abstract features from the individual's face. In some cases, the inability of the camera to accurate track individuals moving through the image field can also prevent multiple individuals from being detected and/or tracked simultaneously within a wide field of view.
The present invention relates generally to face detection and tracking systems and methods in a wide field of view. A facial detection and tracking system in accordance with an illustrative embodiment of the present invention can include a wide field of view camera for detecting one or more objects within a wider field of view, and at least one narrower field of view camera for obtaining a higher-resolution image of at least one object located within a subset space of the wider field of view. The narrower field of view cameras can, in some embodiments, be arranged in an array or pattern that, when seamed together, covers the entire field of view without the need for a positioning and/or zoom mechanism. In certain embodiments, the narrower field of view cameras can be overlapped slightly to facilitate the detection of objects moving from one subset space to the next.
In some illustrative embodiments, the face detection and tracking system can employ one or more tri-band imaging (TBI) cameras to detect and analyze various facial features utilizing a combination of low band near-IR light, high band near-IR light, and/or visible light. A near-IR illuminator can be provided to generate near-IR light on the individual, which can then be sensed by the one or more TBI cameras to determine the presence of skin and/or to detect various facial features. In certain embodiments, an adjustment module can also be provided for adjusting the amount of luminance emitted from the near-IR illuminator, if desired.
An illustrative method for detecting and tracking an individual within a wide field of view can include the steps of detecting an object using a wide field of view camera, determining the subset space location of the object within the wide field of view, tasking one or more narrower field of view cameras covering the subset space location to acquire one or more higher-resolution images of the object, and then processing the higher-resolution images to obtain one or more parameters relating to the object. In certain illustrative embodiments, the one or more narrower field of view cameras can be configured to obtain facial images of a tracked individual, which can then be compared against a facial image database to determine the identity of the individual. Various processing routines can be employed to detect and confirm the presence of skin and/or to detect one or more facial features related to the individual.
The following description should be read with reference to the drawings, in which like elements in different drawings are numbered in like fashion. The drawings, which are not necessarily to scale, depict illustrative embodiments and are not intended to limit the scope of the invention. Although examples of various steps are illustrated in the various views, those skilled in the art will recognize that the many of the examples provided have suitable alternatives that can be utilized. Moreover, while several illustrative applications are described throughout the disclosure, it should be understood that the present invention could be employed in other applications where facial detection and tracking is desired.
To detect one or more facial features of the individual 14 as they move through the wide field of view in the general direction indicated, for example, by arrow 24, the PTZ camera 12 can be configured to pan and/or tilt in a direction towards the individual's face 22 and initiate an optical-zoom or telephoto mode, wherein the PTZ camera 12 zooms-in on the area surrounding the individual's face 22. In certain designs, for example, the PTZ camera 12 can include a vari-focus optical lens that can be adjusted to concentrate the PTZ camera 12 on a particular space within the wide field of view in order to provide a higher-resolution image of the face 22 sufficient to perform facial recognition of the individual 14. In other designs, digital techniques can also be employed to adjust the resolution of the PTZ camera 12, such as, for example, by altering the resolution of a charge coupled device (CCD) or other such optical device within the PTZ camera 12.
The PTZ camera 12 can be configured to monitor the wide field of view until an object of interest has been detected, or, in the alternative, can be configured to scan various subset spaces within the wide field of view until such motion is detection. In the latter case, the PTZ camera 12 can be programmed to scan an area in some predefined or random path until an object of interest is detected. Once an individual 14 or other object of interest has been detected, the PTZ camera 12 can then be configured to focus on the individual 14 and acquire an image of the individual's face 22 in the higher-resolution, telephoto mode.
Because of the time required for the positioning mechanism to pan and/or tilt towards the individual 14 and to zoom-in on the individual's face 22, many PTZ cameras 12 are limited in their ability track individuals quickly moving through a wide field of view. If, for example, the individual 14 is positioned inside a moving vehicle or is otherwise moving through the image field at a rapid rate, the PTZ camera 12 may not be able to adequately track the individual while still providing a steady image necessary to perform facial recognition. In those systems in which the PTZ camera 12 is configured to scan the environment in a predefined or random path, the particular path traveled by the individual 14 through the wide field of view may even escape detection by the PTZ camera 12 altogether.
As can be further seen by reference to dashed lines 18′ and 20′ in
The wide field of view camera 28 can be configured to continuously operate in a wide-angle mode to constantly track objects of interest within a wide field of view. As can be seen in
In certain embodiments, the wide field of view camera 28 can be configured to operate in a low-resolution mode sufficient to detect and/or track an object of interest within the wide field of view while conserving power. The resolution capability of the wide field of view camera will depend on a number of factors, including, for example, the viewing angle of the camera, the pixel density of the optical system employed, and the various characteristics of the surrounding environment. While the illustrative wide field of view camera 28 depicted in
Each of the narrower field of view cameras 30,32 can be directed and/or focused on a subset space of the wide field of view for obtaining a facial-image of the individual. 34. As shown in
In use, the narrower field of view cameras 30,32 can be configured to provide a higher-resolution image of the individual's face 56 to detect and analyze various facial features of the individual 34 not capable with a wider field of view camera. As with the wide field of view camera 28, each of the narrower field of view cameras 30,32 can be fixed in position, covering a subset field of view that does not change significantly as the individual 34 moves from one field of view to another. In operation, this arrangement permits each narrower field of view camera 30,32 to track the individual 34 without having to first pan, tilt, and/or zoom-in on the individual 34. Moreover, since each of the narrower field of view cameras 30,32 remains fixed during tracking, the ability of the system to accurately track objects of interest is not limited to the accuracy and/or speed of the positioning mechanism employed.
In certain embodiments, the narrower field of view cameras 30,32 can be overlapped slightly to facilitate the detection and tracking of objects as they move from one subset space 44,50 to the next. In the illustrative embodiment of
The various cameras 28,30,32 forming the facial tracking and detection system 26 can be physically separated from each other at various locations within the environment, or can comprise a single camera unit including multiple cameras. In the latter case, for example, each of the cameras 28,30,32 can be disposed within a housing or inset within a wall, ceiling or other desired structure. In the illustrative embodiment of
While the illustrative embodiment of
As can be further seen in
In certain embodiments, the narrower field of view cameras 64 can comprise tri-band imaging (TBI) cameras, which use low band near-IR light, high band near-IR light, and visual light to analyze, detect, and match an individual's face. Such devices typically utilize a near-infrared light spectrum to scan facial images by sensing the IR light reflected from the individual's face. The ability to detect such reflected IR light avoids a characteristic problem inherent in many conventional visual spectrum systems, which attempt to analyze non-facial portions of the image during facial recognition. Moreover, since TBI cameras also utilize IR spectrum light to detect the presence of the individual's face, such devices are not as susceptible to environmental conditions such as glare through a window or windshield, inclement weather (e.g. fog, haze, rain, etc.), nighttime conditions, etc. that can affect the ability of the system to acquire clear image signals.
A near-IR illuminator 76 can be provided for generating near-IR light in both the low and high near-IR spectrums, if desired. In certain applications, for example, the near-IR illuminator 76 can be utilized to direct light towards the individual to obtain a clearer image of the individual's face during nighttime, or when other conditions exist. Since the near-IR light is outside of the visible spectrum, such light is not detectable by the naked eye, and therefore does not alert the individual that he or she is being detected and/or tracked.
As can be seen in
In the illustrative embodiment of
The narrower field of view cameras 80 can each be configured to recognize various facial features within a particular a range of X-coordinates and Y-coordinates that covers a subset space within the wide field of view. In some embodiments, the narrower field of view cameras 80 can be configured to cover the entire space covered by the wide field of view camera 78, allowing the system 76 to acquire higher-resolution images of individuals and/or objects at all locations within the wide field of view. The ranges covered by each narrower field of view camera 80 can be discrete (i.e. with no overlap between adjacent fields), or, in the alternative, can be overlapped by some desired amount. In certain embodiments, each of the narrower field of view camera elements 80 can comprise a tri-band image (TBI) camera or other such device for detecting and analyzing facial features using multiple light spectrums (e.g. near-IR light, visible light, UV light, etc.).
The TBI cameras 92,94,96,98 can be configured to operate simultaneously in a coordinated fashion to track and detect individuals 91 as they move from one subset space to the next. As can be seen in
As can be further seen in
The narrower field of view cameras 92,94,96,98 can each be configured to cover a discrete subset space within the wide field of view, or can be overlapped by some desired amount. In the latter case, the narrower field of view cameras 92,94,96,98 can be tasked to focus on different facial features of the individual 91. In certain embodiments, for example, one of the narrower field of view cameras (e.g. camera 92) could be tasked to provide a general scan of the individual's face whereas an adjacent narrower field of view (e.g. camera 94) could be tasked to provide a retinal scan of the individual 91. The various images acquired by each of the narrower field of view cameras 92,94,96,98 can then be processed via the computer 100 to determine the identity of the individual 91 and/or to computer various other parameter relating to the individual 91 (e.g. velocity, direction of travel, height, orientation, etc.).
Turning now to
Once one or more higher-resolution images are obtained from the narrower field of view cameras 80, an image processing routine or algorithm can be initiated to extract various features from the acquired images, as indicated generally by reference to block 128. At this stage, facial features related to the individual's nose, eyes, mouth, skin color, eyebrows, facial size, etc. may be obtained to perform facial recognition on the individual, or to determine some other desired parameter related to the individual. As indicated generally by reference to block 130, one or more parameters relating to the individual can then be outputted and further analyzed, if desired. As indicated generally by return arrow 132, the system 76 can be configured to update the X-Y coordinates and repeat the image processing as the individual moves through the wide field of view, allowing the system 76 to task different narrower field of view cameras 80 to track the individual, if necessary.
Once an image input is received from each narrower field of view TBI camera, a series of operations can then be performed to isolate the skin in the images from other surface elements, as indicated generally by reference to block 140. Such skin detection step 140, for example, can be performed to verify that the tracked object is not wearing a mask or other such covering that would prevent the system from accurately recognizing the individual's face.
As can be further seen with respect to block 142, a face detection step may also be performed to acquire various facial features that can be later used to determine the identity of the tracked individual as well as other desired parameters relating to the individual. Upon detecting various features of the individual face at step 142, the information obtained can then be compared against a facial image database containing a number of previously stored facial images, as indicated generally by reference to block 144. If a match is found, as indicated generally by reference to block 146, the result can be outputted at block 148, informing the operator that a match has been obtained along with the identification of that individual. Alternatively, if no match is found, the system can be configured to alert the operator that the individual is not recognized within the facial image database.
As can be further seen by reference to block 158, one or more feature images can also be extracted from the two near-IR images of steps 152 and 154 using a multi-band feature extraction scheme. In certain embodiments, for example, the two near-IR images obtained from steps 152 and 154 can be utilized in conjunction with visible light, UV light, radar, or some other desired wavelength spectrum to abstract various features from the individual's face (e.g. nose, eyes, mouth, skin color, eyebrows, facial size, etc.) that can be later used to perform facial recognition on the individual.
Next, as indicated generally with reference to block 160, the images acquired from the skin detection step 156 and multi-band feature extraction step 158 can be processed to determine the identity of the individual. In certain embodiments, for example, a series of generalized Hough transforms or model-sized algorithms can be performed, providing an approximation of features such as the location of the eyes, eyebrows, nose, and/or mouth. From this processing step 160, a final video facial image can be produced, as indicated generally by reference to block 162. Other parameters such as the identity of the individual can also be outputted at this step 162, if desired.
Having thus described the several embodiments of the present invention, those of skill in the art will readily appreciate that other embodiments may be made and used which fall within the scope of the claims attached hereto. Numerous advantages of the invention covered by this document have been set forth in the foregoing description. It will be understood that this disclosure is, in many respects, only illustrative. Changes can be made with respect to various elements described herein without exceeding the scope of the invention.
Number | Name | Date | Kind |
---|---|---|---|
5396284 | Freeman | Mar 1995 | A |
6049281 | Osterweil | Apr 2000 | A |
6215519 | Nayar et al. | Apr 2001 | B1 |
6370260 | Pavlidis et al. | Apr 2002 | B1 |
6437819 | Loveland | Aug 2002 | B1 |
6445298 | Shepher | Sep 2002 | B1 |
6483935 | Rostami et al. | Nov 2002 | B1 |
6499025 | Horvitz et al. | Dec 2002 | B1 |
6504482 | Mori et al. | Jan 2003 | B1 |
6611206 | Eshelman et al. | Aug 2003 | B2 |
6678413 | Liang et al. | Jan 2004 | B1 |
6714665 | Hanna et al. | Mar 2004 | B1 |
6718049 | Pavlidis et al. | Apr 2004 | B2 |
6738073 | Park et al. | May 2004 | B2 |
6970576 | Tilsley | Nov 2005 | B1 |
20020063711 | Park et al. | May 2002 | A1 |
20020075258 | Park et al. | Jun 2002 | A1 |
20020076087 | You et al. | Jun 2002 | A1 |
20020105578 | Hunter | Aug 2002 | A1 |
20020140822 | Kahn et al. | Oct 2002 | A1 |
20020180759 | Park et al. | Dec 2002 | A1 |
20030040815 | Pavlidis | Feb 2003 | A1 |
20030053658 | Pavlidis | Mar 2003 | A1 |
20030053659 | Pavlidis | Mar 2003 | A1 |
20030053664 | Pavlidis et al. | Mar 2003 | A1 |
20030076417 | Thomas et al. | Apr 2003 | A1 |
20030095186 | Aman et al. | May 2003 | A1 |
20030123703 | Pavlidis | Jul 2003 | A1 |
20030209893 | Breed et al. | Nov 2003 | A1 |
20040030531 | Miller | Feb 2004 | A1 |
20040105004 | Rui et al. | Jun 2004 | A1 |
20040240711 | Hamza et al. | Dec 2004 | A1 |
20050007450 | Hill et al. | Jan 2005 | A1 |
20050055582 | Bazakos et al. | Mar 2005 | A1 |
20050110610 | Bazakos et al. | May 2005 | A1 |
Number | Date | Country |
---|---|---|
9721188 | Jun 1997 | WO |
Number | Date | Country | |
---|---|---|---|
20070092245 A1 | Apr 2007 | US |