1. Field of the Invention
This invention relates generally to an imaging system in a vehicle for detecting the presence of a person and, more particularly, to an imaging system for use in a vehicle that identifies a human face, and uses this information for vehicle security and convenience purposes.
2. Discussion of the Related Art
The National Highway Traffic Safety Administration requires passenger side airbags on all passenger cars manufactured after 1998. Airbags have saved thousands of lives, but safer airbag deployment systems can save even more lives. Proposals have been submitted by the National Highway Traffic Safety Administration to require advanced airbags that are safer and more efficient. Thus, there is a need in the art for an advanced airbag deployment system that determines airbag deployment depending on the detection, identify and tracking of a person in the passenger seat of the vehicle.
Future airbag deployment systems will be able to identify and track a person in the passenger seat of the vehicle. Based on this information, the system will provide signals whether to fire the airbag, fire the airbag at low velocity or fire the airbag at normal high velocity during a crash event, depending on whether a person is seated in the passenger seat, the size of the person and the position of the person relative to the airbag deployment door. In one example, if a person is detected in the passenger seat, and is some small distance (for example, within three inches) from the airbag door, then the airbag does not fire during a crash event. If a person is detected in the passenger seat, and is close (for example, between three and eight inches) to the airbag door, the airbag is fired at a lower velocity during a crash even. If the person is detected in the passenger seat, and far enough away (for example, more than eight inches) from the airbag door, then the airbag is fired at normal high velocity during a crash event.
Present passenger sensing systems typically include ultrasonic, weight, infrared and/or electromagnetic sensors to detect a passenger for airbag deployment. Ultrasonic sensing systems sense the motion of objects within the passenger seat area and determine whether the object is moving closer or farther away from the sensor location. However, ultrasonic sensors don't identify the nature of the object, and thus can't tell if it is a person or some other object, such as a bag of groceries. Similar to ultrasonic sensors, microwave sensors employed in active doppler radar systems can track objects, but cannot identify human presence. Weight sensors identify objects in the passenger seat based on applied pressure, but don't consider the passenger's location relative to the airbag. Passive IR sensors acquire thermal images of the passenger seat, but these systems are very costly. Present active IR sensors sense the relative location of the passenger relative to the airbag, but cannot identify human presence. Electromagnetic systems include LC resonant circuits where body capacitance is used to detect presence and identify objects, but these systems can not track the objects.
U.S. Pat. No. 5,835,613 issued to Breed et al., Nov. 10, 1998, discloses a vehicle interior monitoring system that claims to identify, locate and monitor persons in the passenger compartment of the vehicle. The monitoring system employs infrared emitters that illuminate the interior of the vehicle, and charge couple device (CCD) arrays that detect the radiation. Outputs from the CCD arrays are analyzed by computational devices that employ pattern recognition algorithms to classify, identify or locate the content or objects in the passenger seat. The pattern recognition system for determining vehicle occupants disclosed in the '613 patent employs complicated software that must learn the shape of an individual in all kinds of lighting situations under various conditions. Additionally, employing pattern recognition in this manner is limited in its ability to track the individual as he or she moves around in the passenger seat. Further, the ability to identify and track humans by general pattern recognition is questionably unreliable. Pattern recognition cannot identify who the person is, only detect an object's shape.
What is needed is an improved airbag deployment sensor and system that is able to effectively identify and track a person in the passenger seat of a vehicle. It is therefore an object of the present invention to provide such a sensing system for vehicle passenger airbag deployment.
In accordance with the teachings of the present invention, an imaging system for providing vehicle security and convenience features is disclosed that employs face recognition software and algorithms to identify and track a person. The system includes infrared emitters that emit an infrared signal along a predetermined field-of-view, and an infrared sensor, such as a CMOS sensor used as a video signal array, that receives reflected infrared illumination from objects in the field-of-view. A processor, including the face recognition software, is employed to detect human faces. Once a face is detected, it can be compared to a data base to identify the person.
Various applications of the imaging system for providing driver convenience and security include determining driver identification as the driver approaches the vehicle for automatically unlocking the door and starting the vehicle; determining if a potential thief is in the vehicle by face recognition; and providing driver seat adjustment, rear and side mirror adjustment and steering wheel adjustment.
Additional objects, advantages and features of the present invention will become apparent from the following description and appended claims, taken in conjunction with the accompanying drawings.
The following discussion of the preferred embodiments directed to an airbag deployment system that employs human facial feature recognition is merely exemplary in nature, and is in no way intended to limit the invention or its applications or uses.
According to one embodiment of the present invention, a technique for detecting, identifying and tracking a person in the passenger seat of a vehicle is disclosed. As discussed above, it is sometimes desirable to deploy a passenger side airbag in a vehicle at low velocity, or not deploy the airbag, during a crash event, depending on whether a person is occupying the passenger seat of the vehicle, the proximity of the person to the airbag door, and the size of the person. According to the present invention, the identification and tracking of the person is determined by face recognition software, in particular, software that recognizes and tracks the person's eyes and other facial features. Tracking can take place that way at many head angles and poses. The software algorithm would use calibrated face and feature separation to estimate range. The tracking of a localized area on the human face allows more image frames to be acquired per second, permitting more frequent tracking of the person. The more frequent tracking of the facial features will prevent acquiring a blurry image because the image is updated more frequently.
Various software is known in the art that processes data from video data patterns received from an object being analyzed, and determines whether the object has a face. For example, such software includes the Visionics Face-It software, well known to those skilled in the art. It is stressed that the present invention is not limited to any particular facial feature mapping function, but can include any known algorithm, suitable for the purposes described herein, for recognizing facial features, whether it be two-dimensional or three-dimensional, that are then also to be used for ranging functions, as well. Further, according to the present invention, ranging algorithms are used in combination with the know face recognition software.
As will be discussed in detail below, the present invention employs infrared radiation reflected off of objects in the passenger side of the passenger compartment of a vehicle that is received by an electronic video camera. The video camera generates the electrical signals and image used by the face recognition software to determine the presence identify and tracking of the person.
A video camera and IR LED illuminator unit 26 is mounted on the instrument panel 22 at a location suitable for the purposes described herein.
A filter 36 is provided over the camera 34 to filter out radiation that is not within the desirable infrared range. The filter 36 can be any filter suitable for the purposes described herein, such as a TiO2 filter or a polarizing filter. The filter layers and thickness can be chosen to transmit the IR image to the detector, but to reflect the visible image away from the detector. The polarizing filter can be used to reduce visible light to the detector using electro-optical polarization that passes the IR wavelengths, but strongly attenuates the non-IR wavelengths.
In this embodiment of the present invention, a single camera is used to acquire and monitor the range of the person 16. The software employed to perform this function utilizes two separate locations on the occupant's face to provide the ranging. In a preferred embodiment, it is the person's eyes that are detected to provide the triangulation for ranging purposes. However, as will be appreciated by those skilled in the art, other facial features of the person 16 can also be used, such as the person's ears, etc. Also, the software algorithm allows the person's head size to be determined so that both eyes do not need to be in view to track the person after he has been acquired. Additionally, the software can be used to view other parts of the person's body, such as the person's torso, in combination with the facial feature or head size detection.
Because human facial features are unique, a database can be used to store specific information, such as eye-to-eye separation, about a person, so that the software can particularly identify that person. This is important, in one example, so that the system can identify children and fifth-percent females, and inhibit air bag firing for these people, as required by government mandated regulations. Also, being able to specifically identify a person improves the ranging accuracy of the system because the system knows that person's eye separation or other specific facial features.
The acquisition and tracking software needs to be calibrated for a particular seat position, airbag position, vehicle makeup, etc.
A calibration procedure, described below, can be used to determine ΔY, ΔX and ΔΦ. There is a trade off between measuring a large number of calibration points and linearly interpolating between them or measuring a small number of calibration points and non-linearly interpolating between them. In theory, calibrating with a large number of points would not require a model for estimating that is described herein. The cost of making the large number of measurements necessary for a brute force calibration and the possibility that out of range operation of the camera 34 or software might occur unnoticed with brute force calibration must be considered before using that approach. The approach used herein uses a small number of calibration tests in order to form a model for interpolation.
From
x=−ΔX+ρ sin(θ+Δθ)cos(φ+Δφ) (1)
y=−ΔY+ρ sin(θ+Δθ)sin(φ+Δφ) (2)
This assumes that θ+Δθ is fixed during the calibration. Rearranging equations (1) and (2) yields:
Making eye coordinate readout measurements at fixed x+ΔX, and then taking the slope of the tangent with respect to changes in y yields:
Knowing x and the result on the right side of equation (4), ΔX can be determined. Knowing ΔX, equation (3) and the measured data can be used to determine ΔY. Then, using ΔX and ΔY, equation (3) and the data, Δφ can be determined. Using equation (3), and the fact that:
eye—separation—distance=yrt
the modified monocular equation follows from equation (6) for defining the objective parameter x, or the eye to airbag distance.
The calibration table consists of a lookup table of the SDK determined eye coordinate as a pixel value linked to the associated ray slope, as viewed in the camera coordinate system. Equation (6) can be simplified in terms of ray slopes (tangents of singular angles), so that the calibration table can be used directly to determine the eye to airbag distance from monocular operation. Employing the tangent trigonometry identity in equation (7) below,
and applying this identify to equation (6), leaves the modified monocular equation in a form that can be used by direct look up of tangent/slopes from the calibration lookup table. This equation is given in equation (8) below.
In one embodiment, the infrared LEDs 30 are continuously on to provide the reflected radiation received by the camera 34. However, typically some kind of filtering or signal processing must be done to correct for the problems caused by direct sunlight on the camera 34 that make it through the filter 36. Particularly, the system 50 needs to be able to distinguish between shadows caused by sunlight and actual edges of features on the occupant 16. According to the present invention, a frame differencing technique is employed that synchronously pulses the LEDs 30 on for a predetermined period of time and a predetermined number of frames of video data, and then off for a predetermined period of time over the same number of frames of video data. Then, the frames of data are subtracted from each other so that the frames without IR illumination can be subtracted from the frames with IR illumination, and the background can be eliminated. The detector is electronically shuttered synchronously with the pulses to provide exposure control. The frame differencing technique described herein is used in conjunction with the infrared pulsing to achieve the desired result. In other words, the frame differencing is synchronized to the infrared pulses.
The concept of frame differencing is the time-aperture, pixel-level storage of images using natural illumination only and natural illumination with added infrared illumination. Frame differencing allows these images to be subtracted to mitigate the effects of strong visible illumination. The set-up includes a neutral density filter that sets the IR illumination, plus worst case background to maximize analog-to-digital converter input. Face recognition requires that the worst case analog-to-digital range for the differenced image be 5 to 6 bits. The visible light would fit within the remaining range allowed by the analog-to-digital converter. The image differencing is either performed in the analog domain, where two pixel level capacitors are charged, one at each illumination level, or in the digital domain where a RAM memory of the digitized pixel output is taken at each illumination. The frame differencing acts to subtract out the illumination effects of the visible illumination, and to improve the image contrast. The frame differencing function can be performed in the acquisition/high bandwidth generation mode or in the narrow bandwidth track mode using pulsed LED illumination. The number of the electrons from the pulsed IR light source must be 10 times greater than the photon noise of the ambient illumination. Here, the noise of the ambient illumination is the square root of two times the number of electrons within the sun intensity, because two image frames are being acquired for every one IR image received.
The two storage sites 94 and 98 are summed in a summation amplifier 102. The difference between the two storage sites 94 and 98 is then digitized by an analog-to-digital converter 104, and represents the frame of data where the ambient light has been removed. The readout of data takes about 10 ms, then at the next time period, the next pulse from the cluster 28 occurs. The complete frame differencing process can be performed on a single chip in CMOS where the pixel array 90 and the storage sites 94 and 98 are together. In an alternate embodiment, the frame differencing is performed at a different time period at an off-chip site, where the storage sites 94 and 98 are RAM.
The frame differencing technique of the invention can be described in the following manner. The variables are defined as ambient illumination I(x,y), direct ambient illumination T(x,y), scene reflectance R(x, y), and modulated source as L when on, and 0 when off. The response of the camera 34 is proportional to the product of reflectance and illumination.
S(x,y,OFF)=k*(I(x,y)*R(x,y))+T(x,y)
S*(x,y,ON)=k*((L+I(x,y))*R(x,y))+T(x,y)
D(x,y)=S(x,y,ON)−S(x,y,OFF)=KL*R(x,y)
This difference scene has much smaller dynamic range than the simple image S(x,y,OFF). The same benefit can be derived by reading out the frame with LED, then reading the frame without LED and subtracting the frames external to the camera 34. The penalty is increased dynamic range required to avoid saturation.
L must be much greater than the direct ambient illumination response I. Thus, I is made as small as possible by using a narrow bandpass filter aligned in frequency to L. The raw sampling rate has to be twice the requirement set by the object tracking because two frames are differenced to get one frame to supply the face recognition software. The LEDs 30 have to be much faster. The IR radiation source has to be modulated such that all emission is during the time when all detectors are active. If the integration times of all pixels in the detector are not aligned, the available time for the source to be ON is reduced by the worst case misalignment.
The face recognition system described above for identifying and tracking a person in a vehicle passenger seat can be extended to other types of vehicle systems for other purposes.
In one embodiment, according to the present invention, a sensor or camera is pointed towards the passenger and driver's seat that takes an image of the driver and the passenger's facial feature. Using the facial feature software, the contours of the face and eye coordinates of the driver and the passenger are compared to images stored in a computer data base or lookup table. If the software makes a match of the human face with one stored in the data base, the driver and/or passenger airbag will be adjusted so that that person's characteristics are used to personalize airbag deployment. This will help the airbag to automatically adjust for low weight, young persons, short drivers, etc.
According to the invention, a drowsy driver application involves identifying when the driver is falling asleep at the wheel and waking the driver up. Initially, at regular time intervals, the face recognition software acquires the facial features such as eyes, or any combination of 2-D or 3-D facial/head/torso feature coordinates of the driver. Depending on the new coordinates of the driver's facial features, the software will send a warning to the driver that he or she is falling asleep. This warning will be based on the frequency of the driver's head nods, i.e., the back and forth head motion, whether the driver's eyes are closed, etc. In particular, the percentage of time that the eyes are closed during a fixed time interval is used to determine if the driver is asleep. Preliminary studies have shown that when a driver's eyes are shut more than 70% of the time over two seconds, the driver is drowsy and may be falling asleep. In addition, the back and forth motion of the driver's head or the driver's eyes looking at a cell phone will be utilized to determine if the driver is paying attention to the road ahead. The face recognition software can sense the difference in the driver's eye coordinates and this information can be relayed back to the driver. This information returns to the driver in the form of a mechanical motion, or an electrical stimulus or the production of a loud signal to wake the driver.
Similar to the drowsy driver application discussed above, drug use could cause the driver to fall asleep. Presently, many drivers use caffeine or speed to keep them awake over a long trip. However, when the drug wears off, the driver's response time diminishes to critical changes in road or driving conditions. Thus, monitoring of the driver's pupils will help detect a driver using drugs when the drugs stop working. Basically, the facial feature recognition software determines how often the iris of the eye is blocked by the driver's eyelid to determine whether the driver is falling asleep.
The facial feature recognition system described herein can also be used for driver identification. When the driver enters the vehicle, a camera pointing outward placed near the driver's door will take an image of the driver's facial features. Using the facial feature recognition software, the contours of the facial features such as eyes, or any combination of 2-D or 3-D facial/head/torso feature coordinates of the perspective driver are compared to images stored in the software data base. If the software makes a match of the face with one stored in the data base, the doors will open and the vehicle can be automatically started. Otherwise, the vehicle will remain in the off position. In an alternate embodiment, after the driver gains access to the vehicle and is in the driver seat, the camera can be used to identify the driver for purposes of starting the vehicle. Additionally, in a parental control situation, the driver can be identified as a young person, where the algorithm sets the control of the vehicle to a maximum speed for that driver's use.
The driver ID concept can be extended to driver convenience features using the face recognition software as well. Particularly, the driver's human facial features can be used to recall the driver's last seat position, rear and side mirror adjustment, steering wheel setting, radio setting, etc. In this example, the vehicle's computer can store the driver's last seat position, and rear and side mirror adjustment with the driver's human facial features. Subsequently, whenever the camera acquires the driver's facial features and this image matches a stored driver's personal preference, the seat and mirrors can be automatically repositioned to the desired locations.
The use of the specific imaging and processing techniques of the present invention allows the presence detection of people of all ages and sizes, whether conscious or asleep, in the interior of a vehicle with very high reliability. This detection is effective at all times, for stationary or moving vehicles, whether the engine is running or not, without being perturbed by external stimulus to the vehicle activities so that determination of dangerous human interior situations is enabled to generate warning alerts locally or to be communicated. This includes situations of abandoned children, medical emergency, captive occupants, sleeping occupants, etc.
In severe automobile collisions, it is sometimes difficult to determine what caused the accident. In one example, before the crash event, the exterior forward looking camera can make pre-crash images. These images will be relayed back to the driver in the form of an electrical or mechanical warning so that the driver can make adjustments to avoid a collision. In the event of a vehicle collision, the images acquired will be stored so at a later date, the pre-crash sensing conditions will be available to make a post crash injury assessment. After the crash, a crash notification signal can be automatically transmitted in the form of a radio wave signal which will alert the police and fire department of the crash location.
To keep the driver's attention on the road and hands on the steering wheel, the driver's facial gestures, such as movement of facial features, can be used to perform tasks previously performed manually. For example, a driver's facial feature command can be compared to an existing stored facial feature that will cause the vehicle to automatically change the automobile controls without the driver taking his or her eyes off the road. Some examples include control of automobile accessories, such as turning on and off the radio, turning on and off the vehicle lights, turning on and off the windshield wipers or blowing the horn.
Further, a camera can be positioned in the vehicle to monitor intrusion. In this example, a camera takes an image anywhere inside of the vehicle passenger compartment. If a person is detected in the backseat of the vehicle, the camera will take an image of the person's facial features. If the facial features match one stored in the computer, then no alarm will go off. Otherwise, an alarm can be sounded to alert the driver that someone is in the back seat. The camera can provide a recording of the intruder for later identification, and provide detection of human presence after the vehicle is locked.
The foregoing discussion discloses and describes merely exemplary embodiments of the present invention. One skilled in the art will readily recognize from such discussion and from the accompanying drawings and claims, that various changes, modifications and variations can be made therein without departing from the spirit and scope of the invention as defined in the following claims.
Number | Name | Date | Kind |
---|---|---|---|
4625329 | Ishikawa et al. | Nov 1986 | A |
4648052 | Friedman et al. | Mar 1987 | A |
4720189 | Heynen et al. | Jan 1988 | A |
4975969 | Tal | Dec 1990 | A |
5008946 | Ando | Apr 1991 | A |
5293427 | Ueno et al. | Mar 1994 | A |
5331544 | Lu et al. | Jul 1994 | A |
5348005 | Merrick et al. | Sep 1994 | A |
5581630 | Bonneau, Jr. | Dec 1996 | A |
5729619 | Puma | Mar 1998 | A |
5748780 | Stolfo | May 1998 | A |
5829782 | Breed et al. | Nov 1998 | A |
5835613 | Breed et al. | Nov 1998 | A |
5844573 | Poggio et al. | Dec 1998 | A |
5845000 | Breed et al. | Dec 1998 | A |
5850470 | Kung et al. | Dec 1998 | A |
RE36041 | Turk et al. | Jan 1999 | E |
6002326 | Turner | Dec 1999 | A |
6060989 | Gehlot | May 2000 | A |
6079738 | Lotito et al. | Jun 2000 | A |
6092192 | Kanevsky et al. | Jul 2000 | A |
6108437 | Lin | Aug 2000 | A |
6144366 | Numazaki et al. | Nov 2000 | A |
6252978 | Grantz | Jun 2001 | B1 |
6323761 | Son | Nov 2001 | B1 |
6326613 | Heslin et al. | Dec 2001 | B1 |
6400835 | Lemelson et al. | Jun 2002 | B1 |
6498970 | Colmenarez et al. | Dec 2002 | B1 |
20020097145 | Tumey et al. | Jul 2002 | A1 |
Number | Date | Country |
---|---|---|
4416507 | Nov 1995 | DE |
19645175 | May 1998 | DE |
0669227 | Aug 1995 | EP |
0924123 | Jun 1999 | EP |
1 167 126 | Jan 2002 | EP |
0038949 | Jul 2000 | WO |