1. Technical Field
One aspect of the present invention generally relates to a method and system for detecting objects, such as pedestrians and animals, using far infrared images.
2. Background Art
Many pedestrian detection systems have been proposed to mitigate pedestrian injuries resulting from the operation of vehicles. The typical pedestrian detection system includes one or more sensors mounted to the front of the vehicle for sensing conditions relating to the space in front of the vehicle to obtain data, which is transmitted to an onboard image processing unit. The image processing unit processes the collected data to detect whether a pedestrian occupies the space in front of the vehicle. If a pedestrian is detected, then the processing unit sends a signal to a vehicle warning system to alert the driver so that measures can be taken to avoid contact between the vehicle and the pedestrian. Moreover, the signal can be transmitted to an autonomous braking system to trigger autonomous braking of the vehicle.
One aspect of the present invention generally relates to a method and system for detecting targets, such as pedestrians or animals, using far infrared (IR) images generated from a far IR camera. In certain embodiments, the far IR camera collects thermal energy data related to the temperature of sensed objects and surroundings that can be converted into a far IR image. Pedestrians and other targets have a detectible footprint in the far IR image. Pedestrians are usually warmer than their surroundings, thereby providing the highly detectible footprint. Conversely, if the pedestrian is cooler than their surroundings, which may occur in desert areas, a highly detectible footprint is still generated.
In at least one embodiment, a relatively low resolution far IR camera is utilized. The use of a low resolution far IR camera with the methodology of certain embodiments of this invention provides a relatively low cost solution for detecting targets. According to one application, the low resolution far IR camera can be mounted to the front end of a vehicle so that the IR camera, in combination with a detection system, can be utilized to detect pedestrians occupying the space in front of the vehicle and in the far IR camera's field of view.
An object detection system is disclosed in at least one embodiment. The system includes a far IR sensor operable to sense thermal radiation of objects and surroundings in a field of view and to generate a far IR image in response thereto, and an image processing device operable to receive and process the far IR image to detect the presence of one or more objects in the field of view. The image processing device can be configured to process the far IR image by generating an initial threshold image based on the far IR image and an initial threshold value, iteratively obtaining a number of successive threshold images based on the far IR image and a number of successively increased threshold values, and determining the presence of one or more objects in the field of view based on the initial threshold image and the number of successive threshold images.
The features of one or more embodiments of the present invention which are believed to be novel are set forth with particularity in the appended claims. One or more embodiments of the present invention, both as to its organization and manner of operation, together with further objects and advantages thereof, may best be understood with reference to the following description, taken in connection with the accompanying drawings which:
a, 8b, 8c, 8d, 8e, 8f and 8g depict a series of successive threshold images according to an embodiment of the present invention;
a, 12b, 12c, 12d, 12e, 12f, 12g, and 12h depict a series of successive threshold images according to an embodiment of the present invention;
a, 16b, 16c, 16d, 16e, 16f, 16g, and 16h depict a series of successive threshold images according to an embodiment of the present invention;
Turning to the drawings,
It should be appreciated that the far IR camera 12 can be disposed at other positions on the vehicle, such as a lateral side of the vehicle or the vehicle rear end, according to different implementations of the present invention.
The far IR camera 12 senses thermal radiation of objects and/or surroundings in the field of view 56 within the detection plane 58 and generates an image relating to the temperature of the sensed objects and/or surroundings. As depicted in
In at least one embodiment, the far IR camera is capable of sensing thermal radiation in the 5-12 micron wavelength band and generating an image related to the temperature of sensed objects and/or surroundings. Far IR cameras suitable for use with one or more embodiments of the present invention are available from FLIR Systems, Indigo Operations, of Santa Barbara, Calif. It should be appreciated that other IR cameras can be utilized in accordance with one or more embodiments of the present invention.
The far IR camera correlates thermal energy emitted from an area in the field of view 56 within a cell of the detection plane 58. Each cell can be represented by a width dimension (x) and a height dimension (y). The radiation sensed at any point in the detection plane 60 can be represented by Px,y. Therefore, the data gathered by the far IR camera 12 includes a two-dimensional array of pixels, identified by a spatial coordinate (Px,y) corresponding to the image sensed by the far IR camera 12.
In at least one embodiment, the image generated by the far IR camera 12 is a gray scale image consisting of a two-dimensional array of gray scale pixels. The lowest level of thermal radiation, that is the relatively coldest temperature, can be represented by a gray scale value of 0 (black), while the highest level of thermal radiation, that is the warmest temperature, can be represented by a gray scale value of 255 (white). The relationship of the gray scale value and the temperature may vary depending on the application, but often a linear or logarithmic scale relationship is utilized.
The image dimensions can vary depending upon implementation of the present invention. The far IR camera may have a 320 x pixel dimension by a 240 y pixel dimension, i.e. a 320×240 resolution camera. In certain embodiments, an 80×60 resolution camera can be utilized in combination with the image processing device 14 to detect objects. In some applications, an 80×60 resolution camera, or lower resolution, is preferred because of the relatively low cost of the camera, coupled with the capability of the methods of one or more embodiments of the present invention to utilize a low resolution camera to detect objects. In yet other embodiments, the dimensions of the image can be selected from an x dimension of 60, 80, 160, 240 and 320 and the y dimension of 20, 60, 80, 120 and 240.
The detection system 10 also includes an image processing device 14 configured to receive the image signal transmitted by the IR camera 12. The image processing device 14 is also configured to detect one or more objects, for example, targets, in the image.
The image processing device 14 can generate object detection output, including, but not limited to, the identification and/or position of one or more objects of thermal interest, and/or the identification and position of one or more targets. In at least one embodiment, the detection output can be transmitted to other modules in a vehicle computer system to trigger a visual and/or audio warning alerting the driver to the presence of one or more targets in the space in front the vehicle.
In at least one embodiment, the detection system 10 also includes a vision sensor 18 and/or a radar sensor 20. The vision sensor 18 can be configured to obtain data using a vision technology that can be utilized to detect the presence of one or more objects. The data output by the vision sensor 18 can be transmitted to the image processing device 14 for detecting one or more objects using one or more pattern matching techniques. The radar sensor 20 can be configured to transmit a signal that is returned by reflecting off of an object. The radar sensor 20 can collect the return signal. The data relating to the transmitted and return signals can be transmitted to a threat detecting device 16 for detecting the presence of an object and for determining the distance between the radar sensor and the object. The threat detecting device 16 can generate object detection output, as depicted in
In block 104 of flowchart 100, the top and bottom of the image 150 is cropped. A top region of the image 150 can be cropped to eliminate a portion of the image 150 that represents the sky 154. A bottom region 156 of the image 150 can be cropped to eliminate a portion of the image 150 that may be obscured by the vehicle hood line. In at least one embodiment, the top 79 rows of the 320 rows of pixels are discarded and the bottom 51 rows of the 320 rows are discarded. The range of discarded top rows can be from 60 to 100, while the range of discarded bottom rows can be from 40 to 60, depending on the implementation of the present invention.
In block 106, the image 150 is thresholded using a threshold value. The gray scale value of each image pixel is compared to the threshold value to obtain a threshold pixel for each image pixel. If the gray scale value of the image pixel is greater than or equal to the threshold value, then the gray scale value of the threshold pixel is set to the maximum value of 1. If the gray scale value of the image pixel is less than the threshold value, then the gray scale value of the threshold pixel is set to the minimum value of 0.
In block 108, the threshold image 200 is searched for one or more connected components. According to the connected components processing technique, if a white pixel has a neighbor pixel that is white, then than the two pixels are connected. Using this technique, all the pixels that are connected to each other can be connected to obtain connected components. While the image 150 is two-dimensional, the image 150 can be embedded in three-dimensional space for purposes of convenience. Pixels in the same component are given the same z value. Black is assigned the value 0 and the connected components are assigned increasing integer values of z starting with the value of 1 for the first component that is located. In at least one embodiment, the connected components are located by searching the upper left hand corner of the image to the lower right hand corner of the image.
In block 110, the connected components are ordered by magnitude, i.e. the number of points in each of the connected components. Table 1 contains the magnitude of each value in three-dimensional representation 250.
The component having the value 1 has the largest magnitude, i.e. 3229. In at least one embodiment, the component having the largest magnitude represents an area of thermal interest, i.e. an area in which the temperature differential and the surrounding background is relatively large.
In block 112, the data of the component threshold image 300 is stored for later use in determining the presence of an object. The data can be stored in a database (not shown) of the detection system 10.
The steps represented in blocks 106, 108, 110 and 112 are repeated one or more times using successively increasing threshold values, as depicted in block 114. These steps are otherwise referred to as the repeated increasing threshold steps.
a, 8b, 8c, 8d, 8e, 8f, and 8g depict examples of a series of largest component threshold images 350, 352, 354, 356, 358, 360 and 362, respectively, utilizing threshold values 170, 180, 190, 200, 210, 220, and 240, respectively. While the threshold value was incremented 10 for most iterations of the repeated increasing threshold steps, it should be appreciated that the threshold value increment can be varied depending on the implementation of the present invention. In certain embodiments, the threshold increment value can be in the range of 1 to 20, and in other embodiments, the threshold increment value can be in the range of 1 to 10.
In block 116, the centroid angle for each largest component threshold image is calculated for each threshold value. The cetroid value can be computed as a two-dimensional pixel value Cx,y using a method known to one skilled in the art. The x value of Cx,y can be converted into an angle value within the field of view. If the field of view is 50 degrees and the x dimension (width) of the image is 320, then x=0 pixels corresponds to −25 degrees and x=319 pixels corresponds to 25 degrees. Using these field of view and x dimension values, the field of view degree is calculated as −25+(x/319)*50. In more general terms, the cetroid angle can calculated using the following equation:
A=B+(x/C)*D (1)
where
A=centroid angle
B=negative boundary of the field of view
x=x pixel of the centroid
C=pixel width of the image
D=total degrees of the field of view
In block 118, it is determined whether a convergence condition is met by the centroid angle for each largest component threshold image as a function of threshold value.
In at least one embodiment, the following algorithm is utilized to determine convergence. Each successive centroid is based on a different successive threshold value as defined above. (x1,y1) represents the x and y coordinates of a first centroid and (x2,y2) represents the x and y coordinates of a successive second centroid. p1 represents the number of points in the first centroid and p2 represents the number of points in a successive centroid. The distance (d) between the first centroid and the second successive centroid is defined as:
d=SQRT((144*(x1−x2)*(x1−x2))+25*(y1−y2)*(y1−y2)/(5*p2)) (2)
In at least one embodiment, if the distance is less than 1.2 between the first centroid and the successive second centroid (otherwise referred to as adjacent centroids), then the adjacent centroids are considered close. In other embodiments, if the distance is less than 1.5, then the adjacent centroids are considered close. In yet other embodiments, if the distance is less than 1.0, then the adjacent centroids are considered close.
In at least one embodiment, the convergence condition is met if at least three adjacent close centroids are identified.
In block 120, the number of pixels in each of the largest component threshold images is compared to the threshold value.
In block 122, the presence of an object of thermal interest is determined based the results of steps 118 and 120. In at least one embodiment, if a centroid angle convergence is identified through step 118 between thresholds bounded by first and second threshold values and curve 452 includes a constant portion 456 within a substantial portion of the threshold boundary, e.g. at least 80% (in other embodiments at least 70%), then an object of thermal interest exists. With respect to
The method 100 set forth in
The centroid angle of each largest component threshold image is calculated using the description set forth above for block 116.
The number of pixels in each of the largest component threshold images is compared to the threshold value using the description set forth above for block 118.
Table 2 compares the data for image 150 (with a pedestrian) to image 500 (without a pedestrian). Column heading 160, 180 and 200 represent the pixel count of the largest component threshold image for threshold values of 160, 180 and 200.
The top (20 rows) and bottom (16 rows) of gray scale image 700 were cropped (block 104). The gray scale image 700 was thresholded to obtain a number of threshold images based on threshold values of 160, 170, 180, 190, 200, 210, 220 and 240, as described above with respect to block 106. Each threshold image was subjected to steps 108, 110 and 112 of
The centroid angle of each largest component threshold image is calculated using the description set forth above for block 116.
The number of pixels in each of the largest component threshold images is compared to the threshold value using the description set forth above for block 118.
Turning back to
As required, detailed embodiments of the present invention are disclosed herein. However, it is to be understood that the disclosed embodiments are merely exemplary of the invention that may be embodied in various and alternative forms. Therefore, specific functional details described herein are not to be interpreted as limiting, but merely as a representative basis for the claims and/or as a representative basis for teaching one of ordinary skill in the art to variously employ the present invention.
While the best mode for carrying out the invention has been described in detail, those familiar with the art to which this invention relates will recognize various alternative designs and embodiments for practicing the invention as defined by the following claims.
Number | Name | Date | Kind |
---|---|---|---|
4859852 | Genna et al. | Aug 1989 | A |
6670910 | Delcheccolo et al. | Dec 2003 | B2 |
6728334 | Zhao | Apr 2004 | B1 |
6759949 | Miyahara | Jul 2004 | B2 |
6815680 | Kormos | Nov 2004 | B2 |
6956469 | Hirvonen et al. | Oct 2005 | B2 |
7024292 | Remillard et al. | Apr 2006 | B2 |
7574031 | Dehmeshki | Aug 2009 | B2 |
7646902 | Chan et al. | Jan 2010 | B2 |
20030053674 | Armato et al. | Mar 2003 | A1 |
20030218676 | Miyahara | Nov 2003 | A1 |
20040146917 | Cork et al. | Jul 2004 | A1 |
20040228529 | Jerebko et al. | Nov 2004 | A1 |
20040252870 | Reeves et al. | Dec 2004 | A1 |
20050110621 | Hahn et al. | May 2005 | A1 |
20050111757 | Brackett et al. | May 2005 | A1 |
20050157929 | Ogasawara | Jul 2005 | A1 |
20050231339 | Kudo | Oct 2005 | A1 |
20050240411 | Yacoub | Oct 2005 | A1 |
20050276447 | Taniguchi et al. | Dec 2005 | A1 |
20060083428 | Ghosh et al. | Apr 2006 | A1 |
20060147101 | Zhang et al. | Jul 2006 | A1 |
20060177125 | Chan et al. | Aug 2006 | A1 |
20060181678 | Stark et al. | Aug 2006 | A1 |
20060206243 | Pawlicki et al. | Sep 2006 | A1 |
20070007436 | Maksymowicz | Jan 2007 | A1 |
20070081712 | Huang et al. | Apr 2007 | A1 |
20070148665 | Cork et al. | Jun 2007 | A1 |
20080002887 | Revow | Jan 2008 | A1 |
20080170763 | Begelman et al. | Jul 2008 | A1 |
20090153659 | Landwehr et al. | Jun 2009 | A1 |
Number | Date | Country | |
---|---|---|---|
20080197284 A1 | Aug 2008 | US |