The technical field of the present invention relates to a traffic information sensing and detection.
Traffic information detection systems using existing video image detection systems (VIDS) have many advantages. For example, such detection systems typically use video cameras as sensors, thus having wide area detection capabilities. Usually, one camera can cover several traffic lanes, which is difficult to achieve using any other sensors like radar or is conductive loops. While images generated by video camera sensors allow for efficient detection of shock waves and other spatial traffic parameters, such as density, queue lengths, and speed profiles, it is typically not the case for images generated by other conventional means. In addition, the VIDS provides ancillary information such as traffic on road shoulders, stopped vehicles, changed lanes, speed variations between vehicles, and traffic slowdowns in the other direction. As the size of camera sensors decreases and processing capabilities of processors increase, it is more and more common to employ traffic information detection systems with VIDS.
Another disadvantage of the prior art traffic detection system is that the sensors cannot identify traffic lanes automatically, thus requiring operating personnel manually obtain lane information from the sample images during installation. The lane information is also transmitted back to the traffic management center for subsequent detection. For a traffic is detection system that includes thousands of sensors, such activities involve an enormous amount of labor. Moreover, once the position or orientation of a sensor is changed, the lane information will have to be re-configured; otherwise erroneous detection results will be generated. Thus, the lack of automatic lane identification capability of the prior art sensors has brought great inconvenience to traffic information detections.
A traffic information sensor system includes an image acquisition device to generate video images of a traffic scene. The system also includes an image detection device to extract traffic information from the video images in connection with lane position information of the traffic scene. The system also includes an information transfer device to transmit the extracted traffic information instead of the video images to a traffic management center.
In the accompanying drawings, like reference symbols refer to like devices.
The image acquisition device 210 that is coupled to the image detection device 220 can be, for example, a video camera, a CMOS (Complementary Metal Oxide Semiconductor) video camera, a CCD (Charge Coupled Device) video camera, or a WebCam camera, and is operable to acquire digital video images of the traffic scene (e.g. a road) continuously.
The image detection device 220 is, for example, an embedded processor, an application specific integrated circuit, a system-on-chip or a general-purpose computer that is programmed to execute particular image detection software instructions for (I) performing detection on the images acquired by the image acquisition device 210, (II) extracting interested traffic information (e.g. the average speed in each lane, and the number of vehicles passing each lane in a given period of time) in connection with lane position information in the traffic scene, and (III) transferring the extracted traffic information to the information transfer device 230 in a digital data signal form.
Referring again to
In
Referring back to
The lane position information in this embodiment may be set in advance, or may be obtained from the video images by the sensor.
The lane localization module 421 localizes the lanes by use of various known methods, e.g. by analyzing vehicle tracks or road marks in the images. For example, lanes are found by least squares polynomial fit with regard to vehicle movement tracks in the images as disclosed in José Melo, Andrew Naftel, Alexandre Bernardino and José Santos-Victor, “Viewpoint Independent Detection of Vehicle Trajectories and Lane Geometry from Uncalibrated Traffic Surveillance Cameras”, ICIAR 2004, LNCS 3212, pp. 454-462, 2004. Lanes are found by identification of road marks on the lanes in the images as disclosed in Andrew H. S. Lai and Nelson H. C. Yung, “ Lane Detection by Orientation and Length Discrimination”, IEEE Transactions on Systems, Man, and Cybernetics—Part B: Cybernetics, Vol. 30, No. 4, August 2000.
Here, as examples, accumulation of difference images between adjacent frames is used to perform lane localization and lane discrimination. This calculation utilizes the fact that most vehicles follow their path and do not change the lane when they pass the camera view field. Specifically, the lane localization module 421 includes two sub-modules, namely an activity map generation sub-module 4211 and a lane discrimination sub-module 4212. First, the activity map generation sub-module 4211 distinguishes two adjacent frames and detects some active pixels whose values are above a threshold, the active pixels indicating the movements of the vehicles. Accumulation of such active pixels during the initialisation period will form an activity map. Thereafter, the lane discrimination sub-module 4212 determines the middle lines of the lanes and the boundaries between the lanes by finding the local minimum and maximum values in the activity map. The operations of the aforementioned two sub-modules will be described in more detail with reference to
The flow chart in
At 52, sub-module 4211 performs accumulation. The activity map is accumulatively added at the positions where the active pixels locate. At 53, it is determined whether or not the map has converged. If so, the activity map is output to 54 so that the present processing can be ended. If it has not converged, the process proceeds back to 51 for the processing of the next frame. The criterion for determining whether the activity map converges is such that it would be deemed converging if the positions of active pixels generated by a number of successive frames are substantially the same as the positions of active pixels on the current activity map.
The flow chart of
After obtaining the binary map (at 61) and at 62, the Hough transform is employed to detect the lines at the vanishing point (u0, v0), thus obtaining line equations of the middle lines of the lanes and the boundary lines between the lanes. After detecting the lines (at 62) and at 63, the vanishing point (u0, v0) is estimated by the sub-module 4212 using the least square optimization algorithm.
At 64, the accurate positions of the boundaries and middle line of each of the lanes are further estimated by using the (u0, v0) obtained in 63. Specifically, the activity map is sampled along the lines connecting the vanishing point (u0, v0) and pixel uj in the bottom row of the image. The activity values along each line are averaged, thus creating a one-dimensional signal that is a function of u. The peaks of the signal indicate strong traffic activity (middles of respective lanes) and the valleys indicate the absence of vehicle activity (lane boundaries), and the accurate positions of the boundaries and middle line of each lane can be obtained by detecting the peak values and valley values, and then connecting the detected values with (u0, v0).
The sub-modules 4211 and 4212 may well use other methods known in the art to generate the activity map and perform the lane discrimination by using the activity map, and these methods will not be described in more detail below.
The vehicle counting module 722 performs the operations shown in
At 81, an average value of n frames (where n is a number that is large enough and may be, for example, 150) of images is calculated by the module 722 as the initial background of the road.
At 82, one or more virtual lines are arranged on the interested lanes by the module 722. Preferably, the virtual lines are arranged on different positions on the middle line of a lane with the virtual lines being perpendicular to the middle line and the two endpoints of each virtual line being located on the two boundaries of the lane, respectively. Of course, the length of the virtual lines may be less than the width of the lane as long as it retains the capability of accurately reflecting the pass of vehicles.
At 83, detection is performed on the pixels in each of the virtual lines, wherein when a vehicle passes a virtual line, the pixels on the virtual line will change from the initial background pixels during several frames. By analyzing the pixel changes in the virtual lines, the two-dimensional image data can be changed into a one-dimensional time signal, thus enabling real-time processing by low-end embedded processors.
At 84, since vehicles may change their lanes and noise may be present in some virtual line, the numbers of vehicles of all the virtual lines that belong to the same lane are preferably averaged so as to obtain the average vehicle number of the lane.
A similar method may be used in night detection situations. At night, the pixel luminance at the head lights of a vehicle is significantly higher than that of other parts, thus in one embodiment, vehicles may be to detected by determining whether or not pixels whose luminance is above a predetermined threshold are present in a virtual line in order to count vehicles. For example, in the situation where the detector pixel values range from 0 to 255, most vehicle lights renders the pixel values 255. Of course there are some vehicle head lights that are less bright, but typically they can get higher than 200. However, the gradation of the road surface is typically lower than 200. Therefore, the predetermined threshold may be set in the pixel value range of 200 to 240. Since the initial background need not be calculated at night, the system efficiency is further improved.
The average vehicle speed calculation module 923 adopts the same method as the module 722 to detect vehicles, and uses a statistical method to estimate the average speed of each of the lanes, respectively. Assume that vehicle lengths have a Gaussian distribution with the average value 1, the frame rate is f frames per second, and the average number of frames corresponding to one vehicle passing one virtual line is n, the average vehicle speed can be calculated as v=1f/n. Similarly, as described above, the vehicle detection may be performed by detecting whether or not the pixel luminance on the virtual lines exceeds a threshold at night.
It is apparent that the modules of the present invention may be combined in various ways. For example, the embodiment shown in
Likewise, the vehicle counting module 722 of
Further, the present invention may also use images to determine the circumstances, and select different image processing algorithms according to the circumstances. By using this method, different algorithms may be designed for different weather such as rainy days and foggy days, and for different time such as daytime and dusk.
Although the present invention has been described above in connection with various embodiments, the above descriptions are only illustrative rather than restrictive. It is to be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may be present according to design requirements and other factors as long as they fall within the scope of the appended claims or equivalents thereof.
Number | Date | Country | Kind |
---|---|---|---|
2006 1 0065374 | Mar 2006 | CN | national |
Number | Name | Date | Kind |
---|---|---|---|
6690292 | Meadows et al. | Feb 2004 | B1 |
7348895 | Lagassey | Mar 2008 | B2 |
7697727 | Xu et al. | Apr 2010 | B2 |
20040131233 | Comaniciu et al. | Jul 2004 | A1 |
Number | Date | Country |
---|---|---|
2004231240 | Dec 2004 | AU |
1187724 | Jan 2004 | CN |
1429302 | Jun 2004 | EP |
0807914 | Nov 2007 | EP |
2393837 | Apr 2004 | GB |
2425385 | Oct 2006 | GB |
2007036873 | Apr 2007 | WO |
Number | Date | Country | |
---|---|---|---|
20070225895 A1 | Sep 2007 | US |