The present invention relates to object detection on a regular route of a vehicle. More particularly, the present invention relates to performing object detection while going through a path of travel. The present invention can be applied to railways for obstacle detection on train tracks with a look-ahead sensor system such as a lidar and/or camera system.
Automated object detection is an important component to modern vehicle safety and navigation. As a vehicle travels down a path, objects may be detected and appropriate responses taken. The responses may vary on the type of object detected including the size or material, of whether the object is fixed, mobile, hard or soft. Given the various responses that may be required, accurate object detection is highly desirable.
To take advantage of the high quality and low cost digital imaging systems available today, some object detection is performed using 2D image processing. As might be expected, image processing based on 2D images from video feeds do not have distance information. Without distance information, objects are usually detected through visual recognition.
Visual recognition algorithms may use machine learning to detect and identify objects on irregular or uneven surfaces such as train tracks. The identification is performed by trained classifiers, which is the logic that is produced from the artificial intelligence (AI) training process. These trained classifiers are difficult to certify because the internal logic is a “black box” and therefore difficult to validate or characterize.
False positives are another problem with trained classifiers. It is difficult to avoid false positives because the variety of shapes encountered on uneven surfaces, like train tracks, is undefined and essentially unlimited. Additionally, the system performing the identification is usually part of a vehicle that is moving down the path of travel at a reasonably high speed so that the contents of the images are moving. Attempting to identify a large number of objects using images taken while moving leads to many false positives, which reduce the usefulness of AI identification systems for object detection on irregular surfaces.
Many vehicles travel regular and known paths over irregular terrain. A good example of this is a train, which typically travels over tracks placed on crushed stone embankments. Buses may also travel well known paths. Since accurate object identification is so important to automated vehicles, it would be beneficial to take advantage of the fact that certain vehicles travel over know paths when performing object identification in order to increase the overall accuracy.
A system and method for performing object detection in a vehicle travelling over a known path is described. In accordance with one embodiment of the invention. the object detection is performed by generating a reference map during a first pass over the path. During a second pass the vehicle is registered at a location within the reference map and a 3D point cloud generated during the second pass is compared with the reference map. Any differences detected during the comparison are used to perform object detection.
The known path may be defined by a “tunnel of travel” in some embodiments. The tunnel of travel is defined as the space having a known height and width over the ground of the known path of travel. In one embodiment of the invention the reference map is created during a first pass through the tunnel of travel by scanning a path as least as wide and as high as the tunnel of travel and recording the detected surfaces into the reference map including the ground level surfaces. The stored surface data preferably includes the locations along the tunnel of travel as well as the time at which the data was last updated.
During exemplary operation, a vehicle travelling through the tunnel of travel will perform its own 3D scan. The scan will typically be focused, or directed, at a location in the tunnel of travel that is ahead of the vehicle's current location in the tunnel of travel. Since the tunnel of travel may not follow a straight line, and the vehicle knows the path of the tunnel of travel, in many cases the vehicle will scan at a location that is not the same as the current direction of travel of the vehicle. For example, if the tunnel of travel curves to the right ahead of the current location of the vehicle then the scan will be performed forward and to the right of the current direction of travel. Alternatively, if the tunnel of travel curves to the left ahead of the current location of the vehicle then the scan will be performed to the left of the current direction of travel. Clearly, the scan may also be directly in front of the vehicle if the tunnel of travel continues in the same direction as the current direction of travel. Curves can also include hard angle turns in some embodiments of the invention. How far ahead along the tunnel of travel the scan is performed may vary with the size and speed of the vehicle, as well as the terrain and environment including level of urbanization, incline and presence of obstructions.
In general, the various steps and operations described herein are performed by processing unit 102 which receives instructions stored in memory 100. Processing unit 102 controls the various other blocks in response to the instructions from memory unit 100 as well as in response to data received from other blocks shown
Memory 100 is typically a combination of semiconductor-based static and dynamic RAM working in conjunction flash-RAM and hard disk drive storage systems, or some subset thereof. Processing unit 102 is preferably a semiconductor-based microprocessor unit. Input 108 may be a keyboard, selection knob, joystick, or touch screen. Display 106 is typically a flat screen display such as an LCD or OLED display. In some embodiments of the invention input 108 and display 106 may be integrated into a single touch-screen display. Interconnect 104 may be a parallel computer bus, USB, SATA, PATA, ethernet or some combination thereof.
Sensor 110 may be various types of scanning devices, including radio frequency, infrared, optical or ultraviolet sensing semiconductor device or component. Sensor 110 may also be a sound sensing device such as a microphone. In various embodiments of the invention the sensors may be time-of-flight 3D scanners or more generally depth-resolved imaging systems such as Lidar. In still other embodiments of the invention, sensor 110 may be a stereoscopic camera system, such as a stereoscopic 3D camera system comprising a 2D camera sensors providing 2D images that can be used to perform geometric transformations to derive 3D information from a set of 2D images. Sensor 110 may generate light, radio frequency signal or sound to enable the detection of the reflection of these signals from object surfaces, or it may use signals naturally present in the environment such as light from outside sources. The described embodiments herein typically use lidar for sensor 110, but sensor 110 should not be limited to lidar.
Alternative embodiments of the invention may place one or more of the components (or portions therefore) shown in
Still referring to
Train 200 also communicates via a wireless interface 216 and wireless link 224 with reference map 222. Reference map 222 may be located in a “cloud” based database. Those skilled in the art will recognize that reference map may also be located in train 200 and that alternative methods for interfacing with reference map 222 may also be used. For example, a wire-based interface may also be used if train 200 is an electrical train that is powered by an overhead electrical line over which communication signals may also be transmitted.
In one embodiment of the invention reference map 222 contains data for all the surface points previously detected within virtual tunnel 220. These surface points may include all the surface points for the irregular surface formed by tracks 202, ties 204 and rocks 206, as well as any surfaces above or to the left or right of the tunnel of travel. This would include buildings, trees or any other objects or structures.
In some embodiments of the invention reference map 222 will also contain free space information. This free space information includes a database or list of previously unoccupied 3D points, or voxels, that may be determined by calculating line-of-sight vectors to surface points. A system and method for performing 3D imaging use free space information is described in co-filed and co-pending U.S. patent application Ser. No. ______ entitled “Single Frame Motion Detection and Three-Dimensional Imaging Using Free Space Information” assigned to the assignee of the present invention and incorporated herein by reference in its entirety.
In an exemplary embodiment of the invention, train 200 begins its scheduled route over train tracks 202 by registering with reference map 220. The registration process generally consists of logging the current location of train 200 and downloading the next section of travel tunnel 220 along which the train will travel on its scheduled route.
As train 200 travels down track 202 along a scheduled route lidar 212 transmits laser pulse 214 to the areas within and around tunnel of travel 220. Laser pulse 214 scans over the area in front of train 200 and the laser reflections are used to determine the surrounding surface areas including the shape of train tracks 202, railroad ties 204 and crushed rock 206. Then train 200 compares the detected surfaces to the surface points for the associated section of travel tunnel 202 with reference database 220.
In accordance with one embodiment of the invention the result of the comparison may yield a set of surface points that do not match the corresponding set of surface points from reference map 220. If new surface points are detected, then object detection is performed using these new surface points. These new surface points may also be thought of as the difference set. This object detection may take into account the shape, size and connectivity (or lack thereof) of these new surface points. Based on the nature of the object detected, various responses can be implemented including issuing warnings, sounding alarms or attempting to stop or slow the train.
In the case of bag 230, the object detection system will note the irregular shape as well as the overall size. The shape and size of the object will be compared against a database and object identification performed. In some embodiments of the invention, object identification may be performed after or even concurrently with object detection. The object identification may incorporate artificial intelligence (AI) techniques. Additionally, in some embodiments of the invention the object detection system will note the level of reflectivity of the surface points. Additionally, when using a multi-spectrum lidar, the multi-spectrum reflection will be used to identify the material, which in this case may be plastic or paper. In still other embodiments of the invention multiple scans will be performed at the same location and any change in the shape of the object will be used to determine whether the object is flexible or rigid. For example, in the example case of
In the case of stone 232, the object detection system will note the size, shape and smooth surface. The shape and size of the object will be compared against a database and object identification performed. n some embodiments of the invention, object identification may be performed after or even concurrently with object detection. The object identification may incorporate artificial intelligence (AI) techniques. Additionally, in some embodiments of the invention the object detection system will note the level of reflectivity of the surface points. Additionally, when using a multi-spectrum lidar, the multi-spectrum reflection will be used to identify the material, which in this case would be natural rock. And in other embodiments of the invention, multiple scans will be performed at the same location and any difference in the shape of the object or obstacle will be used to determine whether stone 232 is rigid or flexible. In this case the object detection system should detect stone 232 is a solid and heavy object and then an appropriate response will be taken. For example, an automatic emergency response triggered, such as issuing a warning alarm or message, reducing the velocity or starting an emergency braking procedure to stop the train. If there is any obstacle blocking equipment or impact preparation materials available to the train, such items would be deployed.
During typical operation train 300 will use lidar 310 to transmit and receive laser pulses 312 into the area defined by tunnel of travel 304. For train 300 in the upper left portion of
For both instances of train 300 shown in
The location, or locations, at which train 300 scans using lidar 310 may also depend on the speed of travel of train 300. For example, as the speed increases the scan may be performed further along the path of travel defined by tunnel of travel 304. Similarly, as the speed of train 300 decreases the scan may be performed closer along the path of travel defined by tunnel of travel 304 in order to maintain the most current scan possible given the speed train 300.
In some embodiments of the invention train 300 may store the scan data generated while passing through tunnel of travel 304 and then use that data to update reference map 308 at some point later in time. This could happen during the time train 300 is travelling, or it could be performed at a later time with possible comparison and combination with data from other passes through the tunnel of travel. The comparison and combination could include averaging or other transformation or combining of data sets. Additionally, if this is the first pass through tunnel of travel 304 using the scanning equipment, then the scan data may be used to create reference map 304.
In other embodiments of the invention train 300 will use multi-special lidar to perform surface scans. A system and method for performing material identification in accordance with some embodiments of the invention is described in international patent application PCT/EP2019/056842 filed on Mar. 19, 2019 entitled “METHOD AND SYSTEMS FOR IDENTIFYING MATERIAL COMPOSITION OF OBJECTS” incorporated herein by reference in its entirety. Another system for using multi-spectral lidar is described in co-pending U.S. patent application Ser. No. 16/675,016 filed on Nov. 5, 2019 and entitled “Adaptive Active Safety System Using Multi-spectral LIDAR” assigned to the assignee of the present invention and incorporated herein by reference in its entirety. Another description of a multi-spectrum lidar system that may be incorporated into some embodiments of the invention is described in co-pending U.S. patent application Ser. No. 16/735,452 filed on Jan. 6, 2020 and entitled “Multi-spectral LIDAR Object Tracking” assigned to the assignee of the present invention and incorporated herein by reference in its entirety.
In this embodiment laser pulse 312 is formed from two or more laser pulses of different frequency. The resulting change in relative intensity of the reflected pulses is used to assist in the identification of the material from which they reflected. This material identification can be combined with the shape and size of the new surface points detected to enhance the accuracy of the object detection as well as possible object identification. Once object detection or object identification have been performed then a more accurate response can also be taken.
At step 402 the scan location within the tunnel of travel is determined. This determination will typically take into account the velocity of the train, the curvature of the tunnel of travel (which can correspond to the curvature of the associated train tracks) as well as any objects obstructing the line-of-sight access to that location such as a hill or structure. In some embodiments of the invention the closest available line-of-sight location within the tunnel of travel will be used as the scan location if there is an obstacle between the train and the preferred location. For example, on a curved section of the track the scan may have to performed on a closer segment of the tunnel of travel than would otherwise be used.
At step 404 a scan of the tunnel of travel is performed at the determined location. At step 406 a spectral analysis is performed on the reflected laser pulses received by the lidar. This is particularly useful for embodiments of the invention where multi-spectrum lidar is used. This multi-spectrum analysis may be used to identify the material from which the reflections originate.
At step 408 the 3D point cloud data from the scan is compared with the surface data from the reference map at the corresponding location (or position). If the comparison results in a set of surface points that were previously not in the reference map an object set is created using these new points. Connected 3D points may be grouped into objects and the size and shape of these objects determined. The size and shape information can be combined with the identification of any materials as described in the previous step.
Additionally, surface points that were part of the reference map, but which were not detected by the scan, may be grouped together into gaps, voids or holes. These voids may also be used as part of any response performed later in this process.
In one embodiment of the invention, the reference map includes free space locations and the 3D point cloud from the scan is compared with the free space location in the reference map. The object set may then be created from points where free space locations have changed to occupied locations.
In some embodiments of the invention a rescan of the tunnel of travel at the same location may be performed at step 410. This rescan will provide additional data that will increase the reliability of the surface information and may also aid in object detection. At step 412 the changes between the first and second scan within tunnel is determined. If changes are detected, this is also used in performing object detection. In particular, where significant changes are detected this may indicate a lack of rigidity of those objects and this can be taken into account when performing object detection or identification. For example, an object might be identified as a paper or plastic bag for which no response is required.
At step 414 object detection and identification is performed. Object detection is preferably performed using some or all of the information gathered or generated in the previous steps. At step 416 the response to the object detection will be determined based on the nature of the particular object detected. Possible responses include sounding alarms, changing the velocity of the train, activating lights, notifying the conductor or sending messages for response by other systems.
In some embodiments of the invention, the data from the scans performed during each pass by train may be used to update the reference map. This will assist in maintaining an accurate reference map and will allow small changes in the irregular surface to be taken into account such as rocks moving or erosion.
In one embodiment of the invention, the reference map is initially generated during a first pass of the lidar through the tunnel of travel. The surfaces detected by the scans performed during this first pass can be stored and then assembled into a reference map that can be used during later passes. The reference map can be supplemented with the scans performed during later passes as well including passes in which the existing reference map is used to perform object identification. The reference map may also be supplemented with free space information.
It should be understood that while the invention is described in the context of a train travelling over train tracks the invention may be incorporated into many other environments including, for example buses or elevators. where regular travel over a known path is performed. And, as mentioned elsewhere in this application, object detection can also include detection of voids or gaps, which present their own dangers in a moving system such as a train. Also, while the invention is described in the context of using a lidar based surface detection system, the use of other imaging systems is also consistent with some embodiments of the invention.
Thus, a system and method for performing object detection and identification has been described. While the invention has been set forth in the context of various embodiments, it should be understood that these embodiments do not limit the scope or breadth of the invention. Rather, the scope and breadth of the invention (or inventions) described herein is set forth more particularly in the claims provided herein or ultimately set forth in any final issued patent or patents.