A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
The invention relates generally to digital maps, geographical positioning systems, and vehicle navigation, and particularly to a system and method for map matching with sensor detected objects.
Within the past several years, navigation systems, electronic maps (also referred to herein as digital maps), and geographical positioning devices, have been increasingly employed to provide various navigation functions. Examples of such navigation functions include determining an overall position and orientation of a vehicle; finding destinations and addresses; calculating optimal routes; and providing real-time driving guidance, including access to business listings or yellow pages.
Generally, a navigation system portrays a network of streets, rivers, buildings, and other geographical and man-made features, as a series of line segments including, within the context of a driving navigation system, a centerline running approximately along the center of each street. A moving vehicle can then be located on the map close to, or with regard to, that centerline.
Some earlier navigation systems, such as those described in U.S. Pat. No. 4,796,191, have relied primarily on relative-position determination sensors, together with a “dead-reckoning” feature, to estimate the current location and heading of the vehicle. However, this technique is prone to accumulating small amounts of positional error. The error can be partially corrected with a “map matching” algorithm, wherein the map matching algorithm compares the dead-reckoned position calculated by the vehicle's computer with a digital map of streets, to find the most appropriate point on the street network of the map, if such a point can indeed be found. The system then updates the vehicle's dead-reckoned position to match the presumably more accurate “updated position” on the map.
Other forms of navigation systems have employed beacons (for example radio beacons, sometimes also referred to as electronic signposts) to provide position updates and to reduce positional error. For several reasons, including high installation costs, electronic signposts were often spaced at very low densities. This means that errors would often accumulate to unacceptable levels before another beacon or electronic signpost could be encountered and used for position confirmation. Thus, even with the use of beacons, techniques such as map matching were still required to eliminate or at least significantly reduce the accumulated error.
The map matching technique has also proven useful in providing meaningful “real-world” information to the driver about his/her current location, orientation, vicinity, destination, route; or information about destinations to be encountered along a particular trip. The form of map matching disclosed in U.S. Pat. No. 4,796,191 might be considered “inferential”, i.e. the disclosed algorithm seeks to match the dead-reckoned (or otherwise estimated) track of the vehicle with a road network encoded in the map. The vehicle has no direct measurements of the road network; instead, the navigation system merely estimates the position and heading of the vehicle and then seeks to compare those estimates to the position and heading of known road segments. Generally, such map matching techniques are multidimensional, and take into account numerous parameters, the most significant being the distance between the road and estimated position, and the heading difference between the road and estimated vehicle heading. The map can also include absolute coordinates attached to each road segment. A typical dead reckoning system might initiate the process by having the driver identify the location of the vehicle on the map. This enables the dead-reckoned position to be provided in terms of absolute coordinates. Subsequent dead-reckoned determinations (i.e. incremental distance and heading measurements) can then be used to compute a new absolute set of coordinates, and to compare the new or current dead reckoned position with road segments identified in the map as being located in the vicinity of the computed dead reckoned position. The process can then be repeated as the vehicle moves. An estimate of the positional error of the current dead reckoned position can be computed along with the position itself. This error estimate in turn defines a spatial area within which the vehicle is likely to be, within a certain probability. If the determined position of the vehicle is within a calculated distance threshold of the road segment, and the estimated heading is within a calculated heading difference threshold of the heading computed from the road segment information, then it can be inferred with some probability that the vehicle must be on that section of the road. This allows the navigation system to make any necessary corrections to eliminate any accumulated error.
With the introduction of reasonably-priced Geographical Positioning System (GPS) satellite receiver hardware, a GPS receiver can also be added to the navigation system to receive a satellite signal and to use that signal to directly compute the absolute position of the vehicle. However, even with the benefits of GPS, map matching is typically used to eliminate errors within the received GPS signal and within the map, and to more accurately show the driver where he/she is on that map. Although on a global or macro-scale satellite technology is extremely accurate; on a local or micro-scale small positional errors still do exist. This is primarily because the GPS receiver may experience an intermittent or poor signal reception or a signal distortion; and because both the centerline representation of the streets and the measured position from the GPS receiver may only be accurate to within several meters. Higher performing systems use a combination of dead-reckoning and GPS to reduce position determination errors, but even with this combination, errors can still occur to a degree of several meters or more.
In some instances, inertial sensors can be added to provide a benefit over moderate distances, but over larger distances even those systems that include inertial sensors will accumulate error.
However, while vehicle navigation devices have gradually improved over time, becoming more accurate, feature-rich, cheaper, and popular; they still fall behind the increasing demands of the automobile industry; and in particular, it is expected that future applications will require higher positional accuracy, and even more detailed, accurate, and feature-rich maps. This is the area that embodiments of the present invention are designed to address.
Embodiments of the present invention address the above-described problems by providing a direct sensor and object matching technique. The direct sensor and object matching technique can be used to disambiguate objects that the driver passes, and make it precisely clear which one of the objects the retrieved information is referring to. The technique also makes it possible for the navigation system to refine (i.e. improve the accuracy of) its position estimate, without user attention.
In accordance with an embodiment that uses scene matching, a system is provided which (a) extracts one or more scenes from the sensor-gathered or raw data; (b) builds a corresponding scene from a map-provided or stored version of the raw data; and (c) compares the two scenes to help provide a more accurate estimate of the vehicle position.
In accordance with an embodiment that uses vehicle-object position matching, a system is provided which (a) extracts raw object data from the sensor-gathered or raw data; (b) compares the extracted data with a corresponding raw object data kept in map from a map-provided or stored version of the raw data; and (c) compares the two measures of object data to help provide a more accurate estimate of the vehicle position.
In accordance with an embodiment uses object characterization, a system is provided which (a) extracts raw object data from the sensor-gathered or raw data; (b) extracts characteristics from those raw objects; and (c) compares those characteristics with the characteristics that are stored in the map to help provide a more accurate estimate of the vehicle position.
In some embodiments, a camera or sensor in the car can be used to produce, dynamically in real time, images of the vicinity of the vehicle. Using direct sensor/object matching techniques map and object information can then be retrieved from a map database, and superimposed on those images for viewing by the driver, including accurately defining the orientation or the platform so that the alignment of the map data and the image data is accurate. Once alignment is achieved, the image can be further enhanced with information retrieved from the database about any in-image objects. The system reduces the need for other, more costly solutions, such as the use of high accuracy systems to directly measure orientation. In some embodiments, once the navigation system is sensor-matched to objects in the vicinity, these objects may be displayed accurately on a map display as icons that help the driver as he/she navigates the roads. For example, an image (or icon representation) of a stop sign, lamppost, or mailbox can be placed on the driver's display in an accurate position and orientation to the driver's actual perspective or point of view. These cue-objects are used to cue the driver to his/her exact position and orientation. In some embodiments, the cue-objects may even be used as markers for the purpose of the system giving clear and practical directions to the driver (for example, “At the stop sign, turn right onto California Street; Your destination is then four meters past the mailbox”).
In some embodiments, once the navigation system is sensor-matched to objects in its vicinity, additional details can be displayed, such as signage information that is collected in the map database. Such information can be used to improve the drivers ability to read the signs and understand his/her environment, and are of particular use when the sign is still too far away for the driver to read, or when the sign is obstructed due to weather or other traffic.
In some embodiments, a position and guidance information can be projected onto a driver's front window or windscreen using a heads-up display (HUD). This allows the precise position and orientation information provided by the system to be used to keep the projected display accurately aligned with the roads to be traveled.
Described herein is a system and method for map matching with sensor detected objects. A direct sensor and object matching technique can be used to disambiguate objects that the driver passes. The technique also makes it possible for the navigation system to refine (i.e. improve the accuracy of) its position estimate.
For future navigation-related applications, it is anticipated that map matching to the center of a road may be insufficient, even when combined with GPS or inertial sensors. A typical roadway with two lanes of travel in each direction, and a lane of parked cars along each side, may be on the order of 20 meters across. The road center line is an idealized simplification of the road, essentially with a zero width. Inference based map matching is generally unable to help locate which particular lane of the road the vehicle is located in, or even where the vehicle is along the road within a high accuracy (better than say, 5 meters). Today's consumer-level GPS technology may have different sources of error, but it yields roughly the same results as non-GPS technology with respect to overall positional accuracy.
Some systems have been proposed that require much higher levels of absolute accuracies within both the information stored in the map database and the information captured and used for the real time position determination of the vehicle. For example, considering that each typical road lane is about 3 meters wide, if the digital map or map database is constructed to have an absolute accuracy level of less than a meter, and if both the lane information is encoded and the real time vehicle position system are also provided at an accuracy level of less than a meter, then the device or vehicle can determine which lane it currently occupies, within a reasonable certainty. Such an approach has led to the introduction of differential signals, and technologies such as WAAS. Unfortunately, it is extremely expensive and time consuming to produce a map with absolute accuracies of a meter, and that also have a very high, say 95%, reliability rate for the positions of all of the features in that map. It is also extremely expensive to produce a robust real time car-based position determination system that can gather information at similar levels of absolute accuracy, robustness, and confidence.
Other systems propose retrieval of object information on the basis of segment matching. However, such systems only retrieve objects from their memory on the basis of their relationship to a particular road or block segment. At that point the information from all objects associated with that segment can be retrieved and made available to the driver. However, it is still up to the driver to differentiate between the information from various objects.
Still other systems propose collecting object locations on the basis of probe data and using these object locations within a map to improve position estimates. However, such systems do not provide any practical solutions as to how to actually make such a system work in the real world.
As the popularity of navigation systems has gained momentum, and the underlying technology has improved in terms of greater performance and reduced cost, the investment in the underlying map database has enriched the available content (both onboard and off-board), and more demanding end-user applications have started to emerge. For example, companies and government agencies are researching ways to use navigation devices for improved highway safety and vehicle control functions (for example, to be used in automated driving, or collision avoidance). To implement many of these advanced concepts, an even higher level of system performance will be required.
In accordance with an embodiment, the inventors anticipate that the next generation of navigation capabilities in vehicles will comprise electronic and other sensors, for detecting and measuring objects in the vicinity of the vehicle. Examples of these sensors include cameras (including video and still-picture cameras), radars operating at a variety of wavelengths and with a wide assortment of design parameters, laser scanners, and a variety of other receivers and sensors for use with technologies such as nearby radio frequency identification (RFID) and close-by or wireless communications devices.
It will also be increasingly beneficial for applications to know more about the objects than the sensors can directly measure or otherwise sense. For example, the application may need to know what is written on a particular street sign, or where that street sign is relative to other objects nearby. To support this, there will be a need to store more information about such objects in the underlying database, and then to use that information in a more intelligent manner.
One approach is to store object information as part of an electronic map, digital map, or digital map database, or linked to such a database, since the objects will often need to be referred to by spatial coordinates or in relationship to other objects that are also stored in such map databases such as roads, and road attributes. Examples of the types of applications that might use such added object information to enhance a driver's experience are described in U.S. Pat. Nos. 6,047,234; 6,671,615; and 6,836,724.
However, many of the above-described techniques store the object data as a general attribute associated with a street segment. Shortcomings of this particular approach include: a lack of high accuracy placement of the objects in the map database; lack of high accuracy position information of the object's location, relative to other objects in the database; lack of any means of utilizing in-vehicle or on-board sensor data to actively locate such objects. These techniques can only imprecisely match an object passed by a vehicle to those objects in the map database that are in the vicinity or along the road segment that the position determination function of the vehicle has identified, and without the aid of object-detecting sensors. Traditional consumer navigation techniques lack any means to utilize sensor location measurements in addition to map data to accurately and uniquely match the sensed object to the corresponding object in a database.
In some systems, position determination is accomplished for the most part with GPS, possibly with help from dead reckoning and inertial navigation sensors and inference-based map matching. Since the absolute position of both the vehicle's position determination and the positions of objects as stored in the map are subject to significant error (in many instances over 10 m), and since the object density, say on a typical major road segment or intersection, might include 10 or more objects within relatively close proximity, current systems would have difficulty resolving which object is precisely of interest to the driver or to the application. Generally, systems have not been designed with a concept of which object might be visible to an on-board sensor, or how to match that detected object to a database of objects to obtain more precise location or orientation information, or to obtain more information about the object and the vicinity.
Co-pending U.S. patent application Ser. No. 12/034,521, titled “SYSTEM AND METHOD FOR VEHICLE NAVIGATION AND PILOTING INCLUDING ABSOLUTE AND RELATIVE COORDINATES”, herein incorporated by reference, describes a technique for storing objects in a map database that are attributed with both an absolute position and a relative position (relative to other nearby objects also represented in this map). The systems and methods described therein support the future use of in-vehicle sensors, and allow for storing attributes in the map database (or dynamically receiving localized object information on an as-needed basis) that will aid in the unique matching of a sensed object with a map object. U.S. patent application Ser. No. 12/034,521 identifies the need for a robust object matching algorithm, and describes techniques for matching sensor detected and measured objects against their representations in the map. Embodiments of the present invention further address the problem of defining enhanced methods for performing this direct sensed-object map matching.
The road network, vehicle, and objects may be considered in terms of a coordinate system 118, including placement, orientation and movement in the x 120, y 122, and z 124 directions or axes. In accordance with an embodiment, a map database in the vehicle is used to store these objects, in addition to the traditional road network and road attributes. An object such as a stop sign, roadside sign, lamppost, traffic light, bridge, building, or even the a lane marking or a road curb, is a physical object that can be easily seen and identified by eye. In accordance with embodiments of the present invention, some or all of these objects can also be sensed 128 by a sensor such as a radar, laser, scanning laser, camera, RFID receiver or the like, that is mounted on or in the vehicle. These devices can sense an object, and, in many cases, can measure the relative distance and direction of the object relative to the location and orientation of the vehicle. In accordance with some embodiment the sensor can extract other information about the object, such as its size or dimensions, density, color, reflectivity, or other characteristics.
In some implementations the system and/or sensors can be embedded with or connected to software and a micro-processor in the vehicle to allow the vehicle to identify an object in the sensor output in real-time, as the vehicle moves.
The navigation system further comprises a navigation logic 148. In accordance with an embodiment, the navigation logic includes a number of additional components, such as those shown in
In accordance with some embodiments, objects in the vicinity of a vehicle are actually processed, analyzed, and characterized for use by the system and/or the driver. In accordance with alternative embodiments, information about the object characteristics does not need to be extracted or completely “understood” from the sensor data; instead in these embodiments only the raw data that is returned from a sensor is used for the object or scene matching. Several different embodiments using one or more of these techniques are described below.
In accordance with an embodiment that uses scene matching, a system is provided which (a) extracts one or more scenes from the sensor-gathered or raw data; (b) builds a corresponding scene from a map-provided or stored version of the raw data; and (c) compares the two scenes to help provide a more accurate estimate of the vehicle position.
Advantages of this embodiment include that the implementation is relatively easy to implement, and is objective in nature. Adding more object categories to the map database does not influence or change the underlying scene matching process. This allows a map customer to immediately benefit when new map content is made available. They do not have to change the behavior of their application platform. Generally, this embodiment may also require greater storage capacity and processing power to implement.
In accordance with an embodiment, and as further described in co-pending U.S. patent application Ser. No. 12/034,521, the vehicle's onboard navigation system may have, at some initial time, only an absolute measurement of position. Alternatively, after a period of time of applying the techniques described in U.S. patent application Ser. No. 12/034,521, the vehicle may have matched to several or to many objects, which have served to also improve the vehicles position and orientation estimate and define the vehicles position and orientation in the appropriate relative coordinate space, as well as possibly improve its estimate on an absolute coordinate basis. In this case the vehicle may have a more accurate position and orientation estimate at least in local relative coordinates. In either case an estimate of positional location accuracy, referred to herein as a contour of equal probability (CEP) can be derived.
In either case the navigation system can place its current estimated location on the map (using either absolute or relative coordinates). In the case of an unrefined absolute location the CEP may be moderately large (perhaps 10 meters). In the case of a relative location or an enhanced absolute location, the CEP will be proportionately smaller (perhaps 1 meter). The navigation system can also estimate a current heading, and hence define the position and heading of the scene that is built up by the sensor.
In accordance with some embodiments, the scene viewed by the navigation system can then be generated as a three dimensional return matrix of a radar, or as a two dimensional projection of radar data, referred to in some embodiments herein as a Vehicle Spatial Object Data (VSOD). In accordance with other embodiments, the scene can comprise an image taken from a camera, or a reflection matrix built by a laser scanner. The scene can also be a combination of a radar or laser scan matrix, colorized by an image collected with a visible-light camera.
In some embodiments, the scene being interpreted can be limited to a Region of Interest (ROI) that is defined as the region or limits of where matching objects are likely to be found. For example, using a laser scanner as a sensor, the scene can be limited to certain distances from the on board sensor, or to certain angles representing certain heights. In other embodiments, the ROI can be limited to distances between, say, 1 and 10 meters from the scanner, and angles between, say, −30 degrees and plus 30 degrees with respect to the horizontal that correspond respectively to ground level and to a height of 5 meters at the close-in boundary of the ROI. This ROI boundary might be defined and tuned to capture, for example, all of the objects along a sidewalk or along the side of the road. As the vehicle moves, the ROI allows the navigation system to focus on regions of most interest, which reduces the complexity of the scene it must analyze, and similarly reduces the computation needs to match that scene.
As further shown in
In accordance with an embodiment, each of a plurality of objects can also be stored in the map database 142 as raw sensor data (or a compressed version thereof). Information for an object 184 in the scene can be retrieved from the map database by the navigation system. The example shown in
In accordance with an embodiment, the map database contains objects defined in a 2-D and/or 3-D space. Objects, such as road signs, can be attributed to describe for example the type of sign and its 3-D coordinates in absolute and/or relative coordinates. The map data can also contain characteristics such as the color of the sign, type of sign pole, wording on sign, or its orientation. In addition, the map data for that object can also comprise a collection of raw sensor outputs from, e.g. a laser scanner, and/or a radar. An object data can also comprise a 2-D representation, such as an image, of the object. The precise location of individual objects as seen in the scene can also be contained as attributes in the map database as to their location within the scene. These attributes are collected and processed during the original mapping/data collection operation, and may be based on manual or automatic object recognition techniques. Some additional techniques that can be used during this step are disclosed in copending PCT Patent Applications No. PCT—6011206 and PCT—6011865, each of which applications are herein incorporated by reference.
If the system knows the type of sensor(s) in the vehicle, the location of the sensor on the vehicle (for example its height above ground, and its orientation with respect to center front and level of the vehicle), and the location and orientation estimates of the vehicle, then it can compute a scene of the objects contained in the map that serves to replicate the scene captured by the sensor in the vehicle. The scenes (including the objects) from the two sources can be placed in the same coordinate reference system for comparison or matching purposes. For example, in those embodiments that utilize VSOD, the data captured by the sensor of the vehicle can be placed in the coordinates of the map data, using the vehicle's estimate of location and orientation, in addition to the known relationship of the sensor position/orientation with respect to the vehicle. This is the vehicle scene. Simultaneously, Map Spatial Object Data (MSOD) can be constructed from the objects in the map and the position and orientation estimates from the vehicle. This is the map scene. The two data sources produce scenes that position both objects as best as they can, based on the information contained by (a) the map database, and (a) the vehicle and its sensors. If there are no additional errors, then these two scenes should match perfectly if they were superimposed.
Depending on which sensor(s) the vehicle employs, the scene can be produced as a matrix of radar returns, or laser reflections or color pixels. In accordance with an embodiment, features are included to make the data received from the two sources be as comparable as possible. Scaling or transformation can be included to perform this. In accordance with an embodiment, the navigation system can mathematically correlate the raw data in the two scenes. For example, if the scene is constructed as a 2D “image” (and here the term image is used loosely to also include such raw data as radar clusters and radio frequency signals), then the two scene versions (vehicle and map) can be correlated in two dimensions. If the scene is constructed as a 3D “image” then the two scene versions can be correlated in three dimensions. Considering again the example shown in
Typically, during implementation of the system, a design engineer will select the best range and increments to use in the correlation function. For example, the range of correlation in the z or vertical direction should have a range that encompasses the distance of the CEP in that dimension which should generally be small, since it is not likely that the estimated value of the vehicle above ground will change appreciably. The range of correlation in the y dimension (parallel to the road/vehicle heading) should have a range that encompasses the distance of the y component of the CEP. Similarly, the range of correlation in the x dimension (orthogonal to the direction of the direction of the road) should have a range that encompasses the distance of the x component of the CEP. Suitable exact ranges can be determined for different implementations. The increment distance used for correlation is generally related to (a) the resolution of the sensor and (b) the resolution of the data maintained in the map database.
In accordance with an embodiment, the scene can be a simple depiction of raw sensor resolution points, for example a binary data set placing a value of 1 in every resolution cell with a sensor return and a value of 0 everywhere else. In this instance, the correlation becomes a simple binary correlation: for example, for any lag in the 3D space, counting the number of cells that are 1 in both scenes and normalized by the average number of ones in both scenes. A search is made to find the peak of the correlation function, and the peak is tested against a threshold to determine if the two scenes are sufficiently similar to consider them a match. The x, y, z lags at the maximum of the correlation function then represent the difference between the two position estimates in coordinate space. In accordance with an embodiment the difference can be represented as an output of correlation by a vector in 2D, 3D, and 6 degrees of freedom respectively. This difference can be used by the navigation system to determine the error of the vehicle position, and to correct it as necessary.
It should be noted that a mismatch between map and sensor may be a result of an orientation error rather than a position error. While this is not expected to be a significant source of error, in accordance with some embodiments map scenes can be produced to bracket possible orientation errors. Similarly the system can be designed to adjust for scale errors which may have resulted from errors in determining the position.
As described above, an example of the scene correlation uses 0's and 1's to signify the presence or absence of sensor returns at specific x, y, z locations. Embodiments of the present invention can be further extended to use other values such as the return strength value from the sensor, or a color value, perhaps as developed by colorizing scanning laser data with color image data collected with a mounted camera on the vehicle and location-referenced to the vehicle and hence the scanner. Other manner of tests could be applied outside the correlation function to further test the reliability of any correlation, for example size, average radar crossection, reflectivity, average color, and detected attributes.
In accordance with an embodiment, the image received from the sensor can be processed, and local optimization or minimization techniques can be applied. An example of a local minimum search technique is described in Huttenlocher: Hausdorff-Based Image Comparison (http://www.cs.cornell.edu/vision/hausdorff/hausmatch.html), which is herein incorporated by reference. In this approach, the raw sensor points are processed by an edge detection means to produce lines or polygons, or, for a 3D set of data, a surface detection means can be used to detect an objects face. Such detection can be provided within the device itself (e.g. by using the laser scanner and/or radar output surface geometry data which define points on a surface). The same process can be applied to both the sensed data and the map data. In accordance with some embodiments, to reduce computation time the map data may be already stored in this manner. The Hausdorff distance is computed, and a local minimum search performed. The result is then compared with thresholds or correlated, to determine if a sufficiently high level of match has been obtained. This process is computationally efficient and exhibits a good degree of robustness with respect to errors in scale and orientation. The process can also tolerate a certain amount of scene error.
In accordance with an embodiment that uses vehicle-object position matching, a system is provided which (a) extracts raw object data from the sensor-gathered or raw data; (b) compares the extracted data with a corresponding raw object data kept in map from a map-provided or stored version of the raw data; and (c) compares the two measures of object data to help provide a more accurate estimate of the vehicle position.
Advantages of this embodiment include that the implementation is objective, and can also easily incorporate other object comparison techniques. This embodiment may also require lower processing power than the scene matching described above. However, the extraction is dependent on the categories that are stored in the map. If new categories are introduced, then the map customer must update their application platform accordingly. Generally, the map customer and map provider should agree beforehand on the stored categories that will be used. This embodiment may also require greater storage capacity.
As illustrated by the example shown in
In accordance with some embodiments, the Hausdorff technique can be used to determine which fraction of object points lie within a threshold distance of database points and tested against a threshold. Such embodiments can also be used to compute coordinate shifts in x and z and scale factors that relate to a shift (error) in the y direction.
It will be noted that the Hausdorff distance technique is only one of the many algorithms known to those familiar with the art of image and object matching. In accordance with other embodiments, different algorithms can be suitably applied to the matching problem at hand.
The above example described a simple case, wherein only a single object was present or considered in both the map and as sensed by the vehicle's sensor. In the real world, the density of objects may be such that multiple objects are present in relatively close proximity (say, 1 to 3 meters apart). In these situations, optimization and minimization techniques such as the Hausdorff technique are of particular use. In such situations, the detailed correlation function and/or the Hausdorff distance computation will have sufficient sensitivity to match all features of the objects (as received by the sensor). It is therefore unlikely that the set of objects would be matched incorrectly. For example, even though the spacing of multiple objects are about the same, the detailed correlation would clearly discern the peak of the correlation and not erroneously correlate, for example a mailbox with a lamppost, or a lamppost with a stop sign.
The approach described above is subject to certain errors. Generally, any error in position or orientation will be more complex than simply a shift in the x, y, z coordinates between the vehicle and map version of the scenes. Orientation errors can introduce perspective differences and location errors might produce scaling (size) errors, both of which would result in a lowering of the overall peak in the correlation function. For the case where the vehicle has a good (small) CEP and reasonable estimate of orientation, which will generally be the case as the vehicle makes one or more previous object matches, these errors should not significantly effect the matching performance. Furthermore, in accordance with some embodiments a set of scenes can be constructed to bracket these errors, and the correlation performed on each or the matching algorithm selected may be reasonably tolerant of such mismatches. Depending on the needs of any particular implementation, the design engineer can determine, based on various performance measures, the trade-off between added computation cost versus better correlation/matching performance. In any of the above descriptions, if the result of the correlation/matching does not exceed a minimum threshold then the map matching fails for this sensor scene. This can happen because, the position/orientation has too large an error and/or because the CEP is computed incorrectly too small. It can also happen if too many temporary objects are visible in the Vehicle Scene that were not present during the map acquisition. Such items as people walking, parked cars, construction equipment can dynamically alter the scene. Also, the number and distribution of objects collected versus the number and distribution of objects that make up the true scene and are detected by the sensor will effect correlation performance. Collecting too many objects is unnecessary, and will increase expense and processor load. In contrast, collecting too few of the objects present will leave the system with too much correlation noise to allow it to make reliable matches. The density and type of objects to be stored in the map is an engineering parameter which is dependant on sensor and performance levels desired. The matching function should take into account the fact that not all vehicle sensed objects may be in the map.
In accordance with an embodiment, one of the approaches that is used to ensure that the map stores an adequate number of objects, yet does not become too large or unwieldy a data set, is to run a self correlation simulation of the reality of objects captured, while populating the map with a sufficient subset of those objects that have been collected to achieve adequate correlations for the applications of interest. Such simulations can be made for each possible vehicle position & objects and/or noise simulation.
If the correlation/image process threshold is exceeded, then a maximum can be computed from the various correlations/image processes performed over the various map scenes constructed. With the correlation/image process, the known objects of the map are matched to specific scene objects in the Vehicle Scene. If the vehicle sensor is one that can measure relative position with its sensor, such as a radar or laser scanner, then a full six degrees of freedom for the vehicle can be determined to the accuracy (relative and absolute) of the objects in the database and the errors associated with the sensor. By testing individual object raw data clusters or extracted object polygons, matched to individual sensor cluster returns or extracted object polygons in the Vehicle Scene, the system can make many validity checks to verify that the scene correlation process has resulted in an accurate match. The results thus enable the higher accuracies that are needed by future applications. In accordance with another embodiment, the scene matching and estimation of the six degrees of freedom enable the road map to be superimposed with high accuracy over real time images (such as the real time images described in PCT Patent Application 6132522), or to adjust the depiction in a HUD display of a path intended to align with upcoming roads. In the case of these embodiments, the outcome will be particularly sensitive to the orientation components, which are generally not available using inference-based forms of map matching.
In accordance with some embodiments the object matching may be performed in a series of stages. Linear objects such as lane markings or curbs can be detected and compared to similar objects in the database. Such linear features have the characteristic of being able to help locate the vehicle in one direction (namely orthogonal to the lane marking i.e. orthogonal to the direction of travel). Such an object match may serve to accurately determine the vehicles location with respect to the y direction shown in
In accordance with an embodiment uses object characterization, a system is provided which (a) extracts raw object data from the sensor-gathered or raw data; (b) extracts characteristics from those raw objects; and (c) compares those characteristics with the characteristics that are stored in the map to help provide a more accurate estimate of the vehicle position.
Advantages of this embodiment include that the embodiment requires less processing power and storage demands. The introduction of new characteristics over time will require the map provider to redeliver their map data more frequently. Successful extraction depends on the categories stored in map. If new categories are introduced then the map customer would also have to change the nature of their application platform. Generally, the map customer and map provider should agree beforehand on the stored categories that will be used.
As with the embodiments described above, the vehicle may have at some initial time only an inaccurate absolute measurement of position. Or after a time of applying the co-pending invention or other forms of sensor improved position determination, it may have matched to several if not many objects or scenes of objects which have served to also define the vehicle's position/orientation in the appropriate relative coordinate space. This may have possibly also improved the vehicle's absolute coordinate estimate. In this case the result of the match may be a more accurate position and orientation estimate at least in relative coordinates and possibly absolute coordinates.
In either case the navigation system can place its current estimated location in the coordinate space of the map (using either absolute or relative coordinates) and an estimate of positional location accuracy can be derived and embodied in its CEP. In the case of an unrefined absolute location the CEP may be moderately large (say 10 meters) and in the case of the relative location the CEP will be proportionately smaller (say 1 meter). In either case the CEP can be computed with respect to the map coordinates, and a point-in-polygon or simple distance algorithm employed to determine which map objects are within that CEP and hence are potential matches to the sensor-detected object or objects. This may be performed in 2D or 3D space.
For example, if the vehicle is approaching a moderately busy intersection, and the sensor detects an object at a range and bearing that, when combined with the position estimate, puts the CEP of the detected object at the sidewalk corner, then if there is only one object within the CEP the matching may be already accomplished. For verification purposes, an object characterization match may be performed.
In accordance with various embodiments, each sensor may have unique object characterization capabilities. For example, a laser scanner might be able to measure the shape of the object to a certain resolution, its size, how flat it is, and its reflectivity. A camera might capture information related to shape, size and color. A camera might only provide a relatively inaccurate estimate of distance to the object, but by seeing the same object from multiple angles or by having multiple cameras, it might also capture sufficient information to compute accurate distance estimates to the object. A radar might possibly measure density, or at least provide a radar size or cross section, and depending on its resolution, might be able to identify shape.
In accordance with an embodiment, objects can also be fitted with radar reflection enhancers, including “corner reflectors” or the like. These small, inexpensive, devices can be mounted on an object so as to increase its detectability, or the range at which it can be detected. These devices can also serve to precisely locate a spatially extended object by creating a strong point-like object within the sensed object's larger signature. So, depending on the sensor there may be several characterizing features of the object which can be used to verify the object match.
One of skill in the art can construct additional ways to use the above mentioned characteristics to match the sensor data to the map data. In accordance with a particular embodiment, laser scanner information (distance and theta—the vertical angle with respect to the platform horizon) is measured by transmitting coherent light from a rotating laser, and receiving that light back from the first object it encounters, can be used to match to an object in the database according to the following algorithm:
Normalized Weighted Score=Sum of (Wi*Ci)/Sum of (Wi)< >Threshold
It will be recognized to one of skilled in the art that there are many such ways to utilize such characterization information to affect a match algorithm.
The above-described algorithm will provide exacting tests that should make matching-errors rare. In accordance with an embodiment, objects can be stored in the map database at a density such that many match tests could be rejected and the match frequency will still be sufficient to keep an accurate location and orientation in relative coordinate space.
In those cases in which more than one object is sensed and more than one object is in the CEP, then a more complex version of the above algorithm may be used. Each sensed object can be compared as discussed. In addition, pairs of sensed objects represent a measured relationship between them (e.g. a pair may be 2 m apart at a relative bearing difference of 4 deg). This added relationship can be used as a compared characteristic in the weighting algorithm described above to disambiguate the situation. Once an object or set of objects are matched their characteristics and attribution can be passed back to the requesting function.
In those cases in which more than one object is sensed but the objects are not resolved, then the sensed but unresolved objects may be considered as a single complex object. The collected objects in the map database can also be characterized as objects likely resolved or not resolved per different sensor or different sensors with different parameters.
Generally, sensors considered to support in-vehicle applications should have a resolution such that many sensor resolution cells will comprise the response from an object. In the embodiments described above specific characteristics of the object are extracted from this multitude of resolution cells. For example the position of the object is defined by an average or centoid measurement of the extended object or its location where it meets the ground in those cases that it does.
In accordance with another embodiment, the system can use an ID tag 270, such as an RFID tag. Such devices transmit an identification code that can be easily detected by a suitable receiver and decoded to yield its identifier or ID. The ID can be looked-up in, or compared with, a table of ID's 272 either within the map database or associated with the map database or other spatial representation. The ID can be associated with a specific object or with a type or class of object 274 (for example, a stop sign, mailbox, or street corner). Generally, the spacing of signs such as stop signs, and the accuracy of the vehicle's position estimation, are sufficient to avoid uncertainty or ambiguity as to which sensed object is associated with which RFID tag. In this way, the object identifier 276 or matching algorithm can include a rapid and certain means to unambiguously match the sensed object with the map appropriate map object.
In accordance with another embodiment the system can use a combination of RFID technology with, say, a reflector. If the RFID is collocated with the reflector then this can serve as a positive identification characteristic. Furthermore, the RFID can be controlled to broadcast a unique identification code or additional flag, only when the reflector (or other sensor) is illuminated by an in-vehicle sensor, say a scanning laser. This allows the device to act as a transponder and creates a highly precise time correlation between the reception of the signal and the reception of the RFID tag. This positive ID match improves (and may even render unnecessary) several of the above-described spatial matching techniques, since a positive ID match improves both the reliability and positional accuracy of any such match. This technique is particularly useful in situations of dense objects, or a dense field of RFID tags.
In accordance with another embodiment, bar codes, sema codes (a form of two-dimensional bar code), or similar codes and identification devices can be placed on objects at sufficient size to be read by optical and other sensing devices. Sensor returns, such as camera or video images, can be processed to detect and read such codes and compare them to stored map data. Precise and robust matches can also be performed in this way.
It will be evident that the scenes shown in the figures above represent just a few of many possible scenes that could be created. The x-z correlation is designed to find the best match in those two dimensions. However, if any of the other coordinates of the navigation system's Position and Orientation estimates are in error, then the scenes will not correlate as well as possible. In accordance with various embodiments, additional features and data can be used to reduce this error, and improve correlation.
For example, consider the vehicle's heading. The car will nominally be heading parallel to the road but may be changing lanes, and so the heading is not exactly that of the road. The vehicle's navigation system estimates heading based on the road and its internal sensors like GPS and INS sensors. But still there can be an error of several degrees in the true instantaneous heading of the vehicle versus the estimated heading of the vehicle. Because the sensor is fixed-mounted to the vehicle there should be very little error introduced when rotating from that of the vehicle's heading to that of the sensor's heading (pointing direction). Still, there is a combined estimate of heading error. The computation of the scene from the map data is sensitive to heading error under certain configurations of objects. For the current embodiment other scenes can be computed from the map objects at different headings bracketing the Estimated Heading. These different heading scenes can each be correlated with the Vehicle Scene, as done above, to find a maximum correlation. Again the choice or range of heading scenes and increment of heading scene (e.g. one scene for every degree of heading) is best left to the design engineer of the system to be implemented.
Consider the vehicle's pitch. For the most part the vehicle's pitch will be parallel to the surface of the road—that is to say it will be on the same slope that the road is on. The map database of objects can store the objects relative to the pitch of the road or can store pitch (slope) directly. There may be deviations in pitch, from the slope of the vehicle. For example, accelerations and decelerations can change the pitch of the car, as can bumps and potholes. Again, all these pitch changes can be measured but it should be assumed that the pitch error can be a few degrees. The computation of the scene from the map data is sensitive to pitch error under certain configurations of objects. For the current embodiment other scenes can be computed from the map objects at different pitches bracketing the Estimated Pitch. These different pitch scenes can each be correlated with the Vehicle Scene to find a maximum correlation. Again the choice or range of pitch scenes and increment of pitch scene (e.g. one scene for every degree of pitch) is best left to the design engineer of the system to be implemented. The maximum correlation will offer feedback to correct the vehicle's estimate of pitch.
Consider the vehicle's roll. For the most part the vehicle's roll will be parallel to the surface of the road—that is to say the vehicle is not tilting towards the driver side or towards the passenger side but is riding straight and level. However, on some roads there is a pronounced crown. Thus the road is not flat and level and a car will experience a roll of several degrees from horizontal if it is driving off the top of the crown, say on one of the outer lanes. The map may contain roll information about the road as an attribute. In addition, there may be deviations in the actual roll of the vehicle, as can be caused by bumps and potholes and the like. Again, all these roll changes can be measured but it should be assumed that the roll can be in error by a few degrees. The computation of the scene from the map data is sensitive to roll error under certain configurations of objects. For the current embodiment other scenes can be computed from the map objects at different rolls bracketing the Estimated Roll. These different roll scenes can each be correlated with the Vehicle Scene to find a maximum correlation. Again the choice or range of roll scenes and increment of roll scene (e.g. one scene for every degree of roll) is best left to the design engineer of the system to be implemented. The maximum correlation can offer feedback to correct the vehicle's estimate of roll.
Consider the vehicle's y position, that is to say the vehicle's position orthogonal to the direction of travel. This is mostly a measure of what lane the vehicle is in or the measure of displacement of the vehicle from the centerline of the road. It is also the basic measurement to determine what lane the vehicle is in. Traditional inferential map matching had no method to make this estimate. If the vehicle was judged to be matched to the road, it was placed on the road's centerline, or some computed distance from it, and no finer estimation could be made. This is totally inadequate for applications that require knowledge of what lane the car is in.
The vehicle's y position will vary depending upon which lane the vehicle is in. The vehicle's position determination will estimate the absolute position but may have significant error in this sensitive dimension. It should be assumed that the error in the y-dimension is estimated by the CEP and can amount to several meters. An error in y position results generally in a scale change of the scene. So for example, if the y position is closer to the sidewalk, objects on the sidewalk should appear bigger and further apart and conversely, if the y position is closer to the center line of the road, objects on the sidewalk should appear smaller and closer together. As described, the computation of the scene from the map data is sensitive to the y position of the vehicle if the scene is generated in relative coordinates as for example in the current embodiment. (If the scene is generated in absolute coordinates than sizes should be scale independent.) For the current embodiment other scenes can be computed from the map objects at different y's bracketing the estimated y position. Again, the choice of range of y-position scenes and increment of y-position scene (e.g. one scene for every meter of y-position) is best left to the design engineer of the system to be implemented. The maximum correlation can offer feedback to correct the vehicle's estimate of its y position, which in turn can improve the estimate of which lane it is in.
As mentioned above, these different scenes can each be correlated with the Vehicle Scene to find a maximum correlation. One way to simplify this process is to, from the sensor measurements, compute a measurement of the average building distance. If this is roughly constant for the scene, and buildings are captured in the map database, then a good estimate of the y position can be derived from that measurement.
A given object may be characterized by a point cluster or set of sensed point cells CI(x,y,z). These raw point cells may be stored in the map database for each sensor measured. For example, each laser scanner point that reflects from the object is characterized by a dl and a thetal. With the vehicle location and platform parameters, these can be translated into a set of points in relative coordinates (x,y,z) or in absolute coordinates (latitude, longitude, height) or other such convenient coordinate system. Other data may be stored for each xyz cell, such as color or intensity, depending upon the sensor involved. The database may store, for the same object, different cluster information for different sensors.
When the vehicle passes the object and the vehicles sensor(s) scans the object it too will get a set of points with the same parameters (perhaps at different resolutions).
Again a centroid calculation is made and the location of the CEP is found within the map. Again all objects are retrieved that fall within the CEP but in this case additional information is retrieved such as the raw sensor data (raw point cluster), at least for the sensors known to be active on the vehicle at that time.
The two sets of raw cluster data are normalized to a common resolution size (common in the art). Using the three dimensional cluster points from the sensed object and each retrieved object, a correlation function is applied. The start correlation point is where the centroid of the raw sensor is matched to the centroid of a candidate object. The correlation result can be weighted and factored into the algorithm as another characteristic.
The present invention may be conveniently implemented using a conventional general purpose or a specialized digital computer or microprocessor programmed according to the teachings of the present disclosure, as will be apparent to those skilled in the computer art. Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will be apparent to those skilled in the software art. The selection and programming of suitable sensors for use with the navigation system can also readily be prepared by those skilled in the art. The invention may also be implemented by the preparation of application specific integrated circuits, sensors, and electronics, or by interconnecting an appropriate network of conventional component circuits, as will be readily apparent to those skilled in the art.
In some embodiments, the present invention includes a computer program product which is a storage medium (media) having instructions stored thereon/in which can be used to program a computer to perform any of the processes of the present invention. The storage medium can include, but is not limited to, any type of disk including floppy disks, optical discs, DVD, CD ROMs, microdrive, and magneto optical disks, ROMs, RAMs, EPROMs, EEPROMs, DRAMs, VRAMs, flash memory devices, magnetic or optical cards, nanosystems (including molecular memory ICs), or any type of media or device suitable for storing instructions and/or data. Stored on any one of the computer readable medium (media), the present invention includes software for controlling both the hardware of the general purpose/specialized computer or microprocessor, and for enabling the computer or microprocessor to interact with a human user or other mechanism utilizing the results of the present invention. Such software may include, but is not limited to, device drivers, operating systems, and user applications. Ultimately, such computer readable media further includes software for performing the present invention, as described above. Included in the programming (software) of the general/specialized computer or microprocessor are software modules for.
The foregoing description of the present invention has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations will be apparent to the practitioner skilled in the art. Particularly, while the invention has been primarily described in the context of position determination enhancement, this is just one of many applications of this combined map matching. For example, the location of a road intersection and its cross walks can be accurately determined as a distance from identified signs, so more accurate turn indications can be given or cross walk warnings given. For another example, the location of the vehicle lateral to the road (with respect to lanes) can be accurately determined to give guidance on which lane to be in, perhaps for an upcoming maneuver or because of traffic etc. By way of additional examples, the matching can be used to accurately register map features on a real-time image collected in the vehicle. In still another example, embodiments of the present invention can be used to provide icon or other visual/audible enhancements to enable the driver to know the exact location of signs and their contexts. It will also be evident that, while many of the embodiments describe the use of relative coordinates, embodiments of the system can also be used in environments that utilize absolute coordinates. The embodiments were chosen and described in order to best explain the principles of the invention and its practical application, thereby enabling others skilled in the art to understand the invention for various embodiments and with various modifications that are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the following claims and their equivalence.
This application claims the benefit of priority to U.S. Provisional Patent Application No. 61/026,063, titled “SYSTEM AND METHOD FOR MAP MATCHING WITH SENSOR DETECTED OBJECTS”; filed Feb. 4, 2008, and incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
61026063 | Feb 2008 | US |