Navigation systems provide information helpful for driving a vehicle using a satellite. A navigation system includes a GPS (global positioning system) module that receives a GPS signal from a GPS satellite and calculates a location of a vehicle based on the GPS signal. GPS receivers are integrated into navigation devices, vehicle telematics systems, and smart phones. Due to their inherent measurement error, map matching is used to pinpoint the correct location on the road network. Many GPS-related applications require map matched data, e.g., traffic monitoring, event detection and road quality assessment.
Map matching methods use inputs generated from positioning technologies (such as GPS or GPS integrated with dead reckoning) and supplement this with data from a road network map to provide an enhanced positioning output. The process of map matching takes a sequence of possibly noisy GPS coordinates from a vehicle trace and estimates the actual road positions, identifying the correct road segment on which the vehicle is travelling and to determine the vehicle location on that segment. This is an important first step used by many GPS applications.
Various GPS systems have different sampling rates. The difficulty of map matching can greatly differ depending on GPS accuracy and the sampling rate. For example, a navigation system may receive GPS information from the GPS receiver at a high sampling rate of once per 1 second or faster, or a slow sampling rate of once per 10 seconds or slower (e.g., 60 seconds), and screen update due to vehicle movement is performed only once per second despite operating at high speeds.
Examples are described in the following detailed description and in reference to the drawings, in which:
Various aspects of the present disclosure are generally directed to map matching. More specifically, various aspects of the present disclosure are directed to map matching that combines geometric methods with minimum weight methods. As described in greater detail below, this approach integrates map matching based on a Fréchet distance measure with global weight optimization. This approach eliminates the need for extensive tuning of the parameters and allows robust performance across datasets of different characteristics.
Aspects of the present disclosure described herein determine the minimum Fréchet distance for an input GPS trace, and choose the minimum weight path among all minimum distance paths. Among other things, this approach may perform consistently against varying sampling rates and accordingly prevent the difficulty of map matching that greatly differs depending on GPS accuracy and the sampling rate.
In one example in accordance with the present disclosure, a map matching method is provided. The method comprises receiving a plurality of global positioning system (GPS) data points in a dataset, receiving road map data related to a plurality of roads, determining a plurality of paths of minimum Fréchet distance for the GPS dataset, assigning a weight to each path of minimum Fréchet distance by applying a weight function, and outputting the path with the minimum weight.
In another example in accordance with the present disclosure, a map matching system is provided. The system comprises a processor, a memory coupled to the processor, and a map matching module stored in the memory and executed on the processor to receive a plurality of global positioning system (GPS) data points in a dataset, determine candidate road locations for each GPS data point, each candidate road location being a projection to a road within a radius, calculate shortest paths from each candidate road location associated with a first GPS data point to the candidate road locations associated with a second GPS data point, select a path with minimum weight according to a weight function, the weight function being based on the calculated shortest paths, and output the path with the minimum weight.
In a further example in accordance with the present disclosure, a non-transitory computer readable medium is provided. The non-transitory computer-readable medium comprises instructions which, when executed, nausea device to (i) receive a plurality of global positioning system (GPS) data points in a dataset, (ii) determine a plurality of paths of minimum Fréchet distance for the GPS dataset, (iii) assign a weight to each path of minimum Fréchet distance by applying a weight function, and (iv) output the path with the best weight.
The GPS 110 may collect location data (e.g., GPS trajectories) over time as the computing device 120 moves from one location to another. The GPS 110 may receive GPS information from a GPS satellite. In one implementation, a sequence of GPS datasets (e.g., samples) consisting of a set of GPS data points may be depicted as z0, z1, . . . , zn. The GPS data points may comprise a sequence of positions with latitude, longitude, instant speed, direction and timestamp information.
The GPS 110 may move with the user 190 in a vehicle. In one implementation, the GPS 110 may be a stand-alone device. In another implementation, the GPS 110 may exist in the computing device 120. For example, the GPS 110 may be implemented as a component of a web browser or a search engine, or may be implemented as an application in the computing device 120. In some implementations, the GPS 110 may receive location data upon detecting a satellite signal based on a pre-determined rate. For example, the GPS 110 may record data every 30 seconds, every 5 minutes, or the like.
The computing device 120 may be any device capable of processing information and communicating over the network. The computing device 120 may include, but not limited to, a portable handheld computing device (e.g., a personal digital assistant, a smart phone, a cellular phone), a laptop computer, a desktop computer, a media player, a digital camcorder, an audio recorder, a camera, or any other device capable of connecting to the network 130. The computing device 120 may have one or a plurality of users.
In one implementation, the computing device 120 may include, but not limited to, a processor, a memory, and one or more communication interfaces. In addition or alternatively, the computing device 120 may have an operating system, and a user interface (UI) module. In another implementation, a display device may be connected to the computing device 120. When executed on the processor, the operating system and UI module collectively facilitate presentation of a user interface on the display device. In a further implementation, the display device may be incorporated into the computing device 120.
The network 130 may represent any type of communications network, including, but not limited to, wire-based networks (e.g., cable), wireless networks (e.g., cellular, satellite), cellular telecommunications network(s), and IP-based telecommunications network(s) (e.g., Voice over Internet Protocol networks). The network 130 may also include traditional landline or a public switched telephone network (PSTN), or combinations of the foregoing (e.g., Unlicensed Mobile Access or UMA networks, circuit-switched telephone networks or IP-based packet-switch networks).
The server 140 may be an example server within the map matching framework 100. The server 140 may include, without limitation, a processor 150, a storage unit 160, and a memory 170. A map matching module 180 may be maintained in the memory 170 and executed on the processor 150. In one implementation, the map matching module 180 may include a road network database (not shown in
In another implementation, the server 140 may comprise at least one communication interface (not shown in
The processor 150 may be at least one of a central processing unit (CPU), a semiconductor-based microprocessor, a graphics processing unit (GPU), a field-programmable gate array (FPGA) configured to retrieve and execute instructions, other electronic circuitry suitable for the retrieval and execution instructions stored on memory 170, or a combination thereof. The processor 150 may fetch, decode, and execute instructions stored on the memory 170 to implement the functionalities described above.
The storage unit 160 may store the GPS data collected by the GPS 110 and sent to the server 140. For example, the GPS data may be stored in GPS logs. In another implementation, the server 140 may also include one or more known input device(s) (not shown in
The memory 170 may be an example non-transitory computer-readable medium. The non-transitory computer-readable medium may store machine-readable instructions, such as programming code, software, firmware, or the like. For example, the computer-readable medium may include one or more of a non-volatile memory, a volatile memory, and/or a storage device. Examples of non-volatile memory include, but are not limited to, electronically erasable programmable read only memory (EEPROM) and read only memory (ROM). Examples of volatile memory include, but are not limited to, static random access memory (SRAM) and dynamic random access memory (DRAM). Examples of storage devices include, but are not limited to, hard disk drives, compact disc drives, digital versatile disc drives, optical devices, and flash memory devices. In some implementations, the instructions may be part of an installation package that can be executed by the processing device. In this case, the computer-readable medium may be a portable medium, or flash drive or a memory maintained by a server from which the installation package can be downloaded and installed. In another implementation, the instructions may be part of an application or application already installed. Here, the computer-readable medium may include integrated memory such as a hard drive.
The memory 170 may include the map matching module 180, which may be executed on the processor 150. The map matching module 180 may receive a sequence of GPS data points associated with the user 190 and a map of the road network. The map matching module 180 may output a sequence of estimated locations of the user 190, such as specific road segments. In one implementation, the map matching module 180 may be connected to the computing device 120 through the network 130 and the GPS 110. The map matching module 180 may deliver the output of a sequence of estimated locations of the user 190 to the computing device 120 for such information to be presented to the user 190. In addition or alternatively, the map matching module 180 may deliver the output sequence of estimated locations to the computing device 120 to be stored in a database in the computing device 120.
The GPS receiving unit 210 may obtain data from a GPS device (as illustrated in
The map unit 220 may obtain data from the road network database in the map matching module 180. The map unit 220 may export a map of the road network. In one implementation, the map may be represented by a set of roads, and each road may be represented as a polyline, i.e., a sequence of line segments. The map unit 220 may determine the candidate projection points. For each GPS data point zi, the map unit 220 may determine a set of candidate locations {xi0, xi1, . . . }, which are the perpendicular projections on road segments {yi0, yi1, . . . }, within a radius or an error eclipse of zi.
The search radius parameter may be used to find candidates for each sample. In one implementation, the radius may be calculated based on a set of data points and their ground-truth road. A small radius may miss the ground truth location, and a large radius may significantly slow down the map-matching process due to the lame number of candidates growing quadratically. For example, a histogram of the distance between each sample and its ground-truth road may show a maximum error of 25.5 m. Accordingly, the candidate search radius may be set to 50 m, about twice the maximum error.
The distance calculator unit 230 may find the paths on the map that have a minimum Fréchet distance to the GPS dataset. The Fréchet distance between two curves may be described as the minimum required length of a tether between two points as they traverse the two curves, respectively, without backtracking. In one implementation; the Fréchet distance between two curves f, g:[0; 1]→R2 may be defined as:
δF(f,g):=infα,β:[0,1]→[0,1]maxtε[0,1]∥f(α(t))−f(β(t))∥ Equation (1)
where α and β are continuous and non-decreasing time-warping functions with α(0)=β(0)=0 and α(1)=1.
In one implementation, a conventional free-space diagram may be used for calculating the Fréchet distance of two curves. The free-space diagram between two curves for a given distance threshold ε may be a two-dimensional region in the parameter space that consists of all point pairs on the two curves at distance at most ε.
As discussed above, the roads may be represented as polylines; which may be used to approximate curved paths. The free space between two polylines may generalize to the free space between a planar graph G=(V, E) of the road network and a polyline Z=(z0, z1, . . . , zn) of the GPS dataset. A path may exist in G with Fréchet distance at most ε to Z if and only if a monotone path exists in the horizontal and in the vertical direction on the free space surface. In one implementation, a monotonic function may be described as one that either never increases or never decreases.
In one implementation, to determine whether a path on G with Fréchet distance at most Σ to Z exists, the free space surface may be constructed, and the free space may be identified by calculating all free space intervals in the free space diagram of two line segments. The sub free space that is monotone according to the topology of G (following the direction of edges in E) and monotone from G0 to Gn (following the direction of Z) may be calculated. It may be concluded that the path exists if and only if a free space interval on Gn exists.
Algorithm 1, set forth below, outlines an example algorithm for calculating the monotone free (i.e., white) space, which may be exactly the region reached by all monotone paths starting from free space intervals on G0, using the map matching module 180.
indicates data missing or illegible when filed
In one implementation, the minimum Fréchet distance to Z may be found by binary search. For example, multiple candidate values may be calculated in a binary search. In another implementation, the minimum Fréchet distance to Z may be found by a parametric search. In a further implementation, the binary search may be stopped when the remaining interval is shorter than a threshold, without finding the exact Fréchet distance. For example, the threshold may be set to 1 meter.
In one implementation, after the binary search identifies an approximate minimum Fréchet distance, there may still be multiple monotone paths on the free space surface with equal Fréchet distance to the polyline Z. Where multiple monotone paths are found on the free surface, the weight unit 240 may determine the most probable match for a sample by calculating a weight for each candidate path. In one implementation, the weight unit 240 may use dynamic programming to assign weights to these paths and return the minimum weight. For example, the weight unit 240 may use Viterbi dynamic programming to find the minimum weighted path among all paths with minimum Fréchet distance. The dynamic programming may eliminate the explicit enumeration of all candidate paths, which can be exponential in the length of the input trace. Using the dynamic programming method, the solution may be computed once and stored, and the next time the same solution is needed, it may simply be looked up. Accordingly, for each candidate path match of each GPS sample, the minimum weight path may be calculated only once and remembered to find the candidate path again.
In one implementation, the weight unit 240 may perform a weight function represented by:
arg min(x0,x1, . . . ,xn)Σi=0ntidi2+αLi Equation (2)
where ti is the time interval between zi and zi−1 (but t0=t1 or some constant, such as 1.0), di is the distance between xi and zi, α is a constant parameter, and Li is the shortest-path distance between xi and xi−1 in the map (but L0=0).
It should be noted that the summation of Li for a candidate sequence is the Length of the shortest path that links it, which does not change with the sampling interval. It should also be noted that the expected value of the summation of tidi2 does not depend on the sampling interval either. Therefore, the ratio between these two terms remains the same for differing sampling intervals, and a constant α performs consistently across all sampling intervals. It should further be noted that the time elapsed between consecutive samples is encountered in the weight function set forth above. Accordingly, the weight function is consistent for input traces with variable sampling rates or with occasionally missing GPS points.
In one implementation, the weight calculation may be based on two features: distance d and shortest-path L. Distance may measure the great circle distance between a sample zi and a candidate location xij. Shortest-path may be calculated based on the length of the shortest path from xi−1k to xij. It will be appreciated to those skilled in the art that in other implementations, additional features may be considered in the weight function. For example, the weight may include factors, such as the alignment between measured bearing and road direction, the direction indicating the angle difference between a line segment of a sample zi and the road segment, and the speed calculated by the length of the shortest path between two candidates divided by the sampling interval.
In one implementation, the weight unit 240 may tune the constant parameters in the weight function to maximize its accuracy for the dataset.
Algorithm 2, set forth below, outlines an example method for finding the minimum weight path on a free surface via dynamic programming.
The weight unit 240 may set the weight for each free space interval I0(u,v) on G0, which is calculated from the distance between z0 and the edge corresponding to I0(u,v), denoted as d0(u,v) in Line 2 of the algorithm as set forth above. The weight is set to negative infinity if I0(u,v) is an empty interval. The main forward loop from Lines 4 to 12 assigns the minimum weight to each free space interval via dynamic programming. Lines 5 to 7 assign the initial weight to a horizontal interval by comparing all the adjacent cells of Ii−1v, corresponding to adjacent edges of v in G. The weight calculated from the length of (u, v) may be added to the cell corresponding to (u; v) since the path has moved from vertex u to v. Line 8 updates the weights of horizontal intervals from other horizontal intervals using the initial weights and following the topology of G. Lines 9 to 11 update a vertical interval on G, from the parallel vertical interval and the bottom horizontal interval of the cell it belongs to. Finally, Line 13 reconstructs the paths backward, from the min-weight interval on Gn to the interval on G0, following the pointer stored at each interval. In one implementation, a monotone path goes through G0, . . . Gn. Moreover, the vertical interval that the path goes through may be Ii(u,v) at Gi, and accordingly, sample zi may be matched to road (u, v). The sequence of free space horizontal intervals (corresponding to vertices in G) that the monotone path goes through forms a path in G, which is the highest weight path with Fréchet distance no more than ε to trace Z.
The map matching module 180 may include a map parser (not shown in
As discussed above in more detail, Viterbi dynamic programming may be used to calculate the shortest path for every consecutive pair of match candidates. In one implementation, the speed optimizing approach may be used to improve the speed of the Viterbi dynamic programming and reduce the computation time associated with the programming. For example, in one implementation, there may be n candidates for sample zi and m candidates for sample zi+1. Accordingly, in this implementation, a total of n*m shortest paths may need to be calculated. In another implementation, the map matching module 180 may calculate all shortest paths from each candidate of zi to all m candidates of zi+1 at once. Therefore, the map matching module 180 runs the calculations only n times, instead of n*m times as a way of adapting the Dijkstra algorithm. Further, in one implementation, each calculation may be performed in parallel.
In another implementation, the shortest-path search range may be limited. For example, the range may be set to a threshold that is equal to the sampling interval of the GPS datasets multiplied by a predetermined speed (e.g., 50 m/s). In addition or alternatively, the candidate search may be limited by setting the radius to a predetermined threshold to reduce the size of candidate sets. For example, the radius may be set to 30 m if it is determined that the largest distance between a sample and its matched road is around 25 m.
Turning now to the operation of the system 100.
The process 300 may begin at block 305, where a system for determining a match between a GPS dataset and a road location receives GPS information. In particular, this process may involve receiving the GPS information from a GPS satellite at predetermined intervals. In one implementation, the set of sampling points may be sent by the GPS 110 and/or the computing device 120 over the network 130. As discussed above in detail with reference to
At block 310, the system performs a candidate computation, the results of which may be used to determine candidate sets. In particular, this process may involve exporting a road map such as OpenStreetMap (OSM). This process may further involve accessing a road network database to determine one or more corresponding road segments using the candidate points. As discussed above, the road may be represented as polylines. In one implementation, this process may verify the existence of the roads through a ground-truth process by relating the image data of the roads to actual roads. The ground-truth of the dataset may match each sample to a road ID, which typically represents a section of road between two adjacent intersections. More specifically, this process may further involve parsing the map by building a bounding rectangle that encloses all the received GPS data points, bypassing the roads outside the rectangle, and implementing a string scanner to parse each input line.
In another implementation, the process at block 310 may also involve limiting the candidate search radius to a predetermined threshold in order to reduce the size of candidate sets.
At block 315, the system identifies the paths on the map that has minimum Fréchet distance to the GPS dataset. As discussed above, a path may exist with minimum Fréchet distance if and only if a monotone path exists on the free space surface between the road network and the GPS dataset. On the basis of this, the map matching module 180 uses Algorithm 1, described above, to calculate the monotone free space. In addition, as discussed above in more detail in reference to
At block 320, the system calculates and assigns a weight for each path identified at block 315. In particular, this process may involve performing a weight function represented by Equation 1 as described above in reference to
Moreover, this process may involve dynamic programming analysis. Which is performed on the candidate paths to find the minimum weighted path among all paths with minimum Fréchet distance. In one implementation, the system may limit the path range by setting it to be the sampling interval multiplied by a predetermined speed in order to reduce the computation time.
The process at block 320 may also involve using Algorithm 2, described above, to determine which path has the minimum weight on a free surface, and therefore has the best weight and is the best match to the GPS dataset.
In response to the weight assignments, at block 325, the system outputs the candidate sequence with the best weight. In particular, this process may involve storing information related to the path with the minimum weight in a database. In one implementation, such information may be used to support external applications such as location management or driving directions.
In addition or alternatively, the process may include visualizing the results on an interface that can be tailored towards different end-user devices of the user 190. As discussed above, in one implementation, the output may be delivered to the computing device 120 to be displayed to the user 190. In another implementation, a display device attached the map matching module 180 may be utilized to display the visualization.
While the above disclosure has been shown and described with reference to the foregoing examples, it should be understood that other forms, details, and implementations may be made without departing from the spirit and scope of the disclosure that is defined in the following claims.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2013/032638 | 3/15/2013 | WO | 00 |