This application is based upon and claims the benefit of priority of the prior Japanese Patent Application No. 2014-089571, filed on Apr. 23, 2014, the entire content of which is incorporated herein by reference.
The embodiments discussed herein are related to a route extraction method, and a route graph generation method.
Hitherto, technology has existed that performs analysis related to routes of moving bodies as spatial information analysis. For example, technology exists that performs an OD (O=“origin”, D=“destination”) search that finds routes having specified localities as origins or destinations, from plural track data that are actual observation data. Technology also exists that analyzes OD frequencies indicating combinations of outset localities and destination localities (such as (Shinjuku Station to Shibuya Station), or (Shinagawa Station to Ikebukuro Station) for example) appearing a specific number of times (for example, 10 times) or more, from plural track data. Technology also exists that finds partial routes, such as a route passing through Shinagawa Station→(Yamanote Line Outer Circle)→Ikebukuro Station, from track data. Technology also exists that analyzes frequent partial routes representing partial routes appearing a specific number of times (for example, 10 times) or more in track data. In such route analysis, when the route to be analyzed is predetermined by a road network, map data, or the like, appropriate analysis results can be obtained by mapping the track data onto routes.
We consider here cases of route analysis in which people, acting as moving bodies, are the focus. Track data representing movement of people may, for example, be acquired as a series of position data, periodically position-measured as a latitude and longitude (a position-measurement point) by a GPS (global positioning system) sensor installed in a mobile phone, smart phone, or the like. WiFi or the like may also be employed for acquiring position data. Movement of people is not limited to movement along a road, train tracks, etc.; sometimes free movement occurs through open spaces in, for example, exhibition halls and the like. Namely, sometimes track data is also acquired for spaces in which routes are undetermined. In methods of performing route analysis on track data associated with predetermined routes such as road networks, or map data, route analysis cannot be performed on track data for spaces in which routes are undetermined.
Technology therefore exists in which a target-analysis range is apportioned to plural regions (a mesh) of a specific surface area, and position-measurement points indicating respective position data included in the track data is associated with the mesh. In such technology, the OD (origin, destination) in the route analysis appears as a combination of a mesh pair associated with position-measurement points indicating the OD, and the routes appear as track data passing through a mesh series.
According to an aspect of the embodiments, a route extraction method includes, by a processor, when extracting, for combination with an identified route out of plural routes, a route from the other routes in the plural routes, performing control that increases extraction probability according to distribution density of the other routes.
The object and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the claims.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention.
Detailed explanation follows regarding examples of exemplary embodiments of technology disclosed herein with reference to the drawings.
In a first exemplary embodiment, a route graph displaying routes employed by route analysis is generated from track data indicating tracks moved along by moving bodies, without dividing a region into a mesh, and without employing road networks, map data, or the like. The track data is a sequence of position data indicating position-measured positions (position-measurement points) of moving bodies.
First, prior to explaining the first exemplary embodiment in detail, explanation follows regarding problems anticipated when generating a route graph from track data.
When a route graph is generated from track data, it is preferable to present collections of routes having high similarity to respective tracks represented in plural track data, and to generate a simple route graph. For example, generating a route graph by sequentially combining tracks having an inter-track distance from each other within a specific distance is conceivable.
For example, explanation follows of an example case where respective tracks t1 to t6 illustrated in
In state A of
However, in the route graph generated as illustrated in D of
In the first exemplary embodiment, the route graph is generated with additional consideration for the original track density, such that the overall characteristics of the original tracks are reflected in the route graph.
Explanation follows below regarding details of a route graph generation device according to the first exemplary embodiment.
As illustrated in
Through a network 28, the track data generation device 22 acquires position-measurement data indicating positions of moving bodies periodically position-measured by respective sensors 26 that position-measure positions of the moving bodies. The sensors 26 may be GPS or the like employed in a mobile phone carried by a person, a person being an example of a moving body, a terminal such as a smart phone, a car navigation system installed in a car, a car being an example of a moving body, or the like. The position-measurement data includes position data represented by a latitude and a longitude, time data indicating the position-measurement time, and a sensor ID that identifies the sensor 26. The track data generation device 22 extracts the plural acquired position-measurement data for each of the sensor IDs, and generates the track data by arranging the position data included in respective position-measurement data in a time sequence based on the time data. For example, if the position-measurement points indicated in position data included in the position-measurement data of a sensor having ID=k are pki (i=1, 2, . . . , n; where n is the total number of position data items included in the position-measurement data that include the sensor having ID=k), then the track data representing track tk may be represented by tk=<pk1, pk2, . . . , pkn>. The position data representing position-measurement point pi is pi=(xi, yi)εR2 (R2 is a 2-dimensional space having real numbers as elements).
The track data generation device 22 stores plural generated track data in the track data storage section 24. The track data storage section 24 may be a storage device provided to the track data generation device 22, or may be a storage device provided as an external device, separate from the track data generation device 22. The track data storage section 24 may be a portable storage medium such as a CD-ROM, DVD-ROM, or USB memory.
As illustrated in
One by one, the commonizable location identification section 11 selects track data representing a processing-target track from the track data included in the input track data collection. The commonizable location identification section 11 identifies portions of another track present within a specific distance ε of the processing-target track as commonizable locations of the processing-target track and the other track. The Frechet distance, for example, may be employed as the inter-track distance.
For each of the tracks tk, the commonizable location identification section 11 calculates awarded scores indicating densities of other tracks in the vicinities of the respective tracks. More specifically, the commonizable location identification section 11 sets a specific score value for the respective tracks tk, and apportions the set score values as assigned scores to the respective position-measurement points included in the given track tk. Then, the assigned score of the position-measurement points pki are apportioned to other tracks identified as commonizable locations αki for those position-measurement points pki, and the apportioned assigned scores are then summed for the respective tracks and set as the awarded score indicating the density for that track.
For example, if a score of 100 is the score value set for each track, since 5 position-measurement points are included in the track t3, the commonizable location identification section 11 apportions an assigned score of 20 each to the respective position-measurement points p3i of the track t3 as illustrated at A in
Similarly, the commonizable location identification section 11 also calculates awarded scores based on the other commonizable locations αk, and calculates a sum of the awarded scores for each of the tracks tk based on the commonizable locations αk as illustrated in
One by one, the representative route extraction section 12 selects track data as track data representing a processing-target track, and extracts a representative route corresponding to the processing-target track from the route connecting position-measurement points included in the commonizable locations of the processing-target track. More specifically, for the tracks tk (the track t3 in the example of
As illustrated at the bottom of
Out of the paths (routes) in the generated planar graph, the representative route extraction section 12 extracts a route having a high degree of matching with the processing-target track as the representative route based on node density and distance from the processing-target track. The degree of matching is defined so as to be higher for routes a shorter distance away from the track, and, in cases in which distances from the track are about the same, higher for routes passing through locations having higher node density. Routes common to other tracks included within the specific distance away from the processing-target track are thereby extracted as representative routes corresponding to the processing-target track.
Explanation follows with reference to
The representative route extraction section 12 searches for nodes of the planar graph included within the distance ε (the range within the dashed circle of a vicinity of the node p31 in
Similarly for the node p35 that is the termination point of the track t3, the representative route extraction section 12 searches for nodes of the planar graph included within the distance ε from the node p35 (the range within the dashed circle at a vicinity of the node p35 in
The representative route extraction section 12 searches for the path having the shortest distance (for example, Frechet distance) from the track t3 out of paths in the planar graph in which the derived start point (node p11) and the derived termination point (node p65) are taken as the start point and termination point of the path. For example, the nodes p21, P13, and p41 exist as candidates for the node following the node p11, and of these, the node p21, being the shortest distance away from the track t3, is selected as the node following the node p11. When plural nodes are present at the shortest distance away from the track t3, the node having the highest density of nodes within a vicinity thereof is selected out of these nodes. Nodes having high density may be selected similarly to in the method of selecting the start point and termination point of the representative route. A specific number of nodes, ranked according to shortest distance away from the track t3, may designed as candidates, and a node may be selected for inclusion in the representative route based on the density within a vicinity of each of these nodes. The representative route is selected by starting from the start point and repeating node selection in this manner until the termination point is reached. Below, the representative route corresponding to the track tk is denoted Πk.
The representative route extraction section 12 similarly extracts the representative routes Πk for the other tracks tk.
The combining section 13 ranks the respective representative routes corresponding to each of the tracks extracted by the representative route extraction section 12 according to the representative routes having the highest awarded score indicating the respective track densities calculated by the commonizable location identification section 11, and generates the route graph by combining with the route graph (temporary route graph) generated at that stage. The method of combination is to combine nodes of the processing-target representative route with nodes of the temporary route graph where the distance between nodes included in the representative route is no more than ε, similarly to in the method explained with reference to
For example, consider a case in which the awarded scores indicating the densities for the respective tracks tk as illustrated in
Similarly, as illustrated in E of
The combining section 13 outputs the generated route graph G=(V, E). Note that V is a collection of data representing nodes of the route graph, and E is a collection of data representing edges connecting nodes together.
The route graph generation device 10 may be implemented by, for example, a computer 40, as illustrated in
The storage section 46 may be implemented by a hard disk drive (HDD), a solid state drive (SSD), a flash memory, or the like. The storage section 46, serving as a recording medium, stores a route graph generation program 50 that causes the computer 40 to function as the route graph generation device 10. The CPU 42 reads the route graph generation program 50 from the storage section 46, expands the route graph generation program 50 into the memory 44, and sequentially executes the processes included in the route graph generation program 50.
The route graph generation program 50 includes a commonizable location identification process 51, a representative route extraction process 52, and a combining process 53. The CPU 42 operates as the commonizable location identification section 11 illustrated in
Note that the route graph generation device 10 may also be implemented by a semiconductor integrated circuit, for example, more specifically by an Application Specific Integrated Circuit (ASIC) or the like.
Next, explanation follows regarding operation of the first exemplary embodiment. The track data generation device 22 acquires position-measurement data position-measured by the plural respective sensors 26 through the network 28, generates track data from the position-measurement data, and stores the track data in the track data storage section 24. When track data collection stored in the track data storage section 24 is input to the route graph generation device 10, the route graph generation processing illustrated in
At step S11 of the route graph generation processing illustrated in
Next, at step S14 the commonizable location identification section 11 apportions the specific score value set for the respective tracks tk to each of the position-measurement points pki, included in that track tk, as assigned scores. Then, for each position-measurement point pki, the commonizable location identification section 11 apportions the assigned scores of the position-measurement points pki to other tracks identified as commonizable locations αk, for that position-measurement point pki, and sums the apportioned assigned score for each of the tracks. The commonizable location identification section 11 thereby calculates awarded scores based on commonizable locations αk, for example, as illustrated in
Next, at step S15 the commonizable location identification section 11 determines whether or not the variable k has reached the count n of the track data included in the track data collection. When k is less than n, processing transitions to step S16, the commonizable location identification section 11 increments k by 1, and processing returns to step S12. When k has reached n, processing transitions to step S17.
At step S17, the commonizable location identification section 11 for example, as illustrated in
Next, at step S18, the representative route extraction section 12 sets the variable k to 1, and at the next step S19, sets the track tk as the processing target. Next, at step S20, for the tracks tk, the representative route extraction section 12 removes the edges connecting together position-measurement points included in the portions of other tracks identified as the commonizable locations αk at step S13 above, and extracts the position-measurement points only. Then the representative route extraction section 12 generates a planar graph with the extracted position-measurement points as nodes.
Next, at step S21, out of paths in the planar graph generated at step S20 above, the representative route extraction section 12 extracts as a representative route Πk a path having a high degree of matching with the track tk, based on node density and distance away from the track tk.
Next, at step S22 the representative route extraction section 12 determines whether or not the variable k has reached the count n of track data included in the track data collection. When k is less than n, processing transitions to step S23, the representative route extraction section 12 increments k by 1, and processing returns to step S19. When k has reached n, processing transitions to step S24.
At step S24, from the representative routes corresponding to each of the tracks extracted at step S21 above, the combining section 13 selects the representative route, out of the unprocessed representative routes, that has the highest awarded score calculated at step S17 above indicating density of respective tracks. Then, the combining section 13 combines the selected representative route with the temporary route graph of that stage. The combining section 13 repeatedly performs the combination with the temporary route graph until there are no unprocessed representative routes present, and thereby generates the final route graph. The combining section 13 outputs the generated route graph G=(V, E), and the route graph generation processing ends.
As explained above, in the route graph generation device 10 according to the first exemplary embodiment, for each of the tracks represented by the track data, representative routes are extracted according to the degree they are commonizable with other tracks in the vicinity. The route graph is then generated by combining the representative routes corresponding to the respective tracks in sequence from the representative route having the highest density of other tracks in the track vicinity. Generation of a route graph that better represents the characteristics of the original tracks is thereby enabled, in contrast to cases in which the route graph is generated by combining together tracks simply is sequence by shortest distance apart from each other. Since the route graph is generated from the track data alone, difficulties caused by adjustments or the like of a mesh surface area do not arise, and an appropriate analysis result can be obtained when the route graph generated according to the first exemplary embodiment is employed in route analysis. Generating a route graph based on track data is also enabled for locations that do not correspond to a road network, map data, or the like, enabling appropriate analysis results to be obtained even for people moving freely.
Generating a planar graph having the position-measurement points included in the commonizable locations as nodes when the representative routes are extracted, enables efficient extraction of representative routes.
The density of other tracks in the track vicinity is calculated as an awarded score as described above, enabling densities to be derived by simple processing.
Next, explanation follows regarding a second exemplary embodiment. As illustrated in
The route collection extraction section 16 maps the respective tracks represented by the track data onto network data including plural nodes and edges connecting nodes together. The network data may, for example, be a planar graph having position-measurement points, represented by the position-measurement data included in the track data of the input track data collection, as nodes. When road network data is usable, the road network data may also be employed. The route collection extraction section 16 extracts as routes to be employed in route graph generation, paths included in portions of network data that are present within a specific distance ε from tracks mapped onto the network data. The routes extracted for the tracks tk are denoted routes cki (i=1, 2, . . . , n; where n is the total number of routes extracted for the track tk), and the routes cki are collectively denoted route collection Sk.
In order to simplify the explanation below, the network data is represented by a grid like that illustrated in
c11: 2—8, 8—9, 9—10, 10—16, 16—17, 17—23, 23—24
In order to simplify the explanation below, explanation follows in which the following route collections Sk are extracted for the respective tracks tk (k=1, 2, 3), as illustrated in
S1:
c11: 2—8, 8—14, 14—15, 15—16, 16—22, 22—23, 23—24
c12: 2—8, 8—9, 9—15, 15—16, 16—17, 17—23, 23—24
c13: 2—8, 8—9, 9—10, 10—16, 16—22, 22—23, 23—24
S2:
C21: 2—8, 8—14, 14—20, 20—21, 21—22, 22—23, 23—24
c22: 2—8, 8—14, 14—15, 15—21, 21—22, 22—23, 23—24
c23: 2—8, 8—14, 14—15, 15—16, 16—22, 22—23, 23—24
S3:
c3i: 7—8, 8—9, 9—10, 10—16, 16—17, 17—23, 23—24
c32: 13—14, 14—15, 15—16, 16—17, 17—23, 23—24
The optimization section 17 generates a route graph by optimizing a combination of edges to be included in the route graph, out of the edges included in the route collections extracted by the route collection extraction section 16, so that the degree of similarity with the original tracks becomes higher.
The optimization section 17 performs the optimization, for example, as follows. First, the optimization section 17 sets the following constraints: (1) a route is a collection of edges; (2) tracks match one of the routes; (3) the route graph includes all of the routes matching the tracks. Then, under these constraints, the optimization section 17 performs optimization such that the number of edges included in the route graph is minimized.
For example, the optimization section 17 takes each of the edges included in the route collections Sk extracted by the route collection extraction section 16 as x, takes the routes cki as y, and takes the routes matching the tracks tk as z, with each of these defined as follows. Note that routes matching the tracks are not only routes perfectly matching a track; routes matching a track with a degree of matching of a specific ratio or greater may also be included.
x2—8, . . . , x23—24ε{0, 1}
y11, y12, . . . , y32ε{0,1}
z1, z2, z3ε{0, 1}
The constraints (1) to (3) are defined as follows.
(1) A route is a collection of edges.
y11:=x2—8×8—14×14—15×15—16×16—22×22—23×23_×24
y12:=x2—8×8—9×9—15×15—16×16—17×17—23×23_×24
y13:=x2—8×8—9×9—10×10—16×16—22×22—23×23_×24
and so on to
y32:=x13—14×14—15×15—16×16—17×17—23×23_×24
(2) Tracks match one of the routes.
z1:=y11y12y13
z2:=y21 y22y23
z3:=y31y32
(3) The route graph includes all of the routes matching the tracks.
G=z1̂z2̂z3
The optimization section 17 defines minimizing the number of edges included in the route graph by the following objective function.
minimize: x2—8++x23—24
Under the constraints (1) to (3), the optimization section 17 reduces the objective function above to optimization of a 0-1 integer programming problem, and finds an optimized solution for x. Namely, an optimized solution is obtained wherein an x of 1 indicates that an edge is included, and an x of 0 indicates that an edge is not included in the route graph G. For example, an optimized solution such as that illustrated in the center of
Similarly to the route graph generation device 10 according to the first exemplary embodiment, the route graph generation device 210 may, for example, be implemented by a computer 40 illustrated in
The route graph generation program 250 includes a route collection extraction process 56, and an optimization process 57. The CPU 42 operates as the route collection extraction section 16 illustrated in
Note that the route graph generation device 210 may also be implemented by a semiconductor integrated circuit, for example, more specifically by an ASIC or the like.
Next, explanation follows regarding operation of the second exemplary embodiment. In the second exemplary embodiment, the route graph generation processing illustrated in
At step S31 of the route graph generation processing illustrated in
Next, at step S33, the route collection extraction section 16 maps the track tk onto the network data. Then, the route collection extraction section 16 extracts route collections Sk included in portions of the network data that are present within the specific distance ε from the track mapped onto the network data.
Next, at step S34 the route collection extraction section 16 determines whether or not the variable k has reached the count n of the track data items included in the track data collection. When k is less than n, processing transitions to step S35, the route collection extraction section 16 increments k by 1, and processing returns to step S32. When k has reached n, processing transitions to step S36.
Next, at step S36 the optimization section 17 sets the following constraints: (1) a route is a collection of edges; (2) tracks match one of the routes; (3) the route graph includes all of the routes matching the tracks. Then, under these constraints, the optimization section 17 finds an optimized solution indicating edges included in the route graph by, for example, solving optimization of a 0-1 integer programming problem like that above, such that the number of edges included in the route graph is minimized.
Next, at step S37 the optimization section 17 generates the route graph G based on the optimized solution obtained at step S36, and the network data employed at step S33. The optimization section 17 outputs the generated route graph G, and route graph G generation processing ends.
As explained above, in the route graph generation device 210 according to the second exemplary embodiment, the tracks are mapped onto the network data, and the route collection is extracted from the portions of the network data included in a range within a specific distance away from the tracks. Then, a route graph is generated by optimizing combinations of edges to be included in the route graph, out of the edges included in the route collection, such that the number of edges included in the route graph is minimized. Generation of a route graph that well represents the characteristics of the original tracks is therefore enabled.
In the second exemplary embodiment, although a route graph is generated from the track data and the network data alone, this network data may employ a planar graph generated from the track data. Accordingly, similarly to the first exemplary embodiment, difficulties caused by adjustments or the like of a mesh surface area do not arise, and an appropriate analysis result can be obtained when the route generated by the second exemplary embodiment is employed in route analysis. Generating a route graph based on track data is also enabled for locations that do not correspond to a road network, map data, or the like, enabling appropriate analysis results to be obtained even for people moving freely.
Explanation has been given above in which the route graph generation programs 50, 250, that are examples of route graph generation programs according to technology disclosed herein, are stored in advance (installed) in the storage section 46; however, there is no limitation thereto. The route graph generation program according to technology disclosed herein may also be provided in a format recorded on a storage medium such as a CD-ROM, DVD-ROM, or USB memory.
In route analysis employing a mesh, analysis results sometimes change according to the size of the set surface area of the mesh. For example, when the surface area of the mesh is small, substantially similar track data sometimes appear as different routes. When the surface area of the mesh is large, different track data intended to be handled as a different route sometimes appears as the same route. Minor variations in mesh surface area adjustments of this type sometimes have a detrimental impact on analysis results. At the partitioning regions of the mesh, even for position-measurement points representing substantially similar places, sometimes one position-measurement point will be associated with one mesh square, and another position-measurement point will be associated with another mesh square on the other side of a boundary, and sometimes appropriate analysis results cannot be obtained.
An advantageous effect of one aspect of technology disclosed herein is enabling generation of a route graph enabling appropriate analysis results to be obtained from route analysis.
All examples and conditional language provided herein are intended for the pedagogical purposes of aiding the reader in understanding the invention and the concepts contributed by the inventor to further the art, and are not to be construed as limitations to such specifically recited examples and conditions, nor does the organization of such examples in the specification relate to a showing of the superiority and inferiority of the invention. Although one or more embodiments of the present invention have been described in detail, it should be understood that the various changes, substitutions, and alterations could be made hereto without departing from the spirit and scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
2014-089571 | Apr 2014 | JP | national |