The present invention relates to a location information analysis device and a location information analysis method for performing an analysis of user tendencies (for example, extracting a haunt area of a user) based on location information of a mobile station that is carried by the user.
Conventionally, in response to location information of an individual user, some technologies providing various services (provision of information etc.) have been proposed. Patent Document 1, for example, proposes a technology in which a server apparatus creates a list of destinations corresponding to the current position of a user (a list of positions that were selected as destinations by a large number of users who were in the position in the past), and provides the user with the created list.
Traditionally, although a lot of technologies that provide services to an individual user in response to the location information of the users have been proposed, a technology that macroscopically analyzes the location information of a large number of users and collects data on the users' tendencies has rarely proposed.
On the other hand, since consumer preference has been diversified and complicated recently, the needs of analysis data with regard to macroscopic user tendencies with regard to a large number of users without an area limitation have increased to a large extent.
However, to collect data with regard to macroscopic user tendencies in a conventional way, it has been required to do a series of extreme time-consuming work such as sending questionnaires to a lot of users, gathering the questionnaires from the users, and collecting the data by using a plenty of man-power. Moreover, it has been forced the users to bear the burden to answer and send back the questionnaires. That is, traditionally, it has been quite troublesome just gathering data with regard to macroscopic user tendencies, making it very difficult to collect the related data promptly.
In the light of the problem described above, an object of the present invention is to effectively analyze location information easily obtained of a large number of users and to quickly collect data with regard to macroscopic user tendencies.
A location information analysis device according to one aspect of the present invention includes: an input module that is adapted to input point data across a plurality of time points with regard to a plurality of users, the point data including location information indicating a position of a user, time information indicating time at which the location information is obtained, and user identifier information with regard to the user; a haunt area extraction module that extracts an area, as a haunt area where the plurality of users frequently haunt, the area in which the point data is concentrated at or more than a predetermined level, based on a distribution status of the input point data plotted on two dimensional map data; and a storage module that stores extracted haunt area information.
As “the point data” here, it is possible to adopt GPS positioning data gained through a GPS positioning system or OPS data. It should be noted that the OPS data does not include specific location information (latitude and longitude information). In this manner, for example, it is possible to obtain point data from the OPS data by assuming that the user is in the gravity position of the area and converting the area information of the user to the location information (latitude and longitude information) of the gravity position in the area.
As described above, with the point data which is obtained relatively easily, the input module inputs the point data across a plurality of time points with regard to a plurality of users. Then, the haunt area extraction module extracts an area, as a haunt area (an area where a plurality of users frequently haunt), the area in which the point data is concentrated at or more than a predetermined level, based on a distribution status of the input point data plotted on the two dimensional map data. In addition, the storage module stores the extracted haunt area information. In this way, with the location information analysis device according to one aspect of the present invention, it is possible to effectively analyze location information with regard to a large number of users that is obtained easily and to quickly collect haunt area information as data with regard to macroscopic user tendencies.
It should be noted that four embodiments can be exemplified for an extraction procedure of haunt areas depending on whether the point data is classified for each user or not, and whether an extraction procedure based on a point data density or an extraction procedure based on grouping of point data is executed. The four embodiments as exemplary configurations of the haunt area extraction module will be described below.
That is, the haunt area extraction module may include: an all-user density estimation module that estimates density of the input point data of all the users in each of a plurality of zones partitioned on the two dimensional map data in advance; and a first extraction module that extracts an area, as the haunt area, the area in which the estimated density of the point data of all the users is equal to or more than a predetermined level.
Alternatively, the haunt area extraction module may include: a grouping module that calculates a distance between the input point data of all the users plotted on the two dimensional map data, and makes a group of point data of which calculated distance is equal to or less than a predetermined reference distance; and a second extraction module that extracts an area, as the haunt area, including a plurality of pieces of grouped point data on the two dimensional map data.
Alternatively, the haunt area extraction module may include: a classification module that classifies the input point data of all the users for each user; a per-user density estimation module that estimates density of the point data for each user based on the classified point data for each user in each of a plurality of zones partitioned on the two dimensional map data in advance, a summation module that totals the estimated density of the point data for each user in each zone and obtains density of the point data of all the users in each zone, and a third extraction module that extracts an area, as the haunt area, the area in which the obtained density of the point data of all the users is equal to or more than a predetermined level.
Alternatively, the haunt area extraction module may include: a classification module that classifies the input point data of all the users for each user, a per-user grouping module that calculates a distance between the classified point data for each user plotted on the two dimensional map data, and makes a group of point data of which calculated distance is equal to or less than a predetermined reference distance; an overlaying module that overlays an area including a plurality of pieces of grouped point data for each user on the two dimensional map data on the two dimensional map data for all the users; and a fourth extraction module that extracts an area, as the haunt area, that is obtained through the overlaying.
In addition, the location information analysis device according to one aspect of the present invention may further include: a concatenation module that classifies the input point data of all the users for each user and concatenates the point data for each user with the extracted haunt area on the two dimensional map data; a translocation history derivation module that obtains translocation history information between haunt areas for each user with the data concatenating the point data for each user with the haunt area on the two dimensional map data obtained through the concatenation based on time sequential transition with regard to relative positions of the point data for each user for the haunt area; and a travel derivation module that integrates the translocation history information between the haunt areas for each of all the users and obtains travel information between the haunt areas with regard to all the users, based on the obtained translocation history information between the haunt areas of all the users, and the storage module may further store travel information between the haunt areas with regard to all the users. In this case, the concatenation module can classify the input point data of all the users for each user and concatenate the point data for each user and the haunt area on the two dimensional map data, and the translocation history derivation module can obtain the translocation history information between the haunt areas for each user with the data concatenating the point data for each user with the haunt area on the two dimensional map data obtained through the concatenation, based on the time sequential transition with regard to relative positions of the point data for each user for the haunt areas. Furthermore, the travel derivation module integrates the translocation history information between the haunt areas for each of all the users and obtains the travel information between the haunt areas of all the users, based on the obtained translocation history information between the haunt areas of all the users, and the storage module stores the travel information between the haunt areas with regard to all the users. In this way, it is possible to further quickly collect the travel information between the haunt areas with regard to all the users as data about macroscopic user tendencies.
Alternatively, the location information analysis device according to one aspect of the present invention may further include: a concatenation module that classifies the input point data of all the users for each user and concatenates the point data for each user with the extracted haunt area on the two dimensional map data; a staying time derivation module that calculates staying time for each user with regard to each haunt area with the data concatenating the point data for each user with the haunt area on the two dimensional map data obtained through the concatenation, based on the time information of the point data of the user located in each haunt area; and a staying time statistic derivation module that integrates staying time information obtained for each user with regard to each haunt area for all the users and calculates predetermined statistics for all the users with regard to the staying time for each haunt area based on the obtained staying time information for all the users, and the storage module may store the predetermined statistics for all the users with regard to the staying time for each haunt area. In this case, the concatenation module can classify the input point data of all the users for each user and concatenate the point data for each user with the extracted haunt area on the two dimensional map data, and the staying time derivation module can calculate staying time for each user with regard to each haunt area with the data concatenating the point data for each user with the haunt area on the two dimensional map data obtained through the concatenation, based on the time information of point data of the user located in each haunt area. Furthermore, the staying time statistic derivation module integrates the staying time information for each user with regard to each haunt area for all the users and calculates predetermined statistics for all the users with regard to the staying time for each haunt area, and the storage module stores the predetermined statistics for all the users with regard to the staying time for each haunt area. In this way, it is possible to further quickly collect the predetermined statistics with regard to the staying time with regard to all the users for each haunt area as data about macroscopic user tendencies.
In addition, the location information analysis device according to one aspect of the present invention may further include: an active time period derivation module that gains information with regard to active time periods of all users for each haunt area based on attribute data including address information of each user input from outside or stored in advance and the staying time for each user in each haunt area derived by the staying time derivation module, for a certain user, by defining a staying time in a haunt area corresponding to the address information of the user as a staying time at home and determining a time period excluding the staying time at home as an active time period of the user, thereby obtaining the active time period of each user, and by integrating information of the active time period of each user for each haunt area, and the storage module may store the information with regard to the active time periods of all the users for each haunt area. In this case, the active time period derivation module gains the active time period of all users for each haunt area based on attribute data including the address information of each user and the staying time for each user in each haunt area, for a certain user, by defining a staying time in a haunt area corresponding to the address information of the user as a staying time at home and determining a time period excluding the staying time at home as an active time period of the user, thereby obtaining the active time period of each user, and by integrating information of the active time period of each user for each haunt area. Furthermore, the storage module stores the information with regard to the active time periods of all users for each haunt area. In this way, it is possible to further quickly collect the information with regard to the active time periods of all the users for each haunt area as data about macroscopic user tendencies.
Alternatively, the location information analysis device according to one aspect of the present invention may further include: a read-out module that reads out the information stored in the storage module; and an output module that outputs the read-out information. In this case, it is possible to output and visualize a variety of information stored in the storage module.
It should be noted that the invention of the location information analysis devices can be interpreted as an invention of a location information analysis method and such method will be described as below. An invention with regard to a location information analysis method can also provide the same operations and advantageous effects as those described above.
A location information analysis method according to another aspect of the present invention is a location information analysis method performed by a location information analysis device and includes: an input step of inputting point data across a plurality of time points with regard to a plurality of users to the location information analysis device, the point data including location information indicating a position of a user, time information indicating time at which the location information is obtained, and user identifier information with regard to the user, and the point data; a haunt area extraction step of extracting an area, as a haunt area where the plurality of users frequently haunt, the area in which the point data is concentrated at or more than a predetermined level, based on a distribution status of the input point data plotted on two dimensional map data; and a storing step of storing the extracted haunt area information.
With the present invention, it is possible to effectively analyze location information of a large number of users that is obtained easily and to quickly collect data with regard to macroscopic user tendencies.
With reference to the attached figures, an embodiment of the present invention will be described. When appropriate, identical parts will be given identical reference numerals, omitting duplicated explanations.
[Configuration of a Communication System]
The exchanges 400 collect location information of the mobile station 100 via the BTSes 200 and the RNCs 300. The RNCs 300 can make determination of the position of a mobile station 100 by using a delay value in an RRC connection request signal when a connection for communication is established with the mobile station 100. The exchanges 400 can receive the location information of the mobile station 100 determined in this way when the mobile station 100 is establishing the connection for communication. The exchanges 400 store the received location information and output the collected location information to the management center 500 every predetermined timing or in accordance with a request from the management center 500. Here, generally, the number of RNCs 300 located all over Japan is about one thousand. The number of exchanges 400 located all over Japan is about three hundreds.
The various process nodes 700 collect the location information of the mobile station 100 through the RNCs 300 and the exchanges 400, recalculate the position depending on a situation, and output the collected location information to the management center 500 every predetermined timing or in accordance with a request from the management center 500.
The management center 500 is, as described above, configured with the social sensor units 501, the peta-mining units 502, the mobile demography units 503, and the visualization solution units 504. Each of the units executes statistic processing with the location information of the mobile station 100.
Each of the social sensor units 501 is a server apparatus that collects data including the location information of the mobile station 100 from each exchange 400 and various process node 700, or off-line. The social sensor unit 501 is configured to receive data which has been regularly output from the exchanges 400 and the various process nodes 700, or to receive data in the timing set in advance by the corresponding social sensor unit 501, from the exchanges 400 and the various process nodes 700.
Each of the peta-mining units 502 is a server apparatus that converts data received from the corresponding social sensor unit 501 to a predetermined data form. The peta-mining unit 502, for example, executes a sorting process by using a user ID as a key, or executes a sorting process for each area.
Each of the mobile demography units 503 is a server apparatus that executes a totalizing process, or counting process for each item, on the data processed by the peta-mining units 502. The mobile demography unit 503, for example, can count the number of users in a specific area or totalize the distribution of areas where users are present.
Each of the visualization solution units 504 is a server apparatus that processes the data which is totalized by the mobile demography unit 503 so as to make the data visible. The visualization solution unit 504, for example, can execute a mapping process that overlays the totalized data on a map. For example, the data processed in the visualization solution unit 504 will be provided to a company, a public agency, or an individual and used for the development of a store, an investigation of traffic, disaster control, and environmental measures. It should be noted that the information statistically processed in this way is modified so as to avoid identifying an individual so that privacy would not be violated.
Furthermore, each of the social sensor units 501, the peta-mining units 502, the mobile demography units 503, and the visualization solution units 504 is, as described above, configured with a server apparatus and equipped with a basic configuration (i.e., a CPU, a RAM, a ROM, an input device such as a key board or mouse, a communication device for communication with outside, a storage device for storing information, and an output device such as a display or printer) for a general information processing apparatus not illustrated in the figures.
[Configuration of a Location Information Analysis Device]
Next, a location information analysis device according to the present embodiment will be described.
In the present embodiment, the location information is processed with a form of point data that includes location information indicating the position of a user, time information indicating the time at which the location information is obtained, and user identifier information with regard to the user. The point data across a plurality of time points with regard to a large number of users is stored in a location information database 620. Examples of the “point data” here may include GPS positioning data obtained with a GPS positioning system or OPS data. It should be noted that the OPS data does not include specific location information (latitude and longitude information). For example, it is possible to convert the area information regarding the area where a specific user is present to the location information (latitude and longitude information) of the gravity position in the area based on a presumption that the user is in the gravity position of the area, and thus obtain the point data from the OPS data. Meanwhile, user attribute information (an address, a gender, and age etc.) is stored in an attribute information database 630.
A correspondence between the logical configuration illustrated in
It should be noted that the storage module 603, the read-out module 610, and the output module 611 included in the location information analysis device 600 may alternatively correspond to the mobile demography units 503 and the visualization solution units 504 illustrated in
Functions of each unit of the location information analysis device 600 illustrated in
The haunt area extraction module 602 extracts an area, as a haunt area, the area in which the point data is concentrated at or more than a predetermined level, on the basis of a distribution status of the input point data plotted on two dimensional map data. Here, the “haunt area” means an area where as a trend, not individuals but a large number of users frequently visit and stay. The extraction process performed by the haunt area extraction module 602 can adopt various embodiments. Various functional block configurations illustrated in
The haunt area extraction module 602, as illustrated in
The haunt area extraction module 602, as illustrated in
The haunt area extraction module 602, as illustrated in
The haunt area extraction module 602, as illustrated in
In the four embodiments described above, the embodiment that performs density estimation on the basis of the point data classified for each user illustrated in
Referring back to
The translocation history derivation module 605 obtains translocation history information between haunt areas for each user with the data concatenating the point data for each user with the haunt area on the two dimensional map data obtained through concatenation by the concatenation module 604, on the basis of the time sequential transition with regard to relative positions of the point data for each user for the haunt areas. The travel derivation module 606 integrates the translocation history information between the haunt areas for each of all the users and calculates a travel between the haunt areas (approximate flow) with regard to all the users, on the basis of the obtained translocation history information between the haunt areas of all the users.
The staying time derivation module 607 calculates staying time for each user with regard to each haunt area with the data concatenating the point data for each user with the haunt area on the two dimensional map data obtained through the concatenation by the concatenation module 604, on the basis of the time information of point data of the user located in each haunt area. The staying time statistic derivation module 608 integrates staying time information for each user with regard to each haunt area for all the users and calculates predetermined statistics (for example, an average staying time, the longest staying time, a median of the staying time) for all the users with regard to the staying time for each haunt area based on the obtained staying time information for all the users.
The active time period derivation module 609 gains an active time period of all users on the basis of attribute data including address information of each user stored in the attribute information database 630 and the staying time for each user in each haunt area derived by the staying time derivation module 607, for a certain user, by defining a staying time in a haunt area corresponding to the address information of the user as a staying time at home and determining a time period excluding the staying time at home as an active time period of the user, thereby obtaining the active time period for each user, and by integrating information of the active time period of each user for each haunt area.
The read-out module 610 reads out information stored in the storage module 603. The output module 611 outputs the information read out by the read-out module 610.
[Various Processes Executed by the Location Information Analysis Device]
Next, various processes executed by the location information analysis device 600 will be described. A first process that extracts haunt areas and derivates a travel (approximate flow) between haunt areas, a second process that extracts haunt areas, derivates statistics with regard to the staying time for each haunt area, and derivates active time period information of all users for each haunt area, and a third process that outputs stored information will be explained in the give order below.
(First Process)
Now, the first process that extracts haunt areas and derivates a travel (approximate flow) between haunt areas will be described.
As shown in
Next, the haunt area extraction module 602 extracts an area, as a haunt area, the area in which the point data is concentrated at or more than a predetermined level, on the basis of a distribution status of the input point data plotted on the two dimensional map data (step S2). The extraction process of step S2 has four embodiments described above. Each of the embodiments will be described below.
A first embodiment executes an extraction procedure based on a point data density without classifying point data for each user. In this embodiment, the haunt area extraction module 602 includes the previously stated configuration that is illustrated in
A second embodiment executes an extraction procedure based on grouping of point data without classifying point data for each user. In this embodiment, the haunt area extraction module 602 includes the previously stated configuration that is illustrated in
A third embodiment classifies point data for each user and executes an extraction procedure based on a point data density. In this embodiment, the haunt area extraction module 602 includes the previously stated configuration that is illustrated in
A fourth embodiment classifies point data for each user and executes an extraction procedure based on grouping of point data. In this embodiment, the haunt area extraction module 602 includes the previously stated configuration that is illustrated in
The haunt area information (a haunt area ID) obtained by the extraction process in step S2 described above, for example, as illustrated in
In the next step S3 illustrated in
Then, the concatenation module 604 classifies point data of all the users for each user and concatenates point data for each user with the haunt areas extracted by the haunt area extraction module 602 on the two dimensional map data (step S4).
The translocation history derivation module 605 obtains translocation history information between haunt areas for each user with the data concatenating the point data for each user with the haunt area on the two dimensional map data obtained through concatenation by the concatenation module 604, on the basis of the time sequential transition with regard to relative positions of the point data for each user for the haunt areas (step S5). For example, the haunt areas with regard to a certain user include an “area A” at 10:10, an “area B” at 10:20, 10:30, and 10:40, and an “area C” at 10:50, and the “area B” is assumed as a haunt area, the point meaning “coming from the area A” and the point meaning “going to the area C” are gained for the area B. That is, with regard to the “area B”, which is a haunt area, as the travel history information between haunt areas, the “area A” is gained as a From-area since the certain user came from the area A and the “area C” is gained as a To-area since the certain user went to the area C. Hear, in the location information analysis device 600, as illustrated in
Next, the travel derivation module 606 integrates the translocation history information between the haunt areas for each of all the users and obtains a travel (approximate flow) between the haunt areas with regard to all the users, on the basis of the obtained translocation history information between the haunt areas of all the users (step S6), and the storage module 603 stores the obtained travel information between the haunt are as with regard to all the users (approximate flow information) (step S7). For example, as illustrated in
With the first process described above, the haunt area information and the travel information between the haunt areas for all the users (approximate flow information) are gained and stored for the output process described later.
(Second Process)
Next, the second process that extracts a haunt area, derivates statistics with regard to the staying time for each haunt area, and derivates the active time period information of all users for each haunt area, will be described below.
As illustrated in
Then, the haunt area extraction module 602 extracts an area, as a haunt area, the area in which the point data is concentrated at or more than a predetermined level, on the basis of a distribution status of the input point data plotted on the two dimensional map data (step S2). Since the extraction process of step S2 is already explained in detail in the first process, detailed explanation will be omitted here.
Next, the storage module 603 stores the extracted haunt area information (step S3). The concatenation module 604 then classifies point data of all the users for each user and concatenates point data for each user and the haunt areas extracted by the haunt area extraction module 602 on the two dimensional map data (step S4).
Then, the staying time derivation module 607 calculates staying time for each user with regard to each haunt area with the data concatenating the point data for each user with the haunt area on the two dimensional map data obtained through the concatenation by the concatenation module 604, on the basis of the time information of point data of the user located in each haunt area (step S8). For example, assuming that the haunt areas with regard to a certain user include the “area A” at 10:10, the “area B” at 10:20, 10:30, and 10:40, and the “area C” at 10:50, it can be decided that the certain user was in the “area B” from 10:20 to 10:40. Therefore, 20 minute is gained as the staying time of the user with regard to the “area B” as the haunt area.
Next, The staying time statistic derivation module 608 integrates staying time information for each user with regard to each haunt area obtained by the staying time derivation module 607 for all the users and calculates predetermined statistics (for example, an average staying time, the longest staying time, a median of the staying time) for all the users with regard to the staying time for each haunt area based on the obtained staying time information for all the users. In this way, for example, as illustrated in
Then, in step S10, the active time period derivation module 609 obtains an active time period of each user on the basis of attribute data including address information of each user stored in the attribute information database 630 and the staying time for each user in each haunt area derived by the staying time derivation module 607, for a certain user, by defining a staying time in a haunt area corresponding to the address information of the user as a staying time at home and determining a time period excluding the staying time at home from a day time period (0 o'clock to 24 o'clock) as an active time period of the user, thereby obtaining the active time period for each user. In this way, for example, as illustrated in
With the second process described above, the haunt area information, the predetermined statistics for all the users with regard to the staying time for each haunt area, and the information with regard to active time periods of all users for each haunt area are gained and stored.
(Third Process)
Next, the third process that outputs the stored information will be described.
As illustrated in
In addition, the output module 611, as illustrated in
In addition, the output module 611, as illustrated in
According to the present embodiment described above, it is possible to effectively analyze location information (GPS positioning data or OPS data) of a large number of users that is obtained easily without user reactions, and quickly collect and output data with regard to macroscopic user tendencies (haunt area information, travel information between haunt areas (approximate flow information), predetermined statistics with regard to the staying time for each haunt area, information with regard to the active time periods of all users for each haunt area).
According to the present embodiment, the information for each user obtained in the process stage is not stored to be used for the output process, whereby a violation of privacy of individual users is prevented. It should be noted that information may be stored for each user. In this case, however, the information for each user is omitted from targets of the output process so as not to be output.
According to the present embodiment, data with regard to not individual user tendencies but macroscopic user tendencies is collected. Therefore, the location information to be a basis of the analyzing process is not necessarily obtained from the mobile stations of users periodically. Location information gained irregularly may be also applicable broadly.
10 . . . communication system, 100 . . . mobile station, 200 . . . BTS (Base Transceiver Station), 300 . . . RNC (Radio Network Controller), 400 . . . exchange, 500 . . . management center, 501 . . . social sensor unit, 502 . . . peta-mining unit, 503 . . . mobile demography unit, 504 . . . visualization solution unit, 600 . . . location information analysis device, 601 . . . input module, 602 . . . haunt area extraction module, 602A . . . all-user density estimation module, 602B . . . first extraction module, 602C . . . grouping module, 602D . . . second extraction module, 602E . . . classification module, 602F . . . per-user density estimation module, 602G . . . summation module, 602H . . . third extraction module, 602I . . . classification module, 602J . . . per-user grouping module, 602K . . . overlaying module, 602L . . . fourth extraction module, 603 . . . storage module, 604 . . . concatenation module, 605 . . . translocation history derivation module, 606 . . . travel derivation module, 607 . . . staying time derivation module, 608 . . . staying time statistic derivation module, 609 . . . active time period derivation module, 610 . . . read-out module, 611 . . . output module, 620 . . . location information database, 630 . . . attribute information database, 700 . . . various process node
Number | Date | Country | Kind |
---|---|---|---|
2009-089152 | Apr 2009 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP10/55234 | 3/25/2010 | WO | 00 | 2/16/2012 |