The invention relates to an inference-use knowledge generation apparatus and an inference-use knowledge generation method for generating inference-use knowledge that is to be used in an inference that is made by a calculating machine, and also relates to a computer-readable recording medium that includes a program recorded thereon for realizing this apparatus and method.
Conventionally, processing aimed at capturing movements of people and stuffs has been performed for store opening plans, crime investigations, evacuation plans and instructions at the time of a disaster, environment management, and the like. In order to execute such processing, geospatial information is required. Many web sites publish geospatial information that can be used by a calculating machine on the Internet (e.g., see Non-Patent Documents 1 to 3).
Also, conventionally, attempts have been made to execute an inference using a calculating machine (see Patent Documents 1 to 4). If an inference is made by a calculating machine, various situations can be deduced based on information obtained from facts. Thus, an inference made by a calculating machine is useful for the above-described store opening plans, crime investigations, evacuation at the time of a disaster, environment management, and the like, and the accuracy of a simulation is expected to be improved utilizing an inference. Also, in recent years, an inference by a calculating machine can be easily utilized due to an improvement in the processing capacity of calculating machines.
Patent Document 1: Japanese Patent Laid-Open Publication No. H9-213081
Patent Document 2: Japanese Patent Laid-Open Publication No. H10-333911
Patent Document 3: Japanese Patent Laid-Open Publication No. 2000-242499
Patent Document 4: Japanese Patent Laid-Open Publication No. 2015-502617
Non-Patent Document 1: “Open Street Map”, [online], Open Street Map contributors, Retrieved on Nov. 18, 2016, Internet <URL: http://www.openstreetmap.org/>
Non-Patent document 2: “GeoNLP”, [online], National Institute of Informatics, Retrieved on Nov. 18, 2016, Internet <URL: http://www.openstreetmap.org/>
Non-Patent Document 3: “Linked Open Addresses Japan”, [online], Open Addresses, Retrieved on Nov. 18, 2016, Internet <URL: http://uedayou.net/loa/>
Incidentally, in order to make an inference using a calculating machine, it is necessary to generate knowledge regarding stuffs that cannot be understood using data indicating just facts. That is, in order to make an inference using a calculating machine for the above-described store opening plans, crime investigations, evacuation at the time of a disaster, environment management, and the like, it is necessary to generate knowledge regarding stuffs in a space. However, if knowledge is generated on demand at the time of execution of an inference, the processing time increases and the processing cost significantly increases.
An example object of the invention is to provide an inference-use knowledge generation apparatus, an inference-use knowledge generation method, and a computer readable recording medium that solve the above-described problems, and can shorten the processing time and reduce the processing cost required when an inference about things in a space is made by a calculating machine.
In order to achieve the above-described object, an inference-use knowledge generation apparatus according to an example aspect of the invention is an apparatus for generating inference-use knowledge that is to be used in an inference that is made by a calculating machine, and the apparatus includes
a data extraction unit configured to extract, based on a set parameter, data corresponding to a designated position or region from a first data set including data regarding a stuff in a predetermined space, and
a knowledge generation unit configured to specify, from a second data set that includes a plurality of entities that form the space and have been grouped into groups of related entities, a group of entities described by words included in the extracted data, and to generate the inference-use knowledge that indicates a spatial relationship between the entities based on the specified group and a term expressing a preregistered spatial relationship.
Also, in order to achieve the above-described object, an inference-use knowledge generation method according to an example aspect of the invention is a method for generating inference-use knowledge that is to be used in an inference that is made by a calculating machine, and the method includes
(a) a step of extracting, based on a set parameter, data corresponding to a designated position or region from a first data set including data regarding a stuff in a predetermined space, and
(b) a step of specifying, from a second data set that includes a plurality of entities forming that form the space and have been grouped into groups of related entities, a group of entities described by words included in the extracted data, and generating the inference-use knowledge that indicates a spatial relationship between the entities based on the specified group and a term expressing a preregistered spatial relationship.
Furthermore, in order to achieve the above-described object, a computer-readable recording medium according to an example aspect of the invention is a computer-readable recording medium that includes a program recorded thereon for, with use of a computer, generating inference-use knowledge that is to be used in an inference that is made by a calculating machine, the program including instructions that cause the computer to carry out the steps of:
(a) a step of extracting, based on a set parameter, data corresponding to a designated position or region from a first data set including data regarding a stuff in a predetermined space, and
(b) a step of specifying, from a second data set that includes a plurality of entities that form the space and have been grouped into groups of related entities, a group of entities described by words included in the extracted data, and generating the inference-use knowledge that indicates a spatial relationship between the entities based on the specified group and a term expressing a preregistered spatial relationship.
As described above, according to the invention, it is possible to shorten the processing time and reduce the processing cost required when an inference about stuffs in a space is made by a calculating machine.
Hereinafter, an inference-use knowledge generation apparatus, an inference-use knowledge generation method, and a program in an example embodiment of the invention will be described with reference to
First, a schematic configuration of an inference-use knowledge generation apparatus in this example embodiment will be described with reference to
An inference-use knowledge generation apparatus 10 shown in
The data extraction unit 11 extracts, from a first data set including data regarding stuffs in a predetermined space, data corresponding to a designated position or region based on a set parameter.
Also, first, the knowledge generation unit 12 specifies, from a second data set that includes a plurality of entities that form a space and have been grouped into groups of related entities, a group of entities described by words included in the data extracted by the data extraction unit 11. Next, the knowledge generation unit 12 generates inference-use knowledge indicating a spatial relationship between entities based on the specified group and a term expressing a preregistered spatial relationship.
In this manner, if a data set regarding stuffs in a predetermined space and a data set including a plurality of entities forming a space are prepared, the inference-use knowledge generation apparatus 10 in this example embodiment can generate inference-use knowledge in advance. Thus, according to this example embodiment, it is possible to shorten the processing time and reduce the processing cost required to generate knowledge required when an inference about stuffs in a space is made by a calculating machine.
Next, a specific configuration of the inference-use knowledge generation apparatus in this example embodiment will be described with reference to
As shown in
Furthermore, in this example embodiment, the inference-use knowledge generation apparatus 10 is connected to a spatial data storage unit 21, an entity storage unit 22, a geographical case knowledge storage unit 23, an extraction parameter storage unit 24, and a spatial relationship term storage unit 25. In addition, the spatial data storage unit 21, the entity storage unit 22, the geographical case knowledge storage unit 23, the extraction parameter storage unit 24, and the spatial relationship term storage unit 25 are each constructed by a storage device of a computer that is external to the inference-use knowledge generation apparatus 10. Note that the storage units may be constructed by a storage device of a computer that is included in the inference-use knowledge generation apparatus 10.
The spatial data storage unit 21 stores a first data set including data (referred to as “spatial data” hereinafter) regarding stuffs in a predetermined space. A specific example of spatial data is electronic map data.
The entity storage unit 22 stores a second data set. As described above, the second data set is a collection of multiple groups of related entities. Specifically, for example, a group may be formed by two related entities (a pair of entities), and in this case, the second data set includes a plurality of pairs of entities.
Also, examples of a pair of entities include combinations of terms whose collocation frequency is greater than or equal to a certain level in past blog articles, past news articles, and the history of queries and the like used in past inferences. In a group of three or more entities, the group includes a combination of three or more terms whose collocation frequency is greater than or equal to a certain level, for example. Examples of terms include terms regarding a geographical space, such as stations, airports, prefectures, municipalities, buildings, stadiums, and landmarks.
The geographical case knowledge storage unit 23 stores case knowledge regarding a predetermined geographical space (e.g., municipalities, prefectures, and districts). Examples of case knowledge include “City A and City B have a contract on support for fire fighting” and “City A and City B have a contract to share supplies at the time of a disaster”.
The extraction parameter storage unit 24 stores parameters used in data extraction performed by the data extraction unit 11. Parameters are used to specify data to be extracted, and a specific example thereof is “<20 km from center of (input place name)” (indicating a range of less than 20 km from the center).
The spatial relationship term storage unit 25 stores spatial relationship terms. A spatial relationship term is a term indicating a spatial relationship using a predicate-argument structure. Specific examples of a spatial relationship term will be described later with reference to
The input acceptance unit 15 accepts a query input from the outside, specifically, accepts text data indicating a designated position or region and transmits the accepted query to the data extraction unit 11. In this example embodiment, the data extraction unit 11 first acquires a parameter from the extraction parameter storage unit 24. Next, the data extraction unit 11 compares the acquired query and parameter with spatial data stored in the spatial data storage unit 21, and extracts spatial data corresponding to the query and parameter.
For example, assume the query is “City A” and the parameter is “<20 km from center of (input place name)”. In this case, the data extraction unit 11 specifies the latitude and longitude of the center of City A, and extracts, as data, the names of places, the names of POIs (Points Of Interfaces), and the like located within a radius of 20 km from the specified latitude and longitude.
In this example embodiment, the knowledge generation unit 12 compares the spatial data extracted by the data extraction unit 11 with pairs of entities stored in the entity storage unit 22, and specifies a specific pair of entities described by words included in the extracted spatial data. For example, if the extracted data includes City A, and “City A, City A General Hospital” exists as a pair of entities, the knowledge generation unit 12 specifies this pair of entities.
Also, the knowledge generation unit 12 applies the specified pair of entities to a spatial relationship term stored in the spatial relationship term storage unit 25, and generates a predicate-argument structure in which the two entities forming the specified pair of entities are used as terms. This generated predicate-argument structure serves as inference-use knowledge. Also, in this example embodiment, the knowledge generation unit 12 outputs the generated inference-use knowledge to the inference-use knowledge storage unit 14 and causes the inference-use knowledge storage unit 14 to store the generated inference-use knowledge.
Herein, processing for creating inference-use knowledge using a spatial relationship term will be specifically described with reference to
As shown in the left end column in
Thus, in this example embodiment, the knowledge generation unit 12 first specifies the attribute of each of the entities forming the specified pair of entities, and extracts, from the spatial relationship terms stored in the spatial relationship term storage unit 25, a spatial relationship term corresponding to the entities having the specified attributes. The knowledge generation unit 12 then applies the specified pair of entities to the extracted spatial relationship term, and generates, as inference-use knowledge, a predicate-argument structure shown in the center column in
Also, the knowledge generation unit 12 includes a case knowledge extraction unit 13 in this example embodiment. The case knowledge extraction unit 13 extracts, from case knowledge stored in the geographical case knowledge storage unit 23, case knowledge at/in a designated position or region, and stores the extracted case knowledge in the inference-use knowledge storage unit 14 in association with the generated inference-use knowledge.
Next, operations of the inference-use knowledge generation apparatus according to an example embodiment of the invention will be described with reference to
As shown in
Next, the data extraction unit 11 compares the parameter accepted in step A1 and the parameter acquired from the extraction parameter storage unit 24 with spatial data stored in the spatial data storage unit 21, and extracts spatial data corresponding to the query and the parameters (step A2).
Next, the knowledge generation unit 12 compares the spatial data extracted in step A2 with the pairs of entities stored in the entity storage unit 22, and specifies a specific pair of entities described by the words included in the extracted spatial data (step A3).
Next, the knowledge generation unit 12 applies the pair of entities specified in step A3 to a spatial relationship term stored in the spatial relationship term storage unit 25, generates a predicate-argument structure in which the two entities forming this pair of entities are used as terms, and uses this generated predicate-argument structure as inference-use knowledge (step A4).
Next, in the knowledge generation unit 12, the case knowledge extraction unit 13 extracts, from the case knowledge stored in the geographical case knowledge storage unit 23, case knowledge in the query accepted in step A1 (step A5).
Then, the case knowledge extraction unit 13 stores, in the inference-use knowledge storage unit 14, the case knowledge extracted in step A5 in association with the inference-use knowledge generated in step A4 (step A6).
In this manner, when steps A1 to A6 are executed, inference-use knowledge is generated, and thus when an inference about stuffs in a space is made by the calculating machine, it is not necessary to derive a spatial relationship on demand when an inference is made, and the processing time can be shortened and the processing cost can be reduced. Also, in this example embodiment, the generated inference-use knowledge includes a predicate-argument structure, and thus can be directly applied to an inference.
Next, a specific example will be described. It is assumed that “Kawasaki City” is first input as a query, for example. Also, it is assumed that the spatial data storage unit 21 stores electronic map data, and the extraction parameter storage unit 24 stores “<20 km from center of (input place name)”.
In this case, the data extraction unit 11 extracts, from electronic map data, names of places or POIs located within a radius of 20 km from the center of Kawasaki City, such as Yokohama City, Sagamihara City, Ota Ward, Setagaya Ward, Shinagawa Ward, Komae City, Chofu City, Kawasaki Station, and Yokohama Station.
Also, it is assumed that the knowledge generation unit 12 specifies, as pairs of entities, (Kawasaki Station, Yokohama Station), (Kawasaki Station, Ota General Hospital), (Kawasaki City, Yokohama City), (Kawasaki City, Ota Ward), and the like, for example. In this case, the knowledge generation unit 12 creates, as inference-use knowledge, “timeDistance (Station L, Station M, drive, 6, hours)”, “nearest (Kawasaki City, Ota General Hospital, hospital)”, “adjoining (Kawasaki City, Yokohama City)”, “adjoining (Kawasaki City, Ota Ward)”, and the like using the spatial relationship terms shown in
Also, in this case, the case knowledge extraction unit 13 extracts, as case knowledge, “hasContract (Kawasaki City, Yokohama City, fire fighting support)”, “hasContract (Kawasaki City, Yokohama City, share supplies at time of disaster)”, and the like, and associates the case knowledge with the above-described inference-use knowledge. Also, the created inference-use knowledge and the extracted case knowledge are stored in the inference-use knowledge storage unit 14.
The fact that “Kawasaki City”, which is a query, has made an agreement about fire fighting support at the time of a fire and sharing of supplies at the time of a disaster with “Yokohama City” in advance is held as knowledge through the above-described processing. Thus, if Kawasaki City urgently seeks support of fire fighting, for example, the fact that Yokohama City is a neighboring city of Kawasaki City and has a fire fighting support contract with Kawasaki City is specified by referencing knowledge in an inference.
A program in this example embodiment may be a program for causing a computer to carry out steps A1 to A6 shown in
Also, the program in this example embodiment may be executed by a computer system constructed by a plurality of computers. In this case, each of the computers may function as the data extraction unit 11 or the knowledge generation unit 12, for example. Also, the inference-use knowledge storage unit 14 may be constructed on a computer other than the computer that executes the program in this example embodiment.
Here, a computer configured to realize the inference-use knowledge generation apparatus 10 by executing the program in this example embodiment will be described with reference to
As shown in
The CPU 111 loads the programs (code) stored in the storage device 113 in this example embodiment to the main memory 112, executes these programs in a predetermined order, and thereby implements various calculations. Typically, the main memory 112 is a volatile storage device such as a DRAM (Dynamic Random Access Memory). Also, a program in this example embodiment is provided in a state of being stored in a computer-readable recording medium 120. Note that the program in this example embodiment may be distributed on the Internet connected via the communication interface 117.
Also, specific examples of the storage device 113 include a semiconductor storage device such as a flash memory, as well as a hard disk drive. The input interface 114 mediates data transmission between the CPU 111 and input devices 118 such as a keyboard and a mouse. The display controller 115 is connected to a display device 119, and controls the display on the display device 119.
The data reader/writer 116 mediates data transmission between the CPU 111 and the recording medium 120, reads out a program from the recording medium 120, and writes the results of processing by the computer 110 to the recording medium 120. The communication interface 117 mediates data transmission between the CPU 111 and another computer.
Also, specific examples of the recording medium 120 include a general-purpose semiconductor storage device such as a CF (Compact Flash (registered trademark)) and an SD (Secure Digital), a magnetic recording medium such as a Flexible Disk, and an optical recording medium such as a CD-ROM (Compact Disk Read Only Memory).
Note that the inference-use knowledge generation apparatus 10 in this example embodiment can be realized by not only a computer on which programs are installed but also hardware corresponding to each unit. Furthermore, a portion of the inference-use knowledge generation apparatus 10 may be realized by a program and the remaining portion thereof may be realized by hardware.
Part or all of the above-described example embodiments can be expressed by Supplementary Notes 1 to 12 below, but are not limited thereto.
An inference-use knowledge generation apparatus for generating inference-use knowledge that is to be used in an inference that is made by a calculating machine, the apparatus including:
a data extraction unit configured to extract, based on a set parameter, data corresponding to a designated position or region from a first data set including data regarding a stuff in a predetermined space; and
a knowledge generation unit configured to specify, from a second data set that includes a plurality of entities that form the space and have been grouped into groups of related entities, a group of entities described by words included in the extracted data, and to generate the inference-use knowledge that indicates a spatial relationship between the entities based on the specified group and a term expressing a preregistered spatial relationship.
The inference-use knowledge generation apparatus according to Supplementary Note 1,
in which the plurality of entities are grouped into groups of two related entities in the second data set, and
the knowledge generation unit is configured to generate, as the inference-use knowledge, a predicate-argument structure in which the two entities forming the specified group are used as terms.
The inference-use knowledge generation apparatus according to Supplementary Note 1 or 2, further including
an inference-use knowledge storage unit configured to store the generated inference-use knowledge.
The inference-use knowledge generation apparatus according to Supplementary Note 3,
in which the knowledge generation unit is configured to extract, from case knowledge regarding the space, case knowledge at/in the designated position or region, and store the extracted case knowledge in the inference-use knowledge storage unit in association with the generated inference-use knowledge.
(Supplementary Note 5)
An inference-use knowledge generation method for generating inference-use knowledge that is to be used in an inference that is made by a calculating machine, the method including:
(a) a step of extracting, based on a set parameter, data corresponding to a designated position or region from a first data set including data regarding a stuff in a predetermined space, and
(b) a step of specifying, from a second data set that includes a plurality of entities that form the space and have been grouped into groups of related entities, a group of entities described by words included in the extracted data, and generating the inference-use knowledge that indicates a spatial relationship between the entities based on the specified group and a term expressing a preregistered spatial relationship.
The inference-use knowledge generation method according to Supplementary Note 5,
in which the plurality of entities are grouped into groups of two related entities in the second data set, and
in the (b) step, a predicate-argument structure in which the two entities forming the specified group are used as terms is generated as the inference-use knowledge.
The inference-use knowledge generation method according to Supplementary Note 5 or 6, further including
(c) a step of storing the generated inference-use knowledge.
The inference-use knowledge generation method according to Supplementary Note 7, further including:
(d) a step of extracting, from case knowledge regarding the space, case knowledge at/in the designated position or region,
in which in the (c) step, the extracted case knowledge is stored in association with the generated inference-use knowledge.
A non-transitory computer readable recording medium that includes a program recorded thereon for, with use of a computer, generating inference-use knowledge that is to be used in an inference that is made by a calculating machine, the program including instructions that cause the computer to carry out the steps of:
(a) a step of extracting, based on a set parameter, data corresponding to a designated position or region from a first data set including data regarding a stuff in a predetermined space, and
(b) a step of specifying, from a second data set that includes a plurality of entities that form the space and have been grouped into groups of related entities, a group of entities described by words included in the extracted data, and generating the inference-use knowledge that indicates a spatial relationship between the entities based on the specified group and a term expressing a preregistered spatial relationship.
The non-transitory computer readable recording medium according to Supplementary Note 9,
in which the plurality of entities are grouped into groups of two related entities in the second data set, and
in the (b) step, a predicate-argument structure in which the two entities forming the specified group are used as terms is generated as the inference-use knowledge.
The non-transitory computer readable recording medium according to Supplementary Note 9 or 10, the program including instructions that cause the computer to further carry out the step of:
(c) a step of storing the generated inference-use knowledge.
The non-transitory computer readable recording medium according to Supplementary Note 11, the program including instructions that cause the computer to further carry out the step of:
(d) a step of extracting, from case knowledge regarding the space, case knowledge at/in the designated position or region,
in which in the (c) step, the extracted case knowledge is stored in association with the generated inference-use knowledge.
Although the invention of this application has been described with reference to an example embodiment, the invention is not limited to the above-described example embodiment. Various modifications that can be understood by those skilled in the art can be made to the configuration and details of the invention within the scope of the invention.
This application is based upon and claims the benefit of priority from Japanese application No. 2017-023409, filed on Feb. 10, 2017, the disclosure of which is incorporated herein in its entirety by reference.
As described above, according to the invention, it is possible to shorten the processing time and reduce the processing cost required when an inference about stuffs in a space is made by a calculating machine. The invention is useful for a system in which an inference about stuffs in a space is made by a calculating machine, for example, a system aimed at capturing movements of people and stuffs, for store opening plans, crime investigations, evacuation plans and instructions at the time of a disaster, environment management, and the like.
Number | Date | Country | Kind |
---|---|---|---|
2017-023409 | Feb 2017 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2018/003337 | 2/1/2018 | WO | 00 |