The present invention relates to a technique for forecasting a risk that may occur.
Exemplary techniques for forecasting risks such as crimes or diseases have been disclosed in, for example, the following patent literatures and non-patent literature.
Patent Document 1 discloses a technique of causing a server to mathematically analyze past crime data, compute the quantitative probability (that is, a forecast) as to where, when and what type of crime will occur, project the forecast onto a target region called box, and propose a police resource deployment plan based on the mathematical analysis. Patent Documents 2 and 3 disclose other techniques of mathematically analyzing data of crimes that occurred in the past, and forecasting and providing a risk in a target region.
Patent Document 4 discloses a technique of providing information useful in replanning the layout of surveillance cameras by determining a surveillance camera having a low frequency of display, based on the frequencies of display and the degrees of increase in display frequency of the surveillance cameras.
Non-Patent Document 1 discloses a technique of analyzing the phenomenon of near repeat victimization for the occurrence of crimes by computing a statistic called a spatio-temporal K function from crime occurrence history data in a certain area. The near repeat victimization for the occurrence of crimes means that in a place near where a certain crime has occurred, another crime repeatedly occurs over a short period of time. Temporally and spatially analyzing the degree of accumulation of occurrences yields information as to the presence or absence of such near repeat victimization, and the spatio-temporal K function is used for this analysis.
In the techniques, as described above, for forecasting a risk that may occur, it is desirable that the forecasting result and an actual observation result (risk occurrence result) agree with each other at a high probability.
The present invention has been made in consideration of the above-described problem. It is one object of the present invention to provide a technique capable of accurately forecasting a risk that may occur.
The present invention provides an information processing apparatus including:
a data division unit that divides risk occurrence history data of a target region into training data used for computing a risk value for each of combinations of a distribution function spatially and temporally representing a risk distribution in the target region, a spatial parameter of the distribution function, and a temporal parameter of the distribution function, and evaluation-value computation data used for evaluating a combination of the distribution function, the spatial parameter, and the temporal parameter;
a selection unit that selects one combination from among combinations of the distribution function, the spatial parameter, and the temporal parameter, based on an evaluation value for each of the combinations computed based on a risk value for each of the combinations based on the training data and the evaluation-value computation data; and
an output unit that outputs a risk forecasting result of the target region, by using the one combination selected by the selection unit.
The present invention provides an information processing apparatus including:
a cell division unit that divides a target region into a plurality of cells;
a generation unit that generates a plurality of combinations of a distribution function spatially and temporally representing a risk distribution in the target region, a spatial parameter of the distribution function, and a temporal parameter of the distribution function;
a selection unit that computes an evaluation value for each of the combinations, by using risk occurrence history data for each of the cells from among risk occurrence history data of the target region, and selecting one combination from among the plurality of combinations, based on the evaluation value for the each combination; and
an output unit that outputs a risk forecasting result of the target region, by using the one combination selected by the selection unit.
The present invention provides a first risk forecasting method executed by a computer, the method including:
dividing risk occurrence history data of a target region into training data used for computing a risk value for each of combinations of a distribution function spatially and temporally representing a risk distribution in the target region, a spatial parameter of the distribution function, and a temporal parameter of the distribution function, and evaluation-value computation data used for evaluating a combination of the distribution function, the spatial parameter, and the temporal parameter;
selecting one combination from among combinations of the distribution function, the spatial parameter, and the temporal parameter, based on an evaluation value for each of the combinations computed based on a risk value for each of the combinations based on the training data and the evaluation-value computation data; and
outputting a risk forecasting result of the target region, by using the selected one combination.
The present invention provides a second risk forecasting method executed by a computer, the method including:
dividing a target region into a plurality of cells;
generating a plurality of combinations of a distribution function spatially and temporally representing a risk distribution in the target region, a spatial parameter of the distribution function, and a temporal parameter of the distribution function;
computing an evaluation value for each of the combinations, by using risk occurrence history data for each of the cells from among risk occurrence history data of the target region, and selecting one combination from among the plurality of combinations, based on the evaluation value for the each combination; and
outputting a risk forecasting result of the target region, by using the selected one combination.
The present invention provides a program for causing a computer to execute the first risk forecasting method.
The present invention provides a program for causing a computer to execute the second risk forecasting method.
The present invention provides a technique capable of accurately forecasting a risk that may occur.
The above and other objects, features, and advantages will be more apparent from the following description of preferred example embodiments and the following accompanying drawings.
Example embodiments of the present invention will be described below with reference to the drawings. It should be noted that in all the drawings, the same reference numerals denote the same components, and a description thereof will not be repeated as appropriate. Unless otherwise specified, in each block diagram, the blocks do not represent hardware-specific configurations, but represent function-specific configurations.
[Description of Overview]
An information processing apparatus according to the present invention uses history data of risks that occurred in the past (to be referred to as “risk occurrence history data” hereinafter) to determine an optimum combination among combinations of distribution functions spatially and temporally representing the distributions of the risks, and spatial parameters and temporal parameters used in the distribution functions. In this specification, the “optimum combination” means a combination exhibiting a risk forecasting accuracy rate higher than those of other combinations. The “risks” in this specification are not particularly limited, but they include, for example, crimes, diseases, infectious diseases (for example, influenza), disease injuries due, for example, to communicable diseases damaging livestock or crops, pests, and natural disasters such as earthquakes and typhoons. The case where “crimes” are handled among these “risks” will be mainly taken as an example in the following example embodiments.
{Functional Configuration}
The data division unit 110 divides risk occurrence history data of a target region into data (to be referred to as “training data” hereinafter) used to compute a risk value for each combination of a distribution function, a spatial parameter, and a temporal parameter, and data (to be referred to as evaluation-value computation data) hereinafter) used to evaluate this combination. The risk value means herein a value representing whether the probability that a risk will occur is high or low, and can take values in an arbitrary range. The selection unit 120 computes an evaluation value for each combination, based on the risk value for each combination based on the training data and the evaluation-value computation data. The selection unit 120 selects one optimum combination from the plurality of combinations of the distribution functions, the spatial parameters, and the temporal parameters, based on the computed evaluation value for each combination. The output unit 130 forecasts a risk in the target region, using the combination selected by the selection unit 120, and outputs the forecasting result.
{Hardware Configuration}
Each functional configuration unit of the information processing apparatus 10 may be implemented as hardware (for example, a hard-wired electronic circuit) for implementing this functional configuration unit, or may be implemented as a combination of hardware and software (for example, a combination of an electronic circuit and a program for controlling it). The case where each functional configuration unit of the information processing apparatus 10 is implemented as a combination of hardware and software will further be described below.
The bus 101 serves as a data transmission line for allowing the processor 102, the memory 103, the storage device 104, the input/output interface 105, and the network interface 106 to exchange data with each other. The method for connecting, for example, the processor 102, the memory 103, the storage device 104, the input/output interface 105, and the network interface 106 to each other, however, is not limited to bus connection.
The processor 102 serves as an arithmetic unit such as a central processing unit (CPU) or a graphics processing unit (GPU). The memory 103 serves as a main storage implemented using, for example, a random access memory (RAM) or a read only memory (ROM). The storage device 104 serves as an auxiliary storage implemented using, for example, a hard disk drive (HDD), a solid state drive (SSD), or a memory card.
The storage device 104 stores a program module for implementing each functional configuration unit (the data division unit 110, the selection unit 120, and the output unit 130) of the information processing apparatus 10. The processor 102 implements a function corresponding to each program module by reading the program module into the memory 103 and executing it.
The input/output interface 105 is used to connect the information processing apparatus 10 to peripheral equipment. An input device 30 and a display device 40, for example, are connected to the input/output interface 105. The input device 30 serves as a device for input, such as a keyboard or a mouse. The display device 40 serves as a device for display output, such as a liquid crystal display (LCD) or a cathode ray tube (CRT) display.
The network interface 106 is used to connect the information processing apparatus 10 to a communication network such as a local area network (LAN) or a wide area network (WAN). It should be noted that the method for connection to the communication network may be either wireless or wired connection. An external device 20 including a history data storage unit 210 that stores risk occurrence history data, for example, may be connected to the network interface 106. The history data storage unit 210 may even be provided in the information processing apparatus 10. The history data storage unit 210 accumulates data representing the occurrence history of risks (see, for example,
{Operation Example}
An operation example of the information processing apparatus 10 according to the first example embodiment will be described below with reference to
The data division unit 110, for example, receives, via the input device 30, input for specifying a target region by the operator of the information processing apparatus 10 (S102). Data of the target region specified in the process of step S102 is extracted from the risk occurrence history data stored in the history data storage unit 210 (S104). It should be noted that the data division unit 110 may further receive input for specifying a period and extract data in the specified period as a target. The data division unit 110 divides the extracted data into training data and evaluation-value computation data (S106). As an example, the data division unit 110 can divide the extracted data into training data and evaluation-value computation data in the following way. The data division unit 110 first sets a sample time instant in the specified period. The data division unit 110 then determines a point of time earlier than the sample time instant, based on a temporal parameter, and sets, as training data, data included in a period defined between the sample time instant and the earlier point of time. The data division unit 110 further sets, as evaluation-value computation data, data included in a predetermined evaluation period after the sample time instant. It should be noted that the data division unit 110 may set a plurality of sample time instants in the specified period and set training data and evaluation-value computation data for each of the plurality of sample time instants. Setting a plurality of sample time instants generates a plurality of combinations of training data and evaluation-value computation data. Computing evaluation values using the plurality of combinations improves the reliability of the evaluation values.
The selection unit 120 computes a risk value using the training data, for each combination of a distribution function, a spatial parameter, and a temporal parameter (S108). It should be noted that a plurality of combinations of distribution functions, spatial parameters, and temporal parameters may be stored in a predetermined storage (for example, the memory 103 or the storage device 104) in advance. The selection unit 120 may even generate a plurality of combinations of distribution functions, spatial parameters, and temporal parameters in accordance with a predetermined rule. The selection unit 120 computes an evaluation value for each combination, based on the risk value for each combination computed for each combination using the training data and the evaluation-value computation data (S110). As an example, the selection unit 120 can compute a numerical value indicating the degree of association of the risk value computed for each combination with an actual risk occurrence count, based on the risk value for each combination computed using the training data for each sample time instant, and the risk occurrence count in the evaluation period corresponding to each sample time instant (the number of pieces of evaluation-value computation data for each sample time instant). The selection unit 120 selects a combination exhibiting a highest evaluation value, based on the computed evaluation value for each combination (S112).
The output unit 130 computes a risk distribution at a future point of time using a combination of a distribution function, a spatial parameter, and a temporal parameter, selected for the target region, and outputs it to the display device 40 or the like as a forecasting result (S114). The output unit 130 outputs, for example, a map representing the forecasting result of the risk distribution to the display device 40 or the like. The output unit 130 may even output a map representing the forecasting result of the risk distribution to a printing device (not illustrated). In this case, the map representing the forecasting result of the risk distribution is output from the printing device (not illustrated).
It should be noted that the processes in step S112, in which an optimum combination of a distribution function and a set of parameters involved is selected, and the preceding steps, and the process in step S114 in which a risk is forecasted using the selected combination need not always be performed successively.
As described above, in this example embodiment, history data of risks that occurred in the past are used to evaluate, for each combination of a distribution function and parameters of the distribution function, whether the forecasting accuracy rate of the combination for the risks is high. A most highly evaluated combination (that is, a combination exhibiting a high accuracy rate in risk forecasting) is selected from the plurality of combinations. Forecasting risks in a target region using the thus selected combination makes it possible to accurately forecast a risk that may occur in the target region. With the increased accuracy in forecasting, furthermore, a person engaged in risk management can easily devise an effective measure.
{Functional Configuration}
The cell division unit 140 receives input of information for specifying a target region and divides the target region into a plurality of subareas (to be referred to as “cells” hereinafter). The generation unit 150 generates a plurality of combinations of distribution functions spatially and temporally representing the risk distributions in the target region, spatial parameters of the distribution functions, and temporal parameters of the distribution functions. The selection unit 160 computes an evaluation value for each combination of the distribution function, the spatial parameter, and the temporal parameter, generated by the generation unit 150, using risk occurrence history data for each cell among risk occurrence history data of the target region. The selection unit 160 selects one combination from the plurality of combinations of the distribution functions, the spatial parameters, and the temporal parameters, based on the computed evaluation value for each combination. More specifically, the selection unit 160 selects a combination exhibiting a highest evaluation value. The output unit 170 outputs a risk forecasting result of the target region, using the combination of the distribution function, the spatial parameter, and the temporal parameter, selected by the selection unit 160, similarly to the first example embodiment.
{Hardware Configuration}
The hardware configuration according to this example embodiment is similar to that (see, for example,
{Operation Example}
An operation example of the information processing apparatus 10 according to the second example embodiment will be described below with reference to
The information processing apparatus 10 first receives input of conditions for selecting an optimum combination of a distribution function and a set of parameters involved (S202). As an example, the information processing apparatus 10 receives input for specifying a target region and a training period (a period to which data used in evaluation for each combination belong). The information processing apparatus 10 may further include a reception unit (not illustrated) that receives a crime type (for example, the classification of a crime, the sex/age of a crime victim, or a combination of them) as one of the above-mentioned conditions. In addition, the information processing apparatus 10 acquires a risk distribution function. The risk distribution function has been stored in, for example, the memory 103, the storage device 104, or an external storage (not illustrated).
The risk distribution function can be defined herein using, for example, the following equation (1):
In the equation (1) above, R(g, k) denotes the “risk value of the cell g at the time instant tk”. In the equation (1) above, hs is the space bandwidth (spatial parameter) and ht is the time bandwidth (temporal parameter). In the equation (1) above, Ik is the number of pieces of crime occurrence history data used to compute the risk value. i is the label number assigned to each piece of crime occurrence history data used to compute the risk value. In the equation (1) above, Ks and Kt describe the shapes of kernel functions for determining the spatial and temporal spreads, respectively, in the distribution function. The kernel functions to be set for Ks and Kt can be selected from kernel functions having various shapes as illustrated in, for example,
The following equation (2) gives a specific example in which the above-mentioned equation (1) is combined with kernel functions. It should be noted that in the following equation (2), xg and yg are the position coordinates of the cell g (for example, the position coordinates of the central point of the cell) in a space defined by x- and y-axes orthogonal to each other; xi and yi are the position coordinates of a crime contained in the ith labeled crime occurrence history data in the space defined by the orthogonal x- and y-axes; and ti is the date and time of the occurrence of the crime contained in the ith labeled crime occurrence history data.
The above-mentioned equation (2) reveals that the smaller the distance between the position coordinates (xg, yg) of the cell g and the position coordinates (xi, yi) of the ith labeled crime occurrence history data, the higher the risk value of the cell g, while the larger this distance, the lower the risk value of the cell g. The above-mentioned equation (2) also reveals that the closer the time instant tk and the date and time ti of the occurrence of the ith labeled crime occurrence history data are to each other, the higher the risk value of the cell g, while the farther the time instant tk and the date and time ti of the occurrence of the ith labeled crime occurrence history data are from each other, the lower the risk value of the cell g. A risk distribution is obtained for the target region by computing risk values for all the cells using an equation as illustrated above.
The cell division unit 140 divides the specified target region into a plurality of cells (S204). The cell division unit 140 can freely set the shapes and sizes of the cells, based on a predetermined rule or input from the operator of the information processing apparatus 10. As an example, the cell division unit 140 can, upon defining as Δs the length of a short side of a quadrangle enclosing the target region, set, as a unit cell, a square having 1/100 of this Δs as the length of its one side. The cell division unit 140 divides the target region by determining the position, in the target region, of each unit cell without any overlap between the unit cells, and assigning a label g (information for distinguishing the cells from each other) to each unit cell.
The generation unit 150 generates a plurality of combinations of distribution functions and sets of parameters involved (S206). The generation unit 150 can generate a plurality of combinations of distribution functions and sets of parameters involved in, for example, the following way.
The generation unit 150 first sets a plurality of sample time instants tk (k=1, 2, 3, . . . , K) in the specified period. The number K of sample time instants may be automatically determined by the generation unit 150, or may be freely set by input of the operator. When, as a specific example, the period from January 1st, 2000 00:00 to Dec. 31, 2000 23:59 is specified, the generation unit 150 can set the sample time instant tk every four days, which is obtained by rounding off 1/100 of this period (366 days) to the nearest integer. In this case, the sample time instant tk is “t1=Jan. 1, 2000 00:00, t2=Jan. 5, 2000 00:00, . . . , tK=Dec. 30, 2000 00:00,” and the number K of sample time instants is 92.
The generation unit 150 then determines a period (evaluation period Δt) for computing the criminal event count for each sample time instant. For example, the generation unit 150 first determines crime occurrence data for the crime type and the target region specified in the process of step S202, based on the crime type and the location information of the crime occurrence history data stored in the history data storage unit 210. The generation unit 150 can then set, as the evaluation period Δt, the average of occurrence intervals computed based on the date and time of the occurrence of the determined crime occurrence data. More specifically, when a crime of the specified type occurs every three days on average in the target region, the generation unit 150 can set Δt to three days. It should be noted that the evaluation period may even take a value that varies for each sample time instant.
The generation unit 150 can set, for example, as the spatial parameter hs a constant multiple (for example, 1, 5, or 10 times) of the length Δs of one side of the unit cell set by the cell division unit 140, and as the temporal parameter ht a constant multiple (for example, 5, 10, or 100 times) of the evaluation period Δt. The generation unit 150 sets the spatial parameter hs and the temporal parameter ht for each of a plurality of distribution functions stored in, for example, the memory 103, the storage device 104, or another storage (not illustrated) in advance, and generates a table as illustrated in, for example,
The generation unit 150 can even generate a combination of a distribution function and a set of parameters involved, based on the technique disclosed in Non-Patent Document 1. Non-Patent Document 1 discloses a technique for analyzing the phenomenon of near repeat victimization for the occurrence of crimes by computing a statistic called a spatio-temporal K function from crime occurrence history data in a certain area. The near repeat victimization for the occurrence of crimes means that when a crime occurs in a certain place, another crime repeatedly occurs in a place near the former place over a short period of time. Temporally and spatially analyzing the degree of accumulation of crimes that have occurred yields information as to the presence or absence of such near repeat victimization. In Non-Patent Document 1, the spatio-temporal K function is used for this analysis. The value (to be referred to as “D0” hereinafter) obtained by computing the spatio-temporal K function in Non-Patent Document 1 from the crime occurrence history data represents the degree and range in which crimes that have occurred accumulate temporally and spatially. In other words, D0 represents the temporal and spatial distribution of crime occurrences. The generation unit 150 can use this D0 as a risk distribution function. It should be noted that in Non-Patent Document 1, D0 is computed by specifying the “distance zone and distance range from the occurrence place” as the spatial parameter, and the “time span and time range from the date and time of occurrence” as the temporal parameter. The generation unit 150 can generate a combination of a distribution function and a set of parameters involved by setting, for example, the “length Δs of one side of the unit cell” as the “distance zone,” the “length of a short side of the target region” as the “distance range,” the above-mentioned “evaluation period Δt” as the “time span,”, “one year” as the “time range,”, and the like and computing D0 by the method disclosed in Non-Patent Document 1.
The selection unit 160 selects one combination from a plurality of combinations of distribution functions, spatial parameters, and temporal parameters, stored in a table as illustrated in
[Math 3]
t
k
−h
t≤Date and Time of Occurrence<tk
and
√{square root over ((xg−xi)2+(yg−yi)2)}≤hs (3)
The above-mentioned sequence will be described below with reference to
The selection unit 160 computes risk values for all the cells, for the respective sample time instants t1, t2, . . . , tK, using the combination of the distribution function, the spatial parameter, and the temporal parameter selected in the process of step S208, and the Ik pieces of training data respectively extracted for the sample time instants t1, t2, . . . , tK. The case where the combination in the first row of the table depicted in
[Math 4]
NR
g
k
=R(g,k)×Δs2Δt (4)
The selection unit 160 extracts crime occurrence history data (to be also referred to as “evaluation-value computation data” hereinafter) corresponding to crimes that have occurred in the evaluation period Δt for each of the sample time instants t1, t2, . . . , tK, from the history data storage unit 210 as evaluation-value computation data, and determines the number of pieces of evaluation-value computation data (S210). More specifically, the selection unit 160 extracts, as evaluation-value computation data, crime occurrence history data satisfying “tk≤Date and Time of Occurrence<tk+Δt” from the crime occurrence history data of the target region stored in the history data storage unit 210. The selection unit 160 computes the criminal event count for each cell for the sample time instant tk by computing the total number of pieces of evaluation-value computation data for each cell, based on the location information of the extracted evaluation-value computation data. The criminal event count for the cell g for the sample time instant tk is mathematically given by the following expression:
[Math 5]
Nevalgk (5)
The selection unit 160 computes an evaluation value for each combination, based on the risk value for each cell in each combination, computed as the above-mentioned equation (4), and the criminal event count for each cell for the sample time instant tk computed as the above-mentioned expression (5) (S212).
As an example, the selection unit 160 can compute a coefficient of correlation CORR(hs, ht) using the following equation (6):
where the pairs of marks < > denote the expected values for all the sample time instants tk and in all the cells distinguished by the labels g. The portions expressed using the pairs of marks < > can be substituted as, for example, the following equation (7):
The coefficient of correlation CORR(hs, ht) represents the strength of association between the risk value computed using the combination of the distribution function, the spatial parameter, and the temporal parameter, and the criminal event count. The closer the absolute value of the coefficient of correlation CORR(hs, ht) comes to one, the higher the strength of association between these numerical values. When, for example, the coefficient of correlation CORR(hs, ht) takes a positive value close to one, a crime can be estimated to occur at a higher probability in a cell exhibiting a higher risk value computed by the selected combination of the distribution function, the spatial parameter, and the temporal parameter.
As another example, the selection unit 160 may compute a sum of risk value relative rank as an index different from the coefficient of correlation. The selection unit 160 can compute the sum of risk value relative rank in, for example, the following way. The selection unit 160 first ranks each cell, based on the risk value for each cell computed using the combination of the distribution function, the spatial parameter, and the temporal parameter, and training data satisfying the conditions presented in the above-mentioned set of inequalities (3) for a certain sample time instant. The selection unit 160, for example, ranks the cells in ascending order as first, second, . . . from cells exhibiting higher computed risk values. The selection unit 160 determines a cell corresponding to each piece of evaluation-value computation data (that is, a cell in which a crime indicated by this piece of evaluation-value computation data has occurred), based on the location information of this piece of evaluation-value computation data, and adds a value that depends on the rank of the determined cell to the sum of risk value relative rank. The selection unit 160 computes the sum of risk value relative rank by repeating the above-mentioned processes for all the sample time instants (t1, t2, t3, . . . , tK). The sum of risk value relative rank may be given by, for example, the following expression (8):
The case where a result as illustrated in
Referring back to
The selection unit 160 determines whether evaluation values have been computed for all combinations (S216). The selection unit 160 can determine whether evaluation values have been computed for all combinations in accordance with, for example, whether the Evaluation Value column of the table illustrated in
It should be noted that, although not explicitly illustrated in the table of
The output unit 170 receives input of conditions regarding forecasting (for example, a crime type, a target region, a date and time of forecasting, and a forecasting period). Upon the input of the conditions regarding forecasting, the output unit 170 computes a risk distribution at a future point of time using an optimum combination of a distribution function, a spatial parameter, and a temporal parameter, selected for the conditions, and outputs it to the display device 40 or the like as a forecasting result (S220). The output unit 170 outputs, for example, a map representing the forecasting result of the risk distribution to the display device 40 or the like. The output unit 170 may even output a map representing the forecasting result of the risk distribution to a printing device (not illustrated). In this case, the map representing the forecasting result of the risk distribution is output from the printing device (not illustrated).
The details of the process in step S220 will be described below. The output unit 170 first looks up a table as illustrated in
[Math 9]
t
p
−h
t≤Date and Time of Ocurrence<tp
and
√{square root over ((xg−xi)2+(yg−yi)2)}≤hs (9)
The output unit 170 computes a risk value for each cell on the date and time tp, using the Ip pieces of data and the combination of the distribution function, the spatial parameter, and the temporal parameter, read for the input conditions. When, for example, the combination in the first row of
The output unit 170 outputs, as the forecasting result of the criminal event count, the following product of the risk value R(g, p) multiplied by the cell area Δs2 of the target region and the forecasting period Δt′:
[Math 11]
NR
g
p
=R(g,p)×Δs2Δt (11)
It should be noted that the processes in step S218, in which an optimum combination of a distribution function and a set of parameters involved is selected, and the preceding steps, and the process in step S220 in which a risk is forecasted using the selected combination need not always be performed successively.
As described above, in this example embodiment, a risk forecasting result is output using an optimum combination conforming to the input conditions (for example, the type of risk and the target region). Even in this example embodiment, an effect similar to that of the first example embodiment can be produced.
This example embodiment has a configuration similar to that of the second example embodiment, except in the following respects.
{Functional Configuration}
The acquisition unit 180 acquires a cell coverage ratio. The cell coverage ratio means a value representing the ratio of cells to which personnel or moving bodies can be sent to a plurality of cells divided by the cell division unit 140. The “moving bodies” include herein manned moving bodies that move while carrying personnel, such as patrol vehicles, and unmanned moving bodies such as drones.
{Hardware Configuration}
The hardware configuration according to this example embodiment is similar to that (see, for example,
{Operation Example}
An operation example of the information processing apparatus 10 according to the third example embodiment will be described below with reference to
The acquisition unit 180 acquires a cell coverage ratio (S312). The acquisition unit 180 can, for example, display, on the display device 40, a screen for allowing an operator to input a cell coverage ratio, and acquire the cell coverage ratio based on information input by the operator. The acquisition unit 180 passes the acquired cell coverage ratio to the selection unit 160.
The selection unit 160 determines cells (to be referred to as “high-risk cells” hereinafter), to which personnel or moving bodies are to be sent, of all the cells in the target region, based on the cell coverage ratio acquired by the acquisition unit 180, and the risk value of each cell for each sample time instant computed using the combination selected in the process of step S308 (S314). Generally, cells exhibiting relatively high risk values are preferentially determined as the high-risk cells to be patrolled by sending personnel or moving bodies. When the cell coverage ratio is %, the selection unit 160 sorts the cells of the target region in descending order of risk value for each of the sample time instants t1, t2, t3, . . . , tK, determines cells (high-risk cells) corresponding to the top % for each sample time instant, and defines a set of these cells as Gk(β). G1(β), for example, is a set of high-risk cells for the sample time instant ti. Assume, as a specific example, that a certain target region is divided into 10,000 cells, and the cell coverage ratio acquired by the acquisition unit 180 is 1%. In this case, the selection unit 160 determines 100 cells as the high-risk cells in descending order of risk value R(g, k) for each of the sample time instants t1, t2, t3, . . . , tK, and generates a set Gk(β) of high-risk cells using the labels g of the determined cells. For the set Gk(β) of high-risk cells, therefore, Gk(β) for one sample time instant tk includes 100 cells. Gk(β) for all the sample time instants (t1, t2, t3, . . . , tK) includes ((Number K of Sample Time Instants)×100) cells.
The selection unit 160 computes an evaluation value for each combination, based on the criminal event count of all the cells and the criminal event count of the set Gk(β) of high-risk cells determined in the process of step S314, for each of the sample time instants t1, t2, t3, . . . , tK (S316). More specifically, the selection unit 160 computes an index (to be referred to as a “patrol coverage ratio” hereinafter) given by the following expression (12), as an evaluation value for each combination. Expression (12) exemplifies the case where the cell coverage ratio β is 1%. When, for example, the cell coverage ratio β is 10%, expression (12) takes a different value.
where the numerator of the division is the sum, for all the sample time instants (t1, t2, t3, . . . , tK), of the total criminal event counts of the high-risk cells for the sample time instants tk, determined in the process of step S314; and the denominator of the division is the sum, for all the sample time instants (t1, t2, t3, . . . , tK), of the total criminal event counts of all the cells for the sample time instants tk. In other words, the selection unit 160 can compute a patrol coverage ratio for each combination by dividing the sum, for all the sample time instants (t1, t2, t3, . . . , tK), of the numbers of criminal events that have occurred in the high-risk cells (that is, the cells to be patrolled) among all crimes that have occurred in evaluation periods Δt of certain sample time instants tk by the sum, for all the sample time instants (t1, t2, t3, . . . , tK), of the numbers of criminal events in all the cells that have occurred in the evaluation periods Δt of the certain sample time instants tk.
The selection unit 160 stores the patrol coverage ratio for each combination, computed using, for example, the above-mentioned expression (12), in a table (see, for example,
The selection unit 160 determines whether evaluation values have been computed for all combinations (S320). The selection unit 160 can determine whether evaluation values have been computed for all combinations in accordance with, for example, whether the Evaluation Value column of the table illustrated in
The output unit 170 receives input of conditions regarding forecasting (for example, a crime type, a target region, a date and time of forecasting, a forecasting period, and a cell coverage ratio). Upon the input of the conditions regarding forecasting, the output unit 170 computes a risk distribution at a future point of time using a combination of a distribution function, a spatial parameter, and a temporal parameter, selected for the conditions, and outputs it to the display device 40 or the like as a forecasting result (S324). The output unit 170 outputs, for example, a map representing the forecasting result of the risk distribution to the display device 40 or the like. The output unit 170 may even output a map representing the forecasting result of the risk distribution to a printing device (not illustrated). In this case, the map representing the forecasting result of the risk distribution is output from the printing device (not illustrated).
The details of the process in step S324 will be described below. The output unit 170 first looks up a table as illustrated in
The subsequent processes are similar to those in the second example embodiment. More specifically, the output unit 170 extracts data satisfying the conditions presented in set of inequalities (9), with regard to the input date and time tP, from the crime occurrence history data stored in the history data storage unit 210. Upon defining the number of extracted pieces of data as Ip, the output unit 170 assigns a label i (i=1, 2, 3, . . . , Ip) to each of the Ip pieces of data. The output unit 170 computes a risk value R(g, p) for each cell on the date and time tp, using the Ip pieces of data and the combination of the distribution function, the spatial parameter, and the temporal parameter, read for the input conditions. When, for example, the combination in the first row of
As described above, according to this example embodiment, an effect similar to those of the above-described example embodiments can be produced. In this example embodiment, furthermore, high-risk cells (cells to be patrolled) are determined based on the cell coverage ratio (the ratio of cells that can be patrolled) and the risk value of each cell computed using the combination of the distribution function and the set of parameters involved. The ratio of the criminal event count of the high-risk cells to the criminal event count of all the cells is used as an evaluation value for each combination. An “optimum combination” is selected based on the thus computed evaluation values, and stored in a predetermined storage unit together with the “cell coverage ratio.” With this operation, when cells to which personnel or equipment and materials can be sent in the target region are limited, accurate forecasting can be performed by selecting an optimum combination of a distribution function, a spatial parameter, and a temporal parameter that depends on the ratio of the cells (cell coverage ratio).
Example embodiments of the present invention have been described above with reference to the drawings, but they are merely illustrative examples of the present invention, and can adopt various configurations other than the foregoing.
For example, in each of the above-described example embodiments, information for determining cell types may further be acquired, and an optimum combination of a distribution function, a spatial parameter, and a temporal parameter may be selected for each of the acquired cell types. Examples of the cell types include herein land use types in National Land Numerical Information provided by the Ministry of Land, Infrastructure, Transport and Tourism. When the cell coverage ratio according to the third example embodiment is used, the following processing, for example, can further be performed. The cell division unit 140 first determines whether the land use type corresponding to each cell applies to a type to be patrolled (for example, “building land”) by referring to the National Land Numerical Information, and assigns a predetermined flag to cells of the type to be patrolled. The selection unit 160 determines high-risk cells of the cells assigned with the predetermined flag, and computes a patrol coverage ratio based on the criminal event count in the high-risk cells. This makes it possible to select an optimum combination that maximizes the forecasting accuracy rate in cells of a desired type, such as “building land.” This implementation is useful in the cases where specific locations are to be monitored, such as a case where a plan for patrolling a residential area is devised. It should be noted that even in the second example embodiment, the selection unit 160 can compute a coefficient of correlation or a sum of risk value relative rank for cells corresponding to a desired cell type.
In the second and third example embodiments, an example in which a table (see, for example,
In the plurality of flowcharts referred to in the above description, a plurality of steps (processes) have been set forth in order, but the order of execution of the steps executed in each example embodiment is not limited to the order set forth. In each example embodiment, the order of the steps illustrated in the drawing can be changed unless any technical difficulties are encountered. The above-described example embodiments can be combined with each other unless any technical contradiction arises between them.
Part or all of the above-described example embodiments may be described as in the following supplementary notes, but they are not limited thereto.
1.
An information processing apparatus including:
a data division unit that divides risk occurrence history data of a target region into training data used for computing a risk value for each of combinations of a distribution function spatially and temporally representing a risk distribution in the target region, a spatial parameter of the distribution function, and a temporal parameter of the distribution function, and evaluation-value computation data used for evaluating a combination of the distribution function, the spatial parameter, and the temporal parameter;
a selection unit that selects one combination from among combinations of the distribution function, the spatial parameter, and the temporal parameter, based on an evaluation value for each of the combinations computed based on a risk value for each of the combinations based on the training data and the evaluation-value computation data; and
an output unit that outputs a risk forecasting result of the target region, by using the one combination selected by the selection unit.
2.
An information processing apparatus including:
a cell division unit that divides a target region into a plurality of cells;
a generation unit that generates a plurality of combinations of a distribution function spatially and temporally representing a risk distribution in the target region, a spatial parameter of the distribution function, and a temporal parameter of the distribution function;
a selection unit that computes an evaluation value for each of the combinations, by using risk occurrence history data for each of the cells from among risk occurrence history data of the target region, and selecting one combination from among the plurality of combinations, based on the evaluation value for the each combination; and
an output unit that outputs a risk forecasting result of the target region, by using the one combination selected by the selection unit.
3.
The information processing apparatus according to 2, further including:
an acquisition unit that acquires a cell coverage ratio representing a ratio of cells to which personnel or a moving body can be sent, to the plurality of cells,
in which the selection unit computes the evaluation value, based on the cell coverage ratio.
4.
The information processing apparatus according to 3, in which the output unit determines a combination to be used for generating the risk forecasting result, based on a second cell coverage ratio input independently of the cell coverage ratio and set as a forecasting condition.
5.
The information processing apparatus according to 2, in which the selection unit computes, as the evaluation value, a coefficient of correlation computed, based on a risk value for each combination of the distribution function, the spatial parameter, and the temporal parameter, and a risk occurrence count based on the risk occurrence history data.
6.
The information processing apparatus according to 2, in which the selection unit computes, as the evaluation value, a sum of risk value relative rank computed, based on a risk value for each combination of the distribution function, the spatial parameter, and the temporal parameter, and a risk occurrence count based on the risk occurrence history data.
7.
The information processing apparatus according to any one of 2 to 6, in which the generation unit sets a plurality of sample time instants in a specified period, and computes the evaluation value for each of the combinations, by using risk values computed based on the combinations and data in a predetermined time before the sample time instants among the risk occurrence history data, and data within a predetermined time after the sample time instants among the risk occurrence history data.
8.
The information processing apparatus according to any one of 1 to 7, further including:
a reception unit that receives input for specifying a type of risk,
in which the selection unit selects data related to the type of risk specified by the input for specifying, from among the risk occurrence history data of the target region.
9.
A risk forecasting method executed by a computer, the method including:
dividing risk occurrence history data of a target region into training data used for computing a risk value for each of combinations of a distribution function spatially and temporally representing a risk distribution in the target region, a spatial parameter of the distribution function, and a temporal parameter of the distribution function, and evaluation-value computation data used for evaluating a combination of the distribution function, the spatial parameter, and the temporal parameter;
selecting one combination from among combinations of the distribution function, the spatial parameter, and the temporal parameter, based on an evaluation value for each of the combinations computed based on a risk value for each of the combinations based on the training data and the evaluation-value computation data; and
outputting a risk forecasting result of the target region, by using the selected one combination.
10.
A risk forecasting method executed by a computer, the method including:
dividing a target region into a plurality of cells;
generating a plurality of combinations of a distribution function spatially and temporally representing a risk distribution in the target region, a spatial parameter of the distribution function, and a temporal parameter of the distribution function;
computing an evaluation value for each of the combinations, by using risk occurrence history data for each of the cells from among risk occurrence history data of the target region, and selecting one combination from among the plurality of combinations, based on the evaluation value for the each combination; and
outputting a risk forecasting result of the target region, by using the selected one combination.
11.
The risk forecasting method executed by a computer according to 10, the method further including:
acquiring a cell coverage ratio representing a ratio of cells to which personnel or a moving body can be sent, to the plurality of cells; and
computing the evaluation value, based on the cell coverage ratio.
12.
The risk forecasting method executed by a computer according to 11, the method further including:
determining a combination used for generating the risk forecasting result, based on a second cell coverage ratio input independently of the cell coverage ratio and set as a forecasting condition.
13.
The risk forecasting method executed by a computer according to 10, the method further including:
computing, as the evaluation value, a coefficient of correlation computed based on a risk value for each of combinations of the distribution function, the spatial parameter, and the temporal parameter, and a risk occurrence count based on the risk occurrence history data.
14.
The risk forecasting method executed by a computer according to 10, the method further including:
computing, as the evaluation value, a sum of risk value relative rank computed, based on a risk value for each of combinations of the distribution function, the spatial parameter, and the temporal parameter, and a risk occurrence count based on the risk occurrence history data.
15.
The risk forecasting method executed by a computer according to any one of 10 to 14, the method further including:
setting a plurality of sample time instants in a specified period, and computing the evaluation value for each of the combinations, by using risk values computed based on the combinations and data in a predetermined time before the sample time instants among the risk occurrence history data, and data within a predetermined time after the sample time instants among the risk occurrence history data.
16.
The risk forecasting method executed by a computer according to any one of 9 to 15, the method further including:
receiving input for specifying a type of risk; and
selecting data related to the type of risk specified by the input for specifying, from among the risk occurrence history data of the target region.
17.
A program for causing a computer to execute the risk forecasting method according to any one of supplementary notes 9 to 16.
This application is based upon and claims the benefit of priority from Japanese Patent Application No. 2017-202195 filed on Oct. 18, 2017, the disclosure of which is incorporated herein in its entirety by reference.
Number | Date | Country | Kind |
---|---|---|---|
2017-202195 | Oct 2017 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2018/038052 | 10/12/2018 | WO | 00 |