1. Field of the Invention
The present invention relates to a computer system for supporting modeling to predict a fluctuating phenomenon such as monthly sales of a store.
2. Description of the Related Art
In late years, sensor networks have been widely deployed, which is facilitating the collection of data which represents fluctuations of features of objects in various industrial fields (for example, sales of commodities, performance of machines, vital signs of organs, and the like). Such data can be useful information at various sites including retailers and maintenance facilities. Thus, applications of a statistical model (a mathematical expression) to such data have been attempted to understand the essence of the phenomena represented by the data, to predict future phenomena, and to early find the changes of characteristics.
Such attempts include regression analysis of data representing past phenomena for generating a model represented by a regression equation. The model enables the analysis of past phenomena or the prediction of future phenomena. In a regression equation, an object phenomenon is represented by a object variable (explained variable), and the factor affecting the phenomenon is represented by an explanatory variable. The following equation (1) is an example of the regression equation, and is the one for linear multiple regression. In the following equation (1), Y is a object variable, X1 and X2 are explanatory variables, and “a”, “b”, and “c” are coefficients of the regression. In particular, “a” is constant term and “b”, “c” are called partial regression coefficients. The regression analysis estimates numeric values of these parameters.
Y=a+b×X1+c×X2 (1)
As an example, when sales at a store is predicted, in above formula (1), the object variable Y may represent forecasted sales at the store, the explanatory variable X1 represents the diversity of the goods displayed in the store, and the explanatory variable X2 may represent the average price of commercial products. In the case, the coefficients a, b, and c can be obtained using the data of past sales, diversity of products, and average price at a plurality of stores (for example, a plurality of chain stores). Then, for example, the owner of the store can compare the contributions of the diversity of the goods displayed in the store and the average price to the sales individually, and also predict the future sales resulting from the diversity of the goods displayed in the store and the average price, using the above formula (1).
Thus, in forming a regression equation of model for analyzing or predicting a phenomenon, the assignment of an explanatory variable which functions as a factor for explaining the phenomenon is the key. The prediction accuracy depends on the assignment for an explanatory variable. So far, however, an appropriate explanatory variable has been inevitably determined by the experience, intuition, and try-and-error of snalysts on each different field.
Then, a prediction apparatus has been disclosed in Japanese Patent Application Laid-Open No. 9-95917 for example, the apparatus being configured to update a prediction model with a large error which is obtained by a calculation using a predictive value for the prediction model and an actual value so that the most appropriate model can be determined. Also, a method for selecting a prediction model to be proposed using prediction data which is obtained by applying time-series achievement data to a plurality of prediction models is disclosed in Japanese Patent Application Laid-Open No. 2001-22729 for example.
The above patent documents provide a prediction apparatus and a method for improving a prediction model for a certain phenomenon. As a result, an improvement of a prediction model using a number of stored past models cannot be achieved with the above apparatus and method. In addition, in the above apparatus and method, regression analysis is used to construct a prediction; thereby it is difficult to improve a prediction model by extracting a main structure of a model from observed variables and using the extracted structure.
The present invention was made in view of the above problems, and one object of the present invention is to provide a modeling support system, a modeling support method, and a modeling support program for improving a prediction model by extracting a model structure from stored past models and using the extracted structure.
The present invention discloses a modeling support system in which a model is stored in a model recorder as a reference model, the model being represented by a union of a plurality of observed variables with data and a plurality of latent variables without data, and a plurality of paths indicating the associations between the variables. The reference model may be the one generated by an information processor in the past or the one generated by other system in the past. When an information processor generates a model by covariance structure analysis of a phenomenon, a model controller acquires the object model that is being generated and represented by the observed variables and latent variables and the paths. Then, receiving a request from the information processor for supporting the generation of the object model, a similar structure extractor of a model extractor compares the object model with the stored reference models, and extracts the entire structure or a partial structure of a reference model having a similar structure to the entire structure or a partial structure of the object model as a similar structure, and the extracted similar structure is notified to the information processor by the model controller. The convariance structure analysis (CSA) is one of the statistical method which investigates causal relationship, such as various kinds of social phenomenon, natural phenomena, etc. It is the statistical approach by drawing the latent variable which does not observe directly from the variable (observed variable) observed directly, and setting up a hypothesis (mathematical model) about the causal relationship of the latent variable and observed variable. Since not only covariance structure but the model which analyzes the average structure of a latent variable was developed. It is called a structural equation model (SEM) in many cases. However SEM may mean the partial model of covariance structural analysis.
In the present invention, when the information processor requests the system according to the present invention to support modeling during an object model is being generated, the system compares the object model with reference models stored in a model recorder therein, as the result of that a structure similar to the entire structure or a partial structure of the object model is extracted from a reference model, thereby the object model (prediction model) can be improved by extracting a model structure from the stored reference models and using the extracted structure.
A system disclosed herein compares an object model which is being generated with stored reference models, so that a similar structure similar to the entire structure or a partial structure of the object model is be extracted from the reference models, thereby the object model can be improved by the extraction of a model structure from the stored reference models and the use of the extracted structure.
The information processor 15a includes a model generator/updater 151 for generating and updating models at each field, a model validator 152 for validation and analysis of models by applying real data at each field to the generated models at each field, and a local model controller 153 for controlling the generated models at each field. The information processor 15a further includes a real database 154 for storing real data on each field, a model database 155 for storing elements of statistical models on each field, rating scales of test value and the like, model generating methods, and model interpretation information, and an interface (IF) 156 for data input/output between the modeling support system 1.
The general operation of the information processor 15a will be explained below by the following example in which a model B is generated for abstracting preferences of students from questionnaire results at B University. In the case, the model generator/updater 151 of the information processor 15a generates a model B such as that shown in
In the present embodiment, a covariance structure model is represented with a union of a plurality of observed variables and a plurality of latent variables, and a plurality of paths representing the associations between the variables.
In
In the present embodiment, the model B shown in
Among the elements (observed variables, latent variables, and paths) of the structural equation for model B shown in
Specifically, in
The (i, j)th element in the structural matrix of
In covariance structure analysis, an appropriate estimate for a coefficient value of a path is determined to satisfy standards for likelihood and the like, by comparing the variance and covariance of variables which are calculated from the model of the structural equation with the variance and covariance which are actually measured. In the determination, an estimate value which is statistically more significant is set to be a higher test value for a path.
A modeling operation with the information processor 15a by an operator determines the observed variables and the latent variables included in the structural equation model for the questionnaire results, and an application of the model to the questionnaire results determines analytical results (estimation results). The model components and the analytical results (estimation results) are shown by the data representation of the elements shown in
Upon receiving a request for modeling support from the operator who accesses the modeling support system 1 from the information processor 15a via a network while the model B shown in
The modeling support system 1 supports the generation and updating of a model which is used in each of the information processors 15a to 15c, as shown in
The modeling support system 1 includes an interface 2 connected to an interface 156 via a network, a local model manager 3, a model controller 4, a model recorder 5, a distance calculator 6, and a model extractor 7.
The local model manager 3 requests the model controller 4 for supporting the model generation/improvement performed by each of the information processors 15a to 15c, on the modeling support system 1 side in response to the information processors 15a to 15c. The local model manager 3 includes a local model information obtainer 31 for obtaining model information of each industrial field, and a local model proposer 32 for generating an alternative model better than an object model using the stored reference models. The local model information obtainer 31 obtains the configuration of each component, the rating scales such as test values, a model generating method, and interpretation information of a model generated by each of the information processors 15a to 15c.
The model controller 4 acquires the object models generated by the information processors 15a to 15c via the local model manager 3, and also controls the notification of support information to each of the information processors 15a to 15c. The model controller 4 further controls the operations of the local model manager 3, the model recorder 5, and the model extractor 7.
The model recorder 5 stores model components of each of the information processors 15a to 15c, and the configuration of each component, the rating scales such as test values, a model generating method, and interpretation information of the model obtained at the local model information obtainer 31 and the models obtained in other ways. Specifically, the model recorder 5 is provided with a model component database 51 for controlling model components, and a model instance database 52 for controlling model instances such as configuration variables of models. The model component database 51 stores the contents shown in
The model component database 51 includes: an observed variable database 53 for storing information of observed variables as components (for example, survey questions); a latent variable database 54 for storing latent variables as components (variable key words (for example, ordinary profit, being interested in human beings, ages, Was the lesson interesting?) used in the observed variable, for example, sympathetic ability, creative ability, emergent ability, and sensitive ability, and information type used in each observed variable (e.g., natural numbers, integers, discrete, qualitative variable)); a path database 55 for storing the path which has a proven record among paths between variables as components; and a model-related external information database 56 for storing external information related to the stored models (for example, [information of distance between business types: size of sales: retail dealer<local supermarket<department store<large supermarket], [business geography information: IY Supermarket: Chiba Prefecture, UN Supermarket: Aichi Prefecture]). Because a path can be set between any variables in principle, the path database 55 records paths therein only as examples for reference purpose, which means other paths can be set between starting points and end points of variables that are not stored in the path database 55.
The model instance database 52 stores, as shown in
The distance calculator 6 calculates distances between entire models, partial models, and variables individually. Specifically, a distance between models is calculated on the basis of matching between a plurality of models. Then, the models that includes extremely similar elements, or the models that have statistical connections extremely similar to each other have a high matching score. As for distance, there are seven types of distance measures: distance between external attributes; distance between variable attributes; distance between path structures; distance between path coefficients; distance between path coefficient signs; distance between path coefficient significances; and distance between model performances. Among these, the distance measures except the distance between model performances can be applied to a partial structure of a model as well as the entire structure of the model.
The seven types of distances will be explained below.
The distance between external attributes is based on the matching of model titles, target businesses, target samples, times of execution, and the like. The exact match of a distance between external attributes occurs when models have exactly matching target attributes. However, the models having the same samples each other means that they are the same models. Therefore, generally, there can be no case with a distance between external attributes of zero.
The distance between variable attributes is based on the matching of attributes of identifying information such as a group of identifying information of observed variables and identifying information of latent variables which are included in a structural equation. The exact match of a distance between variable attributes occurs when two models have exactly matching groups of observed variable IDs and groups of latent variable IDs. This may actually occur in comparing a plurality of models. The distance between variable attributes includes the two types of distances: one between observed variable and the other between latent variables.
The distance between path structures is based on the matching of paths which are set between observed variables and latent variables which are included in a structural equation. The exact match of a distance between path structures occurs when the elements (latent variables or observed variables) at the origins of the paths exactly match each other. Therefore, the models which do not exactly match each other but are similar to each other are influenced by the similarity distance. Any mismatched path is calculated as a value for a mismatched path, and extends the distance between the models.
The distance between path coefficients is based on the matching of coefficient values which are estimated for the paths set between observed variables and latent variables included in a structural equation. The exact match of a distance between path coefficients occurs when the elements (latent variables or observed variables) at the origins of the paths exactly match each other, as in the case of the distance between path structures. Therefore, the models which do not exactly match each other but are similar to each other are influenced by the similarity distance. Any mismatched path is calculated as a value for a mismatched path, and extends the distance between the models.
The distance between path coefficient sign is based on the matching of coefficient signs which are estimated for the paths set between observed variables and latent variables included in a structural equation. The exact match of a distance between path coefficient signs occurs when the elements (latent variables or observed variables) at the origins of the paths exactly match each other, as in the case of the distance between path structures. Therefore, the models that do not exactly match each other but are similar to each other are influenced by the similarity distance. Any mismatched path is calculated as a value for a mismatched path, and extends the distance between the models.
The distance between path coefficient significances is based on the matching of significant points of coefficient test value which are estimated for the paths set between observed variables and latent variables included in a structural equation. The exact match of a distance between coefficient significances occurs when the elements (latent variables or observed variables) at the origins of the paths exactly match each other, as in the case of the distance between path structures. Therefore, the models that do not exactly match each other but are similar to each other are influenced by the similarity distance. Any mismatched path is counted as a number of mismatched path, and extends the distance between the models.
The distance between model performances is based on the matching of performance measures such as goodness-of-fit test value which is a measure indicating the score of a whole structural equation. The exact match of a distance between model performances occurs when the values of performance measures of models individually match each other.
Now, the calculation of distance between the model B and the model A which is already generated as a result of questionnaire administered to the students at A University of Arts in Hyogo Prefecture shown in
The model A is described by the relations as those shown in
As for the distance between external attributes, as shown in
In
The mean value of the above five types of distances is calculated to obtain a distance between external attributes: (1+1+0.5+0.93+0.75)/5=0.837.
As for the distance between elements, the elements of the models are arranged so that the key words of two models correspond to each other, and the distances between the models are calculated, which are regarded as the distances between elements. One example of distances between elements as the result of rearrangement of the model A and the model B are shown in
As for the distance between path structures, the two models are arranged so that the paths of models correspond to each other, and the distances between the models are calculated. Typically, the distance is 1 when the end point name and the starting point name are matched with each other. One example of the result of arrangement of the model A and the model B is shown in
As for the distance between path coefficients, the two models are arranged so that the paths at the same or similar positions correspond to each other, and the correlation coefficients of the models (the matching of directions in which two vectors are directed when the individual combination of the values are considered to be two vectors) are calculated, which are regarded as the distances between path coefficients. One example of the distances between path coefficients as the result of rearrangement of the model A and the model B are shown in
The distance between path sign is an index for measuring the matching of positive/negative signs of path coefficients between two models. As for the distance between path signs, as shown in
As for the distance between path coefficient significances, a t-test is performed to each path coefficient to calculate test values based on confidence intervals. As shown in
The distance between model performances indicates the deviation from the perfect fit of model, and can be evaluated by an index which is called goodness-of-fit Chi-squared, for example. With respect to the p-values for significance of the goodness-of-fit test, the value for 5% significance level is given by 1 point, and that for the 1% significance level is given by 3 points, as in the case of the distance between path coefficient significances. Then, the two models are arranged so that the paths correspond to each other, and the indexes d for the test values of the two models are calculated to obtain the mean value of the indexes, which is set to be the distance between model performances. The distance is set to have a value within a range of 0 to 1. The measure is applied to the entire structural equation, and an application of the measure to a partial structure reduces the statistical accuracy. Therefore, the index should be used to a model as a whole only. The model A is already completed as a model, while the model B is still being generated, and so an evaluation of the model B cannot be performed yet. Therefore, the distances between model performances of the models A and B cannot be measured yet.
Upon receiving a request for a calculation from the model extractor 7, the distance calculator 6 configured as described above calculates the distance of the entire structure or a partial structure of a plurality of models in response to the request.
In a distance calculation of models by the distance calculator 6, for example, as shown in
The model extractor 7 extracts a model or at least a part of a model based on various characteristics of the model. The model extractor 7 includes a model structural characteristics extraction and utilization promoter 71, a model stable/partially independent structure extractor 72, a similar model extractor 73, a latent variable extractor 74, and a focused model performance monitor 75.
The model structural characteristics extraction and utilization promoter 71 recommends a similar model for example which is extracted based on the structural characteristics of the model. The model structural characteristics extraction and utilization promoter 71 is connected with a model structure database 76 and a model performance database 77. The model structure database 76 stores structural characteristics of extracted models (for example, similar structures, partially stable structures, and partially independent structures). The model performance database 77 stores performances of extracted models that change in time series.
The model stable/partially independent structure extractor 72 extracts a partially stable structure or partially independent structure included in a model. The similar model extractor 73 extracts a model similar to a certain model from other models. Specifically, the similar model extractor 73 requests the distance calculator 6 to calculate a distance between an object model (for example, model B) and a reference model which is similar to the object model, so that, according to the calculation result, the entire structure or a partial structure of the similar reference model is extracted as a similar structure. Also, the similar model extractor 73 requests the distance calculator 6 to calculate a distance between a certain reference model and another reference model to extract a common structure or an aggregate structure. Moreover, the similar model extractor 73 compares the generation methods of reference models and that of an object model, and extracts a reference model generated by a method similar to that of the object model as a model having a similar generation method. The similar model extractor 73 includes a similar structure extractor 78, a common structure/aggregate structure extractor 79, and a similar generation method extractor 80.
The similar structure extractor 78 extracts a model having a similar structure to that of the model obtained from the information processors 15a to 15c (one example of an object model) out of the models stored in the model recorder 5 (one example of a reference model), based on the results of the distance calculation from the distance calculator 6 as described above.
The common structure/aggregate structure extractor 79 extracts the models having common parts, or the models which are overlapped with each other for aggregation (complementation) from a group of a plurality of models stored in the model recorder 5. The similar generation method extractor 80 extracts models which are generated by similar methods.
The latent variable extractor 74 extracts latent variables on reference to a reference model similar to an object model. The focused model performance monitor 75 monitors the performance of the model which is closely watched for some reason (for example, for the reason that the model is similar to a certain model). In the case of a virtual model, the focused model performance monitor 75 monitors the performance of the model which is analogically assembled with the performance elements of a plurality of real models.
Next, the operation of the modeling support system 1 will be explained below with reference to the flowcharts of the process procedure shown in
At Step S1 of
Upon receiving the request for extraction of a partial structure, the process goes from Step S3 to Step S6. At Step S6, the partial structure extraction process shown in
Upon receiving the request for extraction of a similar structure, the process goes from Step S7 to Step S10. At Step S10, the similar structure extraction process shown in
Upon receiving the request for extraction of a latent variable, the process goes from Step S5 to Step S13. At Step S13, the latent variable extraction process shown in
The process for partial structure extraction shown in
At Step S22, a statistical significance of a path is added to the path identifying conditions. In the case of a request for independent structure extraction, the process goes from Step S21 to Step S23. At Step S23, the destinations and the number of paths of each variable in the model are examined. At Step S24, groups of unidirectional or bidirectional first order to nth order adjacent variables are searched for each variable, and the maldistribution of the variables is calculated to extract a common structure. At Step S25, the variable which is exclusively connected by a path from the common structure, the variable may be unidirectional, is searched out, and the variable is determined to be an associated element of the common structure and is extracted as a partially independent structure or a partially stable structure. At Step S26, it is determined if the request for partial structure extraction is completed or not, and upon an end request, the process goes back to the main routine, otherwise returns to Step S21.
An actual partial structure extraction process will be explained below by the following example of a model C, shown in
Not shown in
In the case of the model A shown in
In the present embodiment, an independent structure having little associations with other partial structures are extracted from the entire complicated model structure, which facilitates the comparison between the partial structures. Also, when a stable structure is extracted, a model with higher reliability can be constructed after the comparison using highly reliable independent structures.
In similar structure extraction process shown in
Upon a request for an extraction of a similar structure, at Step S31 of
At Step S33, it is determined if the searching of all of the partial structures of the reference model is completed or not. When completed, the process goes back to Step S31. When not yet, the process goes to Step S34. At Step S34, common variables are extracted based on the calculation results of the partial structures of the object model and the partial structures of the reference model. At Step S35, for the comparison between the partial structure of the object model and the partial structure of the reference model, each structural matrices are arranged so that the equal or similar variables (the observed variables and the latent variables) of the two structures are in a row. In the arrangement, the similarity is determined based on the calculated distance of the variable name, and as for the latent variable to which a name is not assigned yet in either structure, the similarity is determined based on the calculation results of distance of path associations, so that the latent variable is positioned in association with a latent variable of the other structure.
At Step S36, after the comparison of each variable of the structural matrices, the same structure is extracted. At Step S37, the adjacent variables (the first to nth order adjacency) in the common structure of the object model are collected. At Step S38, the adjacent variables (the first to nth order adjacency) in the common structure of the reference model are collected. At Step S39, the adjacent variables of the object model and the reference model are compared with each other to extract adjacent variables having low adjacency in the object model and high adjacency (and stability) in the reference model. At Step S40, from the extracted group, a similar structure and adjacent variables exhibiting high commonality with the object structure are selected and extracted as a similar structure. At Step S41, it is determined if the request for partial structure extraction is completed or not, and upon an end request, the process goes back to the main routine, otherwise returns to Step S31.
When one of the information processors 15a to 15c which is currently generating the model B shown in
After the similar structure is extracted, as shown in
In the present embodiment, an object model generated by a information processor is compared with a reference model which is obtained from the stored model, so that a similar structure which is similar to a partial structure of the object model is extracted from the reference model, thereby a prediction model can be improved by extracting a model structure from stored models and using the extracted structure.
In the common aggregate structure extraction process shown in
At Step S53, it is determined if the searching of all of the partial structures of the object model are completed or not. When completed, the process goes back to Step S51. When not yet, the process goes to Step S54. At Step S54, common variables in the partial structure of the object model are extracted. At Step S55, for the comparison between the plurality partial structures of the object model, each structural matrices are arranged so that the equal or similar variables (the observed variables and the latent variables) of the two structures are in a row. In the arrangement, the similarity is determined based on the distance of the variable name, and the latent variable to which a name is not assigned yet in any of the structures is positioned in association with a latent variable of the other structure depending on the similarity of path associations.
At Step S56, after the comparison between each variable of the structural matrices, the common structure is extracted. At Step S57, the adjacent variables (the first to nth order adjacency) in the common structure of the object model are collected. At Step S58, it is determined if the request is made for a common structure or an aggregate structure. In the case of the request for an aggregate structure, the process goes to Step 59 where the adjacent variables (the first to nth order adjacency) in the common structure of the plurality structures of object model are collected. At Step S60, the adjacent variables of the plurality of object structures are compared with each other to extract structures exhibiting high commonality to be added to a common structure, which is extracted as an aggregate structure. Then, the process goes to Step S 61. In the case of the request for a common structure, the process goes from Step S58 to Step S61. At Step S61, it is determined if the request for common/aggregate structure extraction is completed or not, and upon an end request, the process goes back to the main routine, otherwise returns to Step S51.
Next, an example of the extraction of a common structure between a model D for the Kobe Line and Gulf line of the Hanshin Expressway between Kobe and Osaka and a model E for the Meishin Expressway and The Second Keihan Highway between Osaka and Kyoto as shown in
In the present embodiment, a common structure and an aggregate structure including the surrounding part can be extracted from a plurality of models, thereby a model having highly reliable components can be generated, and the use of the components allows a construction of a highly reliable model.
In the similar generation method extraction process shown in
When the start point models are the same, the process goes to Step S77. At Step S77, it is determined if the used evaluation guideline is similar or the same or not. The guideline is generally chi-squared goodness-of-fit test, but other guidelines such as GFI and AGFI are sometimes used. When the same guideline is used, the process goes to Step S78. At Step S78, it is determined if the execution procedure is the same or not. The execution procedure is the steps for changing the paths, and usually an analyst heuristically (randomly) executes the stops, but when the steps are executed under a certain guideline, the procedure is also one reason for measuring the similarity between a certain model and another model. When the execution means is the same, the process goes to Step S79, where the structure of the reference model is recorded as the structure having a similar generation method. At Step S80, it is determined if an end request is made or not. If not, the process goes back to Step S71, and if so, the process returns to the main routine.
On the other hand, when it is determined that they are not the same at Step S75 to Step S78, the process goes to Step S80. After the searching of the structures of all of the reference models is completed, the process goes from Step S71 to Step S81. At Step S81, the ranking of similarity of each generation method is calculated as the total value as the evaluation result at S75 to S78, and a group of reference models at top rankings is extracted. After the step is completed, the process goes to Step S80.
The above result (e.g., a group of reference models of top three rankings) can be used to support a generation of a model. For example, the local model proposer 32 recommends the group to the information processors 15a to 15c via the model structural characteristics extraction and utilization promoter 71.
In the present embodiment, a known reference model which is generated by a similar method can be extracted, thereby a model can be constructed with reference to the model, which facilitates a generation of a more highly reliable model.
In the latent variable extraction process shown in
If there is not a latent variable with a name, the process goes from Step S92 to Step S94. At Step S94, the object model is compared with the reference model, so that it is determined if there is a latent variable which has high similarity to observed variables and latent variables and is present only in the reference model or not. If there is a latent variable that has high similarity and is present only in the reference model, the process goes to Step S95. At Step S95, the latent variable in the reference model and the paths connected to the existing variables are set to be the recommended objects to the object model. If there is not a latent variable that has high similarity and is present only in the reference model, the process goes to Step S96. At Step S96, when there is any recommended object, the recommended object is transmitted to the model structural characteristics extraction and utilization promoter 71. Then, the model structural characteristics extraction and utilization promoter 71 outputs the recommended object to the local model proposer 32, which in turn recommends the object to the information processors 15a to 15c.
In the present embodiment, as shown in
Even if there is not a latent variable with a name at the same position, when there is a latent variable which has high similarity to the observed variables and the latent variables in an object model and is present only in a reference model, the latent variables having high similarity in the reference model are recommended to the positions shown by the dotted lines of
With respect to the above embodiment, the following appendixes are further disclosed:
A modeling support system accessible to an information processor which generates a model having a structure describing a phenomenon to be analyzed by covariance structure analysis and analyzes the phenomenon, comprising:
a model recorder for storing a model represented with a union of a plurality of observed variables and a plurality of latent variables, and a plurality of paths representing associations between the variables, as a reference model;
a model controller for acquiring an object model which is being generated and is represented by a union of the plurality of observed variables and the plurality of latent variables and the plurality of paths describing the phenomenon to be analyzed, from the information processor; and
a model extractor having a similar structure extractor for comparing the object model with the reference model stored in the model recorder and extracting the entire structure or a partial structure of the reference model which is similar to the entire structure or a partial structure of the object model as a similar structure, when receiving a request from the information processor for supporting the generation of the object model, and
the model controller notifies the similar structure extracted by the similar structure extractor to the information processor.
The modeling support system according to Appendix 1, wherein
when the partial structure of the object model for which the similar structure is extracted includes comparing the object model with the reference model from the reference model,
the model extractor further comprises a latent variable extractor for, when the reference model from which the similar structure is extracted includes a latent variable as an element, confirming that the object model does not include a corresponding latent variable, and extracting the latent variable, and
the model controller notifies the extracted latent variable to the information processor.
The modeling support system according to Appendix 1 or 2, wherein
the model extractor further comprises a stable structure extractor for extracting a structure including a union of the observed variables and the latent variables having significant paths therebetween from the independent structure as a stable structure.
The modeling support system according to Appendix 3, wherein:
the model extractor further comprises a stable structure extractor for extracting a stable structure including the latent variables in which the observed variables and the latent variables have significant associations with each other, from the independent structure.
The modeling support system according to any one of Appendixes 1 to 4, wherein
the model extractor further comprises a common structure extractor for extracting partial structures from a plurality of the reference models, and extracting a common structure which is common to the extracted partial structures.
The modeling support system according to Appendix 5, wherein
the model extractor further comprises an aggregate structure extractor for extracting partial structures from a plurality of the reference models, and extracting an aggregate structure which is formed by aggregating the extracted partial structures.
The modeling support system according to any one of Appendixes 1 to 6, wherein
the model extractor further comprises a similar generation method extractor for extracting a reference model which is generated by a model generating method similar to that of the object model.
The modeling support system according to any one of Appendixes 1 to 7, wherein
the model extractor further comprises a model performance monitor for monitoring the performance of a predetermined reference model to real data in time series.
The modeling support system according to any one of Appendixes 1 to 8, wherein
the model extractor further comprises a model structure database for storing structural characteristics of the model having the similar structure.
A modeling support method executed by a computer accessible to an information processor which generates a model having a structure describing a phenomenon to be analyzed by covariance structure analysis and analyzes the phenomenon, comprising:
a model recording step by a model controller of the computer for acquiring a model represented with a union of a plurality of observed variables and a plurality of latent variables, and a plurality of paths representing associations between the variables, as a reference model;
a model controlling step by a model controller of the computer for acquiring an object model which is being generated and is represented by a union of the plurality of observed variables and the plurality of latent variables and the plurality of paths of the object model to be analyzed, from the information processor;
a similar structure extracting step for comparing the object model with the reference model stored in the model recorder, and extracting the entire structure or a partial structure of the reference model which is similar to the entire structure or a partial structure of the elements included in the object model as a similar structure, upon a request from the information processor for supporting the generation of the object model, and
a notifying step by the model controller for notifying the similar structure extracted by the similar structure extractor to the information processor.
A modeling support program executed by a computer accessible to an information processor which generates a model having a structure describing a phenomenon to be analyzed by covariance structure analysis and analyzes the phenomenon, wherein it implements:
a model recording function for storing a model represented with a union of a plurality of observed variables with data and a plurality of latent variables without data, and a plurality of paths representing associations between the variables, as a reference model;
a model controlling function by a model controller of the computer for acquiring an object model which is being generated and is represented with a union of the plurality of observed variables and the plurality of latent variables, and the plurality of paths of the object model to be analyzed;
a similar structure extracting function for comparing the object model with the reference model stored in the model recorder, and extracting the entire structure or a partial structure of the reference model which is similar to the entire structure or a partial structure of the elements included in the object model as a similar structure, upon a request from the information processor for supporting the generation of the object model, and
a notifying function by the model controller for notifying the similar structure extracted by the similar structure extractor to the information processor.
The modeling support system disclosed above is useful for modeling with higher prediction accuracy executed by an information processor which generates a model by analyzing data describing a certain phenomenon, and predicting a future phenomenon using the model
Number | Date | Country | Kind |
---|---|---|---|
JP2008-118238 | Apr 2008 | JP | national |