The present invention relates to an analysis of data.
A technique for recognizing a relationship between a plurality of pieces of data is developed. For example, Patent Document 1 discloses a technique for representing relevance between pieces of data by using a cause-and-effect diagram, a pie graph, and the like. Herein, the cause-and-effect diagram is for visually displaying an effect (result) and a factor thereof, with a trunk extending from the effect and the grouped factors being indicated on branches extending from the trunk.
[Patent Document 1] Japanese Patent Application Publication No. 2019-36061
[Patent Document 2] U.S. Patent Application Publication No. 2014/0222741A1 Specification
Magnitude of influence that a factor gives on an effect may be different for each factor. However, a cause-and-effect diagram in Patent Document 1 does not represent such magnitude of influence of each factor.
The present invention has been made in view of the above-described problem, and provides a technique capable of easily recognizing both an effect and a factor thereof and influence that each factor gives on the effect.
An analysis apparatus according to the present invention includes: 1) an acquisition unit that acquires relationship information indicating a degree of influence of each of a plurality of explanatory variables on an objective variable; and 2) a generation unit that generates, by using the relationship information, a cause-and-effect diagram representing a relationship between the objective variable and the explanatory variables.
The generation unit determines a display aspect for a display relating to each explanatory variable or presence or absence of the display in the cause-and-effect diagram, based on the degree of influence of the explanatory variable.
A control method according to the present invention is executed by a computer. The control method includes: 1) an acquisition step of acquiring relationship information indicating a degree of influence of each of a plurality of explanatory variables on an objective variable; and 2) a generation step of generating, by using the relationship information, a cause-and-effect diagram representing a relationship between the objective variable and the explanatory variables.
In the generation step, a display aspect for a display relating to each explanatory variable or presence or absence of the display in the cause-and-effect diagram is determined based on the degree of influence of the explanatory variable.
A program according to the present invention causes a computer to execute the control method according to the present invention.
A technique capable of easily recognizing both an effect and a factor thereof and influence that each factor gives on the effect is provided.
Hereinafter, example embodiments of the present invention will be described by using the drawings. Note that, a similar component is assigned with a similar reference sign throughout all the drawings, and description therefor will be omitted as appropriate. Further, in each block diagram, each block represents not a configuration on a hardware basis but a configuration on a function basis, except as particularly described.
The analysis apparatus 2000 generates a cause-and-effect diagram 10 representing a relationship between an objective variable and a plurality of explanatory variables. Herein, a cause-and-effect diagram is a diagram visually representing a relationship between an effect and a plurality of factors thereof. In the cause-and-effect diagram 10, an effect is associated with the objective variable with the objective variable, and a factor is associated with the explanatory variable. Herein, the effect may be the objective variable itself, or may be an item relevant to the objective variable. The item relevant to the objective variable is, for example, an item focusing on an object or an event common with the objective variable. For example, there is a case in which the objective variable represents “presence or absence of a defect in a product” and the effect represents “a defect in a product”, and the like. Similarly, the factor may be the explanatory variable itself, or may be an item relevant to the explanatory variable.
The cause-and-effect diagram 10 includes one trunk 11 being linked with a display (an effect display 12) representing the effect, and the trunk 11 is linked with one or more branches 13. Each branch 13 is linked with a display (a group display 14) representing a group of the factor. Further, one or more subbranches 15 are linked with one branch 13. Then, a display (a factor display 16) representing one factor is linked with each subbranch 15.
In an example in
The analysis apparatus 2000 determines a display aspect for each factor display 16 or presence or absence of a display of each factor display 16, based on a degree of influence (hereinafter, a degree of influence) that an explanatory variable associated with the factor display 16 gives on an objective variable. To do so, the analysis apparatus 2000 acquires relationship information 50 representing a relationship between the objective variable and the explanatory variable. The relationship information 50 indicates a degree of influence of each explanatory variable.
For example, the analysis apparatus 2000 makes a display aspect different between the factor display 16 of an explanatory variable having a relatively high degree of influence and the factor display 16 of an explanatory variable having a relatively low degree of influence. More specifically, the factor display 16 of an explanatory variable having a relatively high degree of influence is more emphasized. In an example in
As one method of visually representing a relationship between an objective variable and an explanatory variable, a method of using a cause-and-effect diagram is conceivable. However, in an existing cause-and-effect diagram, magnitude of influence that each explanatory variable has on an objective variable cannot be recognized.
In view of the above, the analysis apparatus 2000 determines a display aspect for the factor display 16, based on a degree of influence. By doing so, difference in a degree of influence of each explanatory variable on an objective variable can be easily recognized in a cause-and-effect diagram visually representing a relationship between an objective variable and an explanatory variable. In other words, by browsing the cause-and-effect diagram 10, both a relationship between an objective variable and an explanatory variable (a relationship between an effect and a factor) and a degree of influence of the explanatory variable can be easily recognized.
Hereinafter, the present example embodiment will be described in further detail.
Each of function configuration units of the analysis apparatus 2000 may be achieved by hardware (example: a hard-wired electronic circuit, or the like) achieving each of the function configuration units, or may be achieved by a combination of hardware and software (example: a combination of an electronic circuit and a program controlling the electronic circuit, or the like). Hereinafter, a case will be further described in which each of the function configuration units of the analysis apparatus 2000 is achieved by a combination of hardware and software.
The computer 1000 may be a dedicated computer designed for achieving the analysis apparatus 2000, or may be a general-purpose computer. In a latter case, for example, each of functions of the analysis apparatus 2000 is achieved with the computer 1000 by installing a predetermined application on the computer 1000. The above-described application is configured by a program for achieving the function configuration units of the analysis apparatus 2000.
The computer 1000 includes a bus 1020, a processor 1040, a memory 1060, a storage device 1080, an input/output interface 1100, and a network interface 1120. The bus 1020 is a data transmission path through which the processor 1040, the memory 1060, the storage device 1080, the input/output interface 1100, and the network interface 1120 transmit and receive data to and from one another. However, a method of connecting the processor 1040 and the like with one another is not limited to bus connection.
The processor 1040 is various processors such as a central processing unit (CPU), a graphics processing unit (GPU), and a field-programmable gate array (FPGA). The memory 1060 is a main storage apparatus achieved by using a random access memory (RAM) or the like. The storage device 1080 is an auxiliary storage apparatus achieved by using a hard disk, a solid state drive (SSD), a memory card, a read only memory (ROM), or the like.
The input/output interface 1100 is an interface for connecting the computer 1000 to an input/output device. For example, an input apparatus such as a keyboard or an output apparatus such as a display apparatus is connected to the input/output interface 1100.
The network interface 1120 is an interface for connecting the computer 1000 to a communication network. The communication network is, for example, a local area network (LAN) or a wide area network (WAN).
The storage device 1080 stores a program (the program achieving the application described above) for achieving the function configuration units of the analysis apparatus 2000. The processor 1040 achieves the function configuration units of the analysis apparatus 2000 by reading the program into the memory 1060 and executing the program.
One example of a usage environment of the analysis apparatus 2000 will be described for easy understanding of the analysis apparatus 2000.
In
For example, the analysis apparatus 2000 provides, to the user terminal 60, screen data (for example, a web page) including an image of the cause-and-effect diagram 10. In this case, the user terminal 60 displays the received web page with a browser. By doing so, a user can browse the cause-and-effect diagram 10.
The usage environment of the analysis apparatus 2000 is not limited to the illustration in
The relationship information 50 indicates a relationship between an objective variable and a plurality of explanatory variables.
A relationship indicated by the relationship information 50 is a relationship estimated, for example, by analyzing data (hereinafter, source data) in which a value of an objective variable is associated with a value of each explanatory variable. The estimated relationship is represented by, for example, a linear model (a linear regression model or a linear identification model) for estimating a value of an objective variable from a value of each explanatory variable. In this case, the relationship information 50 can be regarded also as information representing a linear model.
For example, a degree of influence of an explanatory variable is represented by a coefficient associated with the explanatory variable (a coefficient by which a value of the explanatory variable is multiplied) in a linear model. This is because an explanatory variable having a larger associated coefficient in a linear model gives more influence on a value of an objective variable acquired by using the model. In view of this, for example, the relationship information 50 indicates a coefficient associated with an explanatory variable in a learned linear model, as a degree of influence of the explanatory variable.
For example, it is assumed that presence or absence of a defect in a product is handled as an objective variable and each index (for example, a component of a material, and an environment such as a temperature) representing a manufacturing condition for a product is handled as an explanatory variable. In this case, source data indicate a value (a content of each component, a temperature, and the like) of each index representing a manufacturing condition, and presence or absence of a defect in a product manufactured under the manufacturing condition. When a linear model is learned by using the source data, a learned linear model representing a relationship between presence or absence of a defect and a manufacturing condition can be acquired.
For example, the analysis apparatus 2000 handles information representing the linear model as the relationship information 50. In this case, the relationship information 50 indicates, in the objective variable 52, identification information of an objective variable in the generated linear model, indicates, in the identification information 56, identification information of each explanatory variable in the generated linear model, and indicates, in the degree of influence 58, a coefficient associated with each explanatory variable.
A relationship between an objective variable and an explanatory variable may be represented by a plurality of linear models. Examples of a method of representing a relationship between an objective variable and an explanatory variable by a plurality of linear models include a method of generating an estimation model by using heterogeneous mixture learning (see Patent Document 2). The heterogeneous mixture learning generates an estimation model being defined by a set of a tree configured by a node representing a conditional branch and a plurality of linear models. One linear model is allocated to each leaf of the tree. Upon use of the estimation model, first, the tree is traced from a root to a leaf by using data (a combination of values of each explanatory variable) to be estimated. Then, the data to be estimated are input to a linear model associated with the reached leaf, and thereby a value of an objective variable is acquired.
When a relationship between an objective variable and an explanatory variable is represented by a plurality of linear models, for example, the relationship information 50 indicates the explanatory variable 54 (a combination of the identification information 56 and the degree of influence 58) for each of the plurality of linear models.
The acquisition unit 2020 acquires the relationship information 50 for use in generation of the cause-and-effect diagram 10 (S102). For example, the acquisition unit 2020 acquires, from among a plurality of pieces of relationship information 50 stored in advance in a storage apparatus, the relationship information 50 meeting a condition specified by a user.
For example, the relationship information 50 is determined by a condition relating to a source used for a relationship between an objective variable and an explanatory variable indicated by the relationship information 50. For example, when source data are data relating to manufacture of a product, the source data can be determined by conditions such as a name of the product, a location of manufacture, and a date and time of manufacture.
In view of the above, a user gives, to the analysis apparatus 2000, a condition relating to source data from which the cause-and-effect diagram 10 is desired to be generated. The analysis apparatus 2000 searches the above-described storage apparatus under the given condition, and thereby acquires the relationship information 50 meeting the condition.
Note that, when there are a plurality of pieces of relationship information 50 meeting a condition specified by a user, the analysis apparatus 2000 may generate the cause-and-effect diagram 10 for each of all the pieces of relationship information 50, or may generate the cause-and-effect diagram 10 for only some pieces of relationship information 50. In a latter case, the acquisition unit 2020 may provide, to a user, information relating to pieces of relationship information 50 meeting a specified condition and allow the user to select one or more pieces of relationship information 50. In this case, the analysis apparatus 2000 generates the cause-and-effect diagram 10 for only the relationship information 50 selected by the user.
Besides the above, for example, the acquisition unit 2020 may receive the relationship information 50 transmitted from another apparatus (for example, the user terminal 60). For example, in this case, the user terminal 60 transmits the relationship information 50 to the analysis apparatus 2000.
Besides the above, for example, the acquisition unit 2020 may acquire information necessary for generation of the relationship information 50 and generate the relationship information 50 by using the acquired information. For example, a user provides, to the acquisition unit 2020, information indicating source data, identification information of an objective variable, a type of a model, and the like. The acquisition unit 2020 generates an estimation model by using the provided information, and generates the relationship information 50 represented by the generated estimation model.
Note that, processing of generating the relationship information 50 by using information provided from a user may be performed by an apparatus other than the analysis apparatus 2000. In this case, the acquisition unit 2020 acquires the relationship information 50 from an apparatus generating the relationship information 50.
The generation unit 2040 generates the cause-and-effect diagram 10 by using the relationship information 50. Herein, for generation of the cause-and-effect diagram 10, information (hereinafter, group definition information) for defining a group of explanatory variables is necessary in addition to identification information of an objective variable and an explanatory variable. The group definition information indicates, for example, identification information (a name or the like) of a group and identification information of each explanatory variable included in the group. The group definition information may be included in the relationship information 50, or may be prepared separately from the relationship information 50. Note that, when an effect is not an objective variable itself (for example, when an objective variable is “presence or absence of a defect”, whereas an effect is a “defect”), identification information (a name or the like) of the effect is also prepared as well as identification information of a group. The same applies when a factor is not an explanatory variable itself.
For example, the generation unit 2040 generates the effect display 12, the group display 14, and the factor display 16 by using identification information of an objective variable, identification information of a group indicated by group definition information, and identification information of an explanatory variable, respectively. Further, the generation unit 2040 determines a positional relationship between the group display 14 and the factor display 16 by using the group definition information. Then, the generation unit 2040 generates the cause-and-effect diagram 10 by connecting each of the generated displays with the trunk 11, the branch 13, and the subbranch 14, based on the determined positional relationship.
However, the generation unit 2040 generates the factor display 16, based on a degree of influence of an explanatory variable indicated by the relationship information 50. Hereinafter, a method of generating the factor display 16, based on a degree of influence, will be specifically exemplified.
<<Case in which Display Aspect is Determined According to Degree of Influence>>
For example, the generation unit 2040 determines a display aspect for the factor display 16 of an explanatory variable by comparing a degree of influence of the explanatory variable with a predetermined threshold value. For example, the generation unit 2040 makes a display aspect for the factor display 16 of an explanatory variable having a degree of influence equal to or more than a threshold value different from a display aspect for the factor display 16 of an explanatory variable having a degree of influence less than a threshold value. Examples of a method of making a display aspect for the factor display 16 different include a method of making a background (presence or absence of filling, a color, a pattern, and the like) of the factor display 16 different, a method of making a frame of the factor display 16 different (presence or absence of a frame, a color, a shape, a thickness, and the like), and a method of making a size of the factor display 16 different.
Information determining a display aspect for each of a case in which a degree of influence is equal to or more than a threshold value and a case in which a degree of influence is less than a threshold value is stored in advance in a storage apparatus accessible from the generation unit 2040. However, the information may be modifiable by a user.
Herein, it is preferred that the higher a degree of influence of an explanatory variable, the more highlighted (emphasized) the factor display 16 of the explanatory variable. In view of this, for example, it is preferred that a display aspect for the factor display 16 is determined by a criterion such as “a background of the factor display 16 of an explanatory variable having a degree of influence equal to or more than a threshold value is highlighted more than a background of the factor display 16 other than the above”, “a frame of the factor display 16 of an explanatory variable having a degree of influence equal to or more than a threshold value is highlighted more than a frame of the factor display 16 other than the above”, or “a size of the factor display 16 of an explanatory variable having a degree of influence equal to or more than a threshold value is made larger than a frame of the factor display 16 other than the above”.
There may be a plurality of threshold values for a degree of influence. In other words, a plurality of numerical ranges may be determined for a degree of influence, and a display aspect for the factor display 16 may be made different for each numerical range. For example, three numerical ranges R1 to R3, each being “less than Th1”, “equal to or more than Th1 and less than Th2”, and “equal to or more than Th2”, are provided (where Th1 and Th2 are real numbers satisfying Th1<Th2). In this case, the generation unit 2040 makes a display aspect for the factor display 16 of an explanatory variable having a degree of influence belonging to the numerical range R1, a display aspect for the factor display 16 of an explanatory variable having a degree of influence belonging to the numerical range R2, and a display aspect for the factor display 16 of an explanatory variable having a degree of influence belonging to the numerical range R3 different from one another. For example, when the higher a degree of influence, the more emphasized the factor display 16, a degree of highlight is high in order of a display aspect for the factor display 16 of an explanatory variable having a degree of influence belonging to the numerical range R3, a display aspect for the factor display 16 of an explanatory variable having a degree of influence belonging to the numerical range R2, and a display aspect for the factor display 16 of an explanatory variable having a degree of influence belonging to the numerical range R1.
Information determining association between a numerical range and a display aspect is stored in advance in a storage apparatus accessible from the generation unit 2040. However, the information may be modifiable by a user.
The generation unit 2040 may determine a display aspect for the factor display 16, based on a rank in order of a degree of influence. For example, the generation unit 2040 determines each explanatory variable that falls within a predetermined rank in descending order of a degree of influence from among all explanatory variables, and makes a display aspect for the factor display 16 of the explanatory variable different from a display aspect for the factor display 16 of other explanatory variables. For example, when a predetermined rank in order is 3, a display aspect for the factor display 16 of each of explanatory variables having largest to third largest degrees of influence is made different from the factor display 16 other than the above.
The generation unit 2040 may perform ranking of a degree of influence in a group unit of explanatory variables. In other words, the generation unit 2040 determines, for each group, each explanatory variable that falls within a predetermined rank in descending order of a degree of influence in the group, and makes a display aspect for the factor display 16 of the determined explanatory variable different from a display aspect for the factor display 16 of other explanatory variables. For example, when a predetermined rank in order is 2, the generation unit 2040 determines, for each group, an explanatory variable having the largest degree of influence and an explanatory variable having a next largest degree of influence in the group. Then, the generation unit 2040 makes a display aspect for the factor display 16 of the determined explanatory variables different from a display aspect for the factor display 16 of other explanatory variables.
Note that, a degree of influence may be a minus value like a degree of influence of the component 2 in the relationship information 50 in
Further, the generation unit 2040 may reflect, on a display aspect for the factor display 16, a sign of an associated explanatory variable. For example, the generation unit 2040 adds an upward arrow to the factor display 16 when a sign of a value of an associated explanatory variable is positive, and adds a downward arrow to the factor display 16 when a sign of a value of an associated explanatory variable is negative. Besides the above, for example, the generation unit 2040 may use, for the factor display 16, different colors, shapes, and the like between when a sign of a value of an associated explanatory variable is positive and when a sign of a value of an associated explanatory variable is negative.
<<Case in which Presence or Absence of Factor Display 16 is Determined According to Degree of Influence>>
For example, the generation unit 2040 determines whether to allow the cause-and-effect diagram 10 to include the factor display 16 of an explanatory variable by comparing a degree of influence of the explanatory variable with a threshold value. More specifically, the generation unit 2040 allows the cause-and-effect diagram 10 in such a way as to include the factor display 16 of an explanatory variable having a degree of influence equal to or more than a threshold value, and does not allow the cause-and-effect diagram 10 in such a way as to include the factor display 16 of an explanatory variable having a degree of influence less than a threshold value.
Besides the above, for example, the generation unit 2040 may determine presence or absence of the factor display 16 according to a rank in order of a degree of influence. For example, the generation unit 2040 determines each explanatory variable that falls within a predetermined rank in descending order of a degree of influence from among all explanatory variables, and allows the cause-and-effect diagram 10 to include only the factor display 16 of the determined explanatory variable. Besides the above, for example, the generation unit 2040 determines, for each group, each explanatory variable that falls within a predetermined rank in descending order of a degree of influence in the group, and allows the cause-and-effect diagram to include only the factor display 16 of the determined explanatory variable.
Herein, when presence or absence of the factor display 16 is determined by the above-described method, there may be the group display 14 including no factor display 16. In view of this, the generation unit 2040 may determine a display aspect for the group display 14 according to whether the factor display 16 is included. By doing so, the group display 14 including the factor display 16 is more emphasized than the group display 14 including no factor display 16. An example of a method of making a display aspect for the group display 14 different includes a method of making a background, a frame, and the like different, similarly to a method of making a display aspect for the factor display 16 different. Further, the generation unit 2040 may not display the group display 14 including no factor display 16. In other words, the generation unit 2040 determines presence or absence of a display of the group display 14 only the group display 14 including at least one factor display 16.
<<Case in which Relationship Information 50 Includes Information on Plurality of Linear Models>>
As illustrated by using
Besides the above, for example, the generation unit 2040 may compute, for each explanatory variable, a statistical value (a sum value, a mean value, a maximum value, a minimum value, or the like) of a degree of influence of the explanatory variable, and generate one cause-and-effect diagram 10 by handling the statistical value as a degree of influence of the explanatory variable. For example, when a sum value of degrees of influence indicated by the relationship information 50 is used for generation of the cause-and-effect diagram 10, the generation unit 2040 uses an equation (1) below.
e[i][j] is a degree of influence of an explanatory variable i indicated by the relationship information 50 for a j-th linear model. n is a total number of linear models indicated by the relationship information 50. Ei is a value handled as a degree of influence of the explanatory variable i when the cause-and-effect diagram 10 is generated.
The analysis apparatus 2000 outputs the cause-and-effect diagram 10 generated by the generation unit 2040. There are various specific methods of outputting the cause-and-effect diagram 10. For example, the analysis apparatus 2000 stores image data representing the cause-and-effect diagram 10 in a storage apparatus, displays the image data on a display apparatus connected to the analysis apparatus 2000, or transmits the image data to another apparatus (for example, the user terminal 60).
The analysis apparatus 2000 according to the example embodiment 2 outputs, in response to an input being performed of specifying a factor display 16 on an output cause-and-effect diagram 10, a graph 30 for an explanatory variable (hereinafter, also referred to as a specified explanatory variable) associated with the specified factor display 16. In
Herein, source data used for estimating a relationship between an objective variable and an explanatory variable represented by relationship information 50 include a plurality of sets of a value of an objective variable and a value of an explanatory variable. For example, the source data indicate time-series data of a set of a value of an objective variable and a value of an explanatory variable. For example, as the source data relating to manufacture of a product, data indicating sets of presence or absence of a defect in a product and a manufacturing condition at each different point in time of manufacture can be handled.
The graph 30 is for graphically representing a plurality of values indicated by source data for a specified explanatory variable. For example, the graph 30 is a graph representing the plurality of pieces of data in time-series order, or a graph representing a result of statistically processing the plurality of pieces of data.
A user can easily recognize difference in a degree of influence that each explanatory variable gives on an objective variable, by browsing the cause-and-effect diagram 10 generated by the analysis apparatus 2000. For example, an explanatory variable having a high degree of influence can be easily recognized by emphasizing the factor display 16 of the explanatory variable having a high degree of influence.
Then, it can be said that there is a high probability that a user having recognized difference in a degree of influence of an explanatory variable in this way wants to browse more information relevant to each explanatory variable. For example, when the factor display 16 of an explanatory variable having a high degree of influence is emphasized, there is a high probability that a user wants to browse more detailed information on an explanatory variable having a high degree of influence.
In this regard, the analysis apparatus 2000 according to the present example embodiment generates, when an input of specifying the factor display 16 is performed on the cause-and-effect diagram 10, the graph 30 for the specified explanatory variable. Thus, a user can easily acquire more detailed information on the specified explanatory variable.
Hereinafter, the analysis apparatus 2000 according to the present example embodiment will be described in further detail.
A function configuration of the analysis apparatus 2000 according to the example embodiment 2 is, for example, represented by
A hardware configuration of the analysis apparatus 2000 according to the example embodiment 2 is, for example, represented by
The acquisition unit 2020 acquires identification information of a specified explanatory variable in the cause-and-effect diagram 10 (S202). Herein, when a particular part is specified in a diagram output by a method of being displayed or the like on a display apparatus, various types of existing techniques can be used for a technique for acquiring identification information of the specified part.
The generation unit 2040 acquires, for a specified explanatory variable, a plurality of pieces of data indicated by source data. The source data are stored in advance in a storage apparatus accessible from the generation unit 2040. The generation unit 2040 acquires a plurality of pieces of data for a specified explanatory variable by accessing the storage apparatus.
For example, as described above, the source data indicate time-series data including a set of a value of an objective variable and a value of an explanatory variable. In this case, the generation unit 2040 acquires time-series data representing a temporal change of a value of a specified explanatory variable. However, the source data only need to include a plurality of sets of a value of an objective variable and a value of an explanatory variable, and the sets may not be data representing a temporal change.
The generation unit 2040 generates the graph 30 by using a plurality of pieces of data acquired for a specified explanatory variable (S206). The graph 30 can be any type that can be generated by using a plurality of pieces of data. When acquired data are time-series data as described above, for example, the graph 30 is a line graph or the like representing a temporal change of a value of a specified explanatory variable. Besides the above, for example, the graph 30 is a histogram or the like representing a result of statistically processing a plurality of values of an explanatory variable.
Herein, the generation unit 2040 may generate a plurality of graphs 30 for a specified explanatory variable. For example, the generation unit 2040 generates a graph representing a time-series change of acquired data and a graph representing a result of statistically processing the data. Besides the above, for example, the generation unit 2040 statistically processes acquired data by using each of a plurality of methods, and generates the graph 30 for each of results.
Herein, what type of graph generated as the graph 30 may be fixedly determined in advance, or may be specifiable by a user.
The generation unit 2040 outputs the generated graph 30. A method of outputting the graph 30 is similar to a method of outputting the cause-and-effect diagram 10. Further, the generation unit 2040 may output the cause-and-effect diagram 10 together with the graph 30. For example, the generation unit 2040 generates and outputs screen data (for example, a web page) including both the cause-and-effect diagram 10 and the graph 30.
The graph 30 is not limited to a line graph or a histogram. For example, a box plot may be generated by accumulating for each particular interval, or a bar graph representing the number of samples in a similar interval may be displayed. Note that, the intervals for accumulating may be set in advance, may be selected by a user from among a plurality of types of intervals (monthly, weekly, daily, and hourly) prepared in advance, or any value may be specified by a user.
Further, an entire period included in the graph 30 (a period of time from when to when data are displayed) may be from a first point in time to a last point in time included in source data, or any period may be specified by a user. In a latter case, for example, a calendar is displayed on a screen and an input of specifying both a first point in time and a last point in time can be performed on the calendar.
The generation unit 2040 may acquire, from source data, data on an objective variable in addition to a specified explanatory variable, and generate the graph 30 by using the specified explanatory variable and data on the objective variable. By doing so, a relationship with the objective variable can be directly recognized regarding the specified explanatory variable.
Further, instead of data on an objective variable or together therewith, data on an item relevant to the objective variable may be added to the graph 30. For example, examples include a case of allowing the graph 30 to include data on a defect rate when an objective variable is presence or absence of a defect, and the like.
A user may be able to specify a plurality of factor displays 16 included in the cause-and-effect diagram 10. In this case, the generation unit 2040 generates the graph 30 for each of a plurality of specified explanatory variables. At this time, the generation unit 2040 may generate the graph 30 individually for each piece of data acquired for each of the plurality of specified explanatory variables, or may generate one graph 30 for these pieces of data.
While the example embodiments of the present invention have been described with reference to the drawings, the above-described example embodiments are exemplification of the present invention, and a combination of the above-described example embodiments or various configurations other than the above may be employed.
A part or all of the above-described example embodiments can be described as, but not limited to, the following supplementary notes.
Hereinafter, examples of a reference form will be added.
1. An analysis apparatus including:
an acquisition unit that acquires relationship information indicating a degree of influence of each of a plurality of explanatory variables on an objective variable; and
a generation unit that generates, by using the relationship information, a cause-and-effect diagram representing a relationship between the objective variable and the explanatory variables, in which
the generation unit determines a display aspect for a display relating to each explanatory variable or presence or absence of the display in the cause-and-effect diagram, based on the degree of influence of the explanatory variable.
2. The analysis apparatus according to supplementary note 1, in which
a relationship between the objective variable and a plurality of the explanatory variables is represented by one or more linear models, and
the degree of influence of the explanatory variable indicated by the relationship information is represented by a coefficient by which the explanatory variable is multiplied in the linear model.
3. The analysis apparatus according to supplementary note 1 or 2, in which
the relationship information indicates, for each of the explanatory variables, a plurality of degrees of influence of the explanatory variable, and
the generation unit computes, for each of the explanatory variables, a statistical value of the plurality of degrees of influence indicated by the relationship information for the explanatory variable, and generates the determination-and-effect diagram by handling the computed statistical value as the degree of influence of the explanatory variable.
4. The analysis apparatus according to supplementary notes 1 to 3, in which
the generation unit
makes a display aspect different between a display relating to an explanatory variable having the degree of influence equal to or more than a threshold value and a display relating to an explanatory variable other than the explanatory variable having the degree of influence equal to or more than the threshold value,
makes a display aspect different between a display relating to an explanatory variable having the degree of influence within a predetermined rank in order and a display relating to an explanatory variable other than the explanatory variable having the degree of influence within the predetermined rank in order, or,
for each group of the explanatory variables, makes a display aspect different between a display relating to an explanatory variable having the degree of influence within a predetermined rank in order among explanatory variables belonging to the group and a display relating to an explanatory variable other than the explanatory variable having the degree of influence within the predetermined rank in order among the explanatory variables belonging to the group.
5. The analysis apparatus according to supplementary notes 1 to 3, in which
the generation unit
allows the cause-and-effect diagram to include only a display relating to an explanatory variable having the degree of influence equal to or more than a threshold value,
allows the cause-and-effect diagram to include only a display relating to an explanatory variable having the degree of influence within a predetermined rank in order, or,
for each group of the explanatory variables, allows the cause-and-effect diagram to include only a display relating to an explanatory variable having the degree of influence within a predetermined rank in order among explanatory variables belonging to the group.
6. The analysis apparatus according to any one of supplementary notes 1 to 5, in which
the generation unit
outputs the cause-and-effect diagram,
acquires, when the explanatory variable is specified in the output cause-and-effect diagram, data indicating a plurality of values of the specified explanatory variable, and
generates a graph by using the data.
7. The analysis apparatus according to supplementary note 6, in which
the generation unit
acquires time-series data on the specified explanatory variable, and
generates, as the graph, a first graph representing a temporal change of a value of the explanatory variable or a second graph representing a result of statistically processing the time-series data.
8. The analysis apparatus according to supplementary note 7, in which
the generation unit generates screen data including both the first graph and the second graph.
9. The analysis apparatus according to any one of supplementary notes 6 to 8, in which
the generation unit allows the graph to include data on the objective variable.
10. A control method executed by a computer, including:
an acquisition step of acquiring relationship information indicating a degree of influence of each of a plurality of explanatory variables on an objective variable; and
a generation step of generating, by using the relationship information, a cause-and-effect diagram representing a relationship between the objective variable and the explanatory variables, in which,
in the generation step, a display aspect for a display relating to each explanatory variable or presence or absence of the display in the cause-and-effect diagram is determined based on the degree of influence of the explanatory variable.
11. The control method according to supplementary note 10, in which
a relationship between the objective variable and a plurality of the explanatory variables is represented by one or more linear models, and
the degree of influence of the explanatory variable indicated by the relationship information is represented by a coefficient by which the explanatory variable is multiplied in the linear model.
12. The control method according to supplementary note 10 or 11, in which the relationship information indicates, for each of the explanatory variables, a plurality of degrees of influence of the explanatory variable, the control method further including, in the generation step, computing, for each of the explanatory variables, a statistical value of the plurality of degrees of influence indicated by the relationship information for the explanatory variable, and generating the determination-and-effect diagram by handling the computed statistical value as the degree of influence of the explanatory variable.
13. The control method according to supplementary notes 10 to 12, further including, in the generation step, making a display aspect different between a display relating to an explanatory variable having the degree of influence equal to or more than a threshold value and a display relating to an explanatory variable other than the explanatory variable having the degree of influence equal to or more than the threshold value, making a display aspect different between a display relating to an explanatory variable having the degree of influence within a predetermined rank in order and a display relating to an explanatory variable other than the explanatory variable having the degree of influence within the predetermined rank in order, or, for each group of the explanatory variables, making a display aspect different between a display relating to an explanatory variable having the degree of influence within a predetermined rank in order among explanatory variables belonging to the group and a display relating to an explanatory variable other than the explanatory variable having the degree of influence within the predetermined rank in order among the explanatory variables belonging to the group.
14. The control method according to supplementary notes 10 to 12, further including, in the generation step,
allowing the cause-and-effect diagram to include only a display relating to an explanatory variable having the degree of influence equal to or more than a threshold value,
allowing the cause-and-effect diagram to include only a display relating to an explanatory variable having the degree of influence within a predetermined rank in order, or,
for each group of the explanatory variables, allowing the cause-and-effect diagram to include only a display relating to an explanatory variable having the degree of influence within a predetermined rank in order among explanatory variables belonging to the group.
15. The control method according to any one of supplementary notes 10 to 14, further including:
in the generation step,
outputting the cause-and-effect diagram;
acquiring, when the explanatory variable is specified in the output cause-and-effect diagram, data indicating a plurality of values of the specified explanatory variable; and
generating a graph by using the data.
16. The control method according to supplementary note 15, further including:
in the generation step,
acquiring time-series data on the specified explanatory variable; and
generating, as the graph, a first graph representing a temporal change of a value of the explanatory variable or a second graph representing a result of statistically processing the time-series data.
17. The control method according to supplementary note 16, further including,
in the generation step, generating screen data including both the first graph and the second graph.
18. The control method according to any one of supplementary notes 15 to 17, further including,
in the generation step, allowing the graph to include data on the objective variable.
19. A program causing a computer to execute the control method according to any one of supplementary notes 10 to 18.
This application is based upon and claims the benefit of priority from Japanese patent application No. 2019-193810, filed on Oct. 24, 2019, the disclosure of which is incorporated herein in its entirety by reference.
Number | Date | Country | Kind |
---|---|---|---|
2019-193810 | Oct 2019 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2020/039819 | 10/23/2020 | WO |