The present invention generally relates to a technique for supporting material creation.
Materials informatics (MI) is known as one of material creation support methods. MI is a method for supporting material creation by means of an informatics technique and it is generally a method for aggregating data about material properties and searching for materials with new material properties by using machine learning (for example, PTL 1).
PTL 1: Japanese Patent Application Laid-Open (Kokai) Publication No. 2021-043959
For example, regarding organic materials, simulation of molecular properties from chemical compositions (such as first-principles calculation) is easy. Therefore, a lot of highly-accurate simulation data about properties can be acquired and it is thereby easy to apply MI.
However, the application of MI to some types of materials is difficult. For example, regarding inorganic materials (ceramics), their complicated high-order structures in which a large number of crystalline particles gather (such as their particle size, particle size distribution, and particle boundary phase) affect their properties and it is thereby difficult to simulate the properties. Therefore, it is difficult to acquire highly-accurate simulation data about the properties and it is thereby difficult to apply MI.
A material creation support system has first to third models and a processing unit. The first model is a model to which a manufacturing method recipe dataset indicating a manufacturing method recipe is input and from which a material properties dataset indicating a material property/properties is output. The second model is a model to which a material characteristics dataset indicating a material characteristic(s) is input and from which a material properties dataset is output. The third model is a model to which a manufacturing method recipe dataset is input and from which a material characteristics dataset is output. The processing unit performs either inference using the first model or inference using the second model and the third model according to accuracy of at least one model of the first to third models and, upon the inference, generates or updates recipe properties data that is data indicating the association between a manufacturing method recipe for a material and a material property/properties.
The present invention can contribute to a proposal of a manufacturing method recipe which supports developing a material having desired material properties.
In the description indicated below, an “interface apparatus” may be one or more interface devices. The one or more interface devices may be at least one of the following:
Furthermore, in the description indicated below, a “memory” is one or more memory devices, which are one example of one or more storage devices, and may typically be a main storage device. At least one memory device in the memory may be a volatile memory device or a nonvolatile memory device.
Furthermore, in the description indicated below, a “persistent storage apparatus” may be one or more persistent storage devices which are one example of one or more storage devices. The persistent storage device may be typically a nonvolatile storage device (such as an auxiliary storage device) and may be specifically, for example, an HDD (Hard Disk Drive), an SSD (Solid State Drive), an NVME (Non-Volatile Memory Express) drive, or an SCM (Storage Class Memory).
Furthermore, in the description indicated below, a “storage device” may be a memory and at least a memory for the persistent storage apparatus.
Furthermore, in the description indicated below, a “processor” may be one or more processor devices. At least one processor device may be typically a microprocessor device like a CPU (Central Processing Unit), but may be a processor device of a different type like a GPU (Graphics Processing Unit). At least one processor device may be of a single-core type or a multi-core type. At least one processor device may be a processor core. At least one processor device may be a processor device in a broad sense such as a circuit which is an aggregate of gate arrays by means of hardware description languages for performing a part or whole of processing (such as an FPGA [Field-Programmable Gate Array], CPLD [Complex Programmable Logic Device], or ASIC [Application Specific Integrated Circuit]).
Furthermore, in the description indicated below, a function may be sometimes described by an expression like “yyy unit”; however, the function may be implemented by execution of one or more computer programs by a processor, or may be implemented by one or more hardware circuits (such as FPGA or ASIC), or may be implemented by a combination of the above. If the function is implemented by the execution of a program by the processor, specified processing is performed by using, for example, storage devices and/or interface devices as appropriate and, therefore, the function may be considered as at least part of the processor. The processing explained by referring to the function as a subject may be the processing executed by the processor or an apparatus which has that processor. The program may be installed from a program source. The program source may be, for example, a program distribution computer or a computer-readable recording medium (such as a non-transitory recording medium). An explanation of each function is one example and a plurality of functions may be gathered as one function or one function may be divided into a plurality of functions.
An embodiment of the present invention will be described below with reference to the drawings. Incidentally, in the following description, “MI” stands for “materials informatics” and “DB” stands for “database.”
A material creation support system 10 for supporting material creation is constructed. The material creation support system 10 may be a physical computer system, may be a logical computer system based on a physical computer system, or may be a combination of at least part of a physical computer system and at least part of a logical computer system. The physical computer system may be configured of one or a plurality of physical computers and may include an interface apparatus, a storage apparatus, and a processor connected to them. The logical computer system may include a virtual machine or may include a system as a cloud computing service.
The material creation support system 10 has a data presentation unit 50, a data acquisition unit 60, and a data conversion unit 70. These functions 50, 60, and 70 may be implemented by execution of computer programs by the processor.
The data presentation unit 50: acquires data from a data mart 107 periodically or every time it receives a query; and presents target data. Data in the data mart 107 will be hereinafter referred to as “organized data.” The target data is at least either of the acquired organized data or data based on the inference result obtained by inputting the relevant organized data to a machine learning model. The data mart 107 is an example of a first data source. The relevant data should preferably include data about ceramic materials.
The data acquisition unit 60 stores experimental data which is data obtained by experiments based on the target data in an electronic laboratory notebook 131, a manufacturing DB 121, and an evaluation DB 122. The electronic laboratory notebook 131, the manufacturing DB 121, and the evaluation DB 122 are examples of a second data source. Moreover, in this embodiment, “experimental data” may be data about the experiments such as a summary, details, results, or evaluations of the experiments. The data conversion unit 70 acquires experimental data from the electronic laboratory notebook 131, the manufacturing DB 121, and the evaluation DB 122 and converts the experimental data into data to be included in the existing or new data mart 107.
After the organized data based on the experimental data is stored in the existing data mart 107 or after a new data mart 107 including the organized data based on the experimental data is created, the target data is based on the updated or newly created data mart 107, that is, the organized data of the experimental data of the experiment(s) performed based on the target data presented in the past. Therefore, regardless of whether or not the type of material to be created is a material type which makes it possible to obtain highly-accurate simulation data about material properties, the possibility of presenting useful data for the material creation can be enhanced. As a result, material types which can support the material creation by means of the informatics technique are expanded. Specifically speaking, for example, even if the type of material to be created is an inorganic material (ceramics), data useful for the material creation can be presented.
This embodiment will be described below in detail.
Regarding the material creation support system 10, the interface apparatus may be communicably connected to at least one of the experimental system 110, the data acquisition unit 60, and the researcher terminal 11A. The storage apparatus may store at least some data of the electronic laboratory notebook 131, the manufacturing DB 121, the evaluation DB 122, various types of data 150, a data lake 102, and a data mart 107. The processor may implement the aforementioned functions 50, 60, and 70 by executing computer programs.
The data presentation unit 50 and the data conversion unit 70 are provided in an MI platform system 100. The data acquisition unit 60 is provided outside (or inside) the MI platform system 100. The MI platform system 100 may be a physical computer system, may be a logical computer system, or may be a combination of at least part of the physical computer system and at least part of the logical computer system. In this embodiment, as illustrated in
The AI unit 108 performs learning of a machine learning model(s) and inference using the machine learning model(s). For example, in response to an instruction from the IF unit 109, the AI unit 108 outputs to the IF unit 109 an inference result obtained by inputting the organized data acquired from the data mart 107 to the machine learning model.
The IF unit 109 receives a query from the researcher terminal 11A and presents the target data to the researcher terminal 11A in response to the query. The researcher terminal 11A is an information processing terminal (such as a personal computer or a smartphone) of a materials researcher 5A (an example of a user). The researcher terminal 11A is an example of a transmission source of a query of the target data and also an example of a presentation destination of the target data. When the IF unit 109 receives a query from the researcher terminal 11A (or periodically), it presents a target indicated by the target data (the organized data acquired from the data mart 107 and/or the data based on the inference result obtained by issuing an instruction to the AI unit 108). The presented target may be, for example, a manufacturing method recipe or a material property/properties. In this embodiment, the “manufacturing method recipe” means a material creation method and may typically include a material composition and/or a synthesis process. The “material composition” may be, for example, a compounded composition and the “synthesis process” may be raw material types (such as particle size differences), process conditions, etc. The manufacturing method recipe may include a material development strategy.
The materials researcher 5A creates a new material based on the presented target data or performs an experiment based on the target data. The presentation destination (transmission destination) of the target data may be a system like an experimental system 110 instead of or in addition to the researcher terminal 11A.
The materials researcher 5A performs an experiment using the experimental system 110 based on the presented target. For example, the experiment using the experimental system 110 may be a combinatorial experiment. The experimental system 110 may include one or a plurality of apparatuses such as a ceramic manufacturing apparatus 111 (such as an atmospheric firing furnace) and a ceramic evaluation apparatus 112 (such as an apparatus for evaluating a thermal expansion rate). Experimental data is output from the experimental apparatus(es) such as the manufacturing apparatus 111 and the evaluation apparatus 112. The output experimental data is transmitted to and stored in a data server 120. Moreover, the combinatorial experiment(s) is an example of an experiment(s) and at least one of a high-throughput experiment(s), an automatic experiment(s) using robots, and an experiment(s) mainly performed by manual work may be adopted instead of or in addition to the combinatorial experiment(s).
The data acquisition unit 60 includes the data server 120 and a laboratory notebook unit 130.
The data server 120 manages a DB in which experimental data from the experimental system 110 is stored. The DB managed by the data server 120 may exist for each experimental apparatus type. For example, the manufacturing DB 121 in which experimental data from the manufacturing apparatus 111 is stored and the evaluation DB 122 in which experimental data from the evaluation apparatus 112 is stored may be managed. The experimental data from the experimental system 110 is formed as structured data by the data server 120 and stored in the DB.
The laboratory notebook unit 130 manages experimental data acquired from a DB managed by the data server 120 as an electronic laboratory notebook 131. The electronic laboratory notebook 131 is structured data which is, for example, a DB.
The data conversion unit 70 has a collection unit 101, a feature calculation unit 103, an image analysis unit 104, a natural language analysis unit 105, and an organization unit 106.
The collection unit 101 collects the experimental data from the electronic laboratory notebook 131 and DBs of the data server 120, formats the collected experimental data, and stores the formatted experimental data in the data lake 102. The word “formatting” herein used means organizing the configuration of the experimental data into a specified configuration.
Moreover, the collection unit 101 may collect various types of data 150. The various types of data 150 may include the experimental data. Specifically speaking, for example, the various types of data 150 may include at least one of numerical value data from past experiments, materials databases available with or without charge, language data indicating past experiments (such as experiment content and experiment results), materials data obtained by calculations such as simulations, and language data from patent documents and articles. There may be one or more data sources for the various types of data 150. For example, the data sources for the numerical value data from the past experiments may be different from the data sources for the language data from patent documents and articles. The various types of data 150 may include sensor data which is data including measured values by sensors. At least part of the various types of data 150 may be stored in the data lake 102.
The feature calculation unit 103 calculates a feature(s) of one or more specified types of data in the formatted experimental data. Accordingly, each of the one or more types of data is quantified and, as a result, processing by the AI unit 108 becomes possible. For example, one or more types of data may be image data and text data. The image data is analyzed by the image analysis unit 104 and the feature calculation unit 103 calculates the feature(s) of the image data based on the analysis result. Moreover, the text data is text mined by the natural language analysis unit 105, and the feature calculation unit 103 calculates the feature(s) of the text data based on the text mining result. The aforementioned neural network may be used for calculating the feature(s). A duplicate of the experimental data may be generated in the data lake 102 and each of the one or more specified types of data in the duplicate may be converted into a calculated feature(s) (numerical value(s)).
The organization unit 106 organizes the data lake 102 into one or a plurality of data marts 107 as a set of data conforming to a specified condition(s). For example, a data mart 107 including organized data which can be presented as at least part of the target data not via the AI unit 108, and a data mart 107 including organized data which can be presented as at least part of the target data via the AI unit 108 may be generated. Thus, the organized data based on the experimental data collected and formatted in the MI platform system 100 is prepared and the target data based on the organized data is presented to the materials researcher 5A by the MI platform system 100.
Moreover, the presentation of the data to the materials researcher 5A may also be performed not via the MI platform system 100. For example, another materials researcher 5B may acquire experimental data from the electronic laboratory notebook 131 by using a researcher terminal 11B, organize the experimental data, and input such data to the AI unit 140. The AI unit 140 may be implemented outside (or inside) the researcher terminal 11B. Similarly to the AI unit 108, the AI unit 140 may also perform inference by inputting the input organized data to a machine learning model. The data based on this inference result may be presented to the materials researcher 5A. The materials researcher 5A may perform experiments based on this presented data. This experimental data may also be stored in the data server 120 from the experimental system 110 and collected in the MI platform system 100.
Specified functions implemented inside or outside the researcher terminal 11A may facilitate experiments based on the presented target data. For example, the relevant functions may determine what experiments will be recommended to the materials researcher 5A, and may present the determined items to the materials researcher 5A. Alternatively, the target data may include data indicating what experiments will be recommended.
According to the above-described embodiment, it is expected that a cycle of presenting the target data, experimenting, collecting the experimental data, converting the experimental data into organized data, and presenting the target data based on the organized data can be performed at high efficiency and high speed. In this way, MI mainly based on the experimental data is implemented. Therefore, creation of a material of a material type for which it is difficult to acquire highly-accurate simulation data about material properties, for example, a material with a complicated high-order structure of ceramics can be supported.
Moreover, according to the above-described embodiment, various types of data inside or outside an organization (such as an enterprise) which provides the MI platform system 100 can be aggregated, structured, and managed in the MI platform system 100 and can be reflected in the data mart 107. Therefore, the possibility that the presented target data may be useful for the material creation will be enhanced.
The following expressions are possible based on the above-described explanation. The following expressions may include a supplementary explanation of the above-described explanation or an explanation of variations.
A material creation support system including:
a data presentation unit (50) that: acquires data from a first data source periodically or every time it receives a query; and presents target data which is at least either of the acquired data or data based on an inference result obtained by inputting the acquired data to a machine learning model;
a data acquisition unit (60) that stores experimental data, which is data obtained by an experiment based on the target data, in a second data source; and
a data conversion unit (70) that acquires the experimental data from the second data source and converts the experimental data into data to be included in the existing first data source or a new first data source.
The material creation support system according to Expression 1,
wherein at least one of the data acquired from the first data source, the presented target data, the experimental data, and the data included in the existing first data source or a new first data source includes data about ceramic materials.
A material creation support method including:
(A) acquiring data from a first data source periodically or every time a query is received; and presenting target data which is at least either of the acquired data or data based on an inference result obtained by inputting the acquired data to a machine learning model;
(B) storing the experimental data which is data obtained by an experiment based on the target data in a second data source; and
(C) acquiring the experimental data from the second data source and converting the experimental data into data to be included in the existing first data source or a new first data source.
The material creation support method according to Expression 3,
wherein the machine learning model acquires updated data included in the existing first data source or a new first data source by acquiring the experimental data and learns parameters.
Specifically speaking, for example, the details are as follows. The data in the existing data mart 107 is updated based on the experimental data acquired by (C) (such as the experimental data acquired by the data conversion unit 70) or a new data mart 107 including data based on the acquired experimental data is generated. Therefore, the existing or new data mart 107 includes update data which is data based on the acquired experimental data. The AI unit 108 may acquire the update data from the data mart 107 and learn (for example, update) the parameters of the machine learning model used by the AI unit 108.
A material creation support method including:
collecting existing or new experimental data and converting them into data to be included in a first data source; and
acquiring the data from the first data source periodically or every time a query is received; and presenting target data which is at least either of the acquired data or data based on an inference result obtained by inputting the acquired data to a machine learning model.
For example, this material creation support method may be performed by the MI platform system 100. In other words, the material creation support system 10 may have the data presentation unit 50 and the data conversion unit 70, but may not have the data acquisition unit 60.
The material creation support method according to any one of Expressions 3 to 5,
wherein at least one of the data acquired from the first data source, the presented target data, the experimental data, the data included in the existing first data source or a new first data source, and the data based on the inference result includes data about ceramic materials.
A method for generating a material creation support model, wherein the method includes:
(A) acquiring data from a first data source periodically or every time a query is received; and presenting target data which is at least either of the acquired data or data based on an inference result obtained by inputting the acquired data to a machine learning model;
(B) storing experimental data, which is data obtained by an experiment based on the target data, in a second data source;
(C) acquiring the experimental data from the second data source and converting the experimental data into data to be included in the existing first data source or a new first data source; and
(D) executing learning processing of a machine learning model for outputting a recommended inference result indicating a manufacturing method recipe or a material composition by using learning data included in the first data source of (C).
The method for generating the material creation support model according to Expression 7,
wherein at least one of the data acquired from the first data source, the presented target data, the experimental data, the data included in the existing first data source or a new first data source, the learning data, and the data indicating the recommended inference result includes data about ceramic materials.
A program for causing a computer to execute the material creation support method stated in any one of Expressions 3 to 6.
A program for causing a computer to execute the method for generating the material creation support model according to Expression 7 or 8.
Now, the AI unit 108 according to this embodiment will be described below in detail. In the following description, it is assumed that a target material to be created is ceramics. Moreover, in the following description, definitions of terms are as follows.
The AI unit 108 may receive a dataset from the IF unit 109 (that is, a dataset based on the researcher input information may be generated by the IF unit 109) or may receive the researcher input information from the IF unit 109 and generate a dataset based on that researcher input information. Moreover, the AI unit 108 may output a dataset to the IF unit 109 (that is, the researcher output information based on the dataset may be generated by the IF unit 109) or may output the researcher output information based on the dataset to the IF unit 109. Furthermore, the IF unit 109 may be included in the AI unit 108.
The AI unit 108 inputs a manufacturing method recipe dataset 311 and outputs (predicts) a material properties dataset 313 upon inference using a machine learning model. Furthermore, in this embodiment, when the researcher input information is information indicating a material property/properties, the AI unit 108 identifies a manufacturing method recipe dataset 311 of a manufacturing method recipe expected to create a material having the relevant material property/properties and outputs the manufacturing method recipe dataset 311 (or researcher output information based on the manufacturing method recipe dataset 311) to the IF unit 109. Specifically speaking, when the researcher 5A inputs information indicating a manufacturing method recipe to the MI platform system 100, the researcher 5A can receive provision of information indicating the predicted material properties of the material assumed to have been created by the relevant manufacturing method recipe from the MI platform system 100. Moreover, when the researcher 5A inputs information indicating material properties to the MI platform system 100, the researcher 5A can receive provision of information indicating the manufacturing method recipe of the material expected to create a material with the relevant material properties from the MI platform system 100.
The AI unit 108 has a learning unit 351 for performing learning of a machine learning model(s) and an inference unit 352 for performing inference using the learned machine learning model(s) (that is, the aforementioned inference). Moreover, there are first to third models 301 to 303 as machine learning models. The first to third models 301 to 303 are stored in the storage apparatus 202.
The first model 301 is a model to which a manufacturing method recipe dataset 311 is input and from which a material properties dataset 313 is output. Ceramics have complicated high-order structures in which a large number of crystal particles gather (such as particle size, particle size distribution, and particle boundary phase); however, material characteristics including such high-order structures affect material properties of ceramics, so that it is generally difficult to predict the material properties from a manufacturing method recipe with high the accuracy.
Therefore, in this embodiment, based on the fact that the material characteristics affect the material properties, in other words, there is a correlation between the material characteristics and the material properties, the second model 302 to which a material characteristics dataset 312 (a dataset indicating the material characteristics) is input and from which a material properties dataset 313 is output is prepared. Then, in order to make the material properties dataset 313 predictable from the manufacturing method recipe dataset 311 via an intermediate dataset that is the material characteristics dataset 312, the third model 303 to which a manufacturing method recipe dataset 311 is input and from which a material characteristics dataset 312 is output is prepared.
As described earlier, the “manufacturing method recipe” indicated by the manufacturing method recipe dataset 311 may include the material composition and/or the synthesis process and may specifically include, for example, one or a plurality of recipe items (such as a synthesis temperature) and a value of each of the one or plurality of relevant recipe items (typically a numerical value(s)).
The “material property/properties” indicated by the material properties dataset 313 means a property/properties indicated by a substance as a material (such as a structure) and may specifically include, for example, one or a plurality of property items (such as strength and a thermal expansion coefficient) and a value of each of the one or plurality of the property items (typically a numerical value(s)).
The “material characteristic(s)” indicated by the material characteristics dataset 312 means a physical and/or chemical state indicated by a substance as a material and may specifically include, for example, one or a plurality of characteristic items (such as crystal particle size and particle size distribution) and a value of each of the one or plurality of the characteristic items (typically a numerical value(s)). A feature(s) of each characteristic item may be included in the material characteristics dataset 312. For example, by using the second model 302, it is possible to predict the material properties which include a value(s) of one or a plurality of property items from the material characteristics including values of a plurality of characteristic items or to predict the material properties which include a value(s) of one or a plurality of property items from the material characteristics including a value of one characteristic item.
The material characteristics dataset 312 may be a dataset indicating a feature(s) calculated by the feature calculation unit 103 and may be stored in the data mart 107 and acquired from the data mart 107. One or a plurality of features indicated by the material characteristics dataset 312 are extracted from image data (such as SEM images) and spectral data by making use of data processing (such as processing using CNN) by the feature calculation unit 103. Incidentally, SEM stands for “Scanning Electron Microscope” and CNN stands for “Convolutional Neural Network.”
Regarding each of the first to third models 301 to 303, the relevant model may be linear regression (such as Ridge regression, Lasso regression, and Elastic Net regression), logistic regression, SVM (Support Vector Machine), a decision tree model (such as a random forest, XGBoost (Xtreme Gradient Boosting), and LightGBM (Light Gradient Boosting Machine)), a neural network (such as CNN, RNN (Recurrent Neural Network), ResNet (Residual Network)), Bayesian optimization, k-NN (k-Nearest Neighbor), or a combination of any two or more of these models. Preferably, for example, regarding each of the first to third models 301 to 303, the relevant model may be a random forest, XGBoost, LightGBM, CNN, Bayesian optimization, or a combination of any two or more of these models. For example, the following is expected:
In this embodiment, each of the first to third models 301 to 303 is a regression equation.
The learning unit 351 acquires learning data of the first model 301, specifically, one or more manufacturing method recipe datasets 311 and one or more material properties datasets 313 corresponding to the one or more manufacturing method recipe datasets 311 (S401). These datasets 311 and/or 313 may be input from the researcher 5A in learning processing, may be prepared in advance as learning data, may be acquired from data such as experimental data and/or various types of data 150, or may be a manufacturing method recipe dataset 311 (a manufacturing method recipe dataset 311 newly generated by the learning unit 351) indicating a manufacturing method recipe with narrowed-down recipe items or value ranges (or a manufacturing method recipe with expanded recipe items or value ranges) in a manufacturing method recipe indicated by a certain manufacturing method recipe dataset 311.
The learning unit 351 learns the first model 301 by using the acquired datasets 311 and 313 (S402). Specifically speaking, for example, the learning unit 351 learns the first model 301 by using the manufacturing method recipe dataset 311 and the material properties dataset 313 corresponding to that manufacturing method recipe dataset 311 (the manufacturing method recipe and the material properties of the material created by that manufacturing method recipe).
The learning unit 351 judges whether or not to terminate the learning of the first model 301 (S403). For example, if the accuracy (predicted accuracy) of the first model 301 is equal to or higher than a first accuracy threshold value (or if the accuracy of the first model 301 is higher than the accuracy of the second model 302 and the third model 303), the judgment result of S403 is true (S403: Yes). In this case, the processing of the learning unit 351 is terminated.
If the judgment result of S403 is false (S403: No), the learning unit 351 acquires learning data of the second model 302, specifically, one or more material characteristics datasets 312 and one or more material properties datasets 313 corresponding to the one or more material characteristics datasets 312 (S404). These datasets 312 and/or 313 may be input from the researcher 5A in the learning processing, may be prepared in advance as the learning data, may be acquired from data such as experimental data and/or various types of data 150, or may be a material characteristics dataset 312 (a material characteristics dataset 312 newly generated by the learning unit 351) indicating material characteristics with narrowed-down characteristic items or value ranges (or material characteristics with expanded characteristic items or value ranges) in material characteristics indicated by a certain material characteristics dataset 312.
The learning unit 351 learns the second model 302 by using the acquired datasets 312 and 313 (S405). Specifically speaking, for example, the learning unit 351 learns the second model 302 by using the material characteristics dataset 312 and the material properties dataset 313 corresponding to that material characteristics dataset 312 (the material characteristic(s) and the material property/properties of the material having such material characteristic(s)). The purpose of this learning is to find the material characteristic(s) having a strong correlation with the material property/properties, or in other words, to identify the material characteristic(s) corresponding to the material property/properties.
The learning unit 351 judges whether or not to terminate the learning of the second model 302 (S406). For example, if the accuracy (predicted accuracy) of the second model 302 is equal to or higher than a second accuracy threshold value, the judgment result of S406 is true (S406: Yes). If the accuracy of the second model 302 is equal to or higher than the threshold value, it means that an important characteristic item(s) and/or value(s) (feature(s)) has been identified. In other words, if no important characteristic item(s) and/or value(s) (feature(s)) has been identified, the accuracy of the second model 302 will typically not become equal to or higher than the threshold value.
If the judgment result of S406 is true, the learning unit 351 acquires learning data of the third model 303, specifically, one or more material characteristics datasets 312 identified upon the learning of the second model 302 and one or more manufacturing method recipe datasets 311 corresponding to the one or more material characteristics datasets 312 (S407). The manufacturing method recipe dataset(s) 311 corresponding to the material characteristics dataset(s) 312 may be, for example, the manufacturing method recipe dataset(s) 311 corresponding to the material properties dataset(s) 313 corresponding to the material characteristics dataset(s) 312. These datasets 312 and/or 311 may be datasets acquired from datasets used in this learning processing or may be a manufacturing method recipe dataset 311 (a manufacturing method recipe dataset 311 newly generated by the learning unit 351) indicating a manufacturing method recipe with narrowed-down recipe items or value ranges (or a manufacturing method recipe with expanded recipe items or value ranges) in a manufacturing method recipe indicated by a certain manufacturing method recipe dataset 311.
The learning unit 351 learns the third model 303 by using the acquired datasets 312 and 311 (S408). Specifically speaking, for example, the learning unit 351 learns the third model 303 by using the material characteristics dataset 312 and the manufacturing method recipe dataset 311 corresponding to that material characteristics dataset 312 (the material characteristic(s) and the manufacturing method recipe for creating a material having such material characteristic(s)).
Incidentally, the learning of the third model 303 (S407 and S408) may be performed in parallel with the learning of the second model 302 (S404 and S405). The material characteristics dataset 312 acquired in S407 may be a dataset indicating the feature(s) identified (narrowed down) upon the learning of the second model 302 or may be a material characteristics dataset 312 before such identification is performed.
The learning unit 351 judges whether or not to terminate the learning of the third model 303 (S409). If the accuracy (predicted accuracy) of the third model 303 is equal to or higher than a third accuracy threshold value, the judgment result of S409 is true (S409: Yes). Incidentally, the first to third accuracy threshold values described above may be the same value or different values.
If the judgment result of S409 is true, the learning unit 351 performs S401 again. In S401, the manufacturing method recipe dataset 311 identified (or newly generated) upon the learning of the third model 303 and the material properties dataset 313 corresponding to the above-mentioned manufacturing method recipe dataset 311 may be acquired.
According to the processing illustrated in
In this embodiment, “relearning” means that the processing of the learning unit 351 is terminated once and the processing of the learning unit 351 is performed again. The relearning may be started at a specified timing. The specified timing may be, for example, at least one of the following:
The inference by the inference unit 352 may be performed in parallel with the processing of the learning unit 351. Moreover, the dataset obtained upon the inference by the inference unit 352 may be used for the learning of any one of the models 301 to 303. Specifically speaking, for example, at least one of the following may be adopted.
The inference unit 352 judges whether or not the relearning has been performed (S501). If the judgment result of S501 is true (S501: Yes), the inference unit 352 invalidates a list (S502). The “list” herein used is data generated or updated upon the inference and data in which the relationship between material properties and a manufacturing method recipe and/or material characteristics is recorded. The list is stored in the storage apparatus 202 by the inference unit 352 (see
The inference unit 352 judges whether or not the learning of the first model 301 has been terminated (S503). If the judgment result of S503 is true (S503: Yes), the inference unit 352 performs the inference using the first model 301 (S504). Specifically speaking, for example, the inference unit 352 may acquire a plurality of manufacturing method recipe datasets 311 and acquire a material properties dataset 313 by inputting the manufacturing method recipe dataset 311 to the first model 301 for each acquired manufacturing method recipe dataset 311. Each of the plurality of manufacturing method recipe datasets 311 may be a dataset acquired from newly added experimental data or various types of data 150 or may be a dataset generated by changing a combination of recipe items or a value of each recipe item (for example, the inference unit 352 may increase manufacturing method recipe datasets 311 by repeatedly changing the manufacturing method recipes by, for example, increasing or decreasing the value of the synthesis temperature by a fixed value). Regarding each set of the input manufacturing method recipe dataset 311 and the acquired material properties dataset 313, the inference unit 352 records, in a list, the manufacturing method recipe indicated by the manufacturing method recipe dataset 311 and the material properties indicated by the material properties dataset 313. Moreover, the material property/properties may also be recorded in the list when the relevant material property/properties satisfy a specified condition(s) (for example, when the value of a certain property item is included within a specified value range).
If the judgment result of S503 is false (S503: No), the inference unit 352 judges whether or not the learning of the third model 303 has been terminated (S505). If the judgment result of S505 is true (S505: Yes), the inference unit 352 performs the inference using the second and third models 302 and 303 (S506). Specifically speaking, for example, the inference unit 352 acquires a material properties dataset 313 by inputting the material characteristics dataset 312 to the second model 302. In order to create a material having the material properties indicated by the acquired material properties dataset 313 (that is, in order to identify a manufacturing method recipe dataset 311 from which the material characteristics dataset 312 input to acquire the relevant material properties dataset 313 is output), the inference unit 352 may acquire the material characteristics dataset 312 by inputting the manufacturing method recipe dataset 311 to the third model 303. Regarding each set of the input manufacturing method recipe dataset 311, the acquired material characteristics dataset 312, and the acquired material properties dataset 313, the inference unit 352 records, in a list, the manufacturing method recipe indicated by the manufacturing method recipe dataset 311, the material characteristic(s) indicated by the material characteristics dataset 312, and the material property/properties indicated by the material properties dataset 313. In addition, the material characteristic(s) corresponding to the manufacturing method recipe and the material property/properties may also be recorded in the list. Moreover, the material property/properties may be recorded in the list when the relevant material property/properties satisfy a specified condition(s) (for example, when the value of a certain property item is within a specified value range). Furthermore, each of the plurality of manufacturing method recipe datasets 311 or material characteristics datasets 312 may be a dataset acquired from newly added experimental data or various types of data 150 or may be a dataset generated by changing a combination of respective items or a value of each item (for example, the inference unit 352 may increase manufacturing method recipe datasets 311 by repeatedly changing the manufacturing method recipes by, for example, increasing or decreasing the value of the synthesis temperature by a fixed value).
Incidentally, in S505, in addition to the judgment on whether or not the learning of the third model 303 has been terminated, the inference unit 352 may judge whether or not the accuracy of the inference using the second and third models 302 and 303 is equal to or higher than a threshold value. If this judgment result is also true, the inference unit 352 may perform S506 (the inference using the second and third models 302 and 303).
If the judgment result of S505 is false (S505: No), the inference unit 352 judges whether or not the learning of the second model 302 has been terminated (S507). If the judgment result of S507 is true (S507: Yes), the inference unit 352 performs the inference using the second model 302 and the third model 303 (S508). Specifically speaking, for example, the inference unit 352 firstly identifies a material characteristic(s) necessary to satisfy a target material property (a material characteristics dataset 312 as an input to output a material properties dataset 313 indicating the target material property) by using the second model 302. Then, the inference unit 352 uses the third model 303 to infer a manufacturing method recipe for obtaining the identified material characteristic(s) (specifically, to identify a manufacturing method recipe dataset 311 as an input to output the material characteristics dataset 312 indicating the identified material characteristic(s)). Regarding each set of the manufacturing method recipe dataset 311 as the input, the acquired material characteristics dataset 312, and the acquired material properties dataset 313, the inference unit 352 records, in a list, the manufacturing method recipe, the material characteristic(s), and the material property/properties. Incidentally, each of the plurality of manufacturing method recipe datasets 311 or the material characteristics datasets 312 may be a dataset acquired from newly added experimental data or various types of data 150 or may be a dataset generated by changing a combinations of recipe items or a value of each recipe item (for example, the inference unit 352 may increase the manufacturing method recipe datasets 311 by repeatedly changing the manufacturing method recipes by, for example, increasing or decreasing the value of the synthesis temperature by a fixed value).
If the judgment result of S507 is false (S507: No), the inference unit 352 performs the inference using two models, that is, the second and third models 302 and 303 or the first model 301 (S509). Specifically speaking, for example, the inference unit 352 performs the following (S509-1) or (S509-2). More specifically, for example, if the predicted accuracy of the second model 302 and the third model 303 is higher than the predicted accuracy of the first model 301, the inference unit 352 may perform (S509-1); and if the predicted accuracy of the second model 302 and the third model 303 is the same as or lower than the predicted accuracy of the first model 301, the inference unit 352 may perform (S509-2). Incidentally, in the following, each of the plurality of manufacturing method recipe datasets 311 or material characteristics datasets 312 may be a dataset acquired from newly added experimental data or various types of data 150 or may be a dataset generated by changing a combination of respective items or a value of each item (for example, the inference unit 352 may increase the manufacturing method recipe datasets 311 by repeatedly changing the manufacturing method recipes by, for example, increasing or decreasing the value of the synthesis temperature by a fixed value).
(S509-1) The inference unit 352 acquires a material properties dataset 313 by inputting a material characteristics dataset 312 to the second model 302. In order to create a material having the material property/properties indicated by the acquired material properties dataset 313 (in other words, in order to identify a manufacturing method recipe dataset 311 from which the material characteristics dataset 312 that is input to acquire the relevant material properties dataset 313 is to be output), the inference unit 352 acquires a material characteristics dataset 312 by inputting the manufacturing method recipe dataset 311 to the third model 303. Regarding each set of the input manufacturing method recipe dataset 311, the acquired material characteristics dataset 312, and the acquired material properties dataset 313, the inference unit 352 records, in a list, the manufacturing method recipe indicated by the manufacturing method recipe dataset 311, the material characteristic(s) indicated by the material characteristics dataset 312, and the material property/properties indicated by the material properties dataset 313.
(S509-2) The inference unit 352 acquires material properties dataset 313 by inputting manufacturing method recipe dataset 311 to the first model 301 (in actual experiments, material characteristics datasets 312 may also be collected to evaluate the material characteristics). Regarding each set of the input manufacturing method recipe dataset 311 and the acquired material properties dataset 313, the inference unit 352 records, in a list, the manufacturing method recipe indicated by the manufacturing method recipe dataset 311 and the material property/properties indicated by the material properties dataset 313.
The inference unit 352 judges whether or not to terminate the inference processing (S510). If the judgment result of S510 is true (S510: Yes), the processing is terminated. If the judgment result of S510 is false (S510: No), the processing returns to S501.
In the processing illustrated in
One embodiment has been described above, but this is the illustration of an example for the purpose of explaining this invention and it is not intended to limit the scope of this invention only to this embodiment. This invention can be implemented in various other embodiments.
For example, the AI unit 140 illustrated in
Moreover, for example, the “experimental data” collected by the collection unit 101 may include at least part of data resulting from a combinatorial experiment(s), data acquired from the electronic laboratory notebook 131 in which the past experimental data are accumulated, and data acquired from, for example, an external site(s). Furthermore, the “experimental data” may include calculation data (such as data predicted by first-principles calculations or separately by machine learning) or may include language data. Furthermore, at least one of the datasets 311 to 313 may also include calculation data or language data.
Furthermore, for example, the aforementioned description can be summarized as follows. The following summary may include a supplementary explanation of the aforementioned description and an explanation of variations.
The material creation support system 10 may include, for example, the interface apparatus 201 connected to a user apparatus which is an apparatus having an input device and a display device (such as the researcher terminal 11A), the storage apparatus 202 in which the models 301 to 303 are stored, and the processor 203 connected to the interface apparatus 201 and the storage apparatus 202. The processor 203 may implement the AI unit 108 and the IF unit (interface unit) 109 by, for example, executing computer programs. The AI unit 108 performs either the inference using the first model 301 or the inference using the second model 302 and the third model 303 according to the accuracy (such as the predicted accuracy) of at least one model of the first to third models 301 to 303 and generates or updates the list 600 (an example of recipe properties data that is data indicating the association between a manufacturing method recipe and material properties). The IF unit 109 may receive a query which designates a material property, identify a manufacturing method recipe corresponding to the property designated by the query based on the list 600, and provide the identified manufacturing method recipe.
The second model 302 is prepared based on a correlation between material characteristics and material properties and the third model 303 is prepared to make a material properties dataset 313 predictable from a manufacturing method recipe dataset 311 via an intermediate dataset that is a material characteristics dataset 312. The presence of the second and third models 302 and 303 can contribute to a proposal of a manufacturing method recipe for creating a material with the material property/properties desired by the researcher even if the quantity of experimental data is insufficient or the accuracy of the first model 301 is insufficient due to other reasons.
If the accuracy of the first model 301 is equal to or higher than the first accuracy threshold value or the accuracy of the first model 301 is higher than the accuracy of the second and third models 302 and 303, the AI unit 108 may generate or update the list 600 upon the inference using the first model 301. Accordingly, the inference for generating or updating the list 600 used to provide a manufacturing method recipe can be performed with a smaller computational load (without input or output of the intermediate dataset as the material characteristics dataset 312). Incidentally, the “accuracy of the second model 302 and the third model 303” is not the accuracy of each individual model, that is, the accuracy of the second model 302 and the accuracy of the third model 303, but the accuracy when both the second model 302 and the third model 303 are used.
The AI unit 108 may learn the first model 301 by using the manufacturing method recipe dataset 311 and the material properties dataset 313 identified or generated upon the learning or the inference of the second and third models 302 and 303. For example, it can be specified whether it is preferable or not when, upon the learning or the inference of the second and third models 302 and 303, a recipe item that is synthesis time is present or absent in the manufacturing method recipe. Thus, upon the learning or the inference of the second and third models 302 and 303, recipe items may be expanded or reduced and the values of the recipe items may be expanded or reduced and, as a result, it is expected to increase manufacturing method recipe datasets 311 which are preferred for the learning of the first model 301. Therefore, enhancement of the accuracy of the first model 301 can be expected.
For example, since learning datasets are acquired from experimental data, insufficient experimental data can result in insufficient learning datasets and, as a result, the accuracy of the models 301 to 303 may be insufficient. Moreover, it takes time for the experimental data to become sufficient and it is difficult to guarantee when it will become sufficient. Therefore, the AI unit 108 may perform the inference using the first model 301 and/or the inference using the second and third models 302 and 303 in parallel with the learning of the models 301 to 303. For example, as the learning is switched (depending on learning of which model has been terminated), the AI unit 108 may switch (select) the model to be used for the inference.
Upon the learning of the second model 302, the AI unit 108 may identify or generate one or more material characteristics datasets 312 as a factor to cause the accuracy of the second model 302 to become equal to or higher than the second accuracy threshold value from among a plurality of material characteristics datasets 312. Upon the learning of the third model 303, the AI unit 108 may identify or generate one or more manufacturing method recipe datasets 311 corresponding to the one or more material characteristics datasets 312 identified or generated upon the learning or the inference of the second model 302 from among a plurality of manufacturing method recipe datasets 311. The AI unit 108 may learn the first model by using the one or more manufacturing method recipe datasets 311 identified or generated upon the learning or the inference of the third model 303 and one or more material properties datasets 313 corresponding to the one or more manufacturing method recipe datasets 311. Accordingly, the material characteristics datasets 312 are narrowed down upon the learning or the inference of the third model 303, so that the accuracy of the third model 303 can be enhanced with a small computational load. Moreover, the manufacturing method recipe datasets 311 corresponding to such narrowed-down material characteristics datasets 312 are used to learn the first model 301, so that the accuracy of the first model 301 can be enhanced with a small computational load.
A material characteristics dataset 312 may be a dataset configured of features as numerical values of each characteristic item belonging to the material characteristics. Accordingly, the computational load on the learning and the inference can be reduced and the second and third models 302 and 303 can be simplified.
When relearning is performed, the AI unit 108 may discard the list 600 and newly generate a list 600 or update the list 600 upon the inference using the first model 301 after the relearning and/or the inference using the second model 302 after the relearning and the third model 303 after the relearning. Accordingly, it is possible to prevent the list generated or updated upon the inference using an old model whose the accuracy was not yet improved from being provided as a recipe by the IF unit 109.
Moreover, the following variation may be used as one variation. Specifically speaking, for example, as illustrated in
For example, there may be the following use case as one use case. Specifically speaking, the material characteristics may include (1) density and porosity quantified by the Archimedean method and (2) SEM image features (such as a composition area ratio and a composition area deviation). Characteristic items (characteristic factors) which may affect objective properties (material properties desired by the researcher) may be porosity, a crystal perimeter, a composition area ratio, and a composition area deviation (the “composition area deviation” is a feature indicating variations of the composition ratio of a plurality of segmented areas constituting one image (in this case, an SEM image)). The first model 301 may be Ridge regression (its inputs may be manufacturing conditions such as a composition ratio, material relay, and a firing temperature, and its outputs may be bending strength and a thermal expansion coefficient). The second model 302 may be Ridge regression or LightGBM (its inputs may be the aforementioned (1) density and porosity and (2) SEM image features, and its outputs may be bending strength and a thermal expansion coefficient). The third model 303 may be Ridge regression (its inputs may be manufacturing conditions such as a composition ratio, a material particle size, and a firing temperature, and its outputs may be the aforementioned (1) density and porosity and (2) SEM image features). In response to a query from the researcher 5A (a query which designates a material property), a manufacturing method recipe may be provided (may be displayed on the researcher terminal 11A). Moreover, any one of the models 301 to 303 may include a regression equation for each element as an output (such as an objective variable). Specifically speaking, any one of the models 301 to 303 may include one or more regression equations. Furthermore, although any of the models 301 to 303 may have a common dataset as an input, explanatory variables and weights are typically different for the respective regression equations for the models.
Moreover, the significance of the processing illustrated in
Specifically speaking, the purposes of the inference can be (1) to find a manufacturing method recipe for manufacturing a material having a desired material property/properties and (2) to find a manufacturing method recipe candidate(s) for adding experimental data in order to enhance the model accuracy. The inference of S504 in
A dataset useful or effective for model learning (such as a manufacturing method recipe) is selected from among the combinations identified upon the inference corresponding to the purpose (2) (such as a combination of a manufacturing method recipe and a material property/properties), a dataset obtained by actual experiments based on the aforementioned dataset is added as a learning dataset, and model learning is performed by using the added dataset. Since the dataset is augmented in this way, the model can be learned with better the accuracy.
In the case of Yes in S505, the accuracy of the inference using the second model 302 and the third model 303 can be regarded as sufficient. Therefore, an additional learning dataset can be prepared based on the dataset input and output upon the inference. In addition to the dataset input and output upon the inference, a dataset considered and generated by a researcher (such as an experimenter) may be used to prepare an additional learning dataset.
On the other hand, in the case of No in S505, the accuracy of the inference using the second model 302 and the third model 303 can be regarded as insufficient. Therefore, it is difficult to prepare the additional learning dataset based only on the dataset input and output upon the inference and the dataset considered and generated by the researcher may be also used to prepare the additional learning dataset.
Since the learning of the second model 302 is performed before the learning of the third model 303, it is more efficient to perform the judgment of S505 before the judgment of S507. Incidentally, the order of judgments may not be limited to the sequential order indicated in
The following expressions are possible based on the above description.
If the accuracy of the first model 301 is lower than the first threshold value or the accuracy of the first model 301 is lower than the accuracy of the second model 302 and the third model 303, the AI unit 108 may perform the inference using the second model 302 and the third model 302.
The AI unit 108 may learn the second model 302 before the third model 303. The AI unit 108 may prioritize using the dataset input or output upon the inference using the second model 302 and the third model 303 to learn at least one of the first to third models regarding a second case rather than a first case.
The first case is the case where the inference using the second model 302 and the third model 303 is the inference performed when the accuracy of the third model 303 is lower than the third threshold value.
The second case is the case where the inference using the second model 302 and the third model 303 is the inference performed when the accuracy of the third model 303 is equal to or higher than the third threshold value.
When the accuracy of the second model 302 is lower than the second threshold value: and if the accuracy of the second model 302 and the third model 303 is higher than the accuracy of the first model 301, the AI unit 108 may perform the inference using the second model 302 and the third model 303; and if the accuracy of the second model 302 and the third model 303 is equal to or lower than the accuracy of the first model 301, the AI unit 108 may perform the inference using the first model 301.
Number | Date | Country | Kind |
---|---|---|---|
2022-037335 | Mar 2022 | JP | national |
This application is a continuation application of PCT/JP2023/009174 filed on Mar. 9, 2023, which claims the benefit of priority of Japanese Patent Application No. 2022-037335 filed on Mar. 10, 2022, the entire contents of which are incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/JP2023/009174 | Mar 2023 | WO |
Child | 18823832 | US |