This application claims the benefit of Korean Application No. 10-2023-0015357 filed Feb. 6, 2023, in the Korean Intellectual Property Office. All disclosures of the document named above are incorporated herein by reference.
The present invention relates to an apparatus for obtaining and providing sodium superion conductor material information using a machine learning method.
A lithium ion battery (LIB) is widely used as an energy storage device because of its advantages such as high energy density, low self-discharge rate, long lifespan, and high cell voltage. However, due to the high production cost and lack of lithium, sodium ion batteries (SIBs) using sodium, which have electrochemical properties similar to those of lithium but have abundant reserves, are being studied as next-generation batteries.
On the other hand, solid-state electrolytes (SSE) are attracting attention as a promising alternative to LIBs because of their low production cost unlike liquid electrolytes, the abundance of resources, and electrical/chemical properties similar to those of lithium ion batteries (LIBs). In particular, research on sodium ion solid electrolytes (SSE), which can increase energy density by using metal as an anode material and effectively suppress the formation of dendrites while solving the flammability problem of existing liquid electrolytes, is growing rapidly.
However, since solid electrolytes have lower ionic conductivity than conventional liquid electrolytes, research is being actively conducted to find solid electrolyte materials having ionic conductivity similar to that of liquid electrolytes.
Sodium (Na) Superlonic Conductor (“NASICON”) is one of the well-known sodium ion SSEs. NASICON has a chemical formula of Na1+xM2(AO4)3 (0≤x≤3), and has a structure, in which the AO4 tetrahedron and the MOs octahedron share a three-dimensional space. Na+ ions move through the empty space formed between the tetrahedral and octahedral structures. The high ionic conductivity and thermodynamic stability of the NASICON structure have sufficient properties to be used as SSE, and materials with ionic conductivity of 10−3 S/cm or more have been reported in several studies.
Since the NASICON structure was published, many studies have been conducted to explore NASICON materials with higher ionic conductivity and stability. In general, in the field of material search, synthesis is attempted by doping material with other elements or changing the experimental method, and the synthesized material is subjected to several trials and errors in the process of experimentation and analysis to measure ionic conductivity and interfacial stability. Although this traditional method is accurate, there are problems in that a lot of cost and time are incurred in the research process, and many variables occur depending on the experimental conditions even for the same material.
In order to solve this problem, many studies are being conducted to predict desired properties more quickly and accurately in the conventional material search field through high-throughput calculation and machine learning, but an apparatus for searching for sodium superion conductor material that provides sodium superion conductor material information having electrochemically stability and excellent ion conductivity while having a high level of accuracy and speed required in the technical filed, to which the present invention belongs, is not provided.
An object of the present invention is to provide an apparatus for searching for sodium superion conductor material that provides a sodium superion conductor material information having electrochemical stability and excellent ionic conductivity using high-throughput calculation and machine learning.
The present invention is also to provide an apparatus for searching for sodium superion conductor material having a machine learning model, which is a surrogate model as a sodium superion conductor material search model that generates classification information by classifying sodium superion conductor material information by receiving extracted feature information from test data.
The present invention is also to provide an apparatus for searching for sodium superion conductor material including a learning unit that provides learning data to a sodium superion conductor material search model, which is a machine learning model, and improves the accuracy of the model by training the sodium superion conductor material search model in a supervised learning method.
Another object of the present invention is to provide an apparatus for searching for a sodium superion conductor material that generates sodium superion conductor material information having an ion conductivity of 10−4 S/cm or more.
The problems to be solved by the present invention are not limited to the problems mentioned above, and the problems to be solved by the present invention that are not mentioned can be clearly understood by those of ordinary skill in the art to which the present invention belongs (“those skilled in the art”).
In order to solve the above technical problem, an apparatus for providing sodium superion conductor material information using a machine learning method according to an embodiment of the present invention comprises,
Further, the apparatus further comprises an extraction unit for receiving test data and extracting material characteristic information including chemical information therefrom.
Further, the apparatus further comprises a learning unit for training the sodium superion conductor material search model, wherein the learning unit receives learning data and uses it to improve precision in selecting the sodium superion conductor material information by training the sodium superion conductor material search model in a supervised manner.
Further, the apparatus further comprises an optimization unit for optimizing the preset threshold value of the classification unit, wherein the optimization unit optimizes the preset threshold value by adjusting accuracy, precision, and recall of the sodium superion conductor material search model.
According to the present invention, it is possible to provide an apparatus for searching for sodium superion conductor material that provides sodium superion conductor material information having electrochemical stability and excellent ion conductivity, and through this, an apparatus for searching for a sodium superion conductor material that generates sodium superion conductor material information having ionic conductivity of 10−4 S/cm or more can be provided.
The excellent and/or useful effects according to the present invention are not limited to the effects of the present invention described above, and those skilled in the art can readily recognize the excellent and/or useful effects of the present invention not explicitly disclosed herein based on the disclosure of the present specification. Thus, it should be understood that such useful effects are intentionally disclosed by this specification and are obviously included in the scope of the present disclosure.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee.
These and/or other aspects will become apparent and more readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings in which:
Hereinafter, the present invention will be described in detail.
The following embodiments combine elements and features of the embodiments in a predetermined form. Each component or feature may be considered optional unless explicitly stated otherwise. Each component or feature may be implemented in a form not combined with other components or features. In addition, various embodiments may be configured by combining some components and/or features. The order of operations described in various embodiments may be changed. Some components or features of one embodiment may be included in another embodiment, or may be replaced with corresponding components or features of another embodiment.
In the description of the drawings, procedures or steps that may obscure the gist of various embodiments are not described, and procedures or steps that can be understood by those skilled in the art are not described.
Throughout the specification, when a part is said to “comprising” or “including” a certain component, it means that it may further include other components, not excluding other components, unless otherwise stated. In addition, terms such as “ . . . unit,” and “module” described in the specification mean a unit that processes at least one function or operation, which can be implemented as hardware or software or a combination of hardware and software. Also, “a or an,” “one,” “the” and like terms used herein in the context of describing various embodiments (particularly in the context of the claims below) can be used to include both the singular and the plural unless otherwise indicated or clearly contradicted by context.
Hereinafter, embodiments according to various embodiments will be described in detail with reference to the accompanying drawings. The detailed description set forth below in conjunction with the accompanying drawings is intended to describe exemplary embodiments of various embodiments, and is not intended to represent a single embodiment.
In addition, specific terms used in various embodiments are provided to help understanding of various embodiments, and the use of these specific terms may be changed in other forms without departing from the technical spirit of various embodiments.
Meanwhile, each description and embodiment disclosed in this specification may also be applied to each other description and embodiment. That is, all combinations of various components disclosed herein fall within the scope of the present invention, and descriptions omitted in one embodiment may be interpreted in the same manner as described in another embodiment. In addition, it cannot be seen that the scope of the present invention is limited by the specific descriptions described below.
According to one aspect of the present invention, an apparatus for providing sodium superion conductor material information using a machine learning method according to the present invention may comprise:
According to an embodiment, the sodium superion conductor material search model used in the classification unit is a model, to which a machine learning method is applied, and may include detailed machine learning model units, and each detailed machine learning model unit can perform prediction several times for each of the verification data formed by receiving the material characteristic information of the sodium superion conductor material known through previous experiments. For example, the detailed machine learning model unit may be 9 machine learning model units of Naïve Bayes (“NB”), Logistic Regression (“LR”), Decision Tree (“DT”), Stochastic Gradient Descent (Stochastic Gradient Descent (“SGD”), K-Nearest Neighbor (“KNN”), Random Forest (“RF”), Gradient Boosint (“GB”), Light Gradient Boosting Machine (“LGBM”), and eXtreme Gradient Boosting (“XGB”), and the prediction may be 100 times.
In addition, for example, the material characteristic information may be 264 NASICON and LISICON materials previously known through experiments, and the verification data may be 3573 NASICON verification data, and may be a material whose synthesizability and stability have been verified through convex hull energy by using high-throughput density functional theory (DFT), and materials showing high ionic conductivity among the verification data may be predicted by each of the detailed machine learning model units.
In general, liquid electrolytes exhibit ionic conductivity in the range of 10−4 S/cm to 10−2 S/cm, but since SSE has a lower ionic conductivity than liquid electrolytes, the ionic conductivity that can actually be used as an electrolyte can be determined. For example, a criterion for a classification model may be set at σ=10−4 S/cm to determine whether it is a superionic or non-superionic conductor.
Classification may be performed while adjusting the random state from 0 to 99 100 times for each machine learning model unit for cross-verification. The average accuracy can be applied for 9 detailed machine learning model units, NB, LR, DT, SGD, KNN, RF, GB, LGBM, and XGB units on 100 different train-test sets, and even when the same data is used, the accuracy may be different depending on which detailed machine learning model unit is used. There are many reasons why a particular model has high accuracy, but it may be the result of certain data fitting a particular model and exhibiting high performance, or an error caused by overfitting. Therefore, a unit suitable for the purpose may be selected through a comparison of various detailed machine learning model units.
The material characteristic information is information shown in Table 1 below, and may be information obtained by combining stoichiometric characteristics and characteristics of elements extracted based on the chemical formulas of each material.
In addition, the apparatus for providing the sodium superion conductor material information using the machine learning method may further comprise an extraction unit for receiving test data and extracting material characteristic information including chemical information therefrom.
In addition, the apparatus for providing sodium superion conductor material information using the machine learning method may further comprise a learning unit for training the sodium superion conductor material search model, wherein the learning unit receives learning data and uses it to improve accuracy in selecting the sodium superion conductor material information by training the sodium superion conductor material search model in a supervised manner.
The extraction unit may extract the material characteristic information from the test data in order to extract the information shown in Table 1 above. In addition, the extraction unit may additionally extract information on each atom constituting the chemical formula from the test data as information different from the material characteristic information shown in Table 1, and for example, may additionally extract the information shown in Table 2 below. In this case, both the material characteristic information shown in Table 1 and the information shown in Table 2 below may be provided to the classification unit as material characteristic information.
In Table 2, D1, D2, and D3 mean Na, M1, and M2 of the chemical formula NaxM1M2(PO4)3, respectively, and among the information shown in Table 2, the Shannon ionic radius, Pauling scale electronegativity, and calculated volume may be generated as a characteristic by considering each part of the element. As can be seen from Table 2, nine characteristics can be generated through ionic radius, electronegativity, and effective volume of atoms, respectively, and the number of atoms in each of D1, D2, and D3 can be multiplied by previously calculated properties such as ionic radius, electronegativity, and volume to generate nine or more characteristics, and the difference value between the atomic radii of D1, D2, and D3, the standard deviation of the difference value and electronegativity can generate 6 more characteristics.
In addition, the apparatus for providing the sodium superion conductor material information using the machine learning method further comprises an optimization unit for optimizing a preset threshold of the classification unit, wherein the optimization unit may optimize the preset threshold value by adjusting accuracy, precision, and recall of the sodium superion conductor material search model.
Each of the accuracy, precision, and recall can be expressed by the following equation using each factor of the confusion matrix shown in
In the case of accuracy, if the data used for training the sodium superion conductor material search model is biased, there may be a problem that even if the machine learning model classifies inaccurately, it can show high performance if it predicts according to the bias of the data. Therefore, even if the accuracy of the machine learning model is high, the prediction of the verification data cannot be fully trusted.
In the case of precision, it represents the proportion of true predictions that are actually true, so predictions are reliable when precision is high. However, when only the precision is extremely high, machine learning predicts and provides the result only when it is definitely true, so there may be data that is not included in the predicted result even though it is actually true.
Considering the above, in order to increase the classification reliability of the machine learning model and include as many true data as possible in the classification information, the preset threshold value of the sodium superion conductor material search model may be adjusted in consideration of all of the accuracy, precision, and recall. For example, referring to
The Ewald summation used in the first verification unit for selecting information on the structure in an electrochemically stable state from the generated classification information can be used to determine whether a structure expected to be included in the sodium superion conductor material represented by the material information included in the classification information is in an electrochemically stable state.
The AIMD simulation used in the second verification unit for selecting and providing final information on the structure having high ionic conductivity from the information on the structure in the electrochemically stable state may be ab initio molecular dynamics (“AIMD”) simulation using the Vienna ab initio simulation package (“VASP”). For example, the Perdew-Burke-Emzerhof generalized gradient approximation can be applied as the exchange-correlation function, the plane wave cutoff can be 400 eV, and the gamma-centered 1×1×1 may be applied for k-point grid generation.
To discover the most reliable and accurate ML model, PyCaret is used to compare the performance of 9 detailed machine learning model units, NB, LR, DT, SGD, KNN, RF, GB, LGBM, and XGB units, and their subsequent Hyperparameter optimizations. Among them, the DT, LGBM, and GB models showed higher accuracy than others, and DT showed the highest accuracy. However, DT was excluded to reduce the risk of overfitting, and LGBM and GB were selected as the final detailed machine-learning model units. From the two detailed machine learning model units, the intersection ensemble was finally performed, and the intersection of materials classified as having superionic properties was performed 100 times and selected.
Then, material characteristic information for use in the machine learning model according to the present invention was selected. The additionally selected material characteristic information comprises characteristics of each element such as electronegativity, ionic radius of the element, and number of atoms. The average accuracy for this was 0.853, 0.847 and 0.829 for DT, GB and LGBM, respectively.
Then, the accuracy, precision, and recall of the model were evaluated in order to optimize the preset threshold value of the classification unit including the machine learning model according to the present invention. Referring to
In this embodiment, the recall was 0.7 or more, and the precision was selected to have the highest value within the range of the recall. In terms of accuracy, LGBM goes from 0.8311 to 0.8383 and GB goes from 0.8464 to 0.8460, indicating a slight increase or decrease. Referring to
As a result of using the apparatus according to the present invention, the number of materials selected more than 80 times was 389 (10.085%) in LGBM and 144 (4.01%) in GB, and the number of materials selected more than 90 times was 215 (5.99%) in LGBM and 111 in GB (3.09%). The number of the materials selected for all 100 times was 91 (2.53%) in LGBM and 14 (0.39%) in GB. The model performance was evaluated in the most conserved state by selecting materials, which were selected 100 times. Since the two detailed machine learning model units had different operating principles and showed slight differences in the number of classified materials, these two models were evaluated for verification under the most stringent conditions. As a result, seven materials were finally selected as sodium superionic conductor materials.
This was then evaluated using Ewald summation. Among the finally selected materials, the M position is composed of Ta, T, Sc, La, and Hf. The most common remaining element is Ta. That is, when Ta is used as a dopant of a NASICON-type LATP or LATP structure, ionic conductivity may be increased. The next most common elements are the trivalent cations of Y, Sc and La. These trivalent cations can be doped with the aim of occupying the tetravalent cation sites to increase the cell size and increase the ionic conductivity. Two of the seven materials contained Hf.
Then, the ionic conductivities at room temperature (RT) of the final selected materials were evaluated using AIMD simulations. Prior to calculation, the structural information of the screened materials was obtained by the Ewald summation method in the Pymatgen package. Among the above materials, three of the seven materials were excluded because AIMD simulation consistent results did not converge. For the remaining four materials, the diffusivity versus temperature relationship is shown in
As a result, it was confirmed that the four materials finally selected exhibit significant Na-ion diffusivity in the high-temperature region, exceed the determination criterion (σ>10−4 S/cm), and also have a trend that ionic conductivity and activation energy (Ea) are in inverse proportion to each other.
To summarize the entire process of this experimental example, starting from 3,573 NASIOCN materials, 7 superionic candidate materials were optimized with a preset threshold value from 0.5 to 0.7, and then the optimal machine learning model unit was selected using material characteristic information, and then classification information was generated using this. Four of each material represented by the generated classification information were confirmed to be superionic materials through AIMD.
That is, the present inventors proved that the sodium superionic conductor material search model according to the present invention was effective in selecting the superionic NASICON material. Surrogate models can play an important role in accelerating material selection and reducing costs by classifying NASICON-type SSE materials without unnecessary time-consuming high-throughput calculations or actual experiments performed. It can also be used to effectively classify the ionic conductivities of NASICON-type SSEs as well as other SSE materials targeting superionic materials with simple chemical properties of the materials.
Although the preferred embodiments of the present invention have been described in detail above, the scope of the present invention is not limited thereto, and various modifications and improvements of those skilled in the art using the basic concept of the present invention defined in the following claims are also belong to the scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
10-2023-0015357 | Feb 2023 | KR | national |