The present disclosure claims the priority to the Chinese Patent Application CN202110186760.6 entitled “Optimization decision-making method of an industrial process fusing domain knowledge and multi-source data” filed on Feb. 18, 2021, the entire content of which is incorporated herein by reference.
The present disclosure relates to a field of artificial intelligence, and in particular, to an optimization decision-making method of an industrial process fusing domain knowledge and multi-source data.
A complex industrial manufacturing process is a multi-scale dynamic system involving the collaborative interaction of complex production factors such as people, equipment, material, technology and environment, the optimization decision-making problem of which is generally in complex coupling association with diversified dynamic production factors and possesses features in complex mechanism, multi-objective and multi-constraint, multi-scale dynamic optimization, and the like, causing considerable difficulties to its solution.
As new technologies such as Internet of things, cloud computing and mobile Internet have been increasingly widely used, the types and scales of data in industrial environment increase rapidly and accumulate to form big data of the industrial process, and possess features in high sampling frequency, high information density, multi-source heterogeneity and multi-space and time distribution, and the like, which contains rich innovative knowledge and represents a dynamic variation laws and a potential model of the optimization decision-making process.
Meanwhile, there are also long-term accumulated experience and knowledge of engineers, operators and domain experts, mechanism knowledge of an internal change represented by a differential equation or a chemical equation and process specifications formulated for different processes in the production process, and the like in the industrial process. Such diverse domain knowledge reflects a general description and a common rule of the production factors. It is a core requirement of intelligentizing the production of the manufacturing industry to effectively use the rich knowledge and the diverse domain knowledge contained in the big data of a manufacturing process, to construct an automation system for optimization decision-making knowledge of a complex industrial process and to promote artificial intelligence technology to realize automation of knowledge-based work in intelligent manufacturing.
In order to solve the above technical problems or at least partially solve the above technical problems, the present disclosure provides an optimization decision-making method of an industrial process fusing the domain knowledge and the multi-source data, the method including:
S1: acquiring the domain knowledge of the industrial process by using probability soft logic, and building a domain rule knowledge base of the industrial process;
S2: fusing multi-source data semantics and multi-source data features to form a new semantic knowledge representation of the industrial process, and constructing a semantic knowledge base of the industrial process;
S3: under a posteriori regularization framework, utilizing the domain rule knowledge base of the industrial process and the semantic knowledge base of the industrial process to obtain an optimization decision-making model embedded with domain rule knowledge and obtain a posteriori distribution model; and
S4: migrating knowledge in the optimization decision-making model embedded with the domain rule knowledge into the posteriori distribution model through the knowledge distillation technology.
According to an embodiment of the present disclosure, the specific steps of acquiring the domain knowledge of the industrial process by using the probability soft logic include:
S11: expressing properties, states, attributes and correlations therebetween of different production factors in a manufacturing process by using an N-ary predicate so as to construct a first-order logic rule;
S12: defining a weighted first-order logic rule to express the relationship between an optimization decision-making problem and the cause of the optimization decision-making problem; and
S13: performing weight learning by using the probability soft logic to acquire the domain rule knowledge of the industrial process.
According to an embodiment of the present disclosure, a specific formula of the weighted first-order logic rule is
∀D1,D2, . . . ,Dl,R.P1(D1, . . . )∧P2(D2, . . . )∧ . . . ∧Pl(Dl, . . . )⇒PR(Q,R):λ,
where P1, P2, Pl, PR are predicates; D1, D2, Dl, R are variables; λ represents the weight. The weight indicates an importance of the first-order logic rule. That is, the larger the weight, the more important the first-order logic rule is. A specific value of the weight is a non-negative real number. Such rule indicates that the states of industrial process targets D1, D2, . . . , Dl in a certain condition cause the result of the optimization decision-making problem Q is R.
According to an embodiment of the present disclosure, a specific expression form of the domain rule knowledge base of the industrial process is KR={(ri,λi)}i=1m,
where m represents a size of the domain rule knowledge base, ri represents an i-th first-order logic rule, and λi is the weight of a corresponding first-order logic rule.
According to an embodiment of the present disclosure, the step of acquiring the multi-source data semantics includes:
S21: clustering the multi-source data by an unsupervised clustering method to extract the multi-source data semantics; and
the step of acquiring the multi-source data features includes:
S22: preforming feature extraction on the multi-source data by using a convolution self-encoder.
According to an embodiment of the present disclosure, a specific form of the new semantic knowledge representation of the industrial process is k=hA:R,
where the first half is a fusing feature hA and hA=(H1,H2, . . . ,HN):{A1,A2, . . . . AN}; and R is decision-making semantics corresponding to the fusing feature hA, and R represents a specific decision-making result of the optimization decision-making problem. Further, (H1,H2, . . . ,HN) is a feature of compact optimization for the multi-source data (X1,X2, . . . ,XN); and {A1,A2, . . . ,AN} is the semantics of the multi-source data (X1,X2, . . . ,XN).
According to an embodiment of the present disclosure, a specific expression form of the semantic knowledge base of the industrial process is KS={ki}i=1n, where n represents a size of the semantic knowledge base, and ki represents an i-th knowledge element.
According to an embodiment of the present disclosure, the specific steps of obtaining the optimization decision-making model embedded with the domain rule knowledge and obtaining the posteriori distribution model include:
S31: defining a mapping relation by ϕi:{ril(X,Y)}l=1L→ by using the domain rule knowledge base KR={(ri,λi)}i=1m of the industrial process to encode the domain knowledge of the industrial process;
S32: under the posteriori regularization framework, introducing ϕi:{ril(X,Y)}l=1L→ in the form of constraints into a learning process of the model, converting the optimization decision-making problem thereof into a constraint-optimization problem,
where C, λi and ξi are constants; KL(q(Y|X)∥pθ(Y|X)) is a form of f(X,Y) under the posteriori regularization framework; and KL(⋅) is a divergence calculation; and
S33: obtaining an optimization decision-making knowledge inference model embedded with the domain rule knowledge of the industrial process by solving the constraint-optimization problem;
where q(Y|X) represents the optimization decision-making model embedded with the domain rule knowledge, and Pθ(Y|X) is the posteriori distribution model.
According to an embodiment of the present disclosure, the specific steps of migrating knowledge in the optimization decision-making model embedded with the domain rule knowledge into the posteriori distribution model include:
S41: defining the optimization decision-making model q(Y|X) embedded with the domain rule knowledge as a teacher network, and defining the posteriori distribution model pθ(Y|X) established by the multi-source data semantics as a student network;
S42: training the teacher network and the student network simultaneously by using the domain rule knowledge base KR of the industrial process and the semantic knowledge base KS of the industrial process, where a parameter θ is updated as follows:
where loss(⋅) is a loss function; σθ is a soft output of the student network p74 (Y|X); Stt is a soft output of the teacher network q(Y|X); and CRi is a real decision-making semantics; and
performing iteration training of a teacher-student network by using formulas
and the formula
so as to migrate the knowledge from the teacher network to the student network.
According to an embodiment of the present disclosure, a specific form of the loss function loss(⋅) may be chosen as follows:
when solving a classification problem, the loss function of a cross-entropy
may be chosen,
where y is a real value, and ŷ is a prediction value; and
when solving a regression problem, the loss function of square
may be chosen,
where y is a real value, and ŷ is a prediction value.
The above technical solutions provided by the embodiment of the present disclosure have following advantages over the prior art.
In the method provided by the invention, firstly, the unstructured domain knowledge such as the expert experience, the mechanism knowledge and the process specification is expressed by using the weighted first-order logic rule and the probability soft logic. That is, the domain rule knowledge is represented by the formula ∀D1,D2, . . . ,Dl,R.P1(D1, . . . )∧P2(D2, . . . )∧ . . . ∧Pl(Dl, . . . )⇒PR(Q,R):λ so as to provide support to the intelligent inference of the optimization decision-making for the computer; secondly, the method of the present disclosure provides a new form of knowledge representation as shown by the formula k=(H1,H2, . . . ,HN):{A1,A2, . . . ,AN}:R, which enables compact expression of the massive multi-source data of the industrial process, gives feature semantics to each multi-source data, and converts the multi-source data into knowledge containing the semantics information, so as to provide support to the optimization decision-making of the industrial process; and then building the optimization decision-making model embedded with the domain rule knowledge of the industrial process by using the built domain rule knowledge base and the semantic knowledge base of the industrial process, which enables effectively combination of the domain knowledge and the multi-source data semantic knowledge so as to drive the optimization decision-making of the industrial process collaboratively, accurately reveals the internal mechanism and evolutionary characteristics of the optimization decision-making, and achieves knowledge automation of the optimization decision-making of the industrial process.
Further, the method of the present disclosure provides a feasible solution to compression of the optimization decision-making knowledge inference model embedded with the domain rule knowledge, may obtain an equivalent lightweight model, and provides a real-time online optimization for the optimization decision-making problem of the industrial process.
In addition, the present disclosure provides a solution framework for solving the optimization decision-making problem by combining the domain knowledge with the industrial big data, effectively uses the unstructured domain knowledge of the industrial process and massive data generated by sensor networks, enhances robustness and accuracy of solving the optimization decision-making problem, and lays a solid foundation for the knowledge automation of the industrial process.
The accompanying drawings herein, which are incorporated in and constitute a part of this specification, illustrate embodiments according to the present disclosure and together with the specification serve to explain the inventive principles.
For the purpose of more clearly illustrating the embodiments of the present disclosure or the technical solution in the prior art, a brief description of the accompanying drawings to be used in describing the embodiments or the prior art is given below. It is obvious for ordinary persons skilled in the art to obtain other accompanying drawings from these accompanying drawings without any inventive effort.
For the purpose of making the objects, technical solutions and advantages of the present disclosure clearer, the technical solutions in the embodiments of the present disclosure will be described clearly and completely below in conjunction with the accompanying drawings of embodiment of the invention. Obviously, the described embodiments are a part of the embodiments of the invention, and not all of the embodiments of the invention. Based on the embodiments of the invention, all other embodiments obtained by ordinary persons skilled in the art without inventive work fall within the protective scope of the invention.
As shown in
S1: acquiring the domain knowledge of the industrial process by using probability soft logic, and building a domain rule knowledge base of the industrial process;
S2: fusing multi-source data semantics and multi-source data features to form a new semantic knowledge representation of the industrial process, and constructing a semantic knowledge base of the industrial process;
S3: under a posteriori regularization framework, utilizing the domain rule knowledge base of the industrial process and the semantic knowledge base of the industrial process to obtain an optimization decision-making model embedded with domain rule knowledge and obtain a posteriori distribution model; and
S4: migrating knowledge in the optimization decision-making model embedded with the domain rule knowledge into the posteriori distribution model through the knowledge distillation technology.
Further, as shown in
S11: expressing properties, states, attributes and correlations therebetween of different production factors in a manufacturing process by using an N-ary predicate so as to construct a first-order logic rule;
S12: defining a weighted first-order logic rules to express the relationship between an optimization decision-making problem and the cause of the optimization decision-making problem; and
S13: performing weight learning by using the probability soft logic to acquire the domain rule knowledge of the industrial process.
Further, based on the above solution, a specific formula of the weighted first-order logic rule is
∀D1,D2, . . . ,Dl,R.P1(D1, . . . )∧P2(D2, . . . )∧ . . . ∧Pl(Dl, . . . )⇒PR(Q,R):λ
where P1, P2, Pl, PR are predicates; D1, D2, Dl, R are variables; λ represents the weight. The weight indicates an importance of the first-order logic rule. That is, the larger the weight, the more important the first-order logic rule is. The specific value of the weight is a non-negative real number. Such rule indicates that the states of industrial process targets D1, D2, . . . , Dl in a certain condition cause the result of the optimization decision-making problem Q is R.
Further, based on the above solution, a specific expression form of the domain rule knowledge base of the industrial process is KR={(ri,λi)}i=1m, where m represents a size of the domain rule knowledge base, ri represents an i-th first-order logic rule, and λi is the weight of a corresponding first-order logic rule.
Further, as shown in
S21: clustering the multi-source data by an unsupervised clustering method to extract the multi-source data semantics; and
the step of acquiring the multi-source data features includes:
S22: preforming feature extraction on the multi-source data by using a convolution self-encoder.
Further, based on the above solution, a specific form of the new semantic knowledge representation of the industrial process is k=hA:R ,
where the first half is a fusing feature hA hA=(H1,H2, . . . ,HN):{A1,A2, . . . ,AN}; and R is decision-making semantics corresponding to the fusing feature hA, and R generally represents a specific decision-making result of the decision-making problem. Further, (H1,H2, . . . ,HN) is a feature of compact optimization for the multi-source data (X1,X2, . . . ,XN) ; and {A1,A2, . . . ,AN} is the semantics of the multi-source data (X1,X2, . . . ,XN).
Further, based on the above solution, a specific expression form of the semantic knowledge base of the industrial process is KS={ki}i=1n, where n represents a size of the semantic knowledge base, and ki represents an i-th knowledge element.
Generally, the domain knowledge of the industrial process reflects natural variation laws of the optimization decision-making process, while the knowledge implied by the multi-source data reflects dynamic variation laws of the manufacturing process. The internal mechanism and evolutionary characteristics of the optimization decision-making problem of the industrial process may be accurately revealed by effectively combining the domain knowledge and the innovative knowledge implied by the multi-source data.
Further, as shown in
S31: defining a mapping relation by ϕi: {ril(X,Y)}l=1L→ by using the domain rule knowledge base KR={(ri,λi)}i=1m of the industrial process to encode the domain knowledge of the industrial process;
S32: under the posteriori regularization framework, introducing ϕi:{ril(X,Y)}l=1L→ in the form of constraints into a learning process of the model, converting the optimization decision-making problem thereof into a constraint-optimization problem,
where C, λi and ξi are constants; KL(q(Y|X)∥pθ(Y|X)) is a form of (X,Y) under the posteriori regularization framework; and KL(⋅) is a divergence calculation; and
S33: obtaining an optimization decision-making knowledge inference model embedded with the domain rule knowledge of the industrial process by solving the constraint-optimization problem;
where q(Y|X) represents the optimization decision-making model embedded with the domain rule knowledge, and pθ(Y|X) is the posteriori distribution model.
Further, as shown in
S41: defining the optimization decision-making model q(Y|X) embedded with the domain rule knowledge as a teacher network, and defining the posteriori distribution model pθ(Y|X) established by the multi-source data semantics as a student network;
S42: training the teacher network and the student network simultaneously with the domain rule knowledge base KR of the industrial process and the semantic knowledge base KS of the industrial process, where a parameter θ is updated as follows:
where loss(⋅) is a loss function; σθ is a soft output of the student network pθ(Y|X); Sit is a soft output of the teacher network q(Y|X); and CRi is a real decision-making semantics; and
performing iteration training of a teacher-student network by using the formula
and the formula
so as to migrate the knowledge from the teacher network to the student network.
Further, based on the above solution, a specific form of the loss function loss(⋅) may be chosen as follows:
when solving a classification problem, the loss function of a cross-entropy
may be chosen,
where y is a real value, and ŷ is a prediction value; and
when solving a regression problem, the loss function of square
may be chosen,
where y is a real value, and ŷ is a prediction value.
Embodiments
The energy flow network of iron and steel industry is a complex and huge system. The energy flow network is in a dynamic state in the production process and is in coupling correlation with the material flow. The built model of the material flow and the energy flow of a whole process is required to be dynamically self-adaptive to meet actual needs. The operational big data of the whole process of the iron and steel contains the dynamic variation laws of the material flow and the energy flow, while the dynamic simulation of virtual data of the whole process may reflect the evolution characteristics of physical entities. The method of accurately modeling the material flow and the energy flow of the whole process of the iron and steel by fusing knowledge learning is provided herein, which mines the innovative knowledge implied by the operational big data of the whole process of the iron and steel, forms the domain rule knowledge base and the semantic knowledge base, builds the optimization decision-making knowledge inference model embedded with the domain rule knowledge based on the domain rule knowledge base and the semantic knowledge base, and intelligently corrects the built dual drive model of the material flow and the energy flow of the whole process of the iron and steel, so as to realize the dynamic self-adaptation of the model and improve the accuracy of the model.
Hereinafter, the 3 parts mentioned above will be described in detail, respectively.
(1) The dual drive model of the material flow and the energy flow of the whole process of the iron and steel
The dual drive model of the material flow and the energy flow of the whole process of the iron and steel mainly performs fusing of mechanism model and data model of the material flow and the energy flow of the whole process of the iron and steel by means of a certain method so as to form complementary advantages of the mechanism model and the data model, thereby enhance the accuracy of modeling. Since the material flow and the energy flow have complex structures during the production process of the iron and steel and there are characteristics such as multi-dimensional correlation, multi-field cooperation, multi-phase coupling, and the like between the material flow and the energy flow, the accurately built fusing model is the foundation of achieving collaborative optimization of production operation of the iron and steel. On one hand, in the invention, a network modeling method of “node-connector” is utilized to construct equipment, units, processes, and the like involved by the material flow into “nodes” of different scales, and to construct pipings, hot-metal tanks, ladles, and the like involved by the energy flow into “connectors” so as to form a mechanism model with coupling network of the material flow and the energy flow. On the other hand, semantic features of multi-source heterogeneous space-and-time data of the production process are extracted and mapping mechanism of the semantic features of the material flow and the energy flow is explored to build the data model. Finally, the mechanism model and the data model of the material flow and the energy flow of the whole process of the iron and steel are fused to form the dual drive model mechanism-data-driven (X;θ), where θ is the model parameter.
(2) The operation optimization method based on knowledge inference model parameters
The manufacturing process environment of the iron and steel is complex, and the built model is continuously changing. With the method provided by the invention, the parameter θ of the dual drive model mechanism-data-driven (X;θ) of the material flow and the energy flow of the whole process of the iron and steel is intelligently corrected. Firstly, the operational big data of the whole process of the iron and steel and the simulation data of the dynamic simulation module of the material flow and the energy flow of the whole process of the iron and steel are mined for knowledge so as to form the domain rule knowledge base and the semantic knowledge base.
The representation of the domain rule is as shown follows:
∀D1,D2, . . . ,Dl,R.P1(D1, . . . )∧P2(D2, . . . )∧ . . . ∧Pl(Dl, . . . )⇒PR(Q,R):λ,
where P1, P2, Pl, PR are predicates; D1, D2, Dl, R are variables; λ represents the weight. The weight indicates an importance of the first-order logic rule. That is, the larger the weight, the more important the first-order logic rule is. The specific value of the weight is a non-negative real number. Such rule indicates that the states of industrial process targets D1, D2, . . . , Dl in a certain condition cause the result of the optimization decision-making problem Q is R.
The representation of the semantic knowledge is shown as follows:
k=(H1,H2, . . . ,HN):{A1,A2, . . . ,AN}:R,
where the first half is the fusing feature hA hA=(H1,H2, . . . ,HN):{A1,A2, . . . ,AN}; and R is decision making semantics corresponding to the fusing feature hA, and R generally represents a specific decision-making result of a certain decision-making problem.
The domain rule knowledge base KR and the semantic knowledge base KS are built from the above, and optimization decision-making knowledge inference model embedded with the domain rule knowledge is shown as follows:
q*(R|X) represents the optimization decision-making mode embedded with the domain rule knowledge which is used as the teacher network; pθ(R|H,X) is the optimization decision-making model driven by the multi-source data semantic knowledge, which is used as the student network; and ϕi: {ril(X,Y)}l=1L→ is a mapping function of the constraints of the domain rule.
The parameter of the dual drive model of the material flow and the energy flow of the whole process of the iron and steel is the optimization decision-making problem of the invention, and the nature of this problem is a regression problem. The student network pθ(R|H,X) chooses LSTM model. From this, the optimization decision-making knowledge inference model for the parameter of the dual drive model of the material flow and the energy flow of the whole process of the iron and steel is obtained so as to intelligently correct the model parameter in real time.
(3) The dynamic simulation of the material flow and the energy flow of the whole process of the iron and steel
The whole process, such as sintering, coking, iron-making, steel-making, hot-rolling and cold-rolling, of the iron and steel is modelled by utilizing the built dual drive model of the material flow and the energy flow of the whole process of the iron and steel to establish the dynamic simulation system and performs dynamic interactive iteration mapping with the actual production process, and a digital twin system of the material flow and the energy flow of the whole process of the iron and steel is formed. The dynamic simulation data generated by the digital twin system supports the operation optimization of the knowledge inference model parameters as a data source.
In the invention, it should be noted that relational terms such as “first” and “second” are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply any such actual correlation or sequence between these entities or operations. Further, terms “include”, “comprise” or any other variation thereof are intended to cover non-exclusive inclusion, such that a process, method, article, or device that includes a series of elements includes not only those elements, but also other elements that are not explicitly listed, or elements inherent for such process, method, article, or device. Without further limitations, the elements defined by the statement “including a...” do not exclude the existence of other same elements in the process, method, article or device including the elements.
The above is only the specific embodiments of the invention, which enables persons skilled in the art to understand or realize the invention. Various modifications to these embodiments will be apparent to persons skilled in the art, and the general principles defined herein can be implemented in other embodiments without departing from the spirit or scope of the invention. Hence, the present disclosure will not be limited to these embodiments shown herein, but will conform to the widest scope consistent with the principles and novel features applied herein.
Number | Date | Country | Kind |
---|---|---|---|
202110186760.6 | Feb 2021 | CN | national |
Number | Name | Date | Kind |
---|---|---|---|
20170199928 | Zhao | Jul 2017 | A1 |
Number | Date | Country |
---|---|---|
105975977 | Sep 2016 | CN |
108804715 | Nov 2018 | CN |
108960419 | Dec 2018 | CN |
109902165 | Jun 2019 | CN |
111026877 | Apr 2020 | CN |
111026877 | Apr 2020 | CN |
112348190 | Feb 2021 | CN |
Entry |
---|
First Office Action issued in counterpart Chinese Patent Application No. 202110186760.6, dated Mar. 29, 2021. |