The present disclosure belongs to the technical field of artificial intelligence in industry, and relates to an adaptive deep learning-based intelligent prediction method, apparatus, and device for a complex industrial system, and a storage medium.
In order to achieve closed-loop optimization decision-making in a production process, it is necessary to online predict production indicators representing quality, efficiency, and energy consumption of a product, and key process parameters. Due to a short time cycle of closed-loop optimization decision-making in an industrial process, a prediction model for the production indicators and the key process parameters is required to provide a predictive value in the decision-making time cycle. This requires that a training dataset of a deep learning prediction model should not be too large, and a training algorithm should not take too long. In addition, due to the complexity of a manufacturing process, the production indicators, the key process parameters, and the input and output variables of a related production process are complex dynamic systems. The dynamic system often has strong nonlinearity and strong coupling of a plurality of variables. A model structure and orders of the input and output variables are unknown or even changing, production boundary conditions such as production raw materials change, and a material flow, an information flow, and an energy flow in the production process act upon each other. This causes an unknown change to a characteristic of the dynamic system with production time, and as a result, the input and output data of the system is in changing, open, and uncertain information space. This makes it impossible to apply an existing deep learning technology with complete information space to the above complex industrial dynamic system to establish a prediction model for the complex industrial dynamic system.
The present disclosure is intended to resolve at least one of technical problems in the related art. Technical solutions of the present disclosure are as follows:
An adaptive deep learning-based intelligent prediction method for a complex industrial system includes following steps:
establishing a dynamic model for a complex industrial system;
establishing an offline deep learning prediction model by using the dynamic model;
establishing an online deep learning prediction model by using the offline deep learning prediction model;
establishing a deep learning correction model based on a structure that is the same as a structure of the online deep learning prediction model; and
correcting the online deep learning prediction model by using the deep learning correction model; where
the online deep learning prediction model is configured to predict a parameter of the complex industrial system in real time.
Further, preferably, the establishing a dynamic model for a complex industrial system includes: determining an input variable and an output variable of the dynamic model, where the output variable is a predicted variable; the establishing an offline deep learning prediction model by using the dynamic model includes: establishing the offline deep learning prediction model by using a long-short term memory (LSTM) network, using the input variable of the dynamic model as an input of the LSTM network, using output data of the dynamic model as labels, and determining a neuron quantity, a cell node quantity, and a network layer quantity of the LSTM network, and weight and bias parameters of each layer by using an offline training algorithm based on an error between the labels and an output of the offline deep learning prediction model; the establishing an online deep learning prediction model by using the offline deep learning prediction model includes: establishing the online deep learning prediction model by using an LSTM network, where an input of a single neuron, a neuron quantity, a cell node quantity, and a network layer quantity of the online deep learning prediction model are all the same as those of the offline deep learning prediction model; using weight and bias parameters of each layer of the offline deep learning prediction model as initial values of weight and bias parameters of the corresponding layer of the online deep learning prediction model; and correcting weight and bias parameters of a last layer of the online deep learning prediction model online by using the online training algorithm based on an error between the labels and an output of the online deep learning prediction model; the establishing a deep learning correction model based on a structure that is the same as a structure of the online deep learning prediction model includes: establishing the deep learning correction model by using an LSTM network, where an input of a single neuron, a neuron quantity, a cell node quantity, and a network layer quantity of the deep learning correction model are all the same as those of the online deep learning prediction model; and correcting weight and bias parameters of each layer of the deep learning correction model in real time by using a training algorithm based on an error between the labels and an output of the deep learning correction model; and the correcting the online deep learning prediction model by using the deep learning correction model includes: when a preset condition is met, replacing weight and bias parameters of each layer of the online deep learning prediction model with weight and bias parameters of the corresponding layer of the deep learning correction model; where historical data input into the deep learning correction model is more than historical data input into the online deep learning prediction model.
Further, preferably, the correcting weight and bias parameters of a last layer of the online deep learning prediction model online is specifically correcting some weight parameters and some bias parameters of the last layer of the online deep learning prediction model online.
Further, preferably, the complex industrial system is an alumina production system, and the online deep learning prediction model is configured to predict a caustic concentration detection error of the alumina production system in real time; and the caustic concentration detection error is a difference between a laboratory value of a caustic concentration and a caustic concentration measured by an online caustic concentration detection instrument.
An adaptive deep learning-based intelligent prediction apparatus for a complex industrial system includes:
a dynamic model establishment module configured to establish a dynamic model for a complex industrial system;
an offline deep learning prediction model establishment module configured to establish an offline deep learning prediction model by using the dynamic model;
an online deep learning prediction model establishment module configured to establish an online deep learning prediction model by using the offline deep learning prediction model;
a deep learning correction model establishment module configured to establish a deep learning correction model based on a structure that is the same as a structure of the online deep learning prediction model; and
a self-correction module configured to correct the online deep learning prediction model by using the deep learning correction model; where
the online deep learning prediction model is configured to predict a parameter of the complex industrial system in real time.
Further, preferably, the dynamic model establishment module determines an input variable and an output variable of the dynamic model, where the output variable is a predicted variable; the offline deep learning prediction model establishment module establishes the offline deep learning prediction model by using an LSTM network, uses the input variable of the dynamic model as an input of the LSTM network, uses output data of the dynamic model as labels, and determines a neuron quantity, a cell node quantity, and a network layer quantity of the LSTM network, and weight and bias parameters of each layer by using an offline training algorithm based on an error between the labels and an output of the offline deep learning prediction model; the online deep learning prediction model establishment module establishes the online deep learning prediction model by using an LSTM network, where an input of a single neuron, a neuron quantity, a cell node quantity, and a network layer quantity of the online deep learning prediction model are all the same as those of the offline deep learning prediction model; uses weight and bias parameters of each layer of the offline deep learning prediction model as initial values of weight and bias parameters of the corresponding layer of the online deep learning prediction model; and corrects weight and bias parameters of a last layer of the online deep learning prediction model online by using the online training algorithm based on an error between the labels and an output of the online deep learning prediction model; the deep learning correction model establishment module establishes the deep learning correction model by using an LSTM network, where an input of a single neuron, a neuron quantity, a cell node quantity, and a network layer quantity of the deep learning correction model are all the same as those of the online deep learning prediction model; and corrects weight and bias parameters of each layer of the deep learning correction model in real time by using a training algorithm based on an error between the labels and an output of the deep learning correction model; and when a preset condition is met, the self-correction module replaces weight and bias parameters of each layer of the online deep learning prediction model with weight and bias parameters of the corresponding layer of the deep learning correction model; where more historical data is input into the deep learning correction model compared with the online deep learning prediction model.
Further, preferably, the correcting weight and bias parameters of a last layer of the online deep learning prediction model online is specifically correcting some weight parameters and some bias parameters of the last layer of the online deep learning prediction model online.
Further, preferably, the complex industrial system is an alumina production system, and the online deep learning prediction model is configured to predict a caustic concentration detection error of the alumina production system in real time; and the caustic concentration detection error is a difference between a laboratory value of a caustic concentration and a caustic concentration measured by an online caustic concentration detection instrument.
An adaptive deep learning-based intelligent prediction device for a complex industrial system to implement the above intelligent prediction method, where the device includes an end subdevice, an edge subdevice, and a cloud subdevice;
the end subdevice is configured to collect input data and output data of the complex industrial system;
the edge subdevice is configured to predict a parameter of the complex industrial system in real time by using an online deep learning prediction model; and
the cloud subdevice is configured to train a deep learning correction model and correct the online deep learning prediction model by using the deep learning correction model.
A computer-readable storage medium storing a computer program is provided, where the computer program is executed by a processor to perform the above intelligent prediction method for a complex industrial system.
In order to resolve problems of low prediction accuracy and poor real-time prediction performance of the complex industrial system, the present disclosure establishes the offline deep learning prediction model, the online deep learning prediction model, the deep learning correction model, and a self-correction mechanism to achieve accurate real-time prediction of the complex industrial system.
The technical solutions in the embodiments of the present disclosure are clearly and completely described below with reference to the drawings in the embodiments of the present disclosure. Apparently, the described embodiments are merely a part rather than all of the embodiments of the present disclosure. All other embodiments obtained by those of ordinary skill in the art based on the embodiments of the present disclosure without creative efforts should fall within the protection scope of the present disclosure.
S1: Establish a dynamic model for a complex industrial system.
Specifically, through mechanism analysis of an industrial process, a to-be-predicted production indicator or key process parameter is used as an output variable of the dynamic model for the industrial system, and an input and an output of the industrial process that affect the output variable are used as input variables of the dynamic model, historical output data and historical prediction error data of the dynamic model are used as input variables of the dynamic model, and an unknown constant n is used to represent an order of an unknown change in the input and output variables of the dynamic model. The dynamic model for the industrial system is expressed as follows:
s(k)=f(s(k−1), . . . ,s(k−n),yi(k), . . . ,yi(k−n+1),ui(k), . . . ,ui(k−n+1),Δs(k−1), . . . ,Δs(k−n)) (1)
In the above formula, f represents a nonlinear function of the unknown change; s(k) represents an output of the dynamic model at a kth time point; yi(k) represents an ith output of the industrial process at the kth time point, ui(k) represents an ith input of the industrial process at the kth time point, and i=1, . . . , m; and Δs(k−1)=s(k−1)−ŝ(k−1) represents a prediction error at a k−1th time point, in other words, a difference between an output s(k−1) of the dynamic model and an output ŝ(k−1) of a prediction model at the k−1th time point.
S2: Establish an offline deep learning prediction model by using the dynamic model.
Specifically, the offline deep learning prediction model is established by using an LSTM network, the input variable of the dynamic model is used as an input of the LSTM network, output data of the dynamic model is used as labels, and a neuron quantity, a cell node quantity, and a network layer quantity of the LSTM network, and weight and bias parameters of each layer are determined by using an offline training algorithm based on an error between the labels and an output of the offline deep learning prediction model.
The step S2 includes substeps S21 and S22.
Step S21: Establish the offline deep learning prediction model by an LSTM network structure, set an initial network layer quantity of the LSTM network to 1, and determine the neuron quantity and the cell node quantity of the LSTM network by using a training algorithm based on a difference between the labels and the output of the offline deep learning prediction model.
Specifically, the input variable in the formula (1) is selected as an input xj(k+j−n) (j=1, . . . , n) of a jth single neuron, with an order n being the neuron quantity, in other words,
x
j(k+j−n)=[s(k+j−n−1),yi(k+j−n),ui(k+j−n),Δs(k+j−n−1))]T (2)
In the above formula, j=1, . . . , n; i=1, . . . , m.
The output data s(k) of the dynamic model (which is represented by the formula (1)) for the industrial system is used as the labels, the input and output data of the formula (1) is used to form a big data sample. The offline training algorithm is used to minimize the difference between the labels and the output of the offline deep learning prediction model, thereby determine the neuron quantity n and the cell node quantity
Step S22: Set the neuron quantity and the cell node quantity of the LSTM network to fixed values, and change the network layer quantity of the LSTM network, specifically, select the network layer quantity of the LSTM network based on differences between the labels and the output of the offline deep learning prediction model that correspond to different network layer quantities.
Specifically, the neuron quantity n and the cell node quantity
S3: Establish an online deep learning prediction model by using the offline deep learning prediction model.
Specifically, the online deep learning prediction model is established by using an LSTM network, where an input of a single neuron, a neuron quantity, a cell node quantity, and a network layer quantity of the online deep learning prediction model are all the same as those of the offline deep learning prediction model; weight and bias parameters of each layer of the offline deep learning prediction model are used as initial values of weight and bias parameters of the corresponding layer of the online deep learning prediction model; and weight and bias parameters of a last layer of the online deep learning prediction model are corrected online by using a time sequence N with a fixed data amount, to ensure that the online deep learning prediction model completes a prediction algorithm within a determined optimization decision-making time period. The training algorithm is used to determine the N by minimizing the prediction error. It is obtained, by using a dataset with a time sequence length of the N and a recursive algorithm, that a time sequence of input data of the online deep learning prediction model at the kth time point is (k−N+1), . . . , k and a time sequence of the input data of the online deep learning prediction model at a (k+1)th time point is (k−N+2), . . . , (k+1). The online deep learning prediction model is configured to predict a parameter of the complex industrial system in real time.
S4: Establish a deep learning correction model based on a structure that is the same as a structure of the online deep learning prediction model.
Specifically, the deep learning correction model is established by using an LSTM network. An input of a single neuron, a neuron quantity, a cell node quantity, and a network layer quantity of the deep learning correction model are all the same as those of the online deep learning prediction model. Input data of the model (which is represented by the formula (1)) at a current time points and all previous time points is used as input data of the deep learning correction model, all weight and bias parameters of each layer of the deep learning correction model are trained to obtain a predictive value {tilde over (s)}(k) and a prediction error Δ{tilde over (s)}(k)=s(k)−{tilde over (s)}(k) of the deep learning correction model.
S5: Correct the online deep learning prediction model by using the deep learning correction model.
Specifically, the deep learning correction model is used to adaptively correct the online deep learning prediction model, and an upper limit of a prediction error range is set to δ. When a prediction error of the online deep learning prediction model meets |Δs(k)|≥δ, and the prediction error of the deep learning correction model meets |Δ{tilde over (s)}(k)|<δ, weight and bias parameters of each layer of the deep learning correction model are used to replace weight and bias parameters of the corresponding layer of the online deep learning prediction model to ensure that the prediction error of the online deep learning prediction model is within a specified prediction error range, namely |Δ{tilde over (s)}(k)|<δ.
Further, in an embodiment, the intelligent prediction method for a complex industrial system can be used to predict a caustic concentration detection error of an alumina production system.
Alumina has excellent properties such as high hardness and high melting point, and is commonly used to smelt metallic alumina and manufacture refractory materials. The alumina is a strategic resource that plays a supporting role in the military industry, aerospace, and national economy. At present, a main method for producing the alumina is the Bayer process. According to the Bayer process, crushed bauxite with lime and a caustic solution added according to a required ratio is usually ground. Then, bauxite is dissolved from the caustic solution at a certain temperature and pressure to produce a sodium aluminate solution. An aluminum hydroxide crystal is decomposed and precipitated by purifying the sodium aluminate solution, cooling a purified sodium aluminate solution, adding a seed crystal to the sodium aluminate solution, and stirring the sodium aluminate solution. A precipitated aluminum hydroxide is separated, washed, and roasted to obtain the alumina. A mother liquor (mainly composed of caustic) obtained by separating the aluminum hydroxide undergoes an evaporation process to dissolve new bauxite for a next cycle.
A caustic concentration of an alumina solution is a key process indicator in an evaporation process of the alumina, which is related to final product quality of the alumina. Daily caustic concentration detection is to obtain an accurate caustic concentration value through manual sampling based on a fixed cycle and subsequent testing. However, due to a long sampling interval and long testing time, the caustic concentration cannot be detected in a timely manner, which makes it impossible to achieve operation optimization and control of the evaporation process.
In order to achieve the operation optimization and control of the evaporation process, some alumina enterprises have introduced an online caustic concentration measurement apparatus at an expensive price. In actual production, a grade change of the bauxite and an operation change of a production process of the bauxite result in a significant difference between a caustic concentration measured by the measurement apparatus and a laboratory result. As a result, the caustic concentration measured by the measurement apparatus cannot be used. Due to unknown strong nonlinearity of a dynamic property of the error, an unknown model order, frequent fluctuations in production boundary conditions such as raw materials, and interactions between various process flows and materials, a characteristic of an error dynamic system undergoes an unknown change with production time. As a result, input and output data of the system is in changing, open, and uncertain information space. This makes it impossible to apply an existing deep learning technology with complete information space to a dynamic system for predicting a caustic concentration error in the evaporation process of the alumina. In addition, due to a short time cycle of operation optimization decision-making in an industrial process of producing the alumina, a prediction model for production indicators and key process parameters is required to provide a predictive value in the decision-making time cycle. This requires that a training dataset of the deep learning prediction model should not be too large, and the training algorithm should not take too long.
In order to resolve problems of low prediction accuracy and poor real-time prediction performance of the alumina production system, the offline deep learning prediction model, the online deep learning prediction model, the deep learning correction model, and a self-correction mechanism have been established to achieve accurate real-time prediction of the alumina production system.
S1′: Establish a dynamic model for a detection error between a laboratory value of a caustic concentration and a caustic concentration measured by an online caustic concentration detection instrument.
Specifically, since the online caustic concentration detection instrument takes a refractive index and a temperature of an alumina solution as inputs, and generates a measured concentration value by using the model. Therefore, the refractive index and the temperature of the alumina solution are used as inputs of the dynamic model for the detection error, and a historical difference between the laboratory value of the caustic concentration and the caustic concentration measured by the caustic concentration detection instrument is used as an input of the dynamic model for the detection error. An unknown constant n is used to represent an unknown order of input and output variables of the dynamic model, and the dynamic model for the detection error of the caustic concentration is established as follows:
Δr(k)={circumflex over (f)}(y1(k), . . . ,y1(k−n+1),y2(k), . . . ,y2(k−n+1),Δr(k−1), . . . ,Δr(k−n)) (3)
In the above model, {circumflex over (f)} represents a nonlinear function of an unknown change; y1(k) represents a refractive index of the alumina solution at a kth time point; y2(k) represents a temperature of the alumina solution at the kth time point; and Δr(k)=r(k)−
S2′: Establish an offline deep learning prediction model by using the dynamic model for the detection error.
Specifically, the offline deep learning prediction model is established by using an LSTM network, the input variable of the dynamic model for the detection error is used as an input of the LSTM network, output data of the dynamic model for the detection error is used as labels, and a neuron quantity, a cell node quantity, and a network layer quantity of the LSTM network, and weight and bias parameters of each layer are determined by using an offline training algorithm based on an error between the labels and an output of the offline deep learning prediction model.
The step ST includes substeps S21′ and S22′.
Step S21′: Establish the offline deep learning prediction model by an LSTM network structure, set an initial network layer quantity of the LSTM network to 1, and determine the neuron quantity and the cell node quantity of the LSTM network by using a training algorithm based on the difference between the labels and the output of the offline deep learning prediction model.
Specifically, a jth input variable in the formula (3) is selected as an input xj(k+j−n) (j=1, . . . , n) of a P single neuron, namely:
x
j(k+j−n)=[y1(k+j−n),y2(k+j−n),Δr(k+j−n−1)]T (4)
In the above formula, j=1, . . . , n, and n represents the neuron quantity.
The neuron quantity n and the cell node quantity
An LSTM network neural network with a layer quantity being 1 is selected, and an error between labels Δr(k) at the kth time point and an output Δ{circumflex over (r)}0(k) of the offline deep learning prediction model at the kth time point is minimized by using a big data sample constituted by the input and output variables shown in the formula (3), taking an error Δr(k) between the laboratory value of the caustic concentration and the caustic concentration measured by the measurement instrument as the labels, and using the training algorithm, to determine the n and the
An objective function of the training algorithm is as follows:
In the above objective function, M represents an amount of training data.
The labels Δr(k) is obtained according to a following formula:
Δr(k)=r(k)−
A predictive value Δ{circumflex over (r)}0(k) of the offline deep learning prediction model is a weighted expression of an output h(k) of an nth neuron:
Δ{circumflex over (r)}0(k)=Wd×h(k)+bd (7)
In the above expression, h(k) represents an
h(k)=ok*tanh(C(k)) (8)
In the above formula, ok represents an input of an output gate, and ok represents an
o
k=σ(Wo·[h(k−1),xj(k)]T+bo) (9)
In the above formula, h(k−1) represents an output of an (n−1)th neuron, [h(k−1), xj(k)]T represents an (
C(k) represents a long-term memory state, C(k) represents an it
ci(k) represents an ith element of the vector C(k), and i∈[1, L
C(k)=fk*C(k−1)+ik* (10)
In the above formula, fk, ik, and each represent an
f
k=σ(Wf·[h(k−1),xj(k)]T+bf)
i
k=σ(Wi·[h(k−1),xj(k)]T+bi)
=tanh(WC·[h(k−1),xh(k)]T+bC) (11)
In the above formulas, Wf, Wi, and WC are connection weights of an LSTM network cell, which are all the
It is specified that n=1,2L 22 and
Step S22′: Constantly set the neuron quantity n of the offline deep learning prediction model to 20 and the cell node quantity
The objective function of the training algorithm is shown in the formula (5), and an expression of the labels is shown in the formula (6). The predictive value Δ{circumflex over (r)}0(k) of the offline deep learning prediction model is a weighted expression of an output hL(k) of a 20th neuron of an Lth layer of the LSTM network:
Δ{circumflex over (r)}0(k)=WdL×hL(k)+bdL (12)
In the above expression, hL(k) represents a 180×1 vector, WdL represents a weight parameter, WdL represents a 1×180 vector, and bdL represents a bias parameter.
h
L(k)=okL*tanh(CL(k)) (13)
In the above formula, okL represents an input of the output gate, and okL represents a 180×1 vector.
o
k
L=σ(WoL·[hL(k−1),hL−1(k)]T+boL) (14)
In the above formula, hL(k −1) represents an output of a 19th neuron of an Lth layer of the LSTM network neural network, and hL−1(k) represents an output of a 20th neuron of an (L−1)th layer of the LSTM network neural network, and an input of the 20th neuron of the Lth layer of the LSTM network.
CL(k) represents a long-term memory state, and CL(k) represents a 180 ×1 vector.
C
L(k)=fkL*CL(k−1)+ikL* (15)
In the above formula, fkL, ikL, and each represent a 180×1 vector, respectively, which are calculated according to following formulas:
f
k
L=σ(WfL·[hL(k−1),hL−1(k)]T+bfL)
i
k
L=σ(WiL·[hL(k−1),hL−1(k)]T+biL)
=tanh(WCL·[hL(k−1),hL−1(k)]T+bCL) (16)
In the above formulas, connection weights WfL, WiL, WCL, and WoL of an LSTM network cell each are a 180×360 matrix, and biases bfL, biL, bCL, and boL of the LSTM network cell are a 180×1 vector, respectively.
It is specified that L=1,2,3,4, and the formulas (5), (6), and (12) to (16) are used to minimize the formula (5) by using the gradient descent algorithm. An experimental result is shown in Table 1. When the layer quantity L of the LSTM network neural network is 2, highest prediction accuracy is achieved and training time is short. Therefore, it is determined that there are two neural network layers, and connection weight and bias parameters of each layer of the offline deep learning prediction model are also determined.
S3′: Establish an online deep learning prediction model by using the offline deep learning prediction model.
Specifically, the online deep learning prediction model is established by using an LSTM network. An input of a single neuron, a neuron quantity, a cell node quantity, and a network layer quantity of the online deep learning prediction model are all the same as those of the offline deep learning prediction model. Weight and bias parameters of each layer of the offline deep learning prediction model are used as initial values of weight and bias parameters of the corresponding layer of the online deep learning prediction model. A connection weight Wd2 and a bias bd2 of a second layer of the online deep learning prediction model are corrected online.
The online deep learning prediction model is as follows:
Δ{circumflex over (r)}1=Wd2(k)×h2(k)+bd2(k) (17)
In the above online deep learning prediction model, Wd2(k) represents a correction value of Wd2 at the kth time point, Wd2(k)∈, bd2(k) represents a correction value of bd2 at the kth time point, and h2(k) represents an output of a last neuron in a second layer of the LSTM network cell. To ensure that a prediction algorithm is completed online within a specified prediction cycle, a length N of a time sequence window of input data of the online deep learning prediction model is determined through traversal.
An objective function is as follows:
Correction algorithms for the Wd2 and the bd2 are as follows:
It is specified that N=500,L,900. The Wd2 and the bd2 are corrected by using the above algorithms. An experimental result of the prediction error is calculated by using the formulas (17) and (18), as shown in
Correspondingly, an online deep learning prediction model for a caustic concentration detection error at a (k+1)th time point is as follows:
Δ{circumflex over (r)}1(k+1)=Wd2(k+1)×h2(k+1)+bd2(k+1) (21)
The online deep learning prediction model at the (k+1)th time point uses input data of a time sequence (k−818), (k−817), . . . , (k+1) with N=820 at the (k+1)th time point, and uses following algorithms to correct the weight parameter Wd2(k+1) and the bias parameter bd2(k+1). The predictive value Δ{circumflex over (r)}1(k+1) of the caustic concentration detection error at the (k+1)th time point is obtained according to the formula (21):
In the above algorithms, η represents a learning rate of parameter correction in the online deep learning prediction model, and η=0.0005.
S4′: Establish a deep learning correction model based on a structure that is the same as a structure of the online deep learning prediction model.
Specifically, the deep learning correction model is established by using an LSTM network. An input of a single neuron, a neuron quantity, a cell node quantity, and a network layer quantity of the deep learning correction model are all the same as those of the online deep learning prediction model. All input data of the model (which is represented by the formula (3)) at the current VII time point and all previous time points k,L,2,1 is used as input data of the deep learning correction model, and all weight and bias parameters of a first layer and a second layer of the deep learning correction model are corrected by using a following objective function and training algorithm.
The objective function is as follows:
In the above objective function, k represents the current time point, and Δ represents an output of the deep learning correction model at a tth time point.
Correction algorithms for the Wd2, bd2, Wo2, and bo2 are as follows:
In the above algorithms, η represents a learning rate of parameter correction in the correction model, and η=0.0005. The above correction algorithms can be used to correct remaining connection weight parameters Wf, Wi, WC, Wo, Wd, Wf2, Wi2, WC2 and bias parameters bf, bi, bC, bo, bd, bf2, bi2, bC2 of the first layer and the second layer of the deep learning correction model.
S5′: Correct the online deep learning prediction model by using the deep learning correction model.
Specifically, the online deep learning prediction model is adaptively corrected by using the deep learning correction model. An upper limit of a prediction error range is set to δ,δ=1.5 g/l, and a range at an ith sampling time point is set to [k, k−1,L k−99]. Within 100 consecutive sampling points at a latest time point, when a quantity of sampling points whose prediction errors |Δr(i)−Δ{circumflex over (r)}1(i)| do not exceed the upper limit of the range in the online deep learning prediction model is less than 99, and there are 99 sampling points whose prediction errors |Δr(i)−Δ)| do not exceed the upper limit of the range in the deep learning correction model, weight and bias parameters of each layer of the deep learning correction model are used to correct weight and bias parameters of each layer of the online deep learning prediction model to ensure that the prediction error of the online deep learning prediction model is within a specified prediction error range.
An effect of applying the prediction method for a caustic concentration detection error in the embodiments of the present disclosure to an evaporation process of an alumina plant in Shanxi is shown in Table 2.
An instrument measurement value in Table 2 is a value measured by a caustic concentration instrument online, and a compensated instrument value is a sum of the value measured by the caustic concentration instrument online and a predictive caustic concentration detection error output by the online deep learning prediction model. Table 2 takes statistics on a root mean square error (RMSE) between the instrument measurement value and the laboratory value of the caustic concentration, a qualification rate of the instrument measurement value within a specified error range of the production process, an RMSE between a compensated instrument value and the laboratory value of the caustic concentration, and a qualification rate of the compensated measurement value within a specified error range of the production process separately. It can be seen from Table 2 that after the prediction method for a caustic concentration detection error in the embodiments of the present disclosure is used to compensate for the instrument measurement value, the RMSE between the caustic concentration obtained through instrument measurement and the laboratory value of the caustic concentration can be reduced from 11.25 to 0.50, and the qualification rate can be increased from 10.75% to 99.62%, creating a condition for achieving closed-loop operation and optimization control of the evaporation process of the alumina.
In an embodiment, as shown in
The dynamic model establishment module is configured to establish a dynamic model for a complex industrial system.
The offline deep learning prediction model establishment module is configured to establish an offline deep learning prediction model by using the dynamic model.
The online deep learning prediction model establishment module is configured to establish an online deep learning prediction model by using the offline deep learning prediction model.
The deep learning correction model establishment module is configured to establish a deep learning correction model based on a structure that is the same as a structure of the online deep learning prediction model.
The self-correction module is configured to correct the online deep learning prediction model by using the deep learning correction model.
The online deep learning prediction model is configured to predict a parameter of the complex industrial system in real time.
In an embodiment, the dynamic model establishment module determines an input variable and an output variable of the dynamic model, where the output variable is a predicted variable. The offline deep learning prediction model establishment module establishes the offline deep learning prediction model by using an LSTM network, uses the input variable of the dynamic model as an input of the LSTM network, uses output data of the dynamic model as labels, and determines a neuron quantity, a cell node quantity, and a network layer quantity of the LSTM network, and weight and bias parameters of each layer by using an offline training algorithm based on an error between the labels and an output of the offline deep learning prediction model. The online deep learning prediction model establishment module establishes the online deep learning prediction model by using an LSTM network, where an input of a single neuron, a neuron quantity, a cell node quantity, and a network layer quantity of the online deep learning prediction model are all the same as those of the offline deep learning prediction model; uses weight and bias parameters of each layer of the offline deep learning prediction model as initial values of weight and bias parameters of the corresponding layer of the online deep learning prediction model; and corrects weight and bias parameters of a last layer of the online deep learning prediction model online by using the online training algorithm based on an error between the labels and an output of the online deep learning prediction model. The deep learning correction model establishment module establishes the deep learning correction model by using an LSTM network, where an input of a single neuron, a neuron quantity, a cell node quantity, and a network layer quantity of the deep learning correction model are all the same as those of the online deep learning prediction model; and corrects weight and bias parameters of each layer of the deep learning correction model in real time by using a training algorithm based on an error between the labels and an output of the deep learning correction model. When a preset condition is met, the self-correction module replaces weight and bias parameters of each layer of the online deep learning prediction model with weight and bias parameters of the corresponding layer of the deep learning correction model. Historical data input into the deep learning correction model is more than historical data input into the online deep learning prediction model.
In an embodiment, the correcting weight and bias parameters of a last layer of the online deep learning prediction model online is specifically correcting some weight parameters and some bias parameters of the last layer of the online deep learning prediction model online.
In an embodiment, the complex industrial system is an alumina production system, and the online deep learning prediction model is configured to predict a caustic concentration detection error of the alumina production system in real time. The caustic concentration detection error is a difference between a laboratory value of a caustic concentration and a caustic concentration measured by an online caustic concentration detection instrument.
For specific limitations on the intelligent prediction apparatus for a complex industrial system, reference may be made to the above limitations on the intelligent prediction method for a complex industrial system. Details are not described herein again. The modules of the intelligent prediction apparatus for a complex industrial system may be implemented in whole or in part by software, hardware, or any combination thereof. The modules may be embedded in or independent of a processor of a computer device in a form of hardware, or stored in a memory of the computer device in a form of software, such that the processor can easily invoke and execute corresponding operations of the modules.
In an embodiment, as shown in
In an embodiment, a computer-readable storage medium is provided. The computer-readable storage medium stores a computer program, and the computer program is executed by a processor to implement the intelligent prediction method for a complex industrial system in the above embodiments.
Those skilled in the art may combine different embodiments described in this specification and characteristics of the different embodiments without mutual contradiction.
To sum up, according to the intelligent prediction method, apparatus, and device for a complex industrial system in the embodiments of the present disclosure, in order to resolve problems of low prediction accuracy and poor real-time prediction performance of the complex industrial system, the offline deep learning prediction model, the online deep learning prediction model, the deep learning correction model, and a self-correction mechanism have been established to achieve accurate real-time prediction of the complex industrial system.
The above are merely specific implementations of the present disclosure, and the protection scope of the present disclosure is not limited thereto. Any modification or replacement easily conceived by those skilled in the art within the technical scope of the present disclosure should fall within the protection scope of the present disclosure. Therefore, the protection scope of the present disclosure shall be subject to the protection scope of the claims.
Number | Date | Country | Kind |
---|---|---|---|
202011435304.2 | Dec 2020 | CN | national |
The present application is a Continuation Application of PCT Application No. PCT/CN2021/136373 filed on Dec. 8, 2021, which claims the benefit of Chinese Patent Application No. 202011435304.2 filed on Dec. 10, 2020. All the above are hereby incorporated by reference in their entirety.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2021/136373 | Dec 2021 | US |
Child | 18208881 | US |