The present disclosure relates to the technical field of soft sensor of industrial process product quality indicators, in particular to an industrial process soft sensor method based on a federated stochastic configuration network.
In order to lower production cost and improve production efficiency and quality, a soft sensor technology that can accurately predict the product quality index in real time is an important research direction in the field of complex industrial process control today, and has a far-reaching significance and a practical application value. Due to the problem of insufficient data in the field of complex industry, if multi-party enterprises do not exchange and integrate data, the effect indicators obtained by artificial intelligence model training and prediction are not ideal and difficult to apply. With the further development of big data, paying attention to data privacy and security has become a worldwide trend. Therefore, all countries are strengthening the protection of data security and privacy. A new law, General Data Protection Regulation (GDPR), recently introduced by the European Union shows that the increasingly strict management of user data privacy and security will be a world trend, which brings unprecedented challenges to the field of artificial intelligence. Federated learning, as a machine learning framework, can achieve a unified machine learning model trained from local data of multiple participants under the premise of protecting data privacy. Therefore, in privacy-sensitive scenarios (including financial industry, industry and many other data perception scenarios), the federated learning has shown excellent application prospects. At present, the federated learning is mainly combined with deep learning, but a deep algorithm itself has some bottleneck problems that are difficult to solve, such as falling into local minima, strong dependence on the setting of initial parameters, gradient disappearance and gradient explosion, so that it is difficult to give full play to the strong learning ability of neural networks. As an advanced single hidden layer random weight network with universal approximation property, a stochastic configuration network has been proved to have obvious advantages in compactness, fast learning and generalization performance by a large number of regression and classification experiments.
The objective of the present disclosure: in view of small amount of data of existing industrial process products and the difficulty in concentrating data training of all parties, the present disclosure provides an industrial process soft sensor method based on a federated stochastic configuration network, including the following steps:
In step 1, a total of K factories are set to participate in the federated training, and for the kth factory, nk groups of historical industrial process auxiliary data Xk and the corresponding product quality data Tk are obtained, denoted as {Xk,Tk}; the historical industrial process auxiliary data xik in the ith group of the kth factory contains d auxiliary process variables, the corresponding product quality data ti contains m product quality data, and if the value of i is 1 to nk, then an input sample matrix is Xk={x1k, x2k, . . . , xik, . . . xn
In step 1, all the K factories perform the same industrial process; and most of the same industrial processes adopt the same process flow and process equipment, which have characteristic similarity.
In step 1, the initialized parameters required for local stochastic configuration network learning include: a maximum number of hidden layer nodes Lmax, a maximum number of random configuration times Tmax, an expected tolerance ε, a hidden layer parameter random configuration range Y={λmin:Δλ:λmax}, a learning parameter r, an activation function g(.), and an initial residual e0=Tk, where λmin is a lower limit of allocation interval of random parameters, λmax is an upper limit of allocation interval of the random parameters, and Δλ is an incremental parameter of allocation interval of the random parameters.
Step 1 further includes:
Step 1 further includes: selecting the best candidate nodes from the candidate pool and uploading same to the central server, including weighted aggregation and greedy selection:
Step 2 includes:
In step 2, performing the greedy selection on the uploaded best candidate nodes by means of the central server includes:
Step 3 includes:
H
L
k
=[g((wL*)Tx1k+bL*),g((wL*)Tx2k+bL*), . . . ,g((wL*)TxNk+bL*)]T
βLk=(HLk)+Tk
Beneficial effects: compared with the prior art, the present disclosure has the advantages that the method uses a dynamically configured federated learning way to train the models, and establishes an industrial process product quality soft sensor model with optimal parameters and universal approximation property in a form of a construction method, which requires no complicated retraining process, and can ensure the accuracy of the models. Therefore, the present disclosure has good compactness and generalization performance.
The present disclosure will be further described in detail below with reference to the accompanying drawings and specific implementations, and the above-mentioned and/or other advantages of the present disclosure will become clearer.
The present disclosure provides an industrial process soft sensor method based on a federated stochastic configuration network. A fitting model structure used in the present disclosure is shown in
Step 1. Each factory selects 100 groups of historical data measured in a traditional hematite grinding process from a historical local database of grinding process. That is, each group contains five auxiliary process variable data, namely, ball mill current c1 and spiral classifier current c2, mill feeding capacity c3, mill inlet feed water flow c4 and classifier overflow concentration c5, and xik={ci1k, ci2k, . . . , ci5k} is used to represent input data after homogenization of the kth client and its corresponding product quality data, i.e., a grinding particle size value ti. ci5k represents the c5 auxiliary process variable data of the ith sample of the kth client. There are 10 factories participating in the training, with a total of 1000 groups of historical data, including 800 groups as training sets and 200 groups as test sets. Input samples are Xk={x1k, x2k, . . . xik, . . . x80k}, where xik={ci1k, ci2k, . . . , ci5k}∈5, and output samples are Tk={t1k, t2k, . . . , tik, . . . , t80k}.
Parameters required for federated stochastic configuration network soft sensor model learning are initialized. A maximum number of hidden layer nodes Lmax is equal to 100, a maximum number of configuration times Tmax is equal to 20, an expected tolerance ε is equal to 0.05, a hidden layer parameter random configuration range Y is {1:1:10}, a learning parameter r is equal to 0.99, an initial residual e0 is equal to T, and a Sigmoid (S-curve) function g(x)=1/(1+exp(−x)) is selected as an activation function.
During the construction of the federated stochastic configuration network, when the kth client adds the Lth node:
and
The candidate nodes are substituted into ξLk=Σq=1mξL,qk, respectively to obtain (ξL1)k, (ξL2)k, . . . (ξLj)k, j≤20, where ξL,qk represents a node supervision value of the nth random configuration in the Lth iteration in the client k; (ξL1)k, (ξL2)k, . . . (ξLj)k are a plurality of newly added candidate nodes that satisfy the supervision mechanism for constructing a single-node pool of candidate hidden layers;
A group of hidden layer parameters corresponding to the maximum value ξLk is found out from the node pool, which are the best hidden layer parameters (wLk) and (bLk)* that satisfy the supervision mechanism.
The best candidate nodes are uploaded to a central server, where different parameters are loaded based on different algorithms, and the different algorithms include weighted aggregation and greedy selection:
Step 2 includes:
Performing, by the central server, the greedy selection on the uploaded best nodes includes:
Step 3 includes:
After obtaining the global parameters wL* and bL*, newly added hidden layer outputs and the output weights βLk are calculated by each client and βLk is uploaded to the server for the weighted aggregation, including:
H
L
k
=[g((wL*)Tx1k+bL*),g((wL*)Tx2k+bL*), . . . ,g((wL*)TxNk+bL*)]T
βLk=[β1k,β2k, . . . ,βLk]T:=(HLk)+Tk
At this time, the residual of the stochastic configuration network for each client is: eLk=HLkβL−Tk.
Step 4. When the number of the hidden layer nodes of the federated stochastic configuration network exceeds 100 or a residual in a current iteration meets an expected tolerance of 0.05, no new nodes are added, and the modeling is completed. Otherwise, the flow returns to step 1 to continue to construct the network until preset requirements are met. Each client downloads a soft sensor model of grinding particle sizes based on the federated stochastic configuration network. Each client collects local data online and input same into the global soft sensor model.
Step 5. Each client collects the ball mill current c1, the spiral classifier current c2, the mill feeding capacity c3, the mill inlet feed water flow c4 and the classifier overflow concentration c5 in an online way, inputs same into the constructed soft sensor model of grinding particle sizes, and performs online estimation on the grinding particle sizes, i.e., {tilde over (x)}k=Hkβk, where {tilde over (x)}k is the product quality data estimated online by the client k.
The present disclosure provides the industrial process soft sensor method based on a federated stochastic configuration network. There are many methods and ways to implement the technical solution. The above description is only an exemplary implementation of the present disclosure. It should be pointed out that for a person of ordinary skill in the art, several improvements and modifications can be made without departing from the principles of the present disclosure, and these improvements and modifications should also be regarded as falling within the protection scope of the present disclosure. All the components not specified in this embodiment can be implemented by using existing technologies.
Number | Date | Country | Kind |
---|---|---|---|
202111054478.9 | Sep 2021 | CN | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2022/100744 | 6/23/2022 | WO |