The present invention belongs to the technical field of information, relates to the technologies of data-driven modeling, multi-objective optimization and edge-cloud cooperation, and is a distributed industrial energy operation optimization platform which is capable of automatically constructing intelligent models and algorithms. The platform is divided into three parts: a modeling terminal, a background service and a human-computer interface. The models like data pre-processing, energy generation-consumption-storage trend forecasting and optimal scheduling decision models are encapsulated in the modeling terminal as different visualization modules facing with multiple categories production scenarios, by dragging which the complex functional models can be realized conveniently. The background service is capable of automatically constructing the training samples and the production plans/manufacturing signals series according to the device model requirements of each edge side, interacts with the trained intelligent models through corresponding interfaces, and the computing results are saved in the specified relational database. The computing results are displayed through a friendly customer human-computer interface, and the real-time state of current working condition can also be adjusted, then provide feedback to the cloud server, which can quickly trigger the parameter updating of each edge side model to make the model quickly adapt to the change of the working condition, so as to realize the edge-cloud cooperation of the platform. The platform of the present invention has the advantages of conveniently operation, short modeling cycle and high computing efficiency, and can be widely applied to energy forecasting and optimal scheduling of the integrated energy system of an industrial park.
As the world energy crisis and the environmental pollution become more and more serious, the improvement of comprehensive energy utilization efficiency has become the focus of all countries in the world. The research on the integrated energy system containing various energy chains such as cold, heat, power and gas has gradually become one of the hot spots in the energy field (Sun Hongbin, Pan Zhaoguang, Guo Qinglai (2016). Research on Energy Management for Multi-Energy Flow: Challenges and Prospects [J]. Automation of Electric Power System, 40(15): 1-8). As an important part of social production and energy consumption, industrial parks have the advantages of complementary industrial processes and complementary energy supply and demand and can promote the formation of emerging industrial bases and boost local economic and social development, and thus gradually become important gathering places for domestic science and technology development. However, traditional industrial parks focus on production rather than energy, resulting in loose connection and independent control of various energy media, low comprehensive operational energy efficiency, high cost and great influence on the environment. On the premise of complete energy devices, accurate modeling, trend forecasting and optimal scheduling for energy generation-consumption-storage become an important means of energy conservation and emission reduction as well as cost reduction and efficiency increase (J. Zhao, C. Y. Sheng, W. Wang, W. Pedrycz, Q. L. Liu. (2017). Data-based predictive optimization for byproduct gas system in steel industry [J]. IEEE Transactions on Automation Science and Engineering, 14(4): 1761-1770).
The pipe network of the industrial energy system is widely distributed and complicated, and it is increasingly difficult to establish a model based on physical and chemical mechanisms, and the accuracy cannot be guaranteed with the aging of devices. With the rapid development of industry informatization, the energy control centers in the parks have accumulated massive operation data of production and energy, providing strong support for data-driven methods. In recent years, data-driven methods have been provided for modeling industrial energy systems, such as gas system (T. Y. Wang, Z. Y. Han, J. Zhao, W. Wang. (2018). Adaptive granulation-based prediction for energy system of steel industry [J]. IEEE Transactions on Cybernetics, 48(1): 127-138), steam system (S. Shamshirband, D. Petkovie, R. Enayatifar, A. H. Abdullah, D. Markovic, M. Lee, R. Ahmad. (2015). Heat load prediction in district heating systems with adaptive neuro-fuzzy method [J]. Renewable and Sustainable Energy Reviews, 48: 760-767) and power system (C. Dinesh, S. Makonin, I. V. Bajić. (2019). Residential Power Forecasting Using Load Identification and Graph Spectral Clustering [J]. IEEE Transactions on Circuits and Systems II: Express Briefs, 66(11): 1900-1904). In view of the feature that different energy response time scales are different, short-term energy modeling (M. Fliess, C. Join, C. Voyant. (2018). Prediction bands for solar energy: New short-term time series forecasting techniques [J]. Solar Energy, 166: 519-528) and long-term energy modeling (J. Zhao, Z. Y. Han, W. Pedrycz, W. Wang. (2016). Granular model of long-term prediction for energy system in steel industry [J]. IEEE Transactions on Cybernetics, 46(2): 388-400) are provided. However, the above research is mostly for modeling of single-energy systems without considering mutual coupling under the condition of multiple energy chains. For the optimal scheduling, researchers have studied single-energy systems such as internal by-product gas system (Li Hongjuan, Xiong Wenzhen. Forecasting and optimal scheduling of byproduct gas in steel industry, [J]. Steel, 2016, 51(8): 90-98), thermal system (Zhou Dan, Sun Ke, Zheng Chaoyang, Chen Xixiang, Zheng Weimin. (2020). Research on stochastic optimal scheduling model for electric-thermal integrated system considering thermal storage characteristics of heating system [J]. Renewable Energy Sources, 38(3): 380-387) and power system (Xu Qingshan, Ding Yifan, Zheng Aixia, (2018). Optimal scheduling model for power grid security considering demand response [J]. Control and Decision, 33(3): 549-556). In addition, the scheduling method for coupling of multiple gas media (F. Jin, J. Zhao, Z. Y. Han, W. Wang. (2018). A joint scheduling method for multiple byproduct gases in steel industry [J]. Control Engineering Practice, 80: 174-184) and the long-term scheduling method with discrete production characteristic processes (F. Jin, L. Q. Wang, J. Zhao, W. Wang, Q. L. Liu. (2020). Granular-causality-based byproduct energy scheduling for energy-intensive enterprise [J]. IEEE Transactions on Automation Science and Engineering, in press) are also studied correspondingly. The above methods provide a solution for the operation optimization of energy systems, but the global dynamic optimization of energy in the whole parks has not been realized.
At present, for the modeling of each energy system, researchers have developed corresponding software platforms, such as RT-LAB for power flow analysis (Tan Zhukui, Xu Yutao, Ban Guobang, Lv Qiansu, Yuan Xufeng, Xie Baiming, Cao Mingjie. (2019). Research on testing technology of control device based on RTLAB for flexible distribution network [J]. Power Big Data, 22(7): 1-8), APROS for dynamic simulation of heat supply network (R. Starkloff, F. Alobaid, K. Kamer, B. Epple, M. Schmitz, F. Boehm. (2015). Development and validation of a dynamic simulation model for a large coal-fired power plant [J]. Applied Thermal Engineering, 91: 496-506), TRNSYS (Qiu Liuliang. (2017). Operation simulation and optimal analysis of distributed energy system based on TRNSYS [D]. Shanghai University of Electric Power.) and Simulink for dynamic simulation of gas network (N. Voropai, E. Ukolova, D. Gerasimov, K. Suslov, P. Lombardi, P. Komarnicki. (2019). Simulation approach to integrated energy systems study based on energy hub concept [C]. IEEE Powertech Conference). However, the above platforms only provide the real-time simulation function of the system, and cannot judge the future operation trend of the energy systems or give reasonable optimal scheduling suggestions.
In order to improve the modeling efficiency and accuracy of the industrial integrated energy system and provide real-time, accurate and optimal guidance and suggestions for energy control personnel, the present invention proposes a distributed industrial energy operation optimization platform which is capable of automatically constructing intelligent models and algorithms. The platform comprises two parts: a software system and a hardware controller which are respectively deployed on the enterprise cloud and the edge side of a device, with the distributed operation architecture shown in
Starting from business scenarios, the modeling terminal is the core part of the whole platform, and highly integrates complex mathematical models in the form of visualization modules. In consideration of obvious difference in features of heterogeneous energy flow in the industrial production process, a general intelligent module for multiple production scenarios (e.g. steel making, iron making, etc.), multiple response scales (e.g. short term, medium term, etc.), multiple display modes (e.g., numerical, interval, etc.) and multiple working conditions switching (such as wind break, wind reduction, etc.) is provided. Complex models can be built and trained by logic splicing based on services by means of drag and drop. Meanwhile, an adaptive algorithm is embedded in each module to dynamically optimize parameters, thus ensuring the adaptive and generalization abilities of models. Then, the built models can be saved as separate “*.iail” files for repeated invocation. The modeling terminal is divided into a data pre-processing module, an energy generation-consumption-storage trend forecasting module, an optimal scheduling decision module, a parameter optimization configuration module and a result display module, and each module comprises a plurality of models for different working condition scenarios, wherein
The modeling terminal supports the following two interactive modes, as shown in
With regard to the two interactive modes of on-line model computing and off-line model computing, the former can better obtain patterns or features in new data because of running in the real-time on-line environment; and for the latter, the model is trained off-line and has good timeliness, but needs periodic updating. In addition, the platform has better compatibility and can run in major operating systems such as Windows7/10 and Linux. The platform supports customized participation and realizes the intervention in model training and computing processes by means of parameter setting. A model file is developed in Python and subjected to interface encapsulation through C++, providing an invocation interface for commonly used programming languages such as C# and Java.
The background service is respectively deployed on the cloud and the edge side. The edge side service performs real-time computing by reading and serializing information such as industrial operation data and production plans/signals, then building training samples according to the standardized format and invoking the saved “*.iail” model file interface, and saves the results in the relational database. The cloud service receives the foreground trigger information and the working conditions switching signals obtained by analysis of the edge side, so as to trigger the edge side model for self-updating to realize the adaptive adjustment of parameters of the model.
The human-computer interface realizes the friendly interface display of the computed result of the model. The interface is refreshed every minute to update the computed result in time. In the case of temporary adjustment of working conditions, the information is fed back to the cloud through human-machine interaction, and the cloud service will trigger the adaptive optimization of the parameters of the edge side model, so as to better meet the actual demand.
The technical solution of the present invention is as follows:
A distributed industrial energy resource operation optimization platform automatically building intelligent models and algorithms, comprises the following steps:
(1) A platform-based modeling terminal builds the required filtering, filling, forecasting and scheduling models by means of visual drag and drop, and saves the models as intelligent model files;
(2) A background service is deployed on the cloud and each edge side. The edge side service reads actual industrial data, builds training samples according to the standardized format, invokes a model file interface, performs real-time computing based on auxiliary information such as real-time data and production plans/signals, and saves the results in the relational database; and the cloud service receives the foreground trigger information and the working conditions switching signals obtained by analysis of the edge side, so as to trigger the edge side model for self-updating;
(3) A human-computer interface is deployed on each edge side, realizing the friendly interface display of the computed result of the model. In the case of temporary change of working conditions, the information can be fed back to the cloud service through human-machine interaction to trigger the self-updating of the parameters of the model.
The present invention has the following beneficial effects:
The present invention establishes a distributed industrial energy resource operation optimization platform automatically building intelligent models and algorithms, takes the intelligent model established by the modeling terminal as the basis for analysis and computing of the background service and the human-computer interface, and realizes high efficiency, precision, flexibility and safety of industrial systems by means of edge-cloud cooperation. It is verified through operation in actual industrial fields that the platform can greatly improve the development efficiency and accuracy of complex business logic models, reduce the development cost, and provide reliable support for the optimal scheduling of the integrated energy system.
To better understand the technical route and implementation solution of the present invention, with data pre-processing, generation-consumption-storage trend forecasting and multi-energy flow optimal scheduling processes of an energy system of a large domestic steel enterprise as an example, the algorithm of the modeling terminal and the hardware design of an embedded industrial controller of the background service on the edge side of the platform are described in detail. The energy system comprises multiple energy flows such as steam, gas and power, as shown in
(1) Modeling terminal: data pre-processing, energy generation-consumption-storage trend forecasting and multi-energy flow optimal scheduling model
The historical data of the capacity, the total generated flow and the total consumption flow of the gas system are subjected to filtering, missing value fill and outlier correction by a data pre-processing module, and trained as the input of a forecasting model. The parameters are updated adaptively, so as to obtain a forecasting model with the corresponding capacity. Furthermore, with gas capacity safety, steam demand and adjustable user load capacity as constraint conditions and with economy as the target, an optimal scheduling model of an integrated energy system can be built by logic splicing of the modules, and a reasonable scheduling proposal can be given. The following is the sub-working condition processing algorithm of each module required to build the optimal scheduling model of an integrated energy system.
1) Data Pre-Processing
Filtering
Considering that process data such as energy and production actually acquired in industry generally contain noise, it is necessary to filter the data first to reduce the influence of noise on data forecasting and other results. For real-time data (such as square wave data and data containing white noise) of different working condition scenarios, different intelligent methods are integrated into a filtering module to achieve targeted filtering for different feature data and improve the filtering reliability.
For working condition scenarios with square wave features in the production process, the feature data are obvious. The BFG consumption flow of a blast furnace hot blast stove shown in
wherein
For working condition scenarios with continuous production features, the data generally contain white noise. The present invention adopts an empirical mode decomposition (EMD) method to process the acquired data. In the method, selective reconstruction is carried out on the intrinsic mode function (IMF) to realize denoising and provide support for improving forecasting accuracy. The calculation steps are as follows:
Step 1: calculating all extreme points of a signal series. Respectively connecting local maximum points and minimum points through cubic spline curves to form an upper envelope and a lower envelope, and calculating a new signal h11(t) with low frequency signals removed, as shown in formula (2):
h11(t)=xfc(t)−
wherein
Step 2: supposing the (nfc)th IMF component IMFn
wherein rn
The data of oxygen flow of a blast furnace of a steel enterprise are adopted to conduct a contrast experiment.
Outlier Detection
Due to long transmission distance, complex pipe network and violent fluctuation in generation and consumption of the industrial energy system, data exceptions easily occur under the influence of environment and sensor in the data acquisition process. The present invention builds corresponding outlier detection models respectively for working condition scenarios with class period and aperiodic characteristics of the data.
Typical working conditions with aperiodic characteristics of production data (such as input flow of first-stage COG) are shown in
wherein wƒ
reflects the internal relation between sample data points. ui
The steps of outlier detection based on AFCM are as follows:
Step 1: performing AFCM clustering on a time series to be detected to obtain cod cluster centers which are defined as v1od, . . . , vc
Step 2: calculating the minimum distance between each data point in the time series and each cluster center;
Step 3: determining the threshold of an abnormal distance through a box figure. The distance value which is more than the threshold will be determined as an outlier. First, arranging the data from small to large, and respectively calculating an upper quartile Q3od and a lower quartile Q1od; then calculating an interquartile range IQRod=Q3od−Q1od; and finally, calculating an outlier cut-off point Qod=Q3od+1.5*IQRod.
The accuracy of the method is judged by a misjudgment ratio ηw
wherein nw
With the input flow of first-stage COG as an example, 30,000 samples are selected, the outlier detection results of the local outlier factor (LOF) and the K-Means method are shown in
The energy generation/consumption data of some production processes (such as blast furnace iron making process) are similar in form but not strictly identical in duration within each period. Such working conditions can be considered to have class period characteristics, and typical data are shown in
Step 1: dividing historical data samples into nod sample series [x1od, . . . , xn
Step 2: calculating the sum SumDi
Step 3: with the series Xod of a sample to be detected as the reference, scaling the center sample Yod based on DTW to obtain Y*od, wherein the scaling process meets formula (12):
wherein kod represents the number of points corresponding to the series Yod and the series Xod in the DTW process, and the symbol xi
Step 4: performing AFCM clustering on the scaled series Y*od to obtain a cluster center v*od, and calculating the minimum distance between each point in the sample to be detected and all cluster centers;
Step 5: obtaining a threshold for determining an abnormal distance by calculation through a box figure. The distance value which is more than the threshold will be determined as an outlier.
With the BFG input flow of No. 1 blast furnace as an example, 3,000 samples are selected, and the effect of outlier detection is shown in
Missing Value Fill
Due to bad measurement environment and serious noise interference in the industrial production field, acquisition points are easily missing at some moment due to temporary failure of sensors, which affects the subsequent analysis and computing of the model. Generally, the missing points may be discrete or continuous. In the process of filling missing points, in order to maximize the filling effect and minimize the error, it is necessary to adopt different methods to fill data respectively. In view of the above situation, the present invention proposes a filling method for multiple scenarios (discrete single-point missing and continuous missing). In the method, the basic features are analyzed based on the historical sample data and divided into different working condition scenarios, and different filling methods are respectively proposed for discrete single-point missing and continuous multi-point missing.
wherein yi
Under the working condition where the standard deviation is more than ε, data are filled by means of cubic spline interpolation. The index value of non-missing points in the sample series is {x1tb, . . . , xn
wherein jtb=1, 2, . . . , ntb−1. Sjtb(xtb) is a third expression on [xjtb, xj+1tb], ajtb, bjtb, cjtb and djtb are respectively the corresponding power coefficients of the expression, and Sjtb′(xj+1tb) and Sjtb′(xj+1tb) are respectively the first derivative and the second derivative of the expression Sjtb(xtb). The constraint conditions specify interpolation conditions as well as continuity at non-endpoints and continuity of derivatives. Then, the value at the missing moment can be calculated based on the fitted curve, so as to realize filling under such working conditions.
For continuous missing in data, an autoregressive integrated moving average model (ARIMA Model) is adopted to fill segments.
wherein Δd
AIC=2ktb−2 ln(Ltb) (16)
BIC=ktb ln(ntb)−2 ln(Ltb) (17)
wherein kth is the number of parameters of the model, nth is the number of samples, Ltb is a likelihood Function, and ln(⋅) represents a natural logarithm function. According to formula (16) and formula (17), a (ptb, qtb) combination minimizing AIC or BIC is searched within the range of [0, 5], i.e., the optimum parameter of the model where ptb and qtb respectively represent the two parameters of the autoregressive moving average model adopted in the invention, ptb is the number of autoregressive terms, and qtb is the number of moving average terms. The complete data segments before and after the missing segment are respectively taken as training samples, the data of the missing segment are forecast based on the model, and the average value of the two forecast results is taken as the fill value. In this way, the sample information can be fully utilized so as to improve the filling accuracy.
To sum up, the filling method comprises the following specific steps:
Step 1: judging the information features of time series data to be filled, comprising the miss rate of data, the missing situations of missing points, the periodicity of time series and the standard deviation of time series. If all the missing points in the time series are discrete missing, executing step 2; if all the missing points in the time series are continuous missing, executing steps 4-5; and if discrete missing points and continuous missing points both exist in the time series, executing step 2 first, and then executing steps 4-5 repeatedly for each continuous missing data segment.
Step 2: if the standard deviation of the series is less than ε, filling discrete points by means of moving average. Otherwise, executing step 3.
Step 3: carrying out filling by means of cubic spline interpolation.
Step 4: filling data based on the ARIMA model. Selecting complete series before and after the missing data segment as samples to forecast the data of the missing segment respectively, so as to calculate the average value of the two forecast results as the fill value.
Step 5: establishing an ARIMA model to forecast the missing data segment, and taking the average value of the two forecast results as the fill value.
Adopting the mean absolute error (MAE), the mean absolute percentage error (MAPE) and the root mean square error (RMSE) to measure the filling effect, wherein the three indexes are respectively shown in formula (18) to formula (20):
wherein Ttb is the calculation length, ytb(ttb) is a forecast value, and ydtb(ttb) is a measured value.
With the data of BFG use flow for cold rolling as an example, selecting the threshold ε=20 of the standard deviation according to the process experience. Adopting adjacent, spline interpolation (SI) and moving average (MA) respectively as comparison methods.
2) Energy Trend Forecast
Energy Generation-Consumption Forecasting for Class Period Feature Working Conditions
In the production process of the steel industry, because some processes (such as iron making and coking) have relatively mature production flows and fixed processes, energy generation/consumption has class period features, such as BFG generation flow and BFG consumption flow of a hot blast stove. In view of energy forecasting problems under such working conditions, the present invention proposes a combination forecasting model based on the echo state network (ESN) and the least squares support vector machine (LSSVM). The model can be respectively used for dynamic modeling and forecasting for data quality at different moments, and with the training accuracy as the evaluation index, the model with higher accuracy is taken as the final forecast result.
ESN is a novel recursive neural network structure, and has dynamic characteristics of nonlinear systems because of a large dynamical reservoir (DR) inside. The reservoir comprises a large number of randomly generated and sparsely connected neurons, contains the operating state of the system and has the short-term memory function, thus presenting better performance in time series forecasting. Network updating is shown in formula (21) and formula (22):
xss(kss+1)=ƒss(Winuss(kss+1)+Wxss(kss)+Wbackyss(kss)) (21)
yss(kss+1)=ƒout(Wout(uss(kss+1),xss(kss+1),yss(kss))) (22)
wherein ƒss(⋅) is a DR internal activation function, which is generally a Sigmoid function tanh( ), xss(kss) and uss(kss) are respectively the state variable of DR and the input vector of the model at the moment kss, and yss(kss) is the output of the model. Win is an input weight matrix. W is an internal neuron connection matrix of DR. To enable DR to have the dynamic memory ability, a sparse connectivity of 1%-5% is generally maintained, and the spectral radius is less than 1. Wback is a feedback matrix of output neurons and DR. ƒout(⋅) is an input and output unit activation function, which is generally a linear function. W′ is an output weight matrix.
Different from ESN, an LSSVM regressive model considers a set{xt
yss=wssφss(xss)+bss (23)
wherein φss(⋅) is a non-linear mapping function, and wss and bss are respectively a weight and a bias. The solution of the LSSVM model can be summarized as the optimization problem with constraints, as shown in formula (24):
wherein γss is a penalty coefficient, Nss is the total number of samples, and et
wherein Kss(xp
The Particle Swarm Optimization (PSO) algorithm is an evolutionary computing method based on bionics, which is highly favored due to advantages of high computing speed and the ability to jump from local optimal solutions and commonly used to optimize algorithm parameters. Each particle has two features of velocity and position, of which the iterative formulae are shown in formula (27) and formula (28) respectively:
vt
xt
wherein xt
The energy generation-consumption forecasting method for class period feature working conditions comprises the following specific steps:
Step 1: dividing historical sample data into a training set and a validation set, wherein the training set is used for training the model, and the validation set is used for validating the model accuracy;
Step 2: for the ESN method, optimizing the embedded dimensions of samples and the number of nodes in the reservoir by the PSO algorithm;
Step 3: for the LSSVM method, optimizing the embedded dimensions of samples and the intrinsic parameters (penalty coefficient and kernel parameter) of the algorithm by the PSO algorithm;
Step 4: evaluating the two methods based on the training accuracy, and taking the computed result of the method with higher accuracy as the forecast result of the model.
With the forecasting of the BFG generation flow of No. 1 blast furnace of a steel enterprise as an example, the accuracy of the forecasting method of the present invention is compared with that of the existing forecasting methods, and the forecast results are evaluated by MAPE and RMSE.
The above model considers the case where the historical data are complete. In the actual production process, it is difficult to avoid missing of data points as mentioned above. Although the filling method can ensure the integrity of samples, the reliability cannot be evaluated because the missing data points cannot be acquired. Aiming at this situation, the present invention proposes a method for direct modeling without filling missing points. The idea of ensemble learning is adopted to train the models based on the relevance vector machine (RVM) and Gaussian processes (GP) to give forecast results. In this way, the time of selecting the optimal filling method can be saved, and the generalization and robustness of the forecasting model under different working conditions can be improved.
Embodiments of the present invention are further described below. Due to low sensitivity of devices or changes in working conditions, time series data acquired in industrial fields generally have the characteristics of missing points and high noise. Therefore, such incomplete data cannot be directly used in time series forecasting.
Step 1: initial filling and data set building based on phase space reconstruction. Extracting time series data of a period of time from the real-time relational database of an industrial field. Detecting and recording positions of missing points in the time series, and filling all missing points with zero values. The function of phase space reconstruction is to build a training sample set. Supposing a discrete time series {x1sl, x2sl, . . . , xN
xj
wherein psl is an embedded dimension of the phase vector, and the matrix form of Nsl input samples is Xsl=[x1sl, x2sl, . . . , xN
Step 2: building of RVM model based on incomplete data set. Building an initial RVM model, i.e., building a functional relationship between input and output variables. Supposing the mapping relationship between the (isl)th function value yi
wherein ϕi
wherein α=[α0, α1 . . . , αN]T, and A=diag(α). Parameters α and β are considered as random variables which obey Gamma distribution. In the present invention, a noise variance vector is written as β=[β[m], β[m], . . . , β[m], β[obs], . . . , β[obs], β[obs]]T, and β[obs]=σ−2. In addition, β also can be written as β=[β[m]T, β[obs]T]T in the form of diagonal matrices B[m]=diag(β[m]), B[obs]=diag(β[obz]) and B=diag(β). A training output set is composed of missing points observable points, wherein xsl=[(s[m]sl)T, (x[obs]sl)T]T, s[m]sl and s[obs]sl respectively represent the vectors of missing points and observable points in the output set, the number of output missing points and that of output observable points are respectively Nslymiss and Nslyobs, and Nsl=Nslymiss+Nslyobs. In the present invention, the element β[m] in β[m] is defined as 108 to ensure that the noise variance of the corresponding missing output point is not affected. The likelihood function is shown in formula (32):
wherein Φd=[ϕsl1, ϕsl2, . . . , ϕslN
wherein ix
wherein a covariance matrix {tilde over (C)}=Φx[obs]A−1Φx
wherein isl=1, 2, . . . , Nsl, {tilde over (γ)}i=1−αi{tilde over (Σ)}ii, {tilde over (Σ)}=(A+β[obs]Φx
{circle around (1)} Initializing: α=[1/(Nsl)2, 1/(Nsl)2, . . . , 1/(Nsl)2]T, and β[obs] is initialized as the variance of 10 times xsl[obs]. The related vector set is initialized as a set of all input samples;
{circle around (2)} Updating mean values and covariance matrices of joint posterior probabilities of wsl and xsl[m];
{circle around (3)} Updating α and β[obs];
{circle around (4)} If αj
{circle around (5)} Calculating the mean posterior probability of each element in xsl[m] filling in the corresponding position of the missing input point, and further updating the kernel matrix Φ;
{circle around (6)} Calculating the value of the log edge likelihood function;
{circle around (7)} If the change of an element in a relative to the last iteration or the change of the value of the edge likelihood function is less than the given threshold, stopping the iterative process, thus obtaining a final forecasting model; otherwise, executing step {circle around (2)}.
Step 3: obtaining the forecast value of a future period of time based on RVM single-step iterative forecasting. When a new input xsl* (the input vector contains no missing point) is given, the value of β[obs] can be replaced with β[obs]. The forecast distribution of the corresponding output xsl* is approximately normal distribution, as shown in formula (37):
p(tsl*|tsl[obs])≈∫p(tsl*|xsl*,β*[obs])p(w|tsl[obs],α,β*[obs])dw=N(tsl*|μ*,σ*2) (37)
wherein μ*=ϕ*Tμ[w|obs], σ*2=(β*[obs])−1+ϕ*T{tilde over (Σ)}ϕ* and ϕ*=[1, K(xsl*, xsl1), K(xsl*, xsl2), . . . , K(xsl*, xslN
Step 4: building of LSSVM data set based on incomplete data set. After the parameters α and b are solved, using xsl*=αK(xsl, xsl*)+b to calculate an estimated value of the forecast quantity. Therefore, inputting data containing missing points filled by moving average into the LSSVM model for training, then forecasting the data containing missing points, replacing the original values of missing points with the forecast result, and repeating the above process until the change of the values of missing points is less than the set threshold, thus completing the data updating process.
Step 5: obtaining the forecast value of a future period of time based on GP forecasting. After updating the missing points by LSSVM, inputting the updated data into the GP model, wherein GP is a probability model with kernel. Supposing that the weight wsl and the noise εsl in formula (38) obey zero-mean prior distribution:
p(wsl)=N(wsl|0,α−1I)
p(εsl)=N(εsl|0,β−1) (38)
wherein α and β are hyperparameters of the weight and the noise distribution, and the output likelihood function can be expressed as formula (39):
p(xsl|Φ,wsl,β)=N(xsl|Φwsl,β−1I) (39)
According to Bayes formula, deducing the posterior distribution of the weight as shown in formula (40):
wherein mN=βΣΦTxsl and Σ=(αI+βΦTΦ)−1. Then selecting the Gaussian kernel function and maximizing the likelihood function, thus obtaining optimal hyperparameters.
Step 6: algorithm selection. Calculating the MAPE of the validation sets obtained by the RVM algorithm and the GP algorithm respectively, selecting the optimal algorithm, and saving and outputting the hyperparameter values and the information of the kernel function optimized by the corresponding algorithm.
Step 7: forecasting. After real data are acquired, using the optimal algorithm and the parameters thereof for forecasting.
Energy Generation-Consumption Forecasting for Aperiodic Feature Working Conditions
Another common working condition of production is energy consumption entirely based on the production process, but has no obvious periodic feature (such as cold rolling and hot rolling). In view of energy forecasting problems under such aperiodic feature working conditions, the present invention builds a bidirectional multi-layer long short-term memory forecasting model based on adaptive variational modal decomposition for noise reduction. First, complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) is adopted to decompose energy series, so as to calculate the initial parameters of variational mode decomposition (VMD), which can reduce the noise of the original data, smooth the data and reduce the aperiodicity and uncertainty of the data. Then, the Bayesian optimization method is used to obtain the optimal VMD decomposition parameter M, and the decomposition parameters of the VMD method are determined according to the decomposition results, which can search better parameters within a fixed number of iterations and shorten the time of parameter optimization. The VMD method is used to reduce the noise of the series, then the data are divided into a training set and a validation set, and input/output samples are built. Furthermore, bidirectional multi-layer long short-term memory (LSTM) is built for training, which is used to capture deep features in the series and has better generalization effect than single LSTM. Meanwhile, the measure of early stopping is taken for parameter optimization of the forecasting model, the direction of grid search is from a simple model to a complex model, and search is stopped when the error range is less than a certain degree, so as to shorten the time of parameter optimization. With the structure of decomposition for noise reduction and complex network, the present invention can avoid, with a higher probability, the hysteresis phenomenon of traditional models in forecasting of aperiodic data, and has better generalization ability.
The method comprises the following specific implementation steps:
Step 1: preparing data.
Extracting time series data of BFG consumption flow for cold rolling of a period of time from the real-time relational database of an industrial field.
Step 2: acquiring VMD parameters optimized by the Bayesian optimization method and decomposing and reorganizing the data.
Different from EMD and derived methods thereof, the VMD method is improved based on Hibert transform and Wiener filtering. The variational mode decomposition method is a quasi-orthogonal completely non-recursive decomposition method, which applies a variational problem solving process to the field of signal decomposition, transforms recursive decomposition of the EMD method into non-recursive variational decomposition, and highlights the local features of signals by means of iteratively solving the center frequency and bandwidth of each mode function. VMD has better denoising effect, avoids the phenomenon of spectrum aliasing, and can distinguish series with similar frequencies accurately.
For the original series ƒns(t), the problem of constraint variation is shown in formula (41):
wherein Kns is the number of modes, uk
In order to solve the mode uk
wherein {circumflex over (ƒ)}ns(ωns), ui
Then the above updating process is repeated until the convergence condition of formula (46) is satisfied:
wherein εns is a threshold satisfying the convergence condition. The Bayesian parameter optimization process generally comprises two parts: First, the previous function must be selected to represent the hypothesis about the optimized function. With greater flexibility and tractability, Gauss processes are generally selected as a probability model for Bayesian parameter optimization. Second, Bayesian optimization needs an acquisition parameter for building an acquisition function from model posteriori to determine the next parameter combination to be estimated.
Supposing that a set of hyperparameters is Xns=xns1, xns2, . . . , xnsn
ƒobjns(Kd
wherein eoriginns, represents the error between the series composed of first Kk
Step 3: training a bidirectional multi-layer LSTM model. The parameters of the bidirectional multi-layer LSTM model are mainly as follows: the input dimension, the number of LSTM layers, the number of hidden nodes of each LSTM layer, the learning rate and the number of samples of a training batch. The parameter optimization method is grid search with early stopping, wherein the input dimension is optimized first, then the number of LSTM layers and the number of hidden nodes of each LSTM layer are optimized, and finally, the learning rate and the number of samples of a training batch are optimized. The specific steps are as follows:
{circle around (1)} Determining the optimization range and granularity of the parameters, the optimization direction is decreasing model complexity, i.e., the number of LSTM layers is decreased, the number of hidden nodes of each LSTM layer is decreased, and the objective function is the root-mean-square error;
{circle around (2)} Recording errors between searches during grid search, and stopping search when the error is less than the threshold ò;
{circle around (3)} When optimizing the number of LSTM layers and the number of hidden nodes of each LSTM layer, setting the learning rate and the number of samples of a training batch to the maximum values to shorten the search time, and then using the optimized parameters to train the bidirectional multi-layer LSTM model, thus obtaining a forecasting model.
Step 4: carrying out forecasting based on the latest data acquired from the database.
As the VMD method requires a certain amount of data to ensure the stability of decomposition results, the present invention takes the latest 1,000 sample points, performs VMD decomposition and reorganization to reduce noise first, and then takes the first input dimension sample points as the input to forecast points at the next moment.
With the data forecasting of BFG consumption flow for cold rolling as an example,
For the aperiodic feature time series containing missing points, the present invention first fills the missing data by cubic spline interpolation, as shown in formula (15) and formula (16), and then adopts a forecasting method for complete aperiodic feature data to forecast such data.
With the data of BFG consumption flow for hot rolling as an example,
Forecasting of Gas Reserves for Factor Model
Since the capacity of a gas holder is taken as an important sign of balance of a gas system, the forecasting accuracy thereof plays an extremely important role. To solve the short-term forecasting problem, the present invention proposes a forecasting method considering the capacity fluctuation factor. First, the experimental data acquired in an industrial field are divided into a training set and a validation set, and a relevance vector machine model, a Gaussian process model and a least squares support vector machine model are established respectively; and an optimal model is selected through the error evaluation index for forecasting. The present invention adopts the idea of ensemble algorithm to build a factor model forecasting method integrating the RVM algorithm, the GP algorithm and the LSSVM algorithm, combines multiple machine learning algorithms, and compares validation errors of the models to finally obtain a stable model with good performance in all aspects, which greatly improves the forecasting accuracy and the algorithm stability.
The present invention comprises the following specific implementation steps:
Step 1: pre-processing data.
Extracting the input flow of 4 blast furnace gas pipe networks of a period of time and the consumption flow of 24 gas consumption links from the real-time relational database of an industrial field as input factors of the model, and adding the sum of the capacities at the previous moment to the input factors; and supposing that the sum of capacities at the current moment is the output value. Meanwhile, normalizing all the input data and the output data. Selecting the first 70% of samples as training samples and the remaining 30% as validation samples.
Step 2: building a relevance vector machine regressive model;
Step 3: building a Gaussian process regressive model;
Step 4: building an LSSVM model;
Step 5: selecting an optimal model, and forecasting the estimated value of a forecast point of a future period of time.
After the output estimated values of the validation sets of the RVM model, the GP model and the LSSVM model are respectively obtained by using the validation sets, the present invention compares the model performance by using the error evaluation index MAPE, so as to obtain the optimal model. The smaller the MAPE value is, the more accurate the forecast effect of the model is.
3) Scheduling of multiple by-product gas systems The balance of supply and demand of each system is analyzed based on the trend forecast result of the generation-consumption-storage unit of each gas system. When energy safety events occur, it is necessary to give a reasonable scheduling interval according to the safety constraints of the gas capacity, so as to ensure the safe operation of the by-product gas system. First, according to the capacity operation states of the blast furnace gas (BFG) system, the coke oven gas (COG) system and the linz-donawitz converter gas (LDG) system, the balance of each system is judged with system safety indexes as constraints to give scheduling suggestions (whether adjustment is needed and the corresponding adjustable range) of each single-medium system. Second, with the adjustable range of each single-gas medium and the load capacity of each adjustable user (such as power plant and boiler) as constraints, the comprehensive adjustable range of multiple gas media is calculated. Finally, the adjustable range for optimal scheduling is obtained by integrating the adjustable range of each pipe network and the adjustable range of multiple media. The calculation method for the adjustable range of each stage is as follows:
{circle around (1)} Solving the adjustable range of a single medium
The calculation process of the adjustable range of a single medium is shown in
{circle around (2)} Solving the adjustable range of multiple media
According to the principle that the total calorific value of gas used by a multi-media adjusting user is constant, the use range of each medium within the adjustable range under the multi-media adjusting user is calculated as shown in formula (48):
Total_cal=bfg_cal×x+cog_cal×y+ldg_cal×z (48)
wherein Total_cal is the gross calorific value of the multi-media adjusting user, and bfg_cal, cog_cal and ldg_cal are respectively the calorific value of each gas medium. Formula (49) can be obtained by the elimination method:
The value ranges [x1s′, x2s′], [y1s′, y2s′] and [z1s′, z2s′] of xs, ys and zs in the case of ys∈[y1s, y2s], zs∈[z1s, z2s] and xs∈[x1s, x2s] are solved, that is the adjustable range of each gas medium after multi-media adjustment is considered. Then, the final adjustable ranges [x1s″, x2s″], [y1s″, y2s″] and [z1s″, z2s″] for optimal scheduling are obtained through interval differencing, as shown in formulae (50)-(52) respectively:
[x1s″,x2s″]=[x1s,x2s]−[x1s′,x2s′] (50)
[y1s″,y2s″]=[y1s,y2s]−[y1s′,y2s′] (51)
[z1s″,z2s″]=[z1s,z2s]−[z1s′,z2s′] (52)
4) Collaborative Optimization and Scheduling of Comprehensive Energy
The combined heat and power (CHP) system of the steel enterprise is shown in
Considering that the adjustable range of byproduct gas (BFG and COG) and the total steam demand within a forecasting period t0-t1 have been determined based on the aforementioned forecasting model, the energy consumption of combined heat and power sets and boilers can be optimally distributed to minimize the energy cost (the sum of cost of self-produced energy (such as BFG, COG and steam) and cost of self-generating electricity (coal price)) and maximize the comprehensive energy efficiency, and the corresponding objective functions are respectively shown in formula (53) and formula (54):
max Jeffc=(KHsteamFg+KHelePtotalg)/Σi
wherein Jco represents energy cost, g represents generating capacity, and u represents consumption. Since the energy cost of an energy center is calculated independently from that of other plants (iron works, steel works, etc.), the cost of the energy center is taken as the target for optimization in this example
is the sum of consumption costs of byproduct gas and purchased coal, and qi
The constraint conditions in formula (55) can be described as follows:
c1-c3: X1 and X2 are respectively the total flow of BFG and the total flow of COG input by each boiler, and X3 is the total consumption of coal. δB, δC and δM are respectively the corresponding loss rates;
c4-c6: the relationship between the input and the output of each device, and ƒ(⋅) is the corresponding fitting relationship;
c7-c8: balance of generation and consumption of high pressure steam and medium pressure steam;
c9: constraints on self-generating capacity;
c10-c12: balance constrains on steam pipe network, Fr is the total recovery amount of waste heat, and δs is the loss rate of steam;
c13: constraints on pressure of steam pipe network. PaL and PaH are respectively the upper limit and the lower limit of the pressure of a steam pipe network in the south area, and Pa0 represents the initial state of the pressure; and Kpa is the conversion coefficient of flow to pressure;
Then the swarm intelligence optimization method is adopted for solution, thus obtaining a scheduling scheme with optimal economy.
With certain scheduling as an example, the scheduling effects of BFG, COG and LDG systems are shown in
(2) Background Service: Computing Trigger Mechanism with Edge-Cloud Cooperation
The background service is divided into two parts: the cloud service is deployed on the cloud server and receives parameter updating instructions triggered by edge side computing service and human-machine interaction. The background service on the edge side reads and serializes required real-time samples of each energy system and auxiliary information such as production plan messages and production signals from the relational database, takes the processed data samples and relevant information such as plans as input of the “*.iail” model, and forecasts the trend of concerned users in a future period of time to invoke an optimal scheduling model to compute scheduling suggestions and display same on the human-machine interface of the human-computer interface. In general, the background service updates the computed results every 3 minutes to ensure timeliness of data and adaptability of the system.
(3) Human-computer interface: display of computed result and human-machine interaction
The human-computer interface reads the results saved in the database and displays same on the human-machine screen with the refresh frequency of 1 minute. In the case of unplanned temporary changes (such as temporary wind break of blast furnace) in working conditions of production, operators can trigger feedback information to the cloud service through the human-machine interface, and the cloud service will trigger the edge side model to realize self-updating of parameters. Meanwhile, interaction with artificial knowledge data such as forecasting accuracy, scheduling results and scheduling rules through a standardized template can also be realized through an interface of the human-computer interface, and the background service will realize self-updating of the rule base according to artificial knowledge. The practicability and the computing accuracy of the platform are further improved by means of blended learning of human-machine integration.
Number | Date | Country | Kind |
---|---|---|---|
202110478283.0 | Apr 2021 | CN | national |
Number | Name | Date | Kind |
---|---|---|---|
7552100 | Chen | Jun 2009 | B2 |
8321187 | Kaufman | Nov 2012 | B2 |
10372118 | Li | Aug 2019 | B2 |
10747213 | Ryu | Aug 2020 | B2 |
11003175 | Xenos | May 2021 | B2 |
Entry |
---|
Tao, L., et al. “A Hybrid LSSVM Model with Empirical Mode Decomposition and Differential Evolution for Forecasting Monthly Precipitation” American Meteorological Society, pp. 159-176 (2017) (Year: 2017). |
Pena, J., et al. “Optimal Scheduling of a By-Product Gas Supply System in the Iron-and Steel-Making Process Under Uncertainties” Computers & Chemical Engineering, vol. 125, pp. 351-364 (2019) (Year: 2019). |
Fung, D. S. “Methods for the estimation of missing values in time series” Thesis, Edith Cowan U. (2006) (Year: 2006). |
Sun et al., “Research on Energy Management for Multi-Energy Flow: Challenges and Prospects,” (2016) Automation of Electric Power System, 40(15): 1-8, 9 pages. |
Zhao et al., “Data-Based Predictive Optimization for Byproduct Gas System in Steel Industry,” IEEE Transactions on Automation Science and Engineering, vol. 14, No. 4, Oct. 2017, 10 pages. |
Wang et al., “Adaptive Granulation-Based Prediction for Energy System of Steel Industry,” IEEE Transactions on Cybernetics, vol. 48, No. 1, Jan. 2018, 12 pages. |
Shamshirband et al., “Heat load prediction in district heating systems with adaptive neuro-fuzzy method,” Renewable and Sustainable Energy Reviews 48 (2015) 760-767, 8 pages. |
Dinesh et al., “Residential Power Forecasting Using Load Identification and Graph Spectral Clustering,” IEEE Transactions on Circuits And Systems—II: Express Briefs, vol. 66, No. 11, Nov. 2019, 5 pages. |
Fliess et al., “Prediction bands for solar energy: New short-term time series forecasting techniques,” Solar Energy 166 (2018) 519-528, 10 pages. |
Zhao et al., “Granular Model of Long-Term Prediction for Energy System in Steel Industry,” IEEE Transactions on Cybernetics, vol. 46, No. 2, Feb. 2016, 13 pages. |
Li et al., “Prediction and optimal operation on byproduct gas system in steel enterprises,” Iron and Steel vol. 51, No. 8, Aug. 2016, 9 pages. |
Dan et al., “Research on stochastic optimal scheduling model for electric-thermal integrated system considering thermal storage characteristics of heating system,” (2020) Renewable Energy Sources, 38(3): 380-387, 8 pages. |
Xu et al., “Safe and optimized scheduling of power system considering demand response,” Control and Decision, vol. 33, No. 3, Mar. 2018, 8 pages. |
Jin et al., “A joint scheduling method for multiple byproduct gases in steel industry,” Control Engineering Practice 80 (2018) 174-184, 11 pages. |
Jin et al., “Granular-Causality-Based Byproduct Energy Scheduling for Energy-Intensive Enterprise,” IEEE Transactions on Automation Science and Engineering, vol. 17, No. 4, Oct. 2020, 12 pages. |
Tan et al., “Research on testing technology of control device based on RTLAB for flexible distribution network,” (2019) Power Big Data, 22(7): 1-8, 8 pages. |
Starkloff et al., “Development and validation of a dynamic simulation model for a large coal-fired power plant,” Applied Thermal Engineering 91 (2015) 496-506, 11 pages. |
Liuliang QIU, “Operation simulation and optimal analysis of distributed energy system based on TRNSYS,” (2017) Dissertation for Master's Degree, Shanghai University of Electric Power, 75 pages. |
Voropai et al., “Simulation Approach to Integrated Energy Systems Study Based on Energy Hub Concept,” (2019) IEEE Powertech Conference, 5 pages. |