The present application relates to complex industrial process parameter detection, and more particularly to a method for detecting a dioxin (DXN) emission concentration of a municipal solid waste incineration process based on multi-level feature selection.
Grate furnace-based municipal solid waste incineration (MSWI) is a widely used technique for household waste treatment and recycling[1-2]. As of 2017, there are 303 MSWI power plants in China, in which 220 MSWI power plants use the grate furnace. Most of the imported MSWI equipment is generally manually controlled during the operation, which causes unstable operation and failure to optimize the control[3]. In developing countries, there is an extremely urgent need to control pollutant emission caused by MSWI[4-5]. Dioxin (DXN) is a highly toxic pollutant[6] discharged from MSWI process and is the main cause of the “Not in my back yard” (NIMBY) effect. DXN, referred to as the most toxic pollutant in the century[7], is a general term for persistent organic pollutants composed of polychlorinated dibenzo-para-dioxins (PCDDs), polychlorinated dibenzofurans (PCDFs) and certain polychlorinated biphenyls with dioxin-like properties. DXN has a significant accumulation and amplification effect in organisms[8-9].
Currently, MSWI companies mainly focus on how to minimize DXN emissions by optimizing and controlling operating parameters[10], so it is necessary to realize online measurement of the DXN emission concentration so as to optimize the MSWI process. There are three typical detection methods for the DXN emission concentration: 1) an offline direct detection method, 2) an indicator/related substance online indirect detection method and 3) an online direct detection method. The first method requires a specialized laboratory and associated laboratory analysis equipment, having a lag time of month/week. The second method is performed through three steps. First, flue gas is collected online. Second, a concentration of the indicator/related substance is detected. Third, the DXN emission concentration is indirectly calculated based on a mapping model. In addition, the second method needs expensive and complicated online laboratory analysis equipment and has a lag time of day/hour. The third method does not require the laboratory analysis equipment and has a lag time of minute/second[11]. The present application mainly focuses the third method.
The online direct detection method of the DXN emission concentration is oriented toward input features selected based on mechanism and experience in the current research. Literatures[12-14] use small sample data of different types of incinerators and build models based on linear regression, artificial neural network (ANN), selective ensemble (SEN) least squares-support vector machine (LS-SVM), etc. Literature[15] uses data from more than four years of actual processes in an incineration plant in Taiwan, combines correlation analysis, principal component analysis (PCA) and artificial neural network (ANN), selects 13 variables from 23 readily detected process variables to establish a soft sensing DXN model, and concludes that the input features with a large contribution rate are injection frequency of activated carbon, a concentration of HCL gas emitted from a chimney and a temperature in a mixing chamber. Literature[16] adopts input variables, including a furnace temperature, a flue gas temperature at a boiler outlet, flue gas flow and concentrations of SO2, HCL and particles, to establish a prediction model for the DXN emission concentration and toxicity equivalent based on support vector machine (SVM). Variables of the actual MSWI process have hundreds of dimensions, and are related to DXN generation, absorption and emission in different degrees[17]. However, none of the above processes performs feature selection by combining multi-phase characteristics of the MSWI process and the collinearity among these variables. In addition, labeled samples of DXN soft measurement are difficult to be obtained. Thus, feature selection of small sample high-dimensional data should be elevated to an important position during modeling.
The object of feature selection is to remove irrelevant and redundant features and retain only important features. In order to eliminate the irrelevant features, the degree of correlation between a single feature (independent variable) and the DXN emission concentration (dependent variable) in the MWSI process should be considered. Literature shortens the calculation time and simplifies the modeling by reducing the dimensionality of high-dimensional data via correlation coefficients. Literature[19] discloses a multi-objective semi-supervised feature selection method based on correlation coefficients. However, the linear method based on correlation coefficient proves to hardly describe the complex and arbitrary mapping relationship between the independent variable and the dependent variable[20]. Literature[21] points out that mutual information has good performance in characterizing the correlation between features. Literature[22] proposes a feature selection method based on individual optimal mutual information. Literature[23] proposes a feature selection method based on conditional mutual information, which can effectively evaluate previously selected features. It can be seen that both the correlation coefficient and the mutual information can characterize the correlation between the independent variable and the dependent variable[24-25]. The correlation coefficient focuses on linear relationships, while the mutual information focuses on nonlinear relationships[26-27]. For the actual complex industrial process, the mapping relationship between the independent variable and the dependent variable is difficulty characterized by only using a linear or non-linear relationship. Moreover, none of the above methods considers the approach of performing adaptive feature selection.
After obtaining a single input feature that has a good correlation with DXN, it is required to consider the redundancy among many process variables in the MSWI process, so as to eliminate the redundant features. Literature[28] expresses the redundancy between selected features and current features using the correlation coefficient. Literature[29] solves the problem of collinearity between variables by PCA, but the extracted latent variables can destroy the physical meaning of original features. Literature[30] solves multicollinearity by improving ridge regression. Literature[31] verifies that partial least squares (PLS) has good explanation and decomposition abilities for the multicollinearity between input features. Literature[32] proposes a feature selection method based on genetic algorithm-based partial least squares (GAPLS) algorithm. The feature selection method combines global optimization search capabilities of genetic algorithm (GA) and multicollinearity processing capabilities of PLS. Tang et al. disclose that GAPLS has good selectivity for high-dimensional spectral data[33], however, GA has randomness for small sample high-dimensional data, leading to different results for each feature selection. Therefore, it is necessary to perform statistics on the features selected multiple times to improve robustness and interpretability.
The above feature selection processes are performed based on data drive, and the limited sample size may produce deviations. Based on the existing research results and prior knowledge, there is need to augment the important features with clear mechanism meaning so as to make an online detection method more interpretable and in line with DXN emission characteristics of the MSWI process, thereby providing support for subsequent optimization control research.
It can be seen from
In the above process, DXN is present in incineration ash, the fly ash and the exhaust gas, where the amount of the incineration ash is largest. The amount of the fly ash is slightly smaller than that of the incineration ash. The DXN concentration of the incineration ash is relatively low. The DXN concentration of the fly ash is higher than that of the incineration ash. The DXN concentration of the exhaust gas is highest. The incineration ash and the fly ash require special treatment. The exhaust gas is of two types: incomplete garbage combustion-generated and synthesis reaction-generated[34]. In order to ensure that toxic organic matters are effectively decomposed, the flue gas temperature during the incineration should reach at least 850° C. and be kept fir at least 2 seconds. During the flue gas treatment, the lime and the activated carbon are injected into the reactor to remove acid gas and adsorb DXN and certain heavy metals, and then the flue gas is filtered by the bag dust collector and discharged into the chimney through the draft fan. In addition, a DXN memory effect in the flue gas treatment leads to an increase in the emission concentration. Generally, DXN generation and absorption-related process variables in the furnace incineration and flue gas treatment are stored in seconds by an on-site distributed control system. The concentration of readily detectable gases (CO, HCL, SO2, NOx, HF, etc.) in the exhaust gas is detected in real time by an online detection instrument. Incineration plants or environmental protection authorities usually perform the DXN concentration detection for the exhaust gas by an off-line direct detection method monthly or quarterly.
Accordingly, the DXN emission concentration online detection has the following difficulties. An original DXN content of MSW is unknown. The mechanism in the DXN generation and absorption stage is complicated and unclear. The DXN memory effect during the flue gas treatment leads to uncertainty in measurement. Therefore, it is very necessary to perform feature selection on input features for each sub-process of the MSWI process.
In order to overcome the above-mentioned shortcomings in the prior art, the present application provides a method for detecting a dioxin (DXN) emission concentration in a MSWI process based on multi-level feature selection. Feature selection of input features is performed for each sub-process of the MSWI process, so as to detect the DXN emission concentration of the MSWI process. The method has good interpretability, conforms to DXN emission characteristics of the MSWI process and provides support for subsequent optimization control research.
The technical solutions of the present application are described as follows.
The present application provides a method for detecting a dioxin (DXN) emission concentration in a MSWI process based on multi-level feature selection, comprising:
In an embodiment, the method further comprises:
In an embodiment, the DXN detection model comprises input data and output data;
In an embodiment, the step of obtaining the correlation coefficient value comprises:
In an embodiment, the step of obtaining the mutual information value comprises:
In an embodiment, the step of obtaining the comprehensive evaluation value comprises:
represents a (pi)cor_misel-th candidate feature of the i-th sub-process; and a correlation coefficient value of the (pi)corr_misel-th candidate feature is
and a mutual information value of the (pi)corr_misel-th candidate feature is
represents a standardized correlation coefficient value of the pcorr_misel-th candidate feature of the i-th sub-process; and
represents a standardized mutual information value of the pcorr_misel-th candidate feature of the i-th sub-process;
expressing
as
In an embodiment, kicorr is equal to 0.5; and kimi is equal to 0.5.
In an embodiment, the step of obtaining the comprehensive evaluation value of the candidate input features according to the correlation coefficient value and the mutual information value comprises:
as comprehensive evaluation value-selected input feature; and expressing the variables as:
In an embodiment, the step of arranging the first-level features in series comprises:
represents a p1stsel-th feature in a first-level feature selection set;
represents the number of all of the first-level features; and X1stsel represents single feature correlation-based first-level feature obtained by serially combining the first-level features of all of the sub-processes.
In an embodiment, a strategy of selecting the second-level features comprises:
accordingly, recording all p1stsel-th features of the first-level features as
In an embodiment, the step of selecting the second-level features comprises:
wherein
is the number of times that the p1stsel-th feature in the first-level features is selected.
In an embodiment, the population size is 20; the maximum genetic algebra is 40; a maximum number of latent variables of the PLS algorithm is 6; and the mutation probability is 0.005.
In an embodiment, the step of selecting the third-level features comprises:
that all the p1stsel-th features in the first-level features are selected, setting a scale factor as fDXNRMSE; determining a lower limit of a threshold configured to select the third-level features as θDXNdownlimit; calculating θDXNdownlink according to:
represents the number of times that the p1stsel-th feature in the first-level features is selected by running the GAPLS algorithm J times; μp represents a threshold selection criterion for selecting the third-level features;
In an embodiment, the step of establishing the PLS algorithm-based DXN detection model comprises;
In an embodiment, variables of the PLS algorithm-based DXN detection model have 287 dimensions.
In an embodiment, weight factors ficorr, fimi and ficorr_mi of feature selection of the correlation coefficient value and the mutual information value of the first-level features are 0.8.
In an embodiment, there are 132 feature variables selected by the comprehensive evaluation value; for the selected 132 process variables based on the single feature correlation, an optimal process variable combination is determined using the GAPLS algorithm so as to remove redundant features.
The present application has the following beneficial effects.
In the method of the present application, feature selection of input features is performed for each sub-process of the MSWI process, so as to detect the DXN emission concentration of the MSWI process based on multi-level feature selection. The method has good interpretability, conforms to DXN emission characteristics of the MSWI process and provides support for subsequent optimization control research.
The present application will be further described below with reference to the accompanying drawings, so that the present application is more understandable. The accompanying drawings disclosed herein are merely illustrative and not intended to limit the present application.
In the drawings: 1, unloading hall; 2, storage tank; 3, claw; 4, incinerator feed hopper; 5, grate; 6, slag tank; 7, steam turbine set; 8, reactor; 9, lime storage tank; 10, activated carbon storage tank; 11, fly ash storage bin; 12, bag dust collector; 13, mixer; 14, water tank; 15, draft fan; and 16, chimney.
The present application will be further described below with reference to the accompanying drawings to clearly and completely illustrate the technical solutions of the embodiments. It is apparent that the embodiments below are merely preferred embodiments of the present application and are not intended to limit the invention. Any other embodiments made by those skilled in the art based on the embodiments disclosed herein without sparing any creative efforts should fall within the scope of the invention.
The method includes the following steps.
S101) A grate furnace-based municipal solid waste incineration (MSWI) process is divided into a plurality of sub-processes based on incineration process. The plurality of sub-processes include an incineration treatment sub-process, a boiler operation sub-process, a flue gas treatment sub-process, a steam electric power generation sub-process, a stack emission sub-process and a common resource supply sub-process.
S102) A correlation coefficient value and a mutual information value between each of original input features of the sub-process and the DXN emission concentration are obtained. Then a comprehensive evaluation value of candidate input features is obtained according to the obtained correlation coefficient value and the obtained mutual information value, thereby obtaining first-level features of all of the sub-processes.
S103) The first-level features are selected and statistically processed by adopting a GAPLS-based feature selection algorithm and according to redundancy between different features, thereby obtaining second-level features of all of the sub-processes.
S104) The first-level features and the second-level features are screened based on statistical results within a preset threshold range, thereby obtaining the third-level features of all of the sub-processes
S105) A DXN detection model based on a partial least squares (PLS) algorithm is obtained according to model prediction performance and the third-level features. The DXN emission concentration is detected by the obtained PLS algorithm-based DXN detection model.
Specifically, the goal of feature selection in the present application is to improve the prediction performance and interpretability of a soft sensing model. The concentration detection method of the present application belongs to environmental protection fields, particularly to complex industrial process parameter detection. In the present embodiment, a method for detecting a dioxin (DXN) emission concentration in a MSWI process based on multi-level feature selection is provided. Firstly, from the perspective of the correlation between a single input feature and the DXN emission concentration, a comprehensive evaluation value index is constructed by combining the correlation coefficient and the mutual information, so as to realize the first-level feature selection of process variables of a monitored sub-process in the MSWI process. Secondly, from the perspective of multiple feature redundancy and feature selection robustness, running the GAPLS-based feature selection algorithm multiple times is performed to achieve the second-level feature selection based on the selected first-level features. Finally, by the combination of the number of times that previously selected features are selected, the model prediction performance and mechanism, the third-level feature selection is achieved based on the selected second-level features. The DXN emission concentration detection model can be established based on the obtained features. The method provided herein is verified to be effective by multi-year DXN monitoring data of an incineration plant.
Compared to the prior art, in the method of the present embodiment, feature selection of input features is performed for each sub-process of the MSWI process, so as to detect the DXN emission concentration of the MSWI process. The method has good interpretability, conforms to DXN emission characteristics of the MSWI process and provides support for subsequent optimization control research.
Specifically, the method further includes a step of arranging the first-level features in series after obtaining the first-level features of a 1 of the sub-processes, so as to obtain the first-level features based on single feature correlation.
Specifically, the DXN detection model includes input data and output data.
The input data is expressed as X∈RN×P and includes N samples as row data and P variables as column data. The input data is derived from respective sub-processes of the MSWI process. Monitoring data of an i-th sub-process is obtained by using a programmable logic controller (PLC) device or a distributed control system (DCS) device installed on site and is expressed as Xi∈RN×P
X=[X1, . . . ,Xi, . . . ,XI]={Xi}i=1I (1)
P=P1+. . . +Pi+. . . +PI=Σi=1IPi (2)
The output data is expressed as y={yn}n=1N∈RN×1 and includes N samples; and ŷ represents a predicted value.
Specifically, the step of obtaining the correlation coefficient value is performed through the following steps.
Specifically, the step of obtaining the mutual information value includes the following steps.
Specifically, the step of obtaining the comprehensive evaluation value is performed through the following steps.
represents a (pi)corr_misel-th candidate feature of the i-th sub-process; and a correlation coefficient value of the (pi)corr_misel-th candidate feature is
and a mutual information value of the (pi)corr_misel-th candidate feature is
represents a standardized correlation coefficient value of the pcorr_misel-th candidate feature of the i-th sub-process; and
represents a standardized mutual information value of the pcorr_misel-th candidate feature of the i-th sub-process.
and are expressed as follows:
Specifically, kicorr is equal to 0.5; and kimi is equal to 0.5.
Specifically, the step of obtaining the comprehensive evaluation value of the candidate input features according to the correlation coefficient value and the mutual information value is performed through the following steps.
are selected as comprehensive evaluation value-selected input features, and recorded as:
Specifically, the step of arranging the first-level features in series is performed through the following steps.
The first-level features are arranged in series to obtain the first-level features X1stsel based on the single feature correlation;
represents a p1stsel-th feature in a first-level feature selection set;
represents the number of all of the first-level features; and X1stsel represents single feature correlation-based first-level feature obtained by serially combining the first-level features of all of the sub-processes.
Specifically, a strategy of selecting the second-level features is described as follows.
The first-level features X1stsel are inputted into a GAPLS algorithm. After running the GAPLS algorithm J times, the second-level features (X2ndsel)j are outputted. Then the number of times that the respective inputted first-level features are selected is outputted. The second-level features that are selected Jsel times are statistically processed. When a GAPLS model prediction error is smaller than a prediction error average obtained by running the GAPLS algorithm J times, a second-level feature is selected.
The number of times that a p1stsel-th feature is selected is recorded as
accordingly, all P1stsel-th features of the first-level features are recorded as
J is the number of times that the GAPLS algorithm runs. Jsel is the number of GAPLS models prediction errors of which are smaller than a prediction error average. (X2ndsel)j represents multiple feature redundancy-based second-level features selected by jth run of the GAPLS algorithm.
Specifically, the step of selecting the second-level features is performed through the following steps.
is the number of times that the p1stsel-th feature in the first-level features is selected.
Specifically, the population size is 20. The maximum genetic algebra is 40. A maximum number of latent variables of the PLS algorithm is 6. The mutation probability is 0.005.
Specifically, the step of selecting the third-level features is performed through the following steps.
According to the number of times
that all the p1stsel-th features in the first-level features are selected, a scale factor is set as fDXNRMSE. A lower limit of a threshold configured to select the third-level features recorded as θDXNdownlimit and calculated according to:
A maximum value of the number of times that all the p1stsel-th features in the first-level features are selected is found based on an upper limit θDXNuplimit of the threshold configured to select the third-level features,
The threshold is recorded as θDXN3rd and is between θDXNdownlimit and θDXNuplimit. The third-level features are obtained according to
represents the number of times that the p1stsel-th feature in the first-level features is selected by running the GAPLS algorithm J times; μp represents a threshold selection criterion for selecting the third-level features.
Feature variables of μp=1 are sequentially stored in X3rdsel_temp. The RMSE is calculated. X3rdsel_temp serves as input variables in the establishment of the PLS algorithm-based DXN detection model. X3rdsel represents the third-level features selected from X1stsel based on a feature selection threshold θ3rd and prior knowledge.
Specifically, the step of establishing the DXN detection model based on the PLS algorithm is implemented through the following steps.
Values of the threshold θDXN3rd between θDXNdownlimit and θDXNuplimit are increased one by one so as to establish a plurality of first temporary PLS algorithm-based DXN detection model.
A second temporary PLS algorithm-based DXN detection model is selected from the plurality of first temporary PLS algorithm-based DXN detection models. The second temporary PLS algorithm-based DXN detection model has a minimum value of RMSE.
Checking the input features of the DXN emission concentration detection model is performed to determine whether the input features comprises concentrations of CO, HCL, O2 and NOx emitted from a chimney. At the same time, features in the common resource supply Rib-process are removed. If the input features do not include concentrations of CO, HCL, O2 and NOx, the third-level features are additionally selected to obtain selected three-level features X3rdsel, thereby varying the number of features that are selected and establishing the PLS algorithm-based DXN detection model based on prior knowledge.
Specifically, variables of the PLS algorithm-based DXN detection model have 287 dimensions.
Specifically, weight factors ficorr, fimi and ftcorr_mi of feature selection of the correlation coefficient value and the mutual information value of the first-level features are 0.8.
Specifically, there are 132 feature variables selected by the comprehensive evaluation value. For the selected 132 process variables based on the single feature correlation, an optimal combination of the process variables is determined using the GAPLS algorithm so as to remove redundant features.
The present embodiment provides a method for detecting a dioxin (DXN) emission concentration in a MSWI process based on multi-level feature selection, which is implemented through the following specific steps.
A municipal solid waste incineration (MSWI) process is divided into six sub-processes based on an incineration process. The six sub-processes include an incineration treatment sub-process, a boiler operation sub-process, a flue gas treatment sub-process, a steam electric power generation sub-process, a stack emission sub-process and a common resource supply sub-process.
In the present application, the input data of the DXN detection model is expressed as X∈RN×P and includes N samples as row data and P variables as column data. The input data is derived from respective sub-processes of the MSWI process. Monitoring data of an i-th sub-process is obtained by using a programmable logic controller (PLC) device or a distributed control system (DCS) device installed on site and is expressed as Xi∈RN×P
X=[X1, . . . ,Xt, . . . ,XI]={Xi}i=1I (1)
P=P1+. . . +Pi+. . . +PI=Σi=1IPi (2).
I represents the number of the sub-processes. Pi represents the number of input features in the i-th sub-process, and the input features are variables derived from the monitoring data.
Accordingly, output data of the DXN detection model is expressed as y={yn}n=1N∈RN×1 and includes N samples as row data.
Obviously, the input/output data of the model is quite different in a time scale, and thus N<<P.
In order to make the following description understandable, Xt is modified as:
The present application provides a DXN emission concentration detection strategy for a MSWI process based on multi-level feature selection.
As shown in
represents the number of times that the p1stsel-th feature in the first-level features is selected. X3rdsel represents a third-level feature selected from X1stsel in the light of a feature selection threshold θ3rd and prior knowledge. Mparx represents parameters of the detection model. ŷ represents a predicted value.
In the method of the present embodiment, the algorithm is realized through the following steps.
Step 1.1) An original correlation coefficient value between each of the original input features and the DXN emission concentration is calculated. For example, an original correlation coefficient value between a p-th input feature (xp
Step 1.2) The original correlation coefficient value (ξcorr_orip
(ξcorrp
Step 1.3) Steps (1.1)-(1.2) are repeated for respective original input features until correlation coefficients for all of the original input features are obtained and recorded as {ξcorrp
Step 1.4) A weight factor of the i-th sub-process is set as ficorr. A threshold θicorr configured to select input features based on the correlation coefficients is calculated according to:
Step 1.5) The p-th input feature of the i-th sub-process is selected according to rules as follows:
Step 1.6) A feature (xp
Step 1.7) Steps (1.1)-(1.6) are performed for all of the original input features of the i-th sub-process; and the selected candidate features are recorded as:
Step 1.8) Steps (1.1)-(1.7) are repeated for all the sub-processes; and correlation coefficient measurement-selected features are recorded as {(Xcorrsel)i}t=1J.
Step 2.1) A mutual information value between each of the original input features and the DXN emission concentration is calculated. For example, a mutual information value between the p-th input feature (xp
Step 2.2) Step (2.1) is repeated for the respective original input features until mutual information values of all of the original input features are obtained. The obtained mutual information values are recorded as {ξmip
Step 2.3) A weight factor of the i-th sub-process is set as fimi. A mutual information-related threshold θimi is calculated according to:
Step 2.4) The p-th input feature of the i-th sub-process is selected according to rules as follows:
Step 2.5) A feature (xp
Step 2.6) Steps (2.1)-(2.5) are repeated for all of the input features of the i-th sub-process. The selected candidate features are recorded as:
Step 2.7) Steps (2.1)-(2.6) are repeated for all the sub-processes. Mutual information measurement-selected features are recorded as {(Xmisel)i}i=1I.
1.3 Single Feature Correlation Measurement Based on a Comprehensive Evaluation Value
Step 3.1) For the i-th sub-process, the intersection of the mutual information-selected features (Xmisel)i and the correlation coefficient-selected features (Xcorrsel)i is performed according to Equation (15), thereby obtaining a comprehensive evaluation value-selected candidate feature set
represents a (pi)corr_misel-th candidate feature of the i-th sub-process; and a correlation coefficient value of the (pi)corr_misel-th candidate feature is
and a mutual information value of the (pi)corr_misel-th candidate feature is
Step 3.2) Normalization is performed according to Equations (16) and (17) so as to eliminate size differences of the correlation coefficient value and mutual information value of the different input features;
represents a standardized correlation coefficient value of the pcorr_misel-th candidate feature of the i-th sub-process; and
represents a standardized mutual information value of the pcorr_misel-th candidate feature of the i-th sub-process.
Step 3.3) A comprehensive evaluation value of the candidate input features is defined as
and can be expressed as
Step 3.4) Steps (3.1)-(3.3) are repeated until comprehensive evaluation values of all of the candidate input features are obtained and recorded as
Step 3.5) A weight factor of the i-th sub-process is set as ficorr_mi. A comprehensive evaluation value-related threshold θi1stsel is calculated according to
Step 3.6) A (pi)corr_misel-th candidate input feature of the i-th sub-process is selected according to rules as follows:
Step 3.7) Steps (3.5)(3.6) are performed for all the original candidate input features. Variables of
are selected as comprehensive evaluation value-related input feature and expressed as:
(X1stsel)i=[(x1)i, . . . ,(xp
Step 3.8) Steps (3.5)-(3.7) are repeated until the selection of the first-level features of all the sub-processes is completed.
Step 3.9) The first-level features are arranged in series to obtain the first-level features X1stsel based on the single feature correlation;
represents a p1stsel-th feature in a first-level feature selection set; and
represents the number of all of the first-level features.
2. Second-Level Feature Selection Based on Multiple Feature Redundancy
In the first-level feature selection, only the correlation between a single input feature and the DXN emission concentration is considered, and the redundancy between multiple features is not considered. For the second-level feature selection, GAPLS-based feature selection algorithm is used and the redundancy between multiple features is considered. In the consideration that DXN emission concentration modeling has small sample size and the genetic algorithm (GA) has randomness, provided herein is a second-level feature selection strategy based on multiple feature redundancy according an embodiment of the present application, as shown in
It can be seen from
and accordingly, all P1stsel-th features of the first-level features are recorded as
J is the number of times that the GAPLS algorithm runs, and the GAPLS algorithm generally runs more than 100 times. Jsel is the number of GAPLS model J prediction errors smaller than a prediction error average obtained by running the GAPLS algorithm J times.
The second-level feature selection is performed through the following steps.
Step 1) The number of times that the GAPLS algorithm runs is set as J. GAPLS algorithm parameters are set. A population size, maximum genetic algebra, mutation probability, a crossover method and a number of latent variables of the PLS algorithm are initialized and generally set to 6. Let j=1 and the selection of the second-level features is started.
Step 2) Whether the GAPLS algorithm runs J times is determined. If yes, step (11) continues. If no, step (3) continues.
Step 3) Binary encoding for features is performed, where a length of a chromosome is the number of input features. 1 implies that a feature is selected. 0 implies that no feature is selected.
Step 4) Random initialization is performed on population.
Step 5) The fitness of the population is evaluated. A root mean square error of cross-validation (RMSECV) is calculated using a leave-one-out cross-validation method. The smaller the RMSECV, the better the fitness.
Step 6) Whether a termination condition of the maximum genetic algebra is reached is determined. If no, step (7) continues. If yes, step (9) continues.
Step 7) Genetic operations including selection, crossover and variation are performed through an elite substitution strategy, that is, individuals with poor fitness are replaced with individuals with good fitness. The crossover is performed through single point crossover. The genetic variation is performed through single point mutation.
Step 8) A new population is obtained and step (5) continues.
Step 9) An optimal individual is obtained by running the GAPLS algorithm J times. Further, decoding is performed to obtain selected second-level features (X2ndsel)j.
Step 10) Let j=j+1, and step (2) continues.
Step 11) An average value of root mean square errors (RMSE) of a prediction model is calculated by running the GAPLS algorithm J times. The number of the root mean square errors of the GAPLS model that are larger than the average value is recorded as Jsel. The second-level features that are selected Jsel times is processed by counting the number of times that the P1stsel-th feature in the first-level features is selected
is the number of times that the p1stsel-th feature in the first-level features is selected.
3. Third-level feature selection and modeling based on model prediction performance
According to the number of times
that all the p1stsel-th features in the first-level features are selected and a scale factor fDXNRMSE that has a default value of 1, a lower limit of a threshold configured to select the third-level features is set as θDXNdownlimit and calculated according to:
A maximum value (fDXNRMSE)max and a minimum value (fDXNRMSE)min of fDXNRMSE are calculated according to
A maximum value of the number of times at all the p1stsel-th features in the first-level features are selected is found based on an upper limit θDXNuplimit of the threshold configured to select the third-level features,
The threshold is recorded as θDXN3rd and is between θDXNdownlimit and θDXNuplimit. The third-level feature selection is performed according to
represents the number of times that the p1stsel-th feature in the first-level features is selected by running the GAPLS algorithm J times. μp represents a threshold selection criterion for selecting the third-level features. Feature variables of μp=1 are sequentially stored in X3rdsel_temp. The RMSE is calculated. X3rdsel_temp serves as input variables in the establishment of the PLS algorithm-based DXN detection model. X3rdsel represents the third-level features selected from X1stsel based on a feature selection threshold θ3rd and empirical knowledge.
Values of the threshold θDXN3rd between θDXNdownlimit and θDXNuplimit are increased one by one so as to establish a plurality of first temporary PLS algorithm-based DXN detection model.
A second temporary PLS algorithm-based DXN detection model is selected from the plurality of first temporary PLS algorithm-based DXN detection models. The selected second temporary PLS algorithm-based DXN detection model has a minimum value of RMSE.
The input features of the WONT emission concentration detection model are checked to determine whether the input features include concentrations of CO, HCL, O2 and NOx emitted from a chimney. Features in the common resource supply sub-process are removed. If the input features do not include concentrations of CO, HCL, O2 and NOx, the third-level features are additionally selected to obtain features X3rdsel selected from the third-level features, thereby varying the number of features that are selected and establishing the PLS algorithm-based DXN detection model based on prior knowledge.
In summary, the multi-level feature selection provided in the present application has the following process.
The principle of the method of the present embodiment will be described below in combination with implementation data.
1. Modeling Data Description
The method provided in the embodiment of the present application is implemented in a grate furnace-based MSWI plant in Beijing. The method includes 34 DXN emission concentration detection samples, and variables that include all process variables of the MSWI process has 287 dimensions. It can be seen that the number of input features far exceeds the number of modeling samples, and thus it is very necessary to reduce dimensionality of the variables. In the present embodiment, six sub-processes includes an incineration treatment sub-process, a boiler operation sub-process, a flue gas treatment sub-process, a steam electric power generation sub-process, a stack emission sub-process and a common resource supply sub-process, which are respectively marked as incineration, boiler, flue gas, steam, stack and common.
2. Modeling Results
2.1 Feature Selection Results Based on Single Feature Correlation
For different sub-processes, feature selection weight factors ficorr, fisel and ftcorr_mi of the correlation coefficient and the mutual information are 0.8. kicorr is equal to 0.5. kimi is equal to 0.5. Correlation coefficient values, mutual information values and comprehensive evaluation values of process variables selected by the incineration treatment sub-process are shown in
It can be seen from
Four conclusions can be obtained from
(1) The stack emission sub-process has mean values of 0.2816, 0.7401 and 0.2500 for the correlation coefficient values, the mutual information values and the comprehensive evaluation values, and these mean values of the stack emission sub-process are higher than those of other sub-processes. In the stack emission sub-process, concentrations of gases such as HCL, O2, NOx and CO emitted with DXN from the chimney are measured, which is consistent with DXN generation mechanism and DXN emission detection disclosed in literatures.
(2) For the incineration treatment sub-process, its correlation coefficient values have a maximum value of 0.6760, which is higher than that of other sub-processes. For the incineration treatment sub-process, its mutual information values have a maximum value of 0.8665, which is higher than that of other sub-process. For the stack emission sub-process, its comprehensive evaluation values have a maximum value of 0.2877, which is higher than other sub-processes. Therefore, the incineration treatment sub-process, the stack emission sub-process are related to the DXN generation process.
(3) For the common resource supply sub-process, its correlation coefficient values, mutual information values and comprehensive evaluation values each have a minimum value that is smallest among different sub-processes. In terms of mechanism, the common resource supply sub-process is not directly related to the material flow produced by DXN. However, it can be seen from measurement results of single feature correlation that the correlation coefficient value and the mutual information value between some process variables of the common resource supply sub-process and DXN are relatively large.
(4) The above statistics show that DXN emission industrial data has a certain degree of reliability. From the perspective of single feature correlation, the top three systems are related to DXN generation, adsorption and emission. However, some process variables of other sub-processes are also highly correlated with the DXN emission concentration from the data perspective, and thus the final feature selection should be performed by combining mechanism knowledge.
With reference to
2.2 Feature Selection Results Based on Multiple Feature Redundancy
For the 132 process variables based on single feature correlation, an optimal process variable combination is determined using the GAPLS algorithm for the redundant feature removal.
The GAPLS algorithm adopts the operating parameters of a population size 20, a maximum genetic algebra 40, a maximum number of latent variables (LV) 6, a genetic variation rate 0.005, a window width 1, a convergence percentage 98% and a variable initialization percentage 30%.
After the GAPLS algorithm runs 100 times with the above parameters, RMSE statistical results of the prediction model are obtained and shown in
It can be seen from the statistical results of
Further, the number of times that the 132 process variables are calculated. Statistical results of the number of times that the multi-feature related process variables are selected are shown in
(1) The average number of times that all 132 process variables are selected is 13. A process variable that has the largest selection times is from the common resource supply sub-process.
(2) The stack emission sub-process has four process variables, and these four process variables have largest single feature correlation. The maximum number of times that respective four process variables are selected is only 6, so it can be concluded that there is a difference between the selection results based on multiple feature redundancy and the single feature correlation. It also can be concluded that the GAPLS algorithm has randomness.
(3) The data-driven feature variable selection is flawed, and it is required to supplement mechanism knowledge.
2.3 Feature Selection Results Based on Model Prediction Performance
Based on the above GAPLS running results, a feature selection threshold is set to be in a range of 13-48.
According to the relationship between the feature selection threshold and the prediction performance, the threshold is set to be 18, and the number of selected process variables is 39. The process variables selected based on the model prediction performance in the respective sub-processes are shown in
It can be seen from
According to a relationship between the number of LVs and the RMSE of the prediction performance, when the number of LVs is 2, the training RMSE is 0.01375 and the testing RMSE is 0.01929. Latent variable contribution rates are extracted from different latent variables (LV).
According to DXN generation mechanism, the steam electric power generation sub-process and the common resource supply sub-process are weakly correlated to the DXN emission concentration. The stack emission sub-process is related to DXN. By combining the mechanism, four process variables of the stack emission sub-process are added as input features. The four process variables are concentrations of HCL, O2, NOx and CO emitted from the chimney.
The above-mentioned 18 process variables selected based on the combination of data drive and mechanism are used to establish the PLS model.
According to a relationship between the number of LVs and the RMSE of the prediction performance, when the number of LVs is 2, the training RMSE is 0.01638 and the testing RMSE is 0.02048. Variables extracted by different LVs and LV contribution rates are shown in
It can be seen from
3. Comparison and Discussion
It can be seen from the above that the method provided herein can reasonably consider the contribution of correlation coefficients and mutual information measures. A soft-sensing model based on the different input features is established using the PLS algorithm.
From the above results, it can be seen that, with the same number of LV, PLS modeling methods based on the different input features have similar prediction performance for testing data, but have a significant gap in the dimensionality reduction of the input features. Dimensions of the input features are listed in descending order. The original features have 287 dimensions. The input features based on mutual information have 235 dimensions. The input features based on correlation coefficients have 153 dimensions. The input features based on comprehensive evaluation values have 98 dimensions. The input features based on both of mechanism and the data drive in this application have 18 dimensions. It can be seen that the number of features in the method provided herein has been reduced by 16 times. Therefore, the method in the present application can effectively establish an interpretable soft-sensing model with clear physical meaning. It also shows that the analysis of industrial process data needs to be combined with mechanism knowledge for the implementation.
Multiple feature selection coefficients are involved in the feature selection of the present application. The influence of these coefficients on the feature selection results and model prediction performance requires to be profoundly analyzed. In addition, the modeling method used in this application is a simple linear model, and the selected features are linear and nonlinear mixed features. Therefore, a more reasonable modeling strategy remains to be studied. It is also needed to further explore the approach of measuring the reliability of the industrial process data. In view of the input features with clear mechanism knowledge, it is necessary to consider the use of prior knowledge in the initialization of the genetic algorithm, so as to select process variables with strong mechanism correlation, such as the concentration of CO emitted from the chimney.
In order to address the problems that DXN, as a highly toxic by-product of the MSWI process, has complicated and unclear generation and emission mechanism and is hardly detected online in real time, and high-dimensional input features used for DXN detection fail to be effectively selected, and there are a limited modeling sample size. The present application provides a method for detecting the DXN emission concentration in the MSWI process based on multi-level feature selection, which has the following advantages.
(1) Comprehensive evaluation value indicators are defined to perform single feature selection and measurement based on correlation.
(2) A feature selection method by running GAPLS multiple times for multiple feature redundancy is provided.
(3) Based on the model prediction performance, data drive and mechanism knowledge are combined to select the final input features, so as to establish a detection model. The method provided in the present application is verified to be effective by an incineration plant.
References cited in the specification are listed as follows:
It can be understood by those skilled in the art that, all or part of steps of the method disclosed in the present application can be completed by relevant hardware under the instructions of a program. The program is stored on a storage medium which includes several instructions to cause a computing device (such as a single-chip microcomputer, a chip, etc), or a processor to execute all or part of the steps of the method in the embodiments of the present application. The storage media is selected from various media that can store program codes consisting of a USB flash disk, a mobile hard disk, a Read-Only Memory (ROM), a Random-Access Memory (RAM), a diskette and an optical disc.
It should be understood by those skilled in the art that, in actual applications, various changes can be made without departing from the spirit and scope of the disclosure as claimed.
It should be noted that terms used herein are only for the purpose of description and are not intended to limit the present application. Unless otherwise specified, terms of a singular form also include a plural form. In addition, the terms “comprise” and/or “include” used in the specification are intended to indicate the presence of features, steps, operations, devices, components, and/or a combination thereof.
Unless otherwise specified, the relative arrangement of components and numerical expressions and numerical values in steps in the embodiments are not intended to limit the scope of the present application. At the same time, it should be understood that, the words used in the specification are words of description rather than limitation. The techniques, methods and equipment known to those skilled in the art may not be discussed in detail, but can be regarded as a part of the disclosure as claimed under certain cases. Any specific value disclosed in an embodiment is merely illustrative and is not as a limitation, and thus can be modified in other embodiments. It should be noted that similar numbers and letters indicate similar items in the accompanying drawings. Therefore, once an item is defined in an accompanying drawing, and there is no need to further define it in the subsequent accompanying drawings.
The embodiments disclosed in the present application are merely preferred embodiments. Any changes, modifications and replacements made by those skilled in the art without departing from the spirit of the invention are defined by the scope of the appended claims and equivalents thereof.
Number | Date | Country | Kind |
---|---|---|---|
201910397710.5 | May 2019 | CN | national |
This application is a continuation of International Patent Application No. PCT/CN2019/107216, filed on Sep. 23, 2019, which claims the benefit of priority from Chinese Patent Application No. 201910397710.5, filed on May 14, 2019. The content of the aforementioned applications, including any intervening amendments thereto, is incorporated herein by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
9875440 | Commons | Jan 2018 | B1 |
20150278703 | Liu | Oct 2015 | A1 |
Number | Date | Country |
---|---|---|
103366100 | Oct 2013 | CN |
107944173 | Apr 2018 | CN |
108062566 | May 2018 | CN |
108090317 | May 2018 | CN |
108549792 | Sep 2018 | CN |
109190660 | Jan 2019 | CN |
109492319 | Mar 2019 | CN |
109583115 | Apr 2019 | IN |
2005008572 | Jan 2005 | WO |
Entry |
---|
Hui Cao, Xingyu Yan, Yaojiang Li, Yanxia Wang, Yan Zho, Sanchun Yang, “A Component Prediction Method for Flue Gas of Natural Gas Combustion Based on Nonlinear Partial Least Squares Method,” Mar. 19, 2014, Hindawi Publishing Corporation, The Scientific World Journal, vol. 2014, pp. 1-5. (Year: 2014). |
Zhenhao Tang, Haiyang Zhang “Modeling NOx emission of coal-fired boiler with differential evolution optimized least square support vector machine,” 2018, Chinese Control And Decision Conference, pp. 3364-3367. (Year: 2018). |
Tang Jian et al. Soft Sensor method of Mill Load for Grinding Process based on GA-PLS from Spectral Data using Feature Selection, Jul. 2010. |
Yujing Sun et al. Correlation Feature Selection and Mutual Information Theory Based Quantitative Research on Meteorological Impact Factors of Module Temperature for Solar Photovoltaic Systems, Dec. 2016, No. 1, vol. 10. |
Hassan A. Arafat et al. Environmental performance and energy recovery potential of five processes for municipal solid waste treatment. |
Hui Zhou et al. A review of dioxin-related substances during municipal solid waste incineration. |
A. Mukherjee et al. A Review on Technologies of Removal of Dioxins and Furans from Incinerator Flue Gas. |
Yuanan Hu et al. The growing importance of waste-to-energy (WTE) incineration in China's anthropogenic mercury emissions: Emission inventories and reduction strategies, 2018. |
Tao Huang et al. Ultrasound-enhanced electrokinetic remediation for removal of Zn, Pb, Cu and Cd in municipal solid waste incineration fly ashes, 2018. |
P. H.-Jonas et al. The Global Exposure of Man to Dioxins: A Perspective on Industrial Waste Incineration, 1993,No. 8, vol. 26. |
Jing Bai et al. Mechanism and kinetics study on the ozonolysis reaction of 2,3,7,8-TCDD in the atmosphere, 2014. |
Yu Mingfeng et al. The research of PCDD/Fs emission characteristics in flue gas from municipal solid waste incinerations, May 2018, No. 5, vol. 38. |
T. Gouin et al. Variability of concentrations of polybrominated diphenyl ethers and polychlorinated biphenyls in air: implications for monitoring, modeling and control, 2005. |
Hai-Jun Zhang et al. Influence of variation in the operating conditions on PCDD/F distribution in a full-scale MSW incinerator, 2008. |
Qiao Junfei et al. Dioxin Emission Concentration Measurement Approaches for Municipal Solid Wastes Incineration Process:A Survey, Jun. 2020, No. 6, vol. 46. |
Tang J, Qiao J F, Guo Z H. Soft Sensing of Dioxin Emission Concentration Based on Potential Characteristic Selective Integrated Modeling[J]. Acta Automatica Sinica, in trial. |
Ni-Bin Chang et al. Statistical Modelling for the Prediction and Control of PCDDs and PCDFs Emissions From Municipal Solid Waste Incinerators, 1995. |
Ni-Bin Chang et al.Prediction of PCDDs/PCDFs emissions from municipal incinerators by genetic programming and neural network modeling, 2000. |
Tang Jian, Dioxin emission concentration soft measuring approach of municipal solid waste incineration based on selective ensemble kernel learning algorithm, 2019, No. 2, vol. 70. |
Sond Bunsan et al. Modeling the dioxin emission of a municipal solid waste incinerator using neural networks, 2013. |
Xiao Xiaodong et al. Prediction of dioxin emissions in flue gas from waste incineration based on support vector regression, Aug. 2017, No. 8, vol. 35. |
Tang Jian et al. Dioxin emission concentration soft measurement based on multi-source latent feature selective ensemble modeling for municipal solid waste incineration process,Month, 201X, No. X, vol. X. |
Abul Hasnat et al.Feature Selection in Cancer Microarray Data using Multi-Objective Genetic Algorithm combined with Correlation Coefficient, 2016. |
Frederico Coelho et al.Multi-Objective Semi-Supervised Feature Selection and Model Selection Based on Pearson's Correlation Coefficient. |
Roberto Battiti,Using Mutual Information for Selecting Features in Supervised Neural Net Learning, Jul. 1994, No. 4, vol. 5. |
Jorge R. Vergara • Pablo A. Estevez, A review of feature selection methods based on mutual information,2014. |
Anil K. Jain et al. Statistical Pattern Recognition: A Review, Jan. 2000, No. 1, vol. 22. |
Francois Fleuret, Fast Binary Feature Selection with Conditional Mutual Information, 2004. |
Pablo A. Estévez, Normalized Mutual Information Feature Selection,Feb. 2009, No. 2, vol. 20. |
Fatemeh Amiri et al. Mutual information-based feature selection for intrusion detection systems, 2011. |
Sara Mohammadi et al. Multivariate correlation coefficient and mutual information-based feature selection in intrusion detection,2017, No. 5, vol. 26. |
Hanchuan Peng et al.Feature Selection Based on Mutual Information:Criteria of Max-Dependency, Max-Relevance, and Min-Redundancy, Aug. 2005, No. 8, vol. 27, p. 5 and 59. |
Tihonov A N. O (Solution of incorrectly formulated problems and the regularization method), 1963. |
S. Wold et al. The Collinearity Problem in Linear Regression. The Partial Least Squares (PLS) Approach to Generalized Inverses,Sep. 1984, No. 3, vol. 5. |
R. Leardi et al. Genetic Algorithms as a Strategy for Feature Selection,1992, vol. 6. |
Tang Jian et al. Soft sensing mill load in grinding process by time/frequency information fusion, May 2012, No. 5, vol. 29. |
Number | Date | Country | |
---|---|---|---|
20210033282 A1 | Feb 2021 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2019/107216 | Sep 2019 | US |
Child | 17038723 | US |