Fault monitoring method for sewage treatment process based on fuzzy width adaptive learning model

TECHNICAL FIELD

The present invention relates to the technical field of fault monitoring based on data-driven, in particular to a fault monitoring technology for sewage treatment processes. The fault monitoring method of the present invention for the intermittent process is a specific application in the fault monitoring of the sewage treatment process.

BACKGROUND ART

In recent years, with the advancement of science and technology, the process of industrialization and urbanization has been accelerating, the number of people has continued to increase, and the pollution of the ecological environment has become increasingly serious. Among them, as an essential water resource for human life, due to its shortage and serious pollution, the protection of the water environment has become the consensus of today's society. People are seeking a practical way to protect the water environment, and sewage treatment plants are used as water resources. The key link in the environmental protection chain has received extensive attention from the outside world. The primary purpose of a sewage treatment plant is to ensure that its final product, that is, the quality of the effluent, can meet the prescribed discharge standards, and then improve the efficiency of sewage treatment on this basis to save costs. The prerequisite for achieving this goal is that the sewage treatment plant must be in a safe and stable operating state. Therefore, it is necessary to establish an effective process monitoring program to detect abnormal phenomena in time.

At present, neural network-based fault monitoring methods based on machine learning are widely used in the field of fault monitoring in sewage treatment processes, such as DBN (Deep Belief Network), DBM (Deep Boltzmann Machine), ANN (Artificial Neural Network), AE (Auto-encoder)) and other methods. The currently popular deep neural networks have very powerful performance, but due to the large number of hyperparameters and complex structures involved, the training of the network is a very time-consuming process. In addition, the complexity of the deep network structure itself makes theoretical analysis of its deep structure very difficult. At present, most researches focus on adjusting parameters such as network weights or stacking more layers to obtain higher accuracy. Therefore, it takes a lot of time and expensive equipment and other hardware resources. Modern fast-paced industrial production process is difficult to meet these conditions.

SUMMARY

In order to solve the above problems, a fault monitoring method for sewage treatment process based on fuzzy width adaptive learning model is proposed. The method includes a set of first-order TS fuzzy subsystem and enhanced node layer. First, each fuzzy subsystem processes the input data, and then connects the processed intermediate values as vectors and sends them to the enhanced node group for nonlinear conversion. Secondly, the output of the enhancement layer and the defuzzification output of the fuzzy subsystem are used to generate the final model output. The parameters of the model include the enhanced node group and the final output connection weight. The coefficients of the subsequent parts of fuzzy rules in each fuzzy subsystem can be quickly calculated by pseudo-inverse calculations. Different from the traditional deep network, this method only needs to reconstruct the model in an incremental manner, without retraining entire network from scratch. On the premise of ensuring that the accuracy meets the requirements, the characteristic of fuzzy width learning system to continuously adjust itself with the production process makes it possible to realize online fault monitoring sewage treatment process.

The technical scheme and implementation steps adopted by the present invention include “network training phase” and “online monitoring phase”. The specific steps are as follows:

A. Network Training Stage:

1) Collect historical data {X, Y} of the sewage treatment process as training data, where X=(x₁, x₂, . . . , x_N)^T∈ custom character ^N×M, Y=(y₁, y₂, . . . , y_C)^T∈^N×C, Y is used as label data, the label is 0 during the normal period of the process, and the label data is 1 during the fault period. N is the number of input samples, M is the number of variables, and C is the dimension of the corresponding output (in this method, the output dimension is 2, which means normal and faulty states).

2) For input data X=(x₁, x₂, . . . , x_N)^T∈ custom character ^N×M, X_s=(x_s1, xs2, . . . , x_sM), s=1, 2, . . . , N. The model has n fuzzy subsystems, m enhanced node groups, and the i-th fuzzy subsystem contains K fuzzy rules. The first-order TS fuzzy model is used, where the i-th fuzzy subsystem is taken as an example (i=1, 2, . . . , n). When the input data is the s-th training sample, the output of i-th fuzzy subsystem and k-th fuzzy rule is:

$z_{sk}^{i} = f_{k}^{i} (x_{s 1}, x_{s 2}, \dots, x_{s M}) = \sum_{t = 1}^{M} α_{kt}^{i} x_{st}$

Wherein f_kⁱis a polynomial of the first degree, α_ktⁱis a parameter obtained by initialization which obeys uniform distribution [0,1], t=1, 2, . . . , M, k=1, 2, . . . , K is the number of variables.

3) For the s-th training sample, calculate weighted strength ω_skⁱof k-th fuzzy rule of the i-th fuzzy subsystem:

$ω_{sk}^{i} = \frac{τ_{sk}^{i}}{\sum_{k = 1}^{k_{i}} τ_{sk}^{i}}$

Where τ_skⁱrepresents weighted strength of the kth fuzzy rule of i-th fuzzy subsystem (k=1, 2, . . . , K) when the input training sample is Xs (xs=(xs1, xs2, . . . , xsM)), the formula is as follows:

$τ_{sk}^{i} = \prod_{t = 1}^{M} μ_{kt}^{i} (x_{st})$

Where μ_ktⁱ(x_st) is the Gaussian membership function (t=1, 2, . . . , M) corresponding to the input sample X_st, and its formula is expressed as:

$μ_{kt}^{i} (x_{st}) = e^{- {(\frac{x_{st} - c_{kt}^{i}}{δ_{kt}^{i}})}^{2}}$

Wherein c_ktⁱand δ_ktⁱare parameters, and the value of c_ktⁱis the value of cluster center obtained by C-means clustering, δ_ktⁱ=1

4) Calculate the output Zⁿof n fuzzy subsystems corresponding to sth training sample:

Zⁿ=(Z₁,Z₂, . . . ,Z_n)∈ custom character ^N×(k¹^+k²^{+ . . . k}ⁿ⁾

Wherein, Z₁, Z₂, . . . , Z_nrespectively correspond to the output of n subsystems, k₁, k₂, . . . , k_nrespectively correspond to the number of fuzzy rules of n fuzzy subsystems, and the sth training sample.

Taking i-th fuzzy subsystem as an example, for all training samples, the output matrix Z_iis obtained by the following formula:

Z_i=(Z_1i,Z_2i, . . . ,Z_Ni)^T∈ custom character ^N×K,i=1,2, . . . ,n

Wherein, Z_1i, Z_2i, . . . , Z_Nirespectively represent the output vector of i-th fuzzy subsystem for N training samples. Taking s-th sample as an example, the output vector of i-th fuzzy subsystem can be obtained by the following formula:

Z_si=(ω_s1ⁱz_s1ⁱ,ω_s2ⁱz_s2ⁱ, . . . ,ω_sk₁ⁱz_sk₁ⁱ)

Wherein, z_skⁱand ω_skⁱare obtained by 2) and 3) respectively, and k=1, 2, . . . , K.

5) Input the output Zⁿobtained in 4) into m enhanced node groups mentioned in 2) for nonlinear conversion. Wherein, the j-th enhanced node group has Lj neurons (j=1, 2, . . . , m). For m enhanced node groups, the output matrix can be expressed as:

H^m=(H₁,H₂, . . . ,H_m)∈ custom character ^N×(L¹^+L²^{+ . . . L}^m⁾

Wherein, L₁, L₂, . . . , L_mrespectively represent the number of neurons contained in m enhanced node groups, H₁=ξ_j(ZⁿW_j+β_j)∈ custom character ^N×L^jis the output matrix of j-th enhanced node group, and Wj represents connection weight between fuzzy subsystem output and the corresponding enhanced node group, βj represents the deviation between output of fuzzy subsystem and j-th enhanced node group, the values of Wj and βj are randomly generated from [0,1], ξj is the activation function, and the formula is ξj=tan h(·).

6) Calculate the defuzzification output of fuzzy subsystem. Then for s-th training sample X_s, the defuzzification output vector of i-th fuzzy subsystem can be expressed as:

$F_{si} = \sum_{k = 1}^{k_{i}} ω_{sk}^{i} z_{sk}^{i} = \sum_{k = 1}^{k_{i}} (ω_{sk}^{i} \sum_{t = 1}^{M} α_{kt}^{i} x_{st})$

Introduce parameter δ_kcⁱto the result part of each fuzzy rule. The parameter value is obtained by pseudo-inverse operation method, and C represents dimension of output. Since Y=∈ custom character ^N×Chas C dimensions, each fuzzy subsystem is a multi-output model. After the parameter is introduced, its new output is:

$\begin{matrix} F_{si} = (\sum_{k = 1}^{k_{i}} ω_{sk}^{i} (\sum_{t = 1}^{M} δ_{k 1}^{i} α_{kt}^{i} x_{st}), \dots, \sum_{k = 1}^{K} ω_{sk}^{i} (\sum_{t = 1}^{M} δ_{kC}^{i} α_{kt}^{i} x_{st})) \\ = \sum_{t = 1}^{M} α_{kt}^{i} x_{st} (ω_{s 1}^{i}, ω_{s2}^{i}, \dots, ω_{sK}^{i}) (\begin{matrix} δ_{1 1}^{i} & \dots & δ_{1 C}^{i} \\ ⋮ & ⋱ & ⋮ \\ δ_{K 1}^{i} & \dots & δ_{K C}^{i} \end{matrix}) \end{matrix}$

Where s=1, 2, . . . , N, then, for all training samples X, the output of the i-th fuzzy subsystem is:

$F_{i} = {(F_{1 i}, F_{2 i}, \dots, F_{Ni})}^{T} \overset{Δ}{=} D Ω^{i} δ^{i} \in ℝ^{N \times C}$

$Where D = diag {\sum_{t = 1}^{M} α_{kt}^{i} x_{1 t}, \dots, \sum_{t = 1}^{M} α_{kt}^{i} x_{Nt}}, Ω^{i} = (\begin{matrix} ω_{1 1}^{i} & \dots & ω_{1 K}^{i} \\ ⋮ & ⋱ & ⋮ \\ ω_{N 1}^{i} & \dots & ω_{NK}^{i} \end{matrix}), δ^{i} = (\begin{matrix} δ_{1 1}^{i} & \dots & δ_{1 C}^{i} \\ ⋮ & ⋱ & ⋮ \\ δ_{K 1}^{i} & \dots & δ_{KC}^{i} \end{matrix})$

Thus, the output of n fuzzy subsystems can be obtained:

$F^{n} = \sum_{i = 1}^{n} F_{i} = \sum_{i = 1}^{n} D Ω^{i} δ^{i} = D (Ω^{1}, Ω^{2}, \dots, Ω^{n}) (\begin{matrix} δ^{1} \\ ⋮ \\ δ^{n} \end{matrix}) = D Ω Δ \in ℝ^{N \times C}$

Wherein, Ω=(Ω¹, Ω², . . . , Ωⁿ)∈ custom character ^N×nkis the matrix containing the weighted strengthen ω_skⁱobtained in 3). Δ=((δ¹)^T, . . . , (δⁿ)^T)^T∈^nK×C

7) Calculate the final output result. Weight between fuzzy subsystem and output is set to 1, and the weight between enhanced node group and output is set to W_e∈ custom character ^(L¹^+L²^{+ . . . L}^m^)×C. Multiply the H^mobtained in 4) and Fⁿobtained in 5) respectively. The weights are then summed to get the final output result, the formula is as follows:

$\hat{Y} = F^{n} + H^{m} W_{e} = D Ω Δ + H^{m} W_{e} = (D Ω, H^{m}) (\begin{matrix} Δ \\ W_{e} \end{matrix}) \overset{Δ}{=} Δ (D Ω, H^{m}) W$

For a given training target Y, the matrix W can be calculated by pseudo-inverse:

W=(DΩ,H^m)⁺Y

Wherein, so far, the network model training is complete.

B Online Monitoring:

8) During online monitoring, set the new data X_new=(x_new,1, x_new,2, . . . , x_new,N)^T∈ custom character ^N×M, X_new,s=(X_new,1, X_new,2, . . . , X_new,M), s=1, 2, . . . , N. For n fuzzy subsystems in the training part, taking i-th subsystem as an example, using first-order T-S fuzzy system, then:

$z_{new, ski}^{i} = f_{k}^{i} (x_{new, s 1}, x_{new, s 2}, \dots, x_{new, sM}) = \sum_{t = 1}^{M} α_{kt}^{i} x_{new, st}$

The value of α_ktⁱis the value corresponding to the training part.

9) For s-th test sample, the weighted strength of i-th fuzzy subsystem is as follows:

$ω_{new, sk}^{i} = \frac{τ_{new, sk}^{i}}{\sum_{k = 1}^{K} τ_{new, sk}^{i}}$

Where τ_new,skⁱrepresents the strength of k-th fuzzy rule of i-th fuzzy subsystem, which is obtained by the following formula:

$τ_{new, sk}^{i} = \prod_{t = 1}^{M} μ_{kt}^{i} (x_{new, st})$

Where μ_ktⁱ(x_new,st) is the Gaussian membership function corresponding to the input sample X_new,st(t=1, 2, . . . , M), and its formula is expressed as:

$μ_{kt}^{i} (x_{new, st}) = e^{- {(\frac{x_{new, st} - c_{kt}^{i}}{δ_{kt}^{i}})}^{2}}$

Wherein, c_ktⁱand δ_ktⁱare parameters, and their values are the same as corresponding parameter values of the training part.

10) For all test samples, find the output Z_newⁿof n fuzzy subsystems. The formula is as follows:

Z_newⁿ=(Z_new1,Z_new2, . . . ,Z_newn)∈ custom character ^N×nK

The output Z_newiof i-th fuzzy subsystem can be expressed as follows:

Z_newi=(Z_new,1i,Z_new,2i, . . . ,Z_new,Ni)^T∈ custom character ^N×K,i=1,2, . . . ,n

Taking s-th training sample among all N samples as an example, the output vector Z_new,siof i-th fuzzy subsystem can be expressed as:

Z_new,si=(ω_new,s1ⁱz_new,s1ⁱ,ω_new,s2ⁱz_new,s2ⁱ, . . . ,ω_new,sKⁱz_newsKⁱ)

Where (s=1, 2, . . . , N)

Wherein, z_new,skⁱand ω_new,skⁱare obtained by 8) and 9) respectively, k=1, 2, . . . , K.

11) The output Z_newⁿobtained in 10) is subjected to nonlinear conversion.

Then for m enhanced node groups, the output matrix of enhanced layer can be expressed as:

H_new^m=(H_new1,H_new2, . . . ,H_newm)∈ custom character ^N×(L¹^+L²^{+ . . . L}^m⁾

Where H_newj=ξ_j(Z_newⁿW_j+β_j)∈ custom character ^N×L^jis the output matrix of j-th enhanced node group (j=1, 2, . . . , m), and the values of Wj and βj correspond to the network training part.

12) Calculate the defuzzification output of fuzzy subsystem.

After introducing parameter δ_kcⁱto the result part of each fuzzy rule, c=1, 2, . . . , C. For s-th training sample, defuzzification output vector of i-th fuzzy subsystem can be expressed as:

$\begin{matrix} F_{newsi} = (\sum_{k = 1}^{K_{i}} ω_{sk}^{i} (\sum_{t = 1}^{M} δ_{k 1}^{i} α_{kt}^{i} x_{newst}) \dots \sum_{k = 1}^{K_{i}} ω_{sk}^{i} (\sum_{t = 1}^{M} δ_{kC}^{i} α_{kt}^{i} x_{newst})) \\ = \sum_{t = 1}^{M} α_{kt}^{i} x_{newst} (ω_{s 1}^{i}, ω_{s 2}^{i}, \dots, ω_{sK}^{i}) (\begin{matrix} δ_{11}^{i} & \dots & δ_{1 C}^{i} \\ ⋮ & ⋱ & ⋮ \\ δ_{K 1}^{i} & \dots & δ_{KC}^{i} \end{matrix}) \end{matrix}$

Then, for all training samples X, the output of i-th fuzzy subsystem is:

$F_{newi} = {(F_{new, 1 i}, F_{new, 2 i}, \dots, F_{new, Ni})}^{T} \overset{Δ}{=} D_{new} Ω_{new}^{i} δ^{i} \in ℝ^{N \times C}$

$Wherein D_{new} = diag {\sum_{t = 1}^{M} α_{kt}^{i} x_{new, 1 t}, \dots, \sum_{t = 1}^{M} α_{kt}^{i} x_{new, Nt}}, Ω_{new}^{i} = (\begin{matrix} ω_{new, 11}^{i} & \dots & ω_{new 1, K}^{i} \\ ⋮ & ⋱ & ⋮ \\ ω_{new, N 1}^{i} & \dots & ω_{new, NK}^{i} \end{matrix}), δ^{i} = (\begin{matrix} δ_{11}^{i} & \dots & δ_{1 C}^{i} \\ ⋮ & ⋱ & ⋮ \\ δ_{K 1}^{i} & \dots & δ_{KC}^{i} \end{matrix})$

Thus, the output of n fuzzy subsystems can be obtained:

$F_{new}^{n} = \sum_{i = 1}^{n} F_{newi} = \sum_{i = 1}^{n} D_{new} Ω_{new}^{i} δ^{i} = D_{new} (Ω_{new}^{1}, \dots, Ω_{new}^{n}) (\begin{matrix} δ^{1} \\ ⋮ \\ δ^{n} \end{matrix}) = D_{new} Ω_{new} Δ \in ℝ^{N \times C}$

Wherein, Ω_new=(Ω_new¹, . . . , Ω_newⁿ)∈ custom character ^N×nKis the matrix containing weighted strength ω_new,skⁱobtained in 9).

Δ=((δ¹)^T, . . . ,(δⁿ)^T)^T∈^nK×C

13) Calculate the final output value. Set the weight between combined output of connection enhancement node to W_e∈ custom character ^(L¹^+L²^{+ . . . L}^m^)×Cwhose value is obtained from training phase, and the weight between fuzzy subsystem and output is 1, multiply H_new^mobtained from 11) and F_newⁿobtained from 12) by their respective weights, and then sum them to get final output.

The formula is as follows:

Ŷ=F_newⁿ+H_new^mW_e custom character (D_newΩ_new,H_new^m)W

The value of W corresponds to the value of parameter matrix of network training part.

(D_newΩ_new,H_new^m)=((D_newΩ_new,H_new^m)^T(D_newΩ_new,H_new^m))⁻¹(D_newΩ_new,H_new^m)^T

14) The output value Ŷ is converted to zero by one-hot encoding method. When it is 0, it means that the operation is in a normal state. When the output value is 1, it means that the operation is faulty, so as to realize fault monitoring.

Beneficial Effect

The present invention uses the historical data of sewage treatment process to train the network model. When training the model, normal process data is labeled as 0 and the fault period data is labeled as 1, the network parameters obtained by training are used for online fault monitoring.

Compared with current popular deep network, the present invention only needs to increase the number of enhanced nodes and can be reconstructed in an incremental manner without retraining the entire network from scratch. On the premise of ensuring that the accuracy meets the requirements, the present invention continuously and rapidly adjusts itself as the production process changes, realizes timely online faults monitoring of sewage treatment process, and obtains good monitoring results.

DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram of the overall structure of the present invention;

FIG. 2 is a structural diagram of i-th fuzzy subsystem of the present invention;

FIG. 3 shows the monitoring result of fault 1;

FIG. 4 shows the monitoring results of fault 2;

FIG. 5 shows the monitoring results of fault 3;

FIG. 6 shows the monitoring results of fault 4;

FIG. 7 shows the logic block diagram of the hardware system on which this method is based.

PREFERRED EMBODIMENT

In order to solve the above-mentioned problems, a fault monitoring method of sewage treatment process based on fuzzy width adaptive learning model is proposed, which is based on an online monitoring equipment. The whole device includes input module, information processing module, console module, output result visualization module. The proposed method is imported into the information processing module, and then the network monitoring model is established with the process data retained by the actual industry, and the established model is saved for online fault monitoring. In actual online monitoring of industrial processes, first connect real-time process variables collected by the factory data sensor to input module as the input information of monitoring equipment, and then select previously trained model through console to monitor, and the monitoring results is displayed in real time through visualization module, so that on-site staff can make timely corresponding measures based on visualization monitoring results to reduce the economic loss caused by process failures.

The sewage treatment process is extremely complex, including not only a variety of physics and chemistry, but also biochemical reactions. In addition, various uncertain factors are flooded with it, such as influent flow, water quality and load changes, which give huge challenges for establishment of sewage treatment monitoring model. The present invention uses the “benchmark simulation model 1” (benchmark simulation model 1) developed by the International Water Association (IWA) as the actual sewage treatment process for real-time simulation. The model consists of five reaction tanks (5999 m³) and a secondary settling tank (6000 m³), in addition, it has three aeration tanks. The aeration tank has 10 layers, 4 meters deep, and covers an area of 1500 m². The reaction process includes internal and external reflux. The average sewage treatment flow rate is 20 000 m³/d, and the chemical oxygen demand is 300 mg/l.

This experiment takes BSM1 sewage data as the research object, the sampling interval is 15 minutes, and the sampling period is 14 days. That is, there are 96 sampling points every day, totaling 1344 sampling points. Select 16 variables to monitor the operating status of the process, as shown in Table 1. A batch of normal data and 4 batches of fault data are used as training data to train the network offline, and 4 batches of fault data are used as test data to verify the effectiveness of the method. Set the fault types of two types of faults under rainy conditions: single fault and compound fault. The fault occurrence time is both the eighth day and the ninth day. The detailed information of the fault is shown in Table 2.

TABLE 1

Sewage variables

Number
Name of Variables
Unit

1
Discharge flux
m−3 · d

2
Discharge
SI g COD · m⁻³

concentration

3
Discharge
SS g COD · m⁻³

concentration

4
Discharge
XI g COD · m⁻³

concentration

5
Discharge
XS g COD · m⁻³

concentration

6
Discharge
XBH g COD · m⁻³

concentration

7
Discharge
XBA g COD · m⁻³

concentration

8
Discharge
XP g COD · m⁻³

concentration

9
Discharge
SO g (−COD) · m⁻³

concentration

10
Discharge
SNO g N · m⁻³

concentration

11
Discharge
SNH g N · m⁻³

concentration

12
Discharge
SND g N · m⁻³

concentration

13
Discharge
XND g N · m⁻³

concentration

14
Discharge
SALK mol HCO3− · m⁻³

concentration

15
Discharge
TSS g SS · m⁻³

concentration

16
Discharge Kjeldahl
N g N · m⁻³

concentration

TABLE 2

Failure data settings

Failure
Working

Failure
Starting
Ending

code
condition
Amplitude
type
time
time

1
Rainy
Muh = 3.5
Single
673
864

2
Rainy
Muh = 1
Single
673
864

3
Rainy
Muh = 2 BH = 0.45
Compound
673
864

4
Rainy
Muh = 2 BH = 0.6
Compound
673
864

The fault monitoring process of the application of the present invention in the above-mentioned BSM1 sewage treatment process is specifically stated as follows:

A. Network Training Stage:

Step 1: Collect a batch of operating data under normal working conditions and 4 batches of fault data as a training data X∈ custom character ^N×Mset through BSM1 platform. Each batch of data samples has 1344 points, 16 variables. Take five sets of data as a training data set. It can be expressed as X (6720×16), label the data, the label value is 0 at normal time, and the label value at fault time is 1, which is represented as Y (6720×2).

Step 2: The number of fuzzy subsystems, enhanced node groups, and fuzzy rules of each fuzzy subsystem are determined according to the actual situation. The model in this embodiment contains n=8 fuzzy subsystems, m=10 enhanced node groups, and each group contains 10 neurons. The number of fuzzy rules for each fuzzy subsystem is 5, and the first-order T-S is adopted. Fuzzy model, taking the i-th fuzzy subsystem as an example (i=1, 2, . . . , n), when the input data is s-th training sample, the output of k-th fuzzy rule of i-th fuzzy subsystem is:

$z_{sk}^{i} = f_{k}^{i} (x_{s 1}, x_{s 2}, \dots, x_{sM}) = \sum_{t = 1}^{M} α_{kt}^{i} x_{st}$

Wherein, f_kⁱis a polynomial of the first degree, α_ktⁱis a parameter, obeys uniform distribution [0,1], t=1, 2, . . . , M, k=1, 2, . . . , K represents the number of variables.

Step 3: For the s-th training sample, calculate weighted strength ω_skⁱof k-th fuzzy rule of the i-th fuzzy subsystem:

$ω_{sk}^{i} = \frac{τ_{sk}^{i}}{\sum_{k = 1}^{k_{i}} τ_{sk}^{i}}$

$τ_{sk}^{i} = \prod_{t = 1}^{M} μ_{kt}^{i} (x_{st})$

Where μ_ktⁱ(x_st) is the Gaussian membership function (t=1, 2, . . . , M) corresponding to the input sample X_st, and its formula is expressed as:

$μ_{st}^{i} (x_{st}) = e^{- {(\frac{x_{st} - c_{kt}^{i}}{δ_{kt}^{i}})}^{2}}$

Wherein c_ktⁱand δ_ktⁱare parameters, and the value of c_ktⁱis the value of cluster center obtained by C-means clustering, δ_ktⁱ=1

Step 4: Calculate the output Zⁿof n fuzzy subsystems corresponding to sth training sample:

Zⁿ=(Z₁,Z₂, . . . ,Z_n)∈ custom character ^N×(k¹^+k²^{+ . . . k}ⁿ⁾

Wherein, Z₁, Z₂, . . . , Z₁respectively correspond to the output of n subsystems, k₁, k₂, . . . , k_nrespectively correspond to the number of fuzzy rules of n fuzzy subsystems, and the sth training sample.

Wherein, z_skⁱand ω_skⁱare obtained by 2) and 3) respectively, and k=1, 2, . . . , K.

Step 5: Input the output Zⁿobtained in step 4 into m enhanced node groups mentioned in step 2 for nonlinear conversion. Wherein, the j-th enhanced node group has Lj neurons (=1, 2, . . . , m). For m enhanced node groups, the output matrix can be expressed as:

H^m=(H₁,H₂, . . . ,H_m)∈ custom character ^N×(L¹^+L²^{+ . . . L}^m⁾

Wherein, L₁, L₂, . . . , L_mrespectively represent the number of neurons contained in m enhanced node groups, H_j=ξ_j(ZⁿW_j+β_j)∈ custom character ^N×L^jis the output matrix of j-th enhanced node group, and Wj represents connection weight between fuzzy subsystem output and the corresponding enhanced node group, βj represents the deviation between output of fuzzy subsystem and j-th enhanced node group, the values of Wj and βj are randomly generated from [0,1], ξj is the activation function, and the formula is ξj=tan h(.).

Step 6: Calculate the defuzzification output of fuzzy subsystem. Then for s-th training sample X_s, the defuzzification output vector of i-th fuzzy subsystem can be expressed as:

$F_{si} = \sum_{k = 1}^{k_{i}} ω_{sk}^{i} z_{sk}^{i} = \sum_{k = 1}^{k_{i}} (ω_{sk}^{i} \sum_{t = 1}^{M} α_{kt}^{i} {xs}_{st})$

$\begin{matrix} F_{si} = (\sum_{k = 1}^{k_{i}} ω_{sk}^{i} (\sum_{t = 1}^{M} δ_{k 1}^{i} α_{kt}^{i} x_{st}), \dots, \sum_{k = 1}^{K} ω_{sk}^{i} (\sum_{t = 1}^{M} δ_{kC}^{i} α_{kt}^{i} x_{st})) \\ = \sum_{t = 1}^{M} α_{kt}^{i} x_{st} (ω_{s 1}^{i}, ω_{s 2}^{i}, \dots, ω_{sK}^{i}) (\begin{matrix} δ_{11}^{i} & \dots & δ_{1 C}^{i} \\ ⋮ & ⋱ & ⋮ \\ δ_{K 1}^{i} & \dots & δ_{KC}^{i} \end{matrix}) \end{matrix}$

Where s=1, 2, . . . , N, then, for all training samples X, the output of the i-th fuzzy subsystem is:

$F_{i} = {(F_{1 i}, F_{2 i}, \dots, F_{Ni})}^{T} \overset{Δ}{=} D Ω^{i} δ^{i} \in ℝ^{N \times C}$

$Where D = diag {\sum_{t = 1}^{M} α_{kt}^{i} x_{1 t}, \dots, \sum_{t = 1}^{M} α_{kt}^{i} x_{Nt}}, Ω^{i} = (\begin{matrix} ω_{11}^{i} & \dots & ω_{1 K}^{i} \\ ⋮ & ⋱ & ⋮ \\ ω_{N 1}^{i} & \dots & ω_{NK}^{i} \end{matrix}), δ^{i} = (\begin{matrix} δ_{11}^{i} & \dots & δ_{1 C}^{i} \\ ⋮ & ⋱ & ⋮ \\ δ_{K 1}^{i} & \dots & δ_{KC}^{i} \end{matrix})$

Thus, the output of n fuzzy subsystems can be obtained:

$F^{n} = \sum_{i = 1}^{n} F_{i} = \sum_{i = 1}^{n} D Ω^{i} δ^{i} = D (Ω^{1}, Ω^{2}, \dots, Ω^{n}) (\begin{matrix} δ^{1} \\ ⋮ \\ δ^{n} \end{matrix}) = D ΩΔ \in ℝ^{N \times C}$

Wherein, Ω=(Ω¹, Ω², . . . , Ωⁿ)∈ custom character ^N×nkis the matrix containing the weighted strengthen ω_skⁱobtained in 3). Δ=((δ¹)^T, . . . , (δⁿ)^T)^T∈^nK×C

Step 7: Calculate the final output result. Weight between fuzzy subsystem and output is set to 1, and the weight between enhanced node group and output is set to W_e∈ custom character ^(L¹^+L²^{+ . . . L}^m^)×C. Multiply the H^mobtained in 4) and Fⁿobtained in 5) respectively. The weights are then summed to get the final output result, the formula is as follows:

$\hat{Y} = F^{n} + H^{m} W_{e} = D ΩΔ + H^{m} W_{e} = (D Ω, H^{m}) (\begin{matrix} Δ \\ W_{e} \end{matrix}) \overset{Δ}{=} (D Ω, H^{m}) W$

For a given training target Y, the matrix W can be calculated by pseudo-inverse:

W=(DΩ,H^m)⁺Y

Wherein, so far, the network model training is complete.

B Online Monitoring:

Step 8: During online monitoring, set the new data X_new=(x_new,1, x_new,2, . . . , x_new,N)^T∈ custom character ^N×M, X_new,s=(X_new,1, X_new,2, . . . , X_new,M), s=1, 2, . . . , N. For n fuzzy subsystems in the training part, taking i-th subsystem as an example, using first-order T-S fuzzy system, then:

$z_{new, sk}^{i} = f_{k}^{i} (x_{new, s 1}, x_{new, s 2}, \dots, x_{new, sM}) = \sum_{t = 1}^{M} α_{kt}^{i} x_{new, st}$

The value of α_ktⁱis the value corresponding to the training part.

Step 9: For s-th test sample, the weighted strength of i-th fuzzy subsystem is as follows:

$ω_{new, sk}^{i} = \frac{τ_{new, sk}^{i}}{\sum_{k = 1}^{K} τ_{new, sk}^{i}}$

Where τ_new,skⁱrepresents the strength of k-th fuzzy rule of i-th fuzzy subsystem, which is obtained by the following formula:

$τ_{new, sk}^{i} = \prod_{t = 1}^{M} μ_{kt}^{i} (x_{new, st})$

Where μ_ktⁱ(x_new,st) is the Gaussian membership function corresponding to the input sample X_new,st(t=1, 2, . . . , M), and its formula is expressed as:

$μ_{kt}^{i} (x_{new, st}) = e^{- {(\frac{x_{new, st} - c_{kt}^{i}}{δ_{kt}^{i}})}^{2}}$

Wherein, c_ktⁱand δ_ktⁱare parameters, and their values are the same as corresponding parameter values of the training part.

Step 10: For all test samples, find the output Z_newⁿof n fuzzy subsystems. The formula is as follows:

Z_newⁿ=(Z_new1,Z_new2, . . . ,Z_newn)∈ custom character ^N×nK

The output Z_newiof i-th fuzzy subsystem can be expressed as follows:

Z_newi=(Z_new,1i,Z_new,2i, . . . ,Z_new,Ni)^T∈ custom character ^N×K,i=1,2, . . . ,n

Where (s=1, 2, . . . , N)

Wherein, Z_new,skⁱand ω_new,skⁱare obtained by 8) and 9) respectively, k=1, 2, . . . , K.

Step 11: The output Z_newⁿobtained in 10) is subjected to nonlinear conversion.

Then for m enhanced node groups, the output matrix of enhanced layer can be expressed as:

H_new^m=(H_new1,H_new2, . . . ,H_newm)∈ custom character ^N×(L¹^+L²^{+ . . . L}^m⁾

Step 12: Calculate the defuzzification output of fuzzy subsystem.

After introducing parameter δ_kcⁱto the result part of each fuzzy rule, c=1, 2, . . . , C. For s-th training sample, defuzzification output vector of i-th fuzzy subsystem can be expressed as:

$\begin{matrix} F_{newsi} = (\sum_{k = 1}^{K_{i}} ω_{sk}^{i} (\sum_{t = 1}^{M} δ_{k 1}^{i} α_{kt}^{i} x_{newst}) \dots \sum_{k = 1}^{K_{i}} ω_{sk}^{i} (\sum_{t = 1}^{M} δ_{kC}^{i} α_{kt}^{i} x_{newst})) \\ = \sum_{t = 1}^{M} α_{kt}^{i} x_{newst} (ω_{s 1}^{i}, ω_{s 2}^{i}, \dots, ω_{sK}^{i}) (\begin{matrix} δ_{11}^{i} & \dots & δ_{1 C}^{i} \\ ⋮ & ⋱ & ⋮ \\ δ_{K1}^{i} & \dots & δ_{KC}^{i} \end{matrix}) \end{matrix}$

Then, for all training samples X, the output of i-th fuzzy subsystem is:

Thus, the output of n fuzzy subsystems can be obtained:

Wherein, Ω_new=(Ω_new¹, . . . , Ω_newⁿ)∈ custom character ^N×nKis the matrix containing weighted strength ω_new,skⁱobtained in 9).

Δ=((δ¹)^T, . . . ,(δⁿ)^T)^T∈^nK×C

Step 13: Calculate the final output value. Set the weight between combined output of connection enhancement node to W_e∈ custom character ^(L¹^+L²^{+ . . . L}^m^)×Cwhose value is obtained from training phase, and the weight between fuzzy subsystem and output is 1, multiply H_new^mobtained from step 11 and F_newⁿobtained from step 12 by their respective weights, and then sum them to get final output. The formula is as follows:

Ŷ=F_newⁿ+H_new^mW_e custom character (D_newΩ_new,H_new^m)W

The value of W corresponds to the value of parameter matrix of network training part.

(D_newΩ_new,H_new^m)₊=((D_newΩ_new,H_new^m)^T(D_newΩ_new,H_new^m))⁻¹(D_newΩ_new,H_new^m)^T

Step 14: The output value Ŷ is converted to zero by one-hot encoding method. When it is 0, it means that the operation is in a normal state. When the output value is 1, it means that the operation is faulty, so as to realize fault monitoring.

The above steps are specific application of the method of the present invention in fault monitoring of the sewage treatment process of BSM1 platform. In the experiment, two types of faults under sunny and dry weather conditions are used, namely single fault and compound fault, and the fault occurrence time is both on the eighth and ninth days. The experimental results are shown in FIG. 3 to FIG. 6. Abscissa of each picture represents the sampling point, and the ordinate is the output result, showing a judgment of 0, 1, where 0 means normal operation and 1 means malfunction. The performance indicators are false alarm rate (FAR), missed alarm rate (MAR) and accuracy rate (ACC), among which:

FAR=The number of false alarm/total normal time*100%
MAR=The number of missed alarm/total failure time*100%
ACC=(The number of false alarm+The number of missed alarm)/total sampling time*100%

Unilateral consideration of false alarm rate (FAR) or missed alarm rate (MAR) cannot fully reflect the quality of monitoring performance. Therefore, the false alarm rate (FAR) or missed alarm rate (MAR) shall be considered comprehensively. When the false alarm rate (When FAR and MAR are both low, that is, when the accuracy rate (ACC) is low, the monitoring performance is considered better.

Aiming at the situation of more rainy weather in summer in North China, this method specifically conducts monitoring experiments under rainy conditions. The experimental faults are divided into single faults and compound faults. For a single type of failure, the fuzzy rule for each subsystem is 8, and the enhanced node group is 10. For fault 1, the fault amplitude muh=3.5, the start time is 673, and the end time is 864. The monitoring results are shown in FIG. 3. There are no false positives and false negatives. It has good monitoring performance, and the network training time is 4.85 s, which can detect the occurrence of faults in time. Under the same fault type, change the magnitude of the fault so that when muh=1, the fault occurrence time is the same. The monitoring results are shown in FIG. 4. The network training time is 4.67 s, and the occurrence of the fault can be monitored immediately at the beginning of the fault which has good monitoring performance.

Subsequently, the fault type under rainy conditions was changed to a composite fault for verification. The system has a total of 10 fuzzy subsystems, each of which contains 8 fuzzy rules, and there are 10 enhanced node groups. For fault 3, the amplitude is muh=2, BH=0.45, and the false alarm rate is 0. At 720, there are 5 false alarms, and the false alarm rate is 2.62%. Considering the false alarm and the false alarm, the accuracy rate is ACC. It is 0.37%, which has a good monitoring effect overall, and the time used is 7.85 s, which can detect the occurrence of the fault in time. For fault 4, changing the fault amplitude to muh=2 and BH=0.45, the monitoring effect has not changed, and the detection performance can also be exerted well, and the network training time is 8.07 s. Judging from the monitoring effect of the above 4 failures, compared with the deep neural network, the overall performance of the method of the present invention is less time-consuming for network training under the condition that the monitoring effect is satisfactory, and the sewage treatment is greatly reduced. With the consumption of hardware resources in the plant, ordinary computers can do the job, which saves costs and enables timely online monitoring of faults in the sewage treatment process.

Number	Date	Country
105740619	Jul 2016	CN
106802563	Jun 2017	CN
108628291	Oct 2018	CN
110232256	Sep 2019	CN
3301428	Jul 2002	JP

	Number	Date	Country
Parent	PCT/CN2019/124479	Dec 2019	US
Child	17508911		US

Fault monitoring method for sewage treatment process based on fuzzy width adaptive learning model

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

US

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)

CROSS-REFERENCE TO RELATED APPLICATIONS

Foreign Referenced Citations (5)

Non-Patent Literature Citations (3)

Related Publications (1)

Continuations (1)

Entry
Han et al (“A fuzzy neural network approach for online fault detection in waste water treatment process” 2014) (Year: 2014).
Newhart et al (“Data-driven performance analyses of wastewater treatment plants: A review” (2019) (Year: 2019).
The International Search Report of PCT/CN2019/124479.