This application claims priority to Chinese Patent Application No. 201410602859.X, filed on Nov. 2, 2014, entitled “a Soft-Computing Method for the Effluent Total Phosphorus Based on a Self-Organizing PSO-RBF Neural Network,” which is hereby incorporated by reference in its entirety.
Implementations here related to control environment engineering, more specifically related to methods and systems for determining effluent total phosphorus (TP) concentrations in the urban wastewater treatment process (WWTP).
Biogeochemical characteristics of phosphorus play a significant role in eutrophication processes. Phosphorus may accumulate in lake sediments during heavy loading periods and release from sediments into the overlying water after the external loading is reduced. The released phosphorus sustains the eutrophication processes and cycles between overlying water and sediments through algal growth, organic deposition, decomposition and release. Therefore, phosphorus is generally recognized as the limiting factor in the process of eutrophication. Restoration efforts to control phosphorus from WWTP into rivers are considered to be important strategies for decreasing cyanobacterial risks in the environment.
To reduce levels of phosphate, some design principles and various mechanisms are recently adopted to produce low effluent TP concentrations in urban WWTP. The effluent TP concentration is an index of water qualities in the urban WWTP. However, using conventional technologies, it is difficult to timely estimate the effluent TP concentration under closed loop control. The timely and/or online detection technology of effluent TP concentrations is a bottleneck for the control of the urban WWTP. Moreover, the real-time information of effluent TP concentrations can enhance the quality monitoring level and alleviate the current situation of wastewater to strengthen the whole management of WWTP. Therefore, the timely detection of effluent TP concentration owns both great economic benefit and environmental benefit.
Methods for monitoring the effluent TP concentration may include: spectrophotometry method, gas chromatography method, liquid chromatography method, electrode method, and mechanism model. However, the spectrophotometry method, gas chromatography method, liquid chromatography method and electrode method rely upon previously collected data analysis of primary variables. Some of the variables, such as gas chromatography method, require more than 30 minutes to obtain. This makes these approaches inadequate for real-time and/or online monitoring. The mechanism model studies the phosphorus dynamics to obtain the effluent TP concentration online based on the biogeochemical characteristics of phosphorus. However, significant errors may be incurred in the measurement of effluent TP concentrations. Moreover, because of the different conditions of every urban WWTP, a common model is difficult to be determined. Thus, technologies for timely monitoring effluent TP concentrations are not well developed.
Methods and systems are designed for effluent TP concentrations based on a PSO-SORBF neural network in various implementations. In various implements, the inputs are those variables that are easy to measure and the outputs are estimates of the effluent TP concentration. Since the input-output relationship is encoded in the data used to calibrate the model, a method is used to reconstruct it and then to estimate the output variables. In general, the procedure of soft-computing method comprise three parts: data acquisition, data pre-processing and model design. For various implementations, an experimental hardware is set up. The historical process data are routinely acquired and stored in the data acquisition system. The data may be easily retrieved in the method. The variables whose data are easy to measure by the instruments comprise: influent TP, oxidation-reduction potential (ORP) in the anaerobic tank, dissolved oxygen (DO) concentration in the aerobic tank, temperature in the aerobic tank, total suspended solids (TSS) in the aerobic tank, effluent pH, chemical oxygen demand (COD) concentration in the aerobic tank and total nutrients (TN) concentration in the aerobic tank. Then, data pre-processing and model design are developed to predict the effluent TP concentrations.
Various implementations adopts the following technical scheme and implementation steps:
A soft-computing method for the effluent TP concentration based on a PSO-SORBF neural network, its characteristic and steps include the following steps: (1) Selecting input variables, (2) Initializing the PSO-SORBF neural network, (3) training the PSO-SORBF neural network, and (4) setting the PSO-SORBF neural network.
Remarkable characteristics of the data acquired in urban WWTP are redundancy and possibly insignificance. And the choice of the input variables that influence the model output is a crucial stage. Therefore, it is necessary to select the suitable input variables and prepare their data before using the soft-computing method. Moreover, variable selection comprise choosing those easy to measure variables that are most informative for the process being modelled, as well as those that provide the highest generalization ability. In various implementations, the partial least squares (PLS) method is used to extract the input variables for the soft-computing method.
In various implementations, a history data set {X, y} is used for the variable selection. Since the variables acquired from experimental hardware are influent TP, ORP, DO, temperature, TSS, effluent pH, COD and TN. X is an n×8 process variable matrix, and y is the dependent n×1 variable vector. The PLS method can model both outer and inner relations between X and y. For the PLS method, X and y may be described as:
where T, P and E are the score matrix, loading matrix and residual matrix of X, respectively. U, Q and F are the score matrix, loading matrix and residual matrix of y. ti, pi, ui and qi are the vectors of T, P, U and Q. In addition, the inner relationship between X and y is shown as follow:
ûi=biti,
b
i
=u
i
T
t
i
/t
i
T
t
l , (2)
where i=1, 2, . . . ,8, bi is the regression coefficients between the ti from X and ui from y. Then, the cross-validation values for the components in X and y are described as:
if Ri<ξ, ξ∈(0, 0.1), the ith component is the right input variable for the soft-computing model. Based on the PLS method, the selected input variables are influent TP, ORP, DO, T, TSS and effluent pH in various implementations.
The initial structure of PSO-SORBF neural network comprise three layers: input layer, hidden layer and output layer. There are 6 neurons in the input layer, K neurons in the hidden layer and 1 neuron in the output layer, K>2 is a positive integer. The number of training samples is T. The input vector of PSO-SORBF neural network is x(t)=[x1(t), x2(t), x3(t), x4(t), x5(t), x6(t)] at time t. x1(t) is the value of influent TP, x2(t) is the value of ORP, x3(t) is the value of DO, x4(t) is the value of temperature, x5(t) is the value of TSS, and x6(t) is the value of effluent pH at time t respectively. y(t) is the output of PSO-SORBF neural network, and yd(t) is the real value of effluent TP concentration at time t respectively. The output of PSO-SORBF neural network may be described:
where wk is the output weight between the kth hidden neuron and the output neuron, k=1, 2, . . . , K, K is the number of hidden neurons, and φk is the RBF of kth hidden neuron which is usually defined by a normalized Gaussian function:
φk(x(t))=e(−∥x(t)−μ
μk=[μk,1, μk,2, . . . , μk,6] denotes the center vector of the kth hidden neuron, σk is the width of the kth hidden neuron, ∥x(t)−μk(t)∥ is the Euclidean distance between x(t) and μk(t).
{circle around (1)} Initialize the acceleration constants c1 and c2, c1∈(0, 1), c2∈(0, 1), and the balance factor α∈[0, 1]. During the particle initialization stage, let the position of the ith particle in the searching space be represented as:
ai=[ui,1, σi,1, wi,1, μi,2, σi,2, wi,2 . . . μi,K, σi,K, wi,K], (6)
where ai is the position of ith particle, i=1, 2, . . . , s, and s is the total number of particles, s>2 is a positive integer. μi,k=[μi,k,1, μi,k,2, . . . , μi,k,6], σi,k, wi,k are the center, width and output weight of the kth hidden neuron in the ith particle, and the initial values are ∥μi,k∥<1, σi,k∈(0, 1), wi,k∈(0, 1). Ki is the number of hidden neurons in the ith particle. Simultaneously, initialize the velocity of particle:
vi=[vi,1, vi,2, . . . vi,D
where vi is velocity of ith particle, Di is the dimension of the ith particle, and Di=3Ki.
{circle around (2)} From the input of neural network x(t) and the dimensions Di of each particle, the fitness value of each particle may be calculated:
i=1, 2, . . . , s, Ki(t) is the number of hidden neurons in the ith particle at time t, T is the number of the training samples.
{circle around (3)} Calculate the inertia weight of each particle:
ωi(t)=γ(t)Ai(t), (10)
where ωi(t) is the inertia weight of the ith particle at time t, and
γ(t)=(C−S(t)/1000)−t,
S(t)=fmin(a(t))/fmax(a(t)),
A
i(t)=f(g(t))/f(ai(t)), (11)
C is a constant, and C∈[1, 5], fmin(a(t)), fmax(a(t)) are the minimum fitness value and the maximum fitness value at time t, and g(t)=[g1(t), g2(t), . . . , gD(t)] is the global best position, fmin(a(t)), fmax(a(t)) and g(t) may be expressed as:
where pi(t)=[pi,1(t), pi,2(t), . . . , pi,D(t)] is the best position of the ith particle:
{circle around (4)} Update the position and velocity of each particle:
where r1 and r2 are the coefficient of the particle and global best position respectively, r1∈[0, 1] and r2∈[0, 1].
{circle around (5)} Search the best number of hidden neurons Kbest according to the global best position g(t), and update the number of hidden neurons in the particles:
{circle around (6)} Import the training sample x(t+1), and repeat the steps {circle around (2)}-{circle around (5)}, then, stop the training process after all of the training samples are imported to the neural network.
(4) The Testing Samples are then Set to the Trained PSO-SORBF Neural Network.
The outputs of PSO-SORBF neural network is the predicting values of effluent TP concentration. Moreover, the program of this soft-computing method has been designed based on the former analysis. The program environment of the proposed soft-computing method comprise a Windows 8 64-bit operating system, a clock speed of 2.6 GHz and 4 GB of RAM. And the program is based on the Matlab 2010 under the operating system.
In some implementations, in order to detect the effluent TP concentration online and with acceptable accuracy, a method is developed in various implementations. The results demonstrate that the effluent TP trends in WWTP may be predicted with acceptable accuracy using the influent TP, ORP, DO, temperature, TSS, and effluent pH data as input variables. This soft-computing method can predict the effluent TP concentration with acceptable accuracy and solve the problem that the effluent TP concentration is difficult to be measured online.
This method is based on the PSO-SORBF neural network in various implementations, which is able to optimize both the parameters and the network size during the learning process simultaneously. The advantages of the proposed PSO-SORBF neural network are that it can simplify and accelerate the structure optimization process of the RBF neural network, and can predict the effluent TP concentration accurately. Moreover, the predicting performance shows that the PSO-SORBF neural network-based soft-computing method can match system nonlinear dynamics. Therefore, this soft-computing method performs well in the whole operating space.
Various implementations utilizes six input variables in this soft-computing method to predict the effluent TP concentration. In fact, it is in the scope of various implementations that any of the variables: the influent TP, ORP, DO, temperature, TSS, effluent pH, COD and TN, are used to predict the effluent TP concentration. Moreover, this soft-computing method is also able to predict the others variables in urban WWTP.
The detailed description is described with reference to the accompanying figures.
Various implementations of methods and systems are developed to predict the effluent TP concentration based on a PSO-SORBF neural network in various implementations. For the implementations, inputs of the neural network are variables that are easy to measure and outputs of the neural network are estimates of the effluent TP concentration. In general, the procedure of soft-computing method comprises three parts: data acquisition, data pre-processing and model design. For various implementations, an experimental hardware is set up as shown in
Various implementations adopts the following technical scheme and implementation steps for the effluent TP concentration based on a PSO-SORBF neural network. The characteristic and steps are described as follow.
Remarkable characteristics of the data acquired in urban WWTP are redundancy and possibly insignificance. And the choice of the input variables that influence the model output is a crucial stage. Therefore, it is necessary to select the suitable input variables and prepare their data before using the soft-computing method. Moreover, variable selection comprises choosing those easy to measure variables that are most informative for the process being modelled, as well as those that provide the highest generalization ability. In various implementations, the PLS method is used to extract the input variables for the soft-computing method.
The experimental data is obtained from an urban WWTP in 2014. There are 245 groups of samples which are divided into two parts: 165 groups of training samples and 80 groups of testing samples.
In various implementations, a history data set {X, y} is used for variable selection. Since the variables acquired from experimental hardware are influent TP, ORP, DO, temperature, TSS, effluent pH, COD and TN. X is a 165×8 process variable matrix, and y is the dependent 165×1 variable vector. The PLS method can model both outer and inner relations between X and y. For the PLS method, X and y may be described as follows:
where T, P and E are the score matrix, loading matrix and residual matrix of X, respectively. U, Q and F are the score matrix, loading matrix and residual matrix of y. ti, pi, ui and qi are the vectors of T, P, U and Q. In addition, the inner relationship between X and y is shown as follow:
ûi=biti,
b
i
=u
i
T
t
i
/t
i
T
t
i, (17)
where i=1, 2, . . . , 8, bi is the regression coefficients between the ti from X and ui from y. Then, the cross-validation values for the components in X and y are described as:
if Ri<ξ, ξ=0.01, the ith component is the right input variable for the soft-computing model. Based on the PLS method, the selected input variables are influent TP, ORP, DO, T, TSS and effluent pH in various implementations.
The initial structure of PSO-SORBF neural network, which is shown in
where wk is the output weight between the kth hidden neuron and the output neuron, k=1, 2, . . . , K, K is the number of hidden neurons, and φk is the RBF of kth hidden neuron which is usually defined by a normalized Gaussian function:
φk(x(t))=e(−∥a(t)−μ
μk denotes the center vector of the kth hidden neuron, σk is the width of the kth hidden neuron, ∥x(t)−μk(t)∥ is the Euclidean distance between x(t) and μk(t).
{circle around (1)} Initialize the acceleration constants c1 and c2, c1=0.4, c2=0.6, and the balance factor α=0.1. During the particle initialization stage, let the position of the ith particle in the searching space be represented as:
ai=[μi,1, σi,1, wi,1, μi,2, σi,2, wi,2 . . . μi,K
where ai is the position of ith particle, i=1, 2, . . . , s, and s is the total number of particles, s=3 is a positive integer. μi,k, σi,k, wi,k are the center, width and output weight of the kth hidden neuron in the ith particle, and the initial values of the center, width and output weight are randomly generated within (0, 1). K1=2, K2=3, K3=4. Initialize the velocity of particle:
vi=[vi,1, vi,2, . . . vi,D], (22)
where vi is velocity of ith particle, Di is the dimension of the ith particle, and Di=3Ki.
{circle around (2)} From the input of neural network x(t) and the dimensions Di of each particle, the fitness value of each particle may be calculated:
i=1, 2, . . . , s, Ki(t) is the number of hidden neurons in the ith particle at time t, T is the number of the training samples.
{circumflex over (3)} Calculate the inertia weight of each particle:
ωi(t)=γ(t)Ai(t), (25)
where ωi(t) is the inertia weight of the ith particle at time t, and
γ(t)=(C−S(t)/1000)−t,
S(t)=fmin(a(t))/fmax(a(t)),
A
i(t)=f(g(t))/f(ai(t)), (26)
C=2, fmin(a(t)), fmax(a(t)) are the minimum fitness value and the maximum fitness value, and g(t)=[g1(t), g2(t), . . . , gD(t)] is the global best position, fmin(a(t)), fmax(a(t)) and g(t) may be expressed as:
where pi(t)=[pi,1(t), pi,2(t), . . . , pi,D(t)] is the best position of the ith particle:
{circle around (4)} Update the position and velocity of each particle:
where r1 and r2 are the coefficient of the particle and global best position respectively, r1=0.75 and r2=0.90.
{circle around (5)} Search the best number of hidden neurons Kbest according to the global best position g(t), and update the number of hidden neurons in the particles:
{circle around (6)} Import the training sample x(t+1), and repeat the steps {circle around (2)}-{circle around (5)}, then, stop the training process after all of the training samples are imported to the neural network.
The training results of the soft-computing method are shown in
(4) The testing samples are then set to the trained PSO-SORBF neural network. The outputs of the PSO-SORBF neural network are the predicting values of effluent TP concentration. The predicting results are shown in
Number | Date | Country | Kind |
---|---|---|---|
201410602859.X | Nov 2014 | CN | national |