The present invention relates to the field of a networked control system (NCS), and in particular to an NCS time-delay compensation method based on predictive control.
A networked control system (NCS) is a closed-loop feedback control system in which actuators, sensors and controllers exchange signals through a network. The insertion of the network in the control loop makes the analysis of an NCS complex, especially induced time-delay in the network can impose severe influence on the system performance. At present, there is no method capable of accurately predicting and effectively compensating time-delay of the NCS.
To solve the above technical problem, the present invention provides an NCS time-delay compensation method based on predictive control to achieve an objective of accurately predicting and effectively compensating time-delay of an NCS. Therefore, the NCS time-delay compensation method based on predictive control has excellent development prospect.
To achieve the above objective, the present invention adopts the following technical solution:
an NCS time-delay compensation method based on predictive control comprises the following steps:
(1) acquiring random time-delay data in an NCS, and pre-processing the data;
(2) predicting the current time-delay by using a fuzzy neural network (FNN) optimized by a particle swarm optimization (PSO) algorithm;
(3) compensating the predicted time-delay by using an implicit proportional-integral-based generalized predictive control (PIGPC) algorithm;
(4) determining whether a preset work end time is up according to a clock in the NCS; if yes, ending the process; if no, returning to step (2).
In the technical solution, step (1) specifically comprises: collecting a group of continuous random time-delay data; utilizing first five time-delay values as an input sample 1 and a sixth time-delay value as an output sample 1 to form a first sample pair, then utilizing second to sixth time-delay values as an input sample 2 and a seventh time-delay value as an output sample 2 to form a second sample pair, by such analogy, forming 300 sample pairs as data for training the FNN.
In the technical solution, optimization of the FNN by the PSO algorithm at step (2) is: optimizing a connection weight and a membership function of the FNN by using the PSO algorithm, that is, setting elements of a position vector xi of a particle swarm to be expectation and standard deviation of the connection weight and the membership function of the FNN. Such optimization specifically comprises the following steps:
1), randomly initializing a position and a velocity of each particle, and determining parameters including population size, inertia weight and acceleration factor;
2), calculating a fitness value of each initial particle, wherein pi is the optimal position of the current particle, and pg is the optimal position of the particle swarm;
3), updating the position and the velocity of each particle according to formulae (5) and (6), and iterating to generate a new particle;
wherein mathematical expressions of the PSO algorithm are:
v
id(t+1)=wvid(t)+c1r1(pid−xid(t))+c2r2(pgd−xid(t)) (5)
x
id(t+1)=xid(t)+vid(t+1) (6)
In the expressions, i is the number of particles, i=1, 2, . . . , m; d is a space dimension, d=1, 2, . . . , D; t is time; w is the inertia weight and a nonnegative number; c1 and c2 are acceleration factors and are defined as constants; r1 and r2 are random numbers and are uniformly distributed between 0 and 1; pid is the optimal position of a particle i; pgi is the optimal position of a particle swam; xid(t) is the position of the particle i at the time t, xid(t+1) is the position of the particle i at the time t+1, and xid(t)∈[−xmax, xmax]; vid(t) is the velocity of the particle i at the time t, vid(t+1) is the velocity of the particle i at the time t+1, and vid(t) E [−vmax, vmax]; xmax and vmax are nonnegative numbers;
4), returning to step 2), and stopping iteration till the convergence criterion is met, wherein a global extremum is an optimal solution of a training problem.
In the technical solution, step (2) specifically comprises: inputting the 301st to 305th time-delay values as the input sample into the optimized FNN to obtain a first prediction value of network time-delay; then inputting the 302nd to 306th time-delay values as the input sample into the optimized FNN to obtain a second prediction value of the network time-delay; by such analogy, completing prediction on the network time-delay.
In the technical solution, the convergence criterion at step 4) is: a threshold of a fitness function is 0.001, and the fitness function is as follows:
In the formula, Ns is the number of training set samples, Yi is an expected output, and yi is an actual output.
In the technical solution, step (3) specifically comprises: according to the implicit PIGPC algorithm, obtaining a controller increment Δu(k) when the time-delay exists, further obtaining a control value u(k), and finally transmitting the optimal control value u(k) to an actuator.
According to the above technical solution, the NCS time-delay compensation method based on predictive control provided by the present invention predicts the random time-delay by utilizing the FNN optimized by the PSO, and then compensates the time-delay by utilizing the implicit PIGPC algorithm. Simulation results show: the time-delay prediction accuracy of the FNN optimized by the PSO algorithm is higher than that of the FNN optimized by the traditional backpropagation (BP) algorithm; the implicit PIGPC algorithm has better time-delay compensation effect in comparison with the implicit GPC algorithm; calculation time is shortened; the present invention is more suitable for actual use.
To describe the technical solutions in the embodiments of the present invention or the prior art more clearly, the following briefly introduces the accompanying drawings required for describing the embodiments or the prior art.
The following clearly and completely describes the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention.
The present invention provides an NCS time-delay compensation method based on predictive control. The method can accurately predict and effectively compensate NCS time-delay and has excellent development prospect.
1 FNN Optimized by the PSO Algorithm
1.1 FNN Model
An FNN is a product by combining fuzzy set theory and a neural network. The FNN not only has massive parallel processing capacity, self learning ability and adaptive ability of the neural network, but also can process fuzzy information and achieve a fuzzy inference function so as to have excellent performance.
An FNN prediction model is a multiple-input single-output four-layer network model, as shown in
The first layer is the input layer. The number of neurons of the input layer is the same as the number of input variables. An input vector is x=(x1, x2, . . . , xn)T. y is a variable of the output layer. An output variable is a future time-delay predicted value and only one, so the output layer only has one neuron. The jth fuzzy rule of a system can be expressed as follows:
R
j:if x1=X1j,x2=X2j, . . . ,xn=Xnj, then y=Yj (1)
In the formula, Yj is a result of the jth rule, and j is decided by the number of the neurons of the fuzzy rule layer; Xij is a fuzzy subset corresponding to the input variables, i=1, 2, . . . , n, and j is decided by the number of the fuzzy subsets of each input variable.
The second layer is the fuzzified layer for calculating a membership degree of each input variable. By utilizing a normal distribution function as the membership function, a calculation formula that the ith input variable meets the fuzzy subset Xii is as follows:
In the formula, mij is an expectation of the normal distribution function, σji is a standard deviation, and the membership of the input variable is completely decided by the parameters mij and σji.
The third layer is the fuzzy rule layer. In the design, connection weights of the fuzzified layer and the fuzzy rule layer are defined as follows according to the fuzzy set theory:
In the formula, μX
The third layer and the fourth layer of the FNN complete a fuzzy decision making function. If supposing a system has m rules, the design conducts a defuzzifying by using a weighing average method to obtain an output as follows:
In the formula, wj is the connection weight between the fuzzy rule layer and the output layer.
To train the FNN, parameters to be determined comprises the expectation mij and the standard difference σji of the normal distribution function as well as the connection weight wj. The neural network commonly uses a gradient descent algorithm. Such algorithm has a low speed of convergence, is easy to generate a local optimization problem, and is hard to obtain a global optimal solution. To solve this problem, the present invention optimizes parameters of the FNN by using the PSO algorithm having the global optimization performance.
1.2 PSO Algorithm
1.2.1 Mathematical Description of a PSO Algorithm
If an objective is searched in a certain D-dimensional space, a swarm consists of m particles. Each particle in the swarm has its position, the position of an ith particle is represented by xi=(xi1, xi2, . . . xiD), i=1, 2, . . . , m, and the velocity of the ith particle is a D-dimensional vector and is represented by vi=(vi1, vi2, . . . , viD). When each particle passes through a new position, the new position is compared with all positions through which the particle passes previously. After the ith particle flies to a certain position, an optimal position through which the ith particle passes is pi=(pi1, pi2, . . . , piD). The positions through which all particles in the swarm pass are compared to obtain optimal positions through which all particles pass, and the optimal positions are represented by pg=(pg1, pg2, . . . , pgD). Each particle changes the position according to the following formulae:
v
id(t+1)=wvid(t)+c1r1(pid−xid(t))+c2r2(pgd−xid(t)) (5)
x
id(t+1)=xid(t)+vid(t+1) (6)
In the formulae, i is the number of particles, i=1, 2, . . . , m; d is a space dimension, d=1, 2, . . . , D; t is time; w is an inertia weight and a nonnegative number; c1 and c2 are acceleration factors and are defined as constants; r1 and r2 are random numbers and are uniformly distributed between 0 and 1; pid is the optimal position of a particle i; pgi is the optimal position of a particle swam; xid(t) is the position of the particle i at the time t, xid(t+l) is the position of the particle i at the time t+1, and xid(t)∈[−xmax, xmax]; vid(t) is the velocity of the particle i at the time t, vid(t+1) is the velocity of the particle i at the time t+1, and vid(t)∈[−vmax, vmax]; xmax and vmax are nonnegative numbers.
1.2.2 Flow Chart of the PSO Algorithm
A flow chart of the PSO algorithm is shown in
1.2.3 Steps for Optimizing a Fitness Function and Parameters
A connection weight and a membership function of the FNN are optimized by using the PSO algorithm, that is, elements of a position vector xi of a particle swarm are set to be expectation and standard deviation of the connection weight and the membership function of the FNN. The training of the FNN is mainly used for making a parameter error in the network minimum, so the fitness function can be defined as follows:
In the formula, Ns is the number of training set samples, Yi is an expected output, and yi is an actual output.
Parameter optimization by the PSO algorithm specifically comprises the following steps:
1), randomly initializing a position and a velocity of each particle, and determining parameters including population size, inertia weight and acceleration factor;
2), calculating a fitness value of each initial particle, wherein pi is the optimal position of the current particle, and pg is the optimal position of the particle swarm;
3), updating the position and the velocity of each particle according to formulae (5) and (6) to generate a new particle;
4), returning to step 2), and stopping iteration till the convergence criterion is met, wherein a global extremum is an optimal solution of a training problem.
2 Design of an NCS Controller Based on an FNN-PSO and an Implicit PIGPC Algorithm
2.1 Implicit Proportional-Integral-Based Generalized Predictive Control (PIGPC) Algorithm
The GPC algorithm needs to calculate the Diophantine equation and rolling inverse optimization so as to have long calculation time and low efficiency. However, the implicit GPC algorithm is improved based on the standard GPC algorithm, does not need to calculate the Diophantine equation and the like, but cannot immediately feed back and has low robustness. Therefore, based on the GPC algorithm and the implicit GPC algorithm, an implicit PIGPC algorithm is provided. The implicit PIGPC algorithm has the advantages of combining a feedback structure of a PI controller and a strong prediction function of the GPC algorithm and does not needs to calculate the Diophantine equation so as to have high efficiency and robustness, as shown in
An objective function of the implicit GPC algorithm is:
J(k)=E{Σj=1p[e(k+j)]2+Σj=0N
wherein e(k+j)=ys(k+j)−y (k+j) represents an error between an expected value and an actual value.
Based on formula (8), an objective function of an implicit PIGPC algorithm is defined such that a proportional integral structure is provided as follows:
J(k)=E{Σj=1p[Kp[Δe(k+j)]2+Ki[e(k+j)]2]+Σj=0N
wherein Kp≥0 represents a proportion factor, and Ki≥0 represents an integral factor.
By considering a to-be-controlled object, a CARIMA model is utilized for description, and can be represented as:
wherein z−1 represents a backshift operator corresponding to an amount of a backward sensor sampling cycle, and is expressed by a formula: z−1y(k)=y(k−1), z−1u(k)=u(k−1); A, B, C are polynomial of z−1; B(z−1) may represent a time lag of the to-be-controlled object; to an m1 time lag system, b0˜bm
to provide the principle and method of the GPC, if supposing C(z−1)=1, a pulse transfer function between the input u and the output y of the control system is as follows:
C(z−1)=1 is substitute into the formula (10), the following can be obtained:
An output prediction model at the k+j time is as follows:
To solve a predicted output value when a system outputs y(k+j|k) at k+j time, the following Diophantine equation is considered:
1=Ej(z−1)A(z−1)Δ+z−jFj(z−1) (14)
G
j(z−1)=Ej(z−1)B(z−1)={tilde over (G)}j(z−1)+z−jHj(z−1) (15)
A prediction output at k+j time is as follows:
(k+j|k)={tilde over (G)}j(z−1)Δu(k+j−1|k)+Hj(z−1)Δu(k−1)+Fj(z−1)y(k)
if f(k+j)=Hj(z−1)Δu(k−1)+Fj(z−1)y(k),
a prediction output at the k+j time can also be represented as follows:
(k+j|k)={tilde over (G)}j(z−1)Δu(k+j−1|k)+f(k+j) (16)
A predicted value
e(k+j)=ys(k+j)−{tilde over (G)}j(z−1)Δu(k+j−1|k)−f(k+j) (17)
An error increment Δe(k+j) is defined as:
Δe(k+j)=[Δys(k+j)−Δf(k+j)]−[{tilde over (G)}j(z−1)Δu(k+j−1|k)−{tilde over (G)}j-1(z−1)Δu(k+j−2|k)] (18)
The formulae (17) and (18) are substituted into the objective function J(k)=E{Σj=1p[Kp[Δe(k+j)]2+Ki[e(k+j)]2]+Σj=0N
the following can be obtained:
So, the control increment Δu(k) can be represented as:
Δu(k)=RpΔE(k)+RiE(k) (20)
wherein ΔE(k)=Δys(k)−Δf(k), E(k)=ys(k)−f(k),
so a control value of the PIGPC is as follows:
u(k)=u(k−1)+RpΔE(k)+RiE(k) (21)
A control law of the PI controller is
ū(k)=P(k)+I(k)=Kpē(k)+(I(k−1)+Kiē(k)) (22)
ū(k−1)=Kpē(k−1)+I(k−1) (23)
By subtracting the formula (23) from the formula (22), the following is obtained:
Δū(k)=KpΔē(k)+Kiē(k) (24)
A control value of the PI controller is:
ū(k)=ū(k−1)+KpΔē(k)+Kiē(k) (25)
By comparing the formula (21) with the formula (25), it can be seen that the both have the same form, the PI controller and the implicit GPC algorithm are combined to obtain the structure of the implicit PIGPC controller.
Therefore, a control increment of the implicit PIGPC can be represented as:
Δu(k)=(λI+KpGiTSTSGi+KiGiTGi)−1[KpGiTST(Sys(k)−Sf(k))+KiGiT(ys(k)−f(k))] (26)
If Ω=KiI+KpSTSΔu(k), Δu(k) can be represented as:
Δu(k)=(λI+GiTΩGi)−1GiTΩ(ys(k)−f(k)) (27)
Compared with the GPC algorithm, there is no additional calculation, and the Diophantine equation and the matrix inversion do not need to be calculated for many times. Compared with the implicit GPC algorithm, PI control with the feedback structure is added, so, the efficiency is improved, and the algorithm is also optimized.
2.2 Design of Time-Delay Compensation Controller Based on an FNN-PSO and an Implicit PIGPC Algorithm
The key of a time-delay compensation strategy research is: the current network time-delay cannot be measured. The current time-delay is predicted by selecting a proper prediction method with the aid of historical data and future input.
The time-delay compensation strategy is implemented generally by the following steps:
(1) acquiring time-delay in the past time, and processing the past data;
(2) predicting the current time-delay according to the past data and the future input by using the FNN-PSO;
(3) obtaining a controller increment Δu(k) according to the implicit PIGPC algorithm, further obtaining a control value u(k), and finally transmitting the optimal control value u(k) to an actuator;
(4) determining whether a preset work end time is up; if yes, ending the process; if no, returning to step (2).
3 Prediction Model Building and Simulation Analysis
3.1 Prediction Model Building
Prediction of the network time-delay utilizes a rolling prediction method. First five time-delay values are utilized as an input sample 1, and a sixth time-delay value is utilized as an output sample 1 to form a first sample pair; second to sixth time-delay values are utilized as an input sample 2, and a seventh time-delay value is utilized as an output sample 2 to form a second sample pair; by such analogy, 300 sample pairs are formed to be utilized as data for training the FNN.
Then the 301st to 305th time-delay values as the input sample and the input vector are inputted into the trained FNN to obtain a first prediction value of network time-delay; then the 302nd to 306th time-delay values as the input sample are inputted into the optimized FNN to obtain a second prediction value of the network time-delay; by such analogy, 20 network time-delay predicted values are obtained.
(1) building and training an FNN: determining a structure of a neural structure by using a clustering algorithm, selecting the number of fuzzy memberships by utilizing a Kohonen network, determining that each input variable is divided into two fuzzy subsets, so that the structure of the FNN is 5-10-32-1; next training the FNN by using the training data;
(2) optimizing network parameters: initializing the PSO algorithm, and setting the population size to be 30, the acceleration factor to be c1=c2=2, the fitness threshold to be 0.001 and the maximum allowable iteration times to be 350;
(3) predicting the model: verifying the model by using test data and obtaining a final prediction result.
3.2 Simulation Results and Discussion
Two types of the FNNs are constructed. One type of the FNN utilizes the gradient descent algorithm to train the FNN while the other type of the FNN utilizes the PSO algorithm to train the FNN.
According to
To verify effectiveness of the time-delay compensation algorithm, a direct current servo system is selected as the to-be-controlled object
The type of the network selects Ethernet, the sampling period is 20 ms and the transmission rate is 80000 bits/s. The system adds turbulence nodes such that the network generate a load. The time-delay prediction utilizes the FNN optimized by the PSO.
The above illustration of the disclosed embodiments can enable a person skilled in the art to implement or practice the present invention. Various modifications to these embodiments are readily apparent to a person skilled in the art, and the generic principles defined herein may be practiced in other embodiments without departing from the spirit or scope of the present invention. Thus, the present invention is not limited to the embodiments shown herein but falls within the widest scope consistent with the principles and novel features disclosed herein.
Number | Date | Country | Kind |
---|---|---|---|
201910068775.5 | Jan 2019 | CN | national |
Number | Date | Country | |
---|---|---|---|
20200241487 A1 | Jul 2020 | US |