The present invention relates generally to the field of control systems and methods. More specifically, the present invention pertains to system model identification used for Automated Process Control (APC) and Model Predictive Control (MPC) controllers.
In control theory Advanced Process Control (APC) is a broad term composed of different kinds of process control tools, often used for solving multivariable control problems or discrete control problem. APC is composed of different kinds of process control tools, for example Model Predictive Control (MPC), Statistical Process Control (SPC), Run2Run (R2R), Fault Detection and Classification (FDC), Sensor control and Feedback systems. APC applications are often used for solving multivariable control or discrete control problems. In some instances an APC system is connected to a Distributed Control System (DCS). The APC application will calculate moves that are sent to regulatory controllers. Historically the interfaces between DCS and APC systems were dedicated software interfaces. Alternatively, the communication protocol between these systems is managed via the industry standard Object Linking and Embedding (OLE) for Process Control (OPC) protocol.
APC can be found in the (petro) chemical industries where it makes it possible to control multivariable control problems. Since these controllers contain the dynamic relationships between variables it can predict in the future how variables will behave. Based on these predictions, actions can be taken now to maintain variables within their limits. APC is used when the models can be estimated and do not vary too much. In the complex semiconductor industry where several hundred steps with multiple re-entrant possibilities occur, APC plays an important role for control the overall production. In addition, APC is more and more used in other industries. In the mining industry for example, successful applications of APC (often combined with Fuzzy Logic) have been successfully implemented. APC implementation is in more than 95% of cases done as a replacement of an old control such as a proportional-integral-derivative (HD) controllers.
APC performance, for example the performance of a MPC system, is significantly dependent on the quality of the target system model. Therefore, a very important part of APC design is system model identification by performing experiments on the system. It is well-known fact that these experiments represent the highest costs of APC implementation. The reason is that the experiments are traditionally done by step testing, requiring long duration, where normal system operation and production processes are significantly disrupted.
Step testing is typically performed by opening the control loop (leaving the process control on the manual control of an operator), waiting for the system to reach a steady state, making a step change in selected input and observing the behavior for eventual manual intervention preventing process limits crossing. This is repeated for individual inputs. The long duration of this type of testing is caused by the need to wait for steady state before step changes can be applied. The disruptions are caused by opening the control loop and applying step changes to the inputs.
a) and
In the following description, reference is made to the accompanying drawings that form a part hereof, and in which is shown by way of illustration specific embodiments which may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice the invention, and it is to be understood that other embodiments may be utilized and that structural, logical and electrical changes may be made without departing from the scope of the present invention. The following description of example embodiments is, therefore, not to be taken in a limited sense, and the scope of the present invention is defined by the appended claims.
The functions or algorithms described herein may be implemented in software or a combination of software and human implemented procedures in one embodiment. The software may consist of computer executable instructions stored on computer readable media such as memory or other type of storage devices. Further, such functions correspond to modules, which are software, hardware, firmware or any combination thereof. Multiple functions may be performed in one or more modules as desired, and the embodiments described are merely examples. The software may be executed on a digital signal processor, ASIC, microprocessor, or other type of processor operating on a computer system, such as a personal computer, server or other computer system.
The systems and methods described herein allow for automatic identification experiments in a closed loop, where the old control strategy, already tuned and tested, is utilized. The strategy is modified to inject additional signal optimized for identification. As described in more detail below, the experimenting time is reduced by performing only those system manipulations which explore model uncertainties important to potential degradation of controller performance by discrepancy between the system and the model. The disruptions are reduced by keeping the control loop closed, which eliminates waiting for steady state before applying steps to the inputs and reduces the risk of process limits crossing.
According to one example embodiment, the system has a single “control knob” parameter adjusting the energy of additional signal used to explore the system. This knob allows conducting identification experiment, where open-loop operation is not possible due to the limits on maximum disruption. The energy of additional signal can be set to meet the maximum allowable disruption requirements. The energy of additional signal is in a direct relation to the speed of identification related information gathering. It can be varied in time to follow the needs of system operators.
The optimal identification experiment design is a difficult non-convex optimization problem. According to one example embodiment, several simplifications are applied to get a practical system, and the system is based on a well-known Linear Quadratic (LQ) control, which is relaxed to allow limited deviation from LQ optimal trajectory. This deviation is used to inject additional signal optimized for identification.
The system and method described herein are useful to improve the system identification phase in the course of upgrading standard process control (such as implemented with PID controllers) to APC. The standard controller running in a specialized hardware or in an industrial computer is replaced by a controller imitating the previous control strategy in combination with the injection of additional signal optimized for efficient gathering of information for system identification. The additional signal energy control knob is used to adjust the speed of information gathering and related system disruption. Alternatively, the present technology can be used in already running controllers for on-demand process re-identification. It can be also used to design a “stand-alone” identification experiment (i.e. replace step testing), where some rough system model has to be known a priori or it can be obtained from short step testing or system knowledge.
As noted above, optimal input design for identification is a non-convex problem, which can be made tractable by simplification. According to one embodiment of the present technology, the simplification applied provides that in each sampling period two models are selected from parameters uncertainty (probability density function or “p.d.f.”), such that discriminating between these two models would bring the largest improvement in the model quality (improvement in parameters variance or control oriented model quality criterion). The input trajectory is then designed to cause the largest difference on the outputs of selected models, which would allow the identification process to efficiently distinguish between them.
The input design is based on a relaxed linear quadratic (LQ) controller (referred to herein as “LQID”), which allows limited deviation from LQ optimal input trajectory. This degree of freedom is used for perturbation causing the largest output difference between two models selected from the uncertainty. In addition to standard LQ weight parameters there is a single additional parameter k for continuous adjustment of the amount of additional energy used for the identification—setting this parameter to zero forces LQID to behave like a LQ controller. LQID starts with preliminary model p.d.f., which is updated with each new I/O data pair and used for input design.
LQID leads to quadratic programming with quadratic constraints (QPQC), where the minimized quadratic function has negative definite Hessian. This obviously non-convex optimization is shown to be easily solvable for general Hessian by polynomial root finding on the range, where the polynomial is guarantied to be monotonic and to have a single root.
Optimal input design can be formulated in multiple different ways. In one approach the input is designed to maximally reduce a selected metrics on the model p.d.f. In a first implementation the metrics are Euclidean so the input is designed to minimize the largest eigenvalue of parameters covariance in each step. Another selection would be for example to minimize control performance criterion variance with respect to model parameters uncertainty.
Another deviation from optimality is that the experiment is not completely planned ahead to get the largest decrease of selected metrics, which would be possible only for a priori known model. According to one embodiment, the approach is greedy and it designs the input for maximum decrease of the currently largest uncertainty.
According to one embodiment of the process and system disclosed herein, two “most distant, but still probable” models θ1 and θ2 are selected by maximizing their distance times their probability
The distance metrics ρ can be selected from the control point of view, where for example the difference between models on high frequencies is measured less than on low frequencies. In the first approach the distance is selected in the simplest form as Euclidean
ρ(θ1,θ2)=∥θ1−θ2∥2.
Assuming parameters probability
θ˜N({circumflex over (θ)},P),
then an example of maximized function (1) for model with two parameters is in
where A is the largest eigenvalue and ν is the corresponding eigenvector of P.
Determining Maximum/Minimum Difference Input Signal
The two models selected in the previous section are insufficiently discriminated by the current p.d.f. To discriminate between these models there can be found an input signal causing maximum difference between model outputs
d(y1,y2)=DKL(p(y|θ1,u)∥p(y|θ2,u)),
where DKL is Kullback-Leibler divergence. This general formulation is simplified for the first approach by using the point model selection 1Ji,2 and quadratic output distance
d(y1,y2)=∥y1−y2∥2.
Maximizing this distance without restrictions would lead to unlimited input u. Inversion problem can be solved instead: find the input signal causing the smallest difference between the models outputs and find basis, where orthogonal components with unit size are sorted by the norm of additional output difference they cause when added to the input. The output difference for given input u is
y
1
−y
2=(H1−H2)u+Γ1x1−Γ2x2, (2)
where H is Toeplitz matrix of impulse response and Γ is extended observability matrix
Introducing regularization to reduce solution sensitivity and substitutions Z=H1−H2, b=Γ1x1−Γ2x2, the minimum difference input signal up can be obtained as
where γ is regularization constant. The solution is the same with regularized least squares
u
p=(ZTZ+γI)−1ZTb.
The sorted basis can be obtained by singular value decomposition of Z
Z=UΣVT.
The columns of V contains the basis and corresponding diagonal elements in Σ indicate the quadratic norm of additional output difference.
An example for two second order systems with the same resonance frequency and different attenuations:
Assuming non-zero initial states x1, x2 then the input causing minimal output difference and sorted normalized basis of additional perturbation are in
Input signal optimality criterion for two models differentiation can constructed from up, V and Σ as
This is unconstrained negative definite QP minimization problem with unlimited optimal solution. Reasonable constraints based on LQ control relaxation will be shown in the following section.
As indicated above, the standard LQ controller is modified to allow additive identification optimal perturbation of input trajectory u along LQ optimal control. Assume LQ control criterion
with parameters Q, S, R. Price for the optimal control from state xt is given by Bellman function
Vt=xtTPxt
where P is the solution of algebraic Riccati equation. According to the optimality principle the LQ optimal input trajectory ū satisfies
J(ūtt+h−1)+Vt+h=Vt.
where h is the length of horizon. This equivalence constraint can be relaxed to allow certain degree of freedom for identification optimal perturbation
J(utt+h−1)+Vt+h≦Vt+k, kε0,∞), (4)
where k is optimality margin.
LQ control guarantees that Bellman function is reduced in every step for controllable autonomous system, which also assures asymptotic stability. Relaxed LQ defined by (4) does not guarantee Bellman function reduction in every step. However, it can be shown, that it guaranties Lyapunov (non-asymptotic) stability. This is exactly what is needed for identification because the guarantied reduction of Bellman function in every step would mean that system with zero Bellman function (x=0) cannot be excited for identification.
Relaxed LQ implies that h-steps non-optimal control from xt has to be cheaper than price for optimal control xtTPxt plus margin k. Reducing non-optimal control horizon to single step would not be more expensive than non-optimal control on h steps
J(utt)+Vt+1≦J(utt+h−1)+Vt+h≦Vt+k.
This implies that Bellman function would be reduced when
J(uit)>k,
x
i
T
Qx
t
+x
t
T
Su
t
+u
t
T
Ru
t
>k.
This leads to sufficient condition (neglecting lid, that for every k there exists elliptic set around origin
x
t
T
Qx
t
≦k,
where outside of this set Bellman function is reduced in each step. This guaranties Lyapunov stability of relaxed LQ control.
In one embodiment, relaxed LQ control can be realized as quadratic optimization constraint. Denoting
and expressing states on optimization horizon as
the equation (4) can be written as
X
T
T
T
t
T
Px
t
+k,
leading to quadratic constraint
Practical realization of relaxed LQ control with reference tracking and input differences weighting with criterion
is done, in one example embodiment, by standard extension of model and state with reference wt and input from previous time step ut−1
Mixing together relaxed LQ constraint (5) and input signal optimality criterion (3) gives LQID
This is quadratic programming problem with quadratic constraints (QPQC)
where H1 is negative definite, therefore optimal solution will always be on the boundaries, i.e. equivalent solution can be obtained with equality constraints
This non-convex optimization problem can be easily solved (Section 4).
LQID is applied to system
Two models, which need to be distinguished, are fixed to same model with parameters
sys1: ω1=0.2π, ξ1=1,
sys2: ω2=0.2π, ξ2=0.05.
Their frequency characteristics are similar except the resonance frequency area→the input signal causing highest difference on systems outputs is harmonic signal with frequency w=2π. The parameters of LQID are
Qr=10, RΔu=10, h=30, Ts=1.
The input design with negligible additive measurement noise (σe=0.001) is in
Notice that after k is increased from 0.001 to 0.1 in t=150 the reference is tracked and the disturbances are rejected with the input preferring 0.1 Hz harmonic, which is the most differentiating input signal.
Similar example as the previous with 4th order system and two resonance frequencies 0.1 Hz and 0.3 Hz. LQID input design is in
The LQID system is able to perform well even with unstable and non-minimum phase systems
Identification with LQID
In the preceding description, it was assumed that there are two models and it is uncertain, which is correct or closer to the real system. The input was designed to optimally distinguish between these two systems, while also respecting control criterion. However, the real system model was assumed as known. In the ensuing description the LQID will be used with recursive identification to design input signal, which would most efficiently identify the system, i.e. reducing maximal eigenvalue of parameters covariance on finite interval with limited energy of LQ optimal trajectory perturbation.
Process 1 (Identification with LQID)
System
is sampled with Ts=1 s and its output is measured with additive white noise with standard deviation σe=1. The system is modeled as the second order ARX model
In the pre-identification phase the system is insufficiently excited (on purpose) by one and quarter period of harmonic signal with frequency 0.025 Hz and noise e(t)˜N(0, 1) (
QP with Quadratic Constraints
Quadratic programming with quadratic constraints (QPQC)
is optimization problem on the surface of hyper-ellipsoid. This problem is apparently non-convex in the primary formulation. However, it can be reformulated to convex problem even for criterion with arbitrary Hessian.
To simplify QPQC the optimization on the surface of general ellipsoid is transformed to optimization on the surface of unit sphere. This can be done by transforming constraints to “centralized” form
(x−x*)TH2(x−x*)+d2=0,
where
Decomposing H2 by eigen-decomposition to VDVT=VD1/2D1/2VT and dividing by −d2
(x−x*)TVD1/2(−d2)1/2(−d2)−1/2D1/2VT(x−x*)−1=0.
Introducing substitution
T
Transformed optimization problem is
where
H=−d
2
D
−1/2
V
T
H
1
VD
−1/2,
f=√{square root over (−d2)}(2x*TH1VD−1/2+f1TVD−1/2).
Notice that feasible set is empty if d2>o.
The optimization
is solved by transformation to an equivalent convex problem. These transformations are done such that the position of the minimum on the feasible set is not changed:
The solution is divided for two cases:
Quadratic criterion function is extended with term ααTx
f(
Varying α in the range (−min(eig(H)), ∞) preserves the positive definiteness of H+αI and changes the position of unconstrained minimum
from infinity to origin. Finding α* such that ∥x*(α*)∥=1 gives also the optimal solution of the original non-covex problem as x*(α*). This solution satisfies the constraints and additional term does not change the position of minimum on the feasibility set (
Solving ∥x*(α*)∥=1 is straightforward, because ∥x* (α)∥ is a monotonous function on the range αε(−min(eig(H)), ∞) with function value from ∞ to 0 respectively (can be shown by second derivative). However, numerical problems may occur in direct application of Newton method and it may be safer to use for example interval halving (finite interval may be obtained by substitution
The solution for non-positive definite H is similar to the previous. This time the original criterion function is multiplied by α>0 and term xTx is added
f(
Varying α in the range (0.1/min(eig(H))) assures positive definiteness of f(
from infinity to origin (
A more complex example with indefinite H
And an example of QPQC
In one embodiment, the system and process avoids mixing control and identification criterion into single criterion on purpose, because there is no reasonable way to make it correctly. It would involve mixing of negative and positive definite quadratic functions and the solution would abruptly change at point of mixture Hessian eigenvalue crossing zero.
Instead control criteria are formulated as relaxed constraints. On the other hand the LQID formulation still “mixes” the criterions as the cheaper directions from LQ criterion are wider than more expensive directions in quadratic constraints.
Application of the same approach to MPC would require negative and positive definite criterions mixing. There is no reasonable solver for indefinite QP with linear constraints.
Box constraints on the inputs can be approximately implemented by inspection and modification of quadratic constraints hyper ellipse to fit box constraints.
Simple modification can be used to emphasize identification on low frequencies by adding low-pass filter to finding minimum difference input signal and basis by changing (2) into
Δy(u)=HLP(y1−y2)
where HLP is Toeplitz matrix of low-pass filter impulse responses. This allows to design input signal for identification on low frequencies only.
Referring now to
Referring now to
Computer-readable instructions stored on a computer-readable medium are executable by the processing unit 202 of the computer 210. A hard drive, CD-ROM, and RAM are some examples of articles including a computer-readable medium.
The Abstract is provided to comply with 37 C.F.R. §1.72(b) is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims.