Method for real-time nonlinear system state estimation and control

Description

FIELD OF INVENTION

The invention pertains to the field of nonlinear system state estimation and control in which the nonlinear system is describable by a set of nonlinear differential equations. More specifically the invention relates to the use of extended Kalman filtering (EKF) type techniques for state estimation and system control.

BACKGROUND TO THE INVENTION

Observation of a physical system for understanding its behavior requires obtaining access to key parameters (state variables) within the system. Often, these state variables are not directly available for observation so that they must be inferred from indirect and noisy measurements. Optimal linear estimation theory has been developed for estimating these state variables by producing a minimal error estimate from their contaminated images. The need for optimal estimation technology is widespread and includes such diverse applications as monitoring system behavior in hostile environments, estimating system parameters for system modeling, estimating (detecting) messages in communication systems, remote measuring systems, and controlling of physical systems.

Optimal estimation techniques are based on statistical signal processing (filtering) techniques for extracting a statistical estimate of the desired state variables from inaccurate and distorted measurements by minimizing a prescribed error function. The form of the error function determines the nature of the estimator optimality.

Kalman filtering (Kalman, R. E., “A New Approach to Linear Filtering and Prediction Problems”, Trans. ASME, J. Basic Eng., Vol. 82D, pp. 34-45, March 1960) is an optimal filtering technique commonly used for estimating state variables of a linear system. Kalman filtering is a time domain operation that is suitable for use in estimating the state variables of linear time-varying systems that can be described by a set of linear differential equations with time-varying coefficients (linear differential equations with constant coefficients being a special case). Although Kalman filtering has found application in state estimation in many systems that may be approximately described as linear, the basic Kalman filter technique can not adequately accommodate general nonlinear systems. Because nonlinear systems are common, attempts have been made to adapt the Kalman filter to estimation of states in nonlinear systems by quasi-linearization techniques. These adaptations, when restricted to computationally feasible methods, result in sub-optimal estimators which do not yield a minimal (optimal) error estimation.

Because it is desirable to use digital computers for applying Kalman filter techniques, discrete-time (time-sampled) adaptations have been developed. Discrete Kalman filtering is ideally suited for estimation of states in discrete-time systems that can be properly described by a set of finite difference equations with discrete time-varying coefficients. However, because of the strong incentives for incorporating digital signal processing techniques for continuous-time Kalman filter estimation, extensive use has been made of discrete Kalman filters in continuous-time system state estimation.

Because Kalman filters (and other state variable estimators, such as the Luenberger observer (Luenberger, D. G., “Observing the State of a Linear System”, IEEE Trans. On Military Electronics, pp. 74-80, April 1964)) are based on a model of a system whose states are to be estimated, the use of a discrete Kalman filter for estimating the states of a continuous system implies modeling the continuous system as a discrete time-sampled system in which integrators are replaced by accumulators and continuous time-varying coefficients are replaced by discrete time-varying coefficients. In addition to propagating the state estimates, estimators may propagate state estimate error covariance using a model that may be a different approximation to the original continuous system model. As the time interval between sampled data points is increased, the approximation error for each model increases and may result in model behaviors that departs drastically from the actual system behavior.

When the effects of modeling continuous time systems by using discrete-time models are combined with the inaccuracies of quasi-linear model approximations to nonlinear systems, estimator error stability can be severely impaired. Loss of stability means severe departure between the actual state values and the estimates. Increasing the sampling rate of the model produces smaller finite difference intervals with improved performance but often at an unacceptable cost for the increased computational burden.

Optimal state estimators for linear systems form the basis for optimal control in the so-called Linear Quadratic Gaussian (LQG) control problem (Reference: Applied Optimal Control, A. E. Bryson and Y. C. Ho, John Wiley & Sons, 1975). System state variables are estimated using optimal estimation techniques and then a quadratic objective performance criterion is applied to establish a control design strategy. Kalman filtering is commonly used as a means for estimating the state variables of a system. When the estimated state variables are combined, using the control law based on the objective performance criterion (or performance index), optimal LQG control system results.

The objective performance criterion summarizes the objectives of the system by the performance index, J, which is a mathematical expression that must be minimized in order to meet the objectives of the system. For example, the performance index can represent the final error, total time, or the energy consumed in meeting the system objective. The various possible performance indices may be used individually or in combination so that more than one objective is satisfied.

FIG. 1

is a block diagram of a basic state variable control system

10

in which the physical plant

11

that is to be controlled has a vector (multichannel) input, u(t), and a state vector, x(t), with vector elements corresponding to the set of state variables representing the attributes of plant

11

that are to be controlled. State vector x(t) is generally not directly accessible but through the measurement vector, z(t), which are available from a set of appropriate sensors in sensor unit

12

. The measurement vector may have more or less elements than the state vector. These measurements can also be contaminated by measurement noise due to the system environment and/or due to sensor noise. Typically, the sensors are transducers that convert the various physical elements of the state vector representing diverse physical activity (e.g. heat, light, mass, velocity, etc.) into electrical signal representations suitable for additional signal processing. State observer

13

accepts vector z(t) as a noisy and distorted representation of the state vector x(t) from which an estimated state vector, {circumflex over (x)}(t), of the state vector is made. The estimated state vector, {circumflex over (x)}(t), is a statistical estimate of necessity because of the stochastic nature of the random noise introduced into the measurement process and into the dynamics. The estimated state vector, {circumflex over (x)}(t), is then used by controller (−C(t))

14

to form a suitable input vector, u(t), to drive plant

11

. Because of the stochastic nature of real control systems, the performance index, J, must be expressed as an average value, {overscore (J)}=E{J}, where E{.} is the expectation operator. In order to accommodate time-varying plants, i.e. plants that change physical attributes as a function of time, controller

14

may also be a time-varying controller based upon a time-varying performance index, {overscore (J)}(t), and a time varying state observer

13

.

FIG. 2

is a signal flow block diagram representing a plant such as plant

11

of FIG.

1

. Summing junction

111

provides at its output vector {dot over (x)}(t), the time derivative of state vector x(t) that is the sum of three inputs to summing junction

111

: the output vector F(t)x(t) of feedback gain unit

112

, an input signal Γ(t)u(t) from gain unit

116

and an input noise signal G(t)w(t) from gain unit

114

. The input noise, w(t), is generally assumed to be white gaussian noise of zero mean. The state vector time-derivative, {dot over (x)}(t), applied to vector integrator

110

for producing the state vector, x(t), which is applied to vector feedback gain unit

112

and to vector gain unit

113

of measurement system

120

. The output of gain unit

113

, y(t), is combined with a white noise vector, v(t), by vector summing junction

115

for producing output measurement vector z(t). The system satisfies the following differential equations:

{dot over (x)}

(

t

)=

F

(

t

)

x

(

t

)+Γ(

t

)

u

(

t

)+

G

(

t

)

w

(

t

)

y

(

t

)=

H

(

t

)

x

(

t

)

z

(

t

)=

y

(

t

)+

v

(

t

) (1)

where w(t) and v(t) are normal zero mean vectors with covariance matrices Q(t) and R(t), respectively, i.e.

w

(

t

)≈

N

(0,

Q

(

t

) δ(

t

))

v

(

t

)≈

N

(0,

R

(

t

) δ(

t

)),

δ(t) is the Dirac delta function, and u(t) is a yet to be defined input control vector. Also, both noise vectors are usually assumed to be independent, i.e. E{w(t)v(t)}=0.

As shown in

FIG. 3

, the state observer

13

of

FIG. 1

can be implemented as a Kalman filter that has identical elements with those identified as integrator

110

, input gain unit

116

, measurement gain unit

113

, and state vector feedback gain unit

112

, respectively corresponding to elements

130

,

136

,

133

, and

132

in FIG.

3

. The output, z(t), from sensor unit

12

of

FIG. 1

is applied together with {circumflex over (z)}(t), the output of feedback gain unit

133

, to the input of vector adder

139

for forming the difference, (z(t)−{circumflex over (z)}(t)), representing error in the Kalman filter estimate of the vector z(t). The difference is applied to Kalman gain unit

137

(for applying the Kalman gain matrix to the difference) and supplying its output to one input of vector adder

138

. A second input to vector adder

138

is the output of vector gain unit

132

that forms the product F(t){circumflex over (x)}(t), so that the vector adder output represents an estimate, {dot over ({circumflex over (x)})}(t), of the time derivative of the state vector. The Kalman filter signal flow block diagram of

FIG. 3

supports the following vector differential equation:

{dot over ({circumflex over (x)})}

(

t

)=

F

(

t

){circumflex over (x)}(

t

)+Γ(

t

)

u

(

t

)+

K

(

t

)[

z

(

t

)−

H

(

t

){circumflex over (x)}(

t

)]

and

{circumflex over (x)}

(0)=

x

0

. (2)

Equation (2) requires the solution of the following continuous Ricatti equation for P(t), the covariance matrix of the state vector, x(t), in order to solve for the Kalman gain matrix, K(t) in equation (2).

P

(

t

)=

F

(

t

)

P

(

t

)+

P

(

t

)

F

T

(

t

)+

G

(

t

)

Q

(

t

)

G

T

(

t

)−

K

(

t

)

R

(

t

)

K

T

(

t

) (3)

where [.]

T

indicates the matrix transpose operation, P(0)=P

0

, and P(t)=E{(x(t)−{circumflex over (x)}(t))(x(t)−{circumflex over (x)}(t))

T

}. Thus, the covariance matrix P(t) is based on the difference between the actual state represented by vector x(t) and the estimated state represented by vector {circumflex over (x)}(t).

The Kalman gain matrix, K(t), is

K

(

t

)=

P

(

t

)

H

T

(

t

)

R

−1

(

t

) (4)

Equations (2)-(4) specify the Kalman filter observer that is the optimal linear filter for estimating the minimum variance, conditional mean state vector of plant

11

of FIG.

1

. Equations (1) describe the signal model used in constructing the Filter equations (2)-(4).

The control vector, u(t), may be considered to be an exogenous variable whose functional form can be prescribed for altering the dynamic properties of the plant. An optimal control vector obtains when u(t) is chosen to minimize a prescribed performance index, {overscore (J)}, as previously discussed.

A quadratic performance index is commonly used that minimizes the weighted integrated quadratic error and the quadratic value (energy) of the control vector and the weighted quadratic error of the state vector at a designated final time, t

f

, as follows:

\begin{matrix} \overline{J} = E {{x (t_{f})}^{T} - V_{f} x (t_{f}) + \int_{t_{0}}^{t_{f}} [x^{T} (t) V (t) x (t) + u^{T} (t) U (t) u (t)] ⅆ t} & (5) \end{matrix}

where V

f

, V(t), and U(t) are weighting matrices for establishing the relative importance of the three quadratic terms. Minimization of {overscore (J)} is performed subject to the conditions implied by the plant differential equations (1) and a solution for u(t) is obtained in terms of the estimated state vector, {circumflex over (x)}(t) as

u

(

t

)=−

C

(

t

)

{circumflex over (x)}

(

t

) (6)

where controller gain matrix C(t) is computed using the matrix Riccati equation for control and the certainty-equivalence principle. (Reference: Applied Optimal Control, A. E. Bryson and Y. C. Ho, John Wiley & Sons, 1975)

FIG. 4

is a block diagram of an optimal controller

15

controlling plant

11

by using measurements obtained by sensor unit

12

. The optimal controller comprises a Kalman filter

13

, for operating on the output of sensor unit

12

and producing an estimated state vector, and a controller gain matrix unit

14

that operates on the estimated state vector to produce the control vector for controlling the plant.

FIG. 5

is a signal flow block diagram of a discrete time sampled model

20

for a physical plant that is to be controlled. The most obvious difference between the continuous signal model of

FIG. 2

is that integrator

110

has been replaced by time delay unit

200

that introduces a vector delay of T seconds. At time t=kT, the discrete state vector, x

k

, appears at the output of delay unit

200

. Vector adder

201

accepts the output, F

k

x

k

, of matrix gain unit

203

, control vector Γ

k

u

k

from matrix gain unit

206

, and the output, G

k

w

k

, of matrix gain unit

202

and presents as the vector sum, x

k+l

, corresponding to the state vector for t=(k+1)T. The measurement portion of the plant is represented by vector gain matrix unit

204

, that operates on state vector x

k

to produce y

k

, and vector adder

205

that adds white noise vector v

k

to y

k

for producing output measurement vector z

k

. Noise vectors w

k

and v

k

are assumed to be independent discrete processes.

As a result the following set of finite difference equations, that are comparable to equations (1), apply to FIG.

5

:

x

k+1

=F

k

x

k

+Γ

k

u

k

+G

k

w

k

y

k

=H

k

x

k

z

k

=y

k

+v

k

E{v

k

v

1

}=R

k

δ

k1

E{w

k

w

1

}=Q

k

δ

k1

(7)

FIG. 6

is a block diagram of the discrete Kalman filter, analogous to the continuous Kalman filter of

FIG. 3

, that accepts the output measurement vector, z

k

, of the discrete time sampled plant model of FIG.

5

and produces a conditional estimated state vector, {circumflex over (x)}

k|k−l

, at the output of vector time delay unit

200

. The vector, {circumflex over (x)}

k|k−l

, is fedback through vector gain unit

303

(F

k

−K

k

H

k

T

) to vector adder

301

to be added to the output of Kalman matrix gain unit

302

and added to input control vector Γ

k

u

k

from gain unit

206

.

The following finite difference equations apply to the discrete Kalman filter of FIG.

6

.

{circumflex over (x)}

k+1|k

=(

F

k

−K

k

H

k

T

)

{circumflex over (x)}

k|k−1

+Γ

k

u

k

+K

k

z

k

,

where

{circumflex over (x)}

0|−1

=E

{{circumflex over (x)}

0

}=

{overscore (x)}

0

K

k

=F

k

Σ

k|k−1

H

k

[H

k

T

Σ

k|k−1

H

k

+R

k

]

−1

\begin{matrix} \sum_{k + 1 &LeftBracketingBar; k} = F_{k} [\sum_{k &LeftBracketingBar; k - 1} - \sum_{k &LeftBracketingBar; k - 1} {H_{k} (H_{k}^{T} \sum_{k &LeftBracketingBar; k - 1} H_{k} + R_{k})}^{- 1} H_{k}^{T} \sum_{k &LeftBracketingBar; k - 1}] F_{k}^{T} + G_{k} Q_{k} G_{k}^{T} & (8) \end{matrix}

where

Σ

0|−1

=P

(0)=

P

0

,

and

{circumflex over (x)}

k|k

={circumflex over (x)}

k|k−1

+Σ

k|k−1

H

k

(

H

k

T

Σ

k|k−1

H

k

+R

k

)

−1

(

z

k

−H

k

T

{circumflex over (x)}

k|k−1

)

Σ

k|k

=Σ

k|k−1

+Σ

k|k−1

H

k

(

H

k

T

Σ

k|k−1

H

k

+R

k

)

−1

H

k

T

Σ

k|k−1

)

In the above expressions involve conditional vectors, {circumflex over (x)}

k|k−n

, and covariance matrices, Σ

k|k−n

for n=0 or 1, which should be interpreted to mean that they represent predicted values conditioned on the measured observations [z

0

z

1

. . . z

k−n

] as follows:

{circumflex over (x)}

k|k−n

=E{x

k

|z

0

z

1

. . . z

k−n

}

Σ

k|k−n

=E{{circumflex over (x)}

k|k−n

{circumflex over (x)}

k|k−n

T

|z

0

z

1

. . . z

k−n

} (9)

The Kalman filter controller shown in

FIG. 4

would apply to the discrete case described above if the following changes were implemented: replace Kalman filter

13

with the discrete Kalman filter of

FIG. 6

; provide sensor unit

12

with ADC means for periodically sampling and quantizing z(t); and provide controller

14

with a digital-to-analog converter (DAC) output with first order hold output means if an analog input to plant

11

is required. In this manner, the input would be held constant during the interval between successive samples of z(t). The control

14

gain matrix would be calculated by minimizing an appropriate discrete performance index, J(k).

Because the computation of the state vector covariance, Σ, can lead to numerical instability, it is advantageous to deal with the square-root, S, of the covariance, Σ=SS

T

, because the product SS

T

can not lead to a matrix that fails to be nonnegative definite as a result of computational errors. Also, the numerical conditioning of S is generally much better than that of Σ and only requires half as many significant digits for computation. The covariance square root matrix, S

k|k

, can be update using a matrix transformation as follows:

\begin{matrix} [\begin{matrix} S_{k + 1 &LeftBracketingBar; k}^{T} \\ 0 \end{matrix}] = T [\begin{matrix} S_{k &LeftBracketingBar; k}^{T} F_{k}^{T} \\ Q_{k}^{T / 2} G_{k}^{T} \end{matrix}] & (10) \end{matrix}

where S

k

T+1|k

is a n×n upper triangular matrix and T is any orthogonal matrix that makes S

k

T+1|k

upper triangular. Similarly, the both time and covariance updates may be obtained by the following operations:

\begin{matrix} {\hat{x}}_{k &LeftBracketingBar; k} = {\hat{x}}_{k &LeftBracketingBar; k - 1} + {{\tilde{K}}_{k} (R_{k} + H_{k}^{T} \sum_{k &LeftBracketingBar; k - 1} H_{k})}^{- 1 / 2} (z_{k} - H_{k}^{T} x_{k &LeftBracketingBar; k - 1}) & (11) \\ [\begin{matrix} {(R_{k} + H_{k}^{T} \sum_{k &LeftBracketingBar; k - 1} H_{k})}^{T / 2} & {\tilde{K}}_{k} \\ 0 & S_{k &LeftBracketingBar; k}^{T} \end{matrix}] = \tilde{T} [\begin{matrix} R_{k}^{T / 2} & 0 \\ S_{k &LeftBracketingBar; k - 1}^{T} H_{k} & S_{k &LeftBracketingBar; k - 1}^{T} \end{matrix}] \end{matrix}

where {tilde over (T)} is an orthogonal matrix that causes the transformed matrix on the left to be upper triangular (Reference:B. D. O. Anderson and J. Moore, Optimal Filtering, Prentice Hall. 1979 pp. 148-149).

The discrete model described above replaces differential equations needed to describe most practical physical systems that operate in the continuous domain by finite difference equations that may not adequately describe the physical system that is to be controlled. The discrete model is only an approximation to the integration process of the continuous model that may require high sampling rates (and consequently more processing) in order to provide a acceptable approximation to the continuous model. This problem becomes more acute when these techniques are applied to nonlinear time varying system models that more accurately describe practical processing systems. Prior art has attempted to extend the discrete Kalman filter control technique from the linear time-varying type of system to nonlinear time-varying systems by a method known as extended Kalman filtering (EKF).

A rather general nonlinear model for a nonlinear plant is given by the following nonlinear differential equations:

{dot over (x)}=

f

(

x,u

)+

w

z=g

(

x

)+

v

(12)

where u,v,w,x, and z represent the same variables as previously defined but with the dependent time variable, t, suppressed, f(.) is a nonlinear function of both the state vector, x, and the input vector, u, and g(.) is a nonlinear function of the state vector, x. The corresponding nonlinear plant block diagram model,

40

, is shown in

FIG. 7

where input unit

401

accepts input vector u and state variable feedback x and produces the nonlinear output vector f(x,u) to which vector summer

402

adds noise vector w to form the differentiated state variable vector, {dot over (x)}. Vector integrator

403

operates on {dot over (x)} to produce state vector x at the output. State vector x is feedback to input unit

401

and to measurement nonlinear transformation unit

404

that transforms vector x by forming the output vector g(x) to which noise vector v is added for producing the measurement observation vector z.

Because the optimal nonlinear system state vector estimation is generally computationally intractable, common practical approaches approximate equations (12) by using finite difference equations and by adapting the linear methods discussed above after linearizing the above nonlinear plant differential equations (12) along a nominal trajectory, so that the equations take on the following form:

x

k+

1

=A

k

x

k

+B

k

u

k

+w

k

z

k

=C

k

x

k

+v

k

, for k=1,2, . . . (13)

These equations represent a time-varying linear model with index k, and discrete time state vectors. The nominal trajectory for the non-linear equations (12) must be available for computation of the linearized model.

Anderson et al. (op. cit., p. 195) apply Kalman filter methods to obtain a suboptimal estimator for the discrete model given by the following equations:

x

k+

1

=f

k

(

x

k

)+

g

k

(

x

k

)

w

k

z

k

=h

k

(

x

k

)+

v

k

(14)

where the trajectory index and the time sampling index are both k, f

k

(.) and h

k

(.) are nonlinear time dependent operators, g

k

is a time-dependent gain, v

k

and w

k

are mutually independent zero-mean white gaussian processes with covariances R

k

and Q

k

respectively and x

0

=N({overscore (x)}

0

,P

0

).

The nonlinear model is linearized by use of a Taylor series expansion about x={circumflex over (x)}

k|k

, so that

f

k

(

x

k

)≅

f

k

(

{circumflex over (x)}

k|k

)+

F

k

(

x

k

−{circumflex over (x)}

k|k

)

g

k

(

x

k

)≅

g

k

(

{circumflex over (x)}

k|k

)≡

G

k

h

k

(

x

k

)≅

h

k

(

{circumflex over (x)}

k|k−1

)+

H

k

T

(

x

k

−{circumflex over (x)}

k|k

)

where

F

k

=∂f

k

(

x

k

)/∂

x|

x={circumflex over (x)}

k|k

and

H

k

T

=∂h

k

(

x

)/∂

x|

x={circumflex over (x)}

k|k−1

. (15)

Assuming that {circumflex over (x)}

k

and {circumflex over (x)}

k|k−1

are known, the resulting quasi-linear signal model equations may be written so that they are similar in form to the linear equations used previously, as follows:

x

k+1

=F

k

x

k

+G

k

w

k

+χ

k

z

k

=H

k

T

x

k

+v

k

+η

k

(16)

where χ

k

and η

k

are available from estimates as

χ

k

=f

k

(

{circumflex over (x)}

k|k

)−

F

k

{circumflex over (x)}

k|k

η

k

=h

k

(

{circumflex over (x)}

k|k−1

)−

H

k

T

{circumflex over (x)}

k|k−1

(17)

The corresponding EKF equations are as follows:

{circumflex over (x)}

k|k

={circumflex over (x)}

k|k−1

+L

k

[z

k

−h

k

(

{circumflex over (x)}

k|k−1

)]

{circumflex over (x)}

k+1|k

=f

k

(

{circumflex over (x)}

k|k

)

L

k

=S

k|k−1

H

k

Ω

k

−1

W

k

=H

k

T

Σ

k|k−1

H

k

+R

k

Σ

k|k

=Σ

k|k−1

−Σ

k|k−1

H

k

[H

k

T

Σ

k|k−1

H

k

+R

k

]

−1

H

k

T

Σ

k|k−1

Σ

k|k+1

=F

k

Σ

k|k

F

k

T

+G

k

Q

k

G

k

T

where

Σ

0|−1

=P

0

=E

{(

x

0

−{overscore (x)}

0

)(

x

0

−{overscore (x)}

0

)

T

, and

{circumflex over (x)}

0|−1

={overscore (x)}

0

. (18)

FIG. 8

is a block diagram that shows the EKF equations (18).

FIG. 8

is labeled so as to correspond to the variables and matrices of equations (18).

The prior art EKF technology must be selectively and judiciously applied. Otherwise, poor performance can result due to stability of numerical integration, stability of the filtering equations and numerical error build-up. The finite time increments used in difference equation approximations can require fine increments that begin to approach infinitesimal increments. Consequently, the additional computation resulting from these fine-increment difference equations can impose an unacceptable computational burden on the control system processor.

Jazwinski, (A. H. Jazwinski, Stochastic Processes and Filtering Theory, Academic Press, 1970) describe an Extended Kalman Filter for continuos-time system without exogenous inputs u(t). Equations (14) above are replaced by

{dot over (x)}=f

(

x,t

)+

G

(

t

)

w

z=h

(

x,t

)+

v

(14a)

The filtering equations are identical to (18) except that the state estimate time propagation is performed by direct numerical integration given by

\begin{matrix} \hat{x} (t_{k + 1} &LeftBracketingBar; t_{k}) = \hat{x} (t_{k} &LeftBracketingBar; t_{k}) + \int_{t_{k}}^{t_{k + 1}} f (\hat{x} (t &LeftBracketingBar; t_{k}), t) ⅆ t & (18a) \end{matrix}

The use of direct numerical integration of the continuos-time nonlinear system presents several problems. Stability of the numerical integration, stability of the filtering equations and the computational load for real-time use are the main problems.

Instability of numerical integration arises from selection of step size in numerical integration to achieve feasible computational loads. Instability of the filter equations arise from mismatch between the linearized signal model used in the filter update and the implicit signal model in numerical integration. Typical physical models have widely different time scales leading to stiff differential equations (Reference: G. Golub and J. Ortega, Scientific Computing and Differential Equations, Academic Press, 1992). Prior methods using variable step solvers such as Gear's stiff method are used for such systems. Such variable step solvers impose highly variable data-dependent computational load. Highly variable computation load makes these methods inappropriate for real-time use. Fixed step methods such as Runge-Kutta or Euler require very small step size to ensure stability of numerical integration itself. Lack of numerical stability of integration leads to large errors. Numerical stability of the integration is not sufficient to provide stability of the real-time filter.

The present invention, by contrast, imposes fixed computation burden, permits time propagation in a single step that ensures numerical stability of integration as well as that of the filter. Stability of the filter in the present invention is arises from ensuring that

1) the time update for the state estimates and the covariance updates for the filtering equations use consistent time-varying linear system signal models; and

2) the time-varying linear signal model satisfies conditions for observability, controllability through disturbances and positivity of initial state error covariance matrix P

0

.

In the present invention, these conditions are easily satisfied regardless of accuracy of the non-linear system equation (12), (19) with respect to the real system that generates signals used by the filter so that stability is preserved under large model errors. This property is useful in applications of CEKF for adaptive processing, real-time system identification, hypothesis testing, fault detection and diagnosis where large model errors must be tolerated in real-time. Consequently, the resulting state estimates are stable and avoid the unstable consequences of prior-art EKF state estimation methods.

Prior-art formulation (14) and (14a) does not explicitly account for exogenous inputs u(t). In digitally controlled systems, exogenous inputs are typically outputs of zero-order-hold circuits. Exogenous inputs play a critical role in behavior of the controlled systems and have to be accurately represented in the model used in the filtering equations. Present invention addresses exogenous inputs explicitly. It also handles the digitally controlled exogenous inputs effectively.

A sensor processing method for the implementation of extended Kalman filter techniques for continuos-time nonlinear systems should provide the following features:

1) the use of numerical integration in the solution of continuous time nonlinear differential system equations in the state estimate time update that imposes low fixed, deterministic computation load;

2) covariance update and gain calculations that ensure stability of the combined state estimate update and the filter update in spite of mismatch between the system generating the signals and its assumed signal model in the filter;

3) the numerically stable square-root method for updating of the state vector covariance matrix should be used; and

4) reduced probability that minimization of the estimation error metric results in a local minimization rather than the desired global minimum.

SUMMARY OF THE INVENTION

A method for real-time nonlinear system state estimation in which the system is represented by a set of continuous-time nonlinear differential state equations that include a state vector representing the system states that are to be estimated, and a nonlinear system matrix that defines the functional relationship between the input vector, the state vector, and the time-derivative of the state vector. The state vector is estimated from a set of accessible measurements that are dependent on the state variables. The set of measurements, labeled as a measurement vector, are taken at prescribed times and represented by a measurement vector. Both the state vector and the measurement vector are respectively contaminated by a state noise vector and a measurement noise vector. The method includes the following steps:

(a) constructing a set of state equations representing the nonlinear system;

(b) establishing a set of state variable values for a current time state vector, and a set of matrix element values for one each covariance matrix for a current time state vector, for a current time state noise vector, and for a current time output measurement noise vector;

(c) acquiring a current time measurement vector;

(d) updating the state covariance matrices using the current time measurement covariance;

(e) estimating an updated current time state vector from the current time measurement vector by use of a state vector estimating filter that operates on the current time state vector using the current time measurement vector, the covariance matrices for the current time state vector, for the current time state, and for the current time measurement noise vector, the estimated updated state vector representing the state vector at the current measurement time;

(f) projecting the estimated updated state vector forward in time by integrating the system state equations over the prescribed time interval between the current prescribed measurement time and the next prescribed measurement time for obtaining a current state vector;

(g) projecting the updated state vector covariance matrix forward in time by using the results of the system equation integration of step (f); and

(h) at the next measurement time iterate steps (c)-(g).

Integration is performed by using matrix exponential in a single time step in a manner so that the signal models used by the filter state updates and the filter covariance updates are consistent.

Another embodiment includes the use of simulated annealing in the filter equations for reducing the probability of getting trapped in a local minimum of state estimation error metric rather than the global minimum.

Another embodiment includes the use of the above estimation techniques in closed-loop (feedback) control systems.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention will be more fully understood from the detailed preferred embodiments of the invention, which, however, should not be taken to limit the invention to the specific embodiment but are for explanation and better understanding only.

FIG. 1

shows a basic state variable control system using an observer for state variable estimation.

FIG. 2

is a state variable signal model for a physical plant.

FIG. 3

is a Kalman filter implementation for estimating the state variables of the physical plant model shown in FIG.

2

.

FIG. 4

is a block diagram of a state variable control system using a Kalman filter as a state observer.

FIG. 5

is a discrete-time plant signal model.

FIG. 6

is a block diagram of a discrete-time Kalman filter suitable for estimating the state variables of the plant model shown in FIG.

5

.

FIG. 7

is a block diagram of a state variable nonlinear continuous-time physical plant model.

FIG. 8

is a discrete-time extended Kalman filter for use in estimating state variables in a nonlinear continuous-time physical plant as in FIG.

7

.

FIG. 9

shows an example of a nonlinear thermal kinetic process plant.

FIG. 10

shows a highly nonlinear heat transfer characteristic of a thermal kinetic system.

FIG. 11

is an example of a biochemical nonlinear batch processing system.

FIG. 12

is a block circuit diagram for a nonlinear system with exogenous inputs.

FIG. 13

shows the waveforms and timing of the CTEKF.

FIG. 14

is a block diagram of a CTEKF continuous-time linearized model.

FIG. 15

is a block diagram of a CTEKF discretized-time and linearized model.

FIG. 16

is a block diagram of a CTEKF.

FIG. 17

is a flow diagram of the CTEKF method.

FIG. 18

shows an example of a local and global minimum.

FIG. 19

is a flow diagram for using simulated annealing with the CTEKF method.

DETAILED DESCRIPTION OF THE PRESENT INVENTION

Computer models of complex physical processes, such as those involving heat, mass, and reacting flows, form the basis for controlling these processes. Traditionally, every effort is made to greatly simplify the model by decoupling and dissociating various aspects of the physical process in order to reduce the number of interacting factors which may complicate estimation of the desired properties. As a result, estimates of system model parameters and states are made using quasi-static tests with the hope of isolating and measuring individual properties of interest, free from unknown effects of other processes and confounding due to memory effects of dynamic transients. The primary deficiency of this approach is that many individual and isolated experiments are needed.

High complexity and lack of predictive accuracy are often cited as problems in using realistic physical models in real-time control and process optimization. However, a controller based on an inaccurate model generally leads to a significant performance penalty. Simultaneous estimation of unknown constant parameters and unknown endogenous variables from measurements is required to improve the models. This problem, known as the system identification problem (Reference Ljung, System Identification, Theory for the User, Prentice Hall, 1987), is a special case of the estimation problem addressed by this invention.

Most methods and software tools based on modem control theory are limited to linear time-invariant models. Simulation models are used for obtaining local linear models for system synthesis and for closed-loop testing and verification of the controller design. The reliability and the complexity of the simulation model refinement and verification task is a major bottleneck in such efforts. Efficient refinement and verification of the system model in the presence of noisy sensor data, which involves nonlinear state estimation and system identification, can greatly reduce these efforts and permit better designs.

The method to be described for estimating nonlinear system states provides an accurate and stable method in the design, test, diagnosis and control phases of these nonlinear systems.

As an example of a non-linear system, consider the thermal kinetic system

80

shown in

FIG. 9

in which a porous part

81

, a carbon preform, is placed in a reactor filled with a reactant bearing carbon species

83

for deposition of carbon in the porous spaces of the preform. Preform

81

is heated resistively by current passing through the preform on conductive leads

84

(or by inductive radio frequency current passing through inductive heating coil

85

). The surface temperature is sensed by a set of pyrometers

86

. The density of the deposits are measured indirectly by means of Eddy current probes

87

that induce a high frequency electrical current in the preform which is monitored by the readout sensors of the Eddy current probes. The determination of the thermal behavior of a preform during the deposition process is critical because poor thermal control leads to uneven densification and poor part strength. In this regard, it is important to determine the nonlinear boiling curve that represents the heat transfer rate between the reactant and the preform. Typically, as the preform surface temperature increases the mechanism of heat transfer from the preform to the liquid reactant varies with the surface temperature as shown in the boiling curve of FIG.

10

. Over the range of surface temperature shown, the boiling curve passes through three distinct modes known as the nucleate regime (A), the unstable transition regime (B), and the stable film regime (C). This is a highly nonlinear function of temperature in which an increase in temperature can cause a decrease in heat transfer rate due to the vapor film caused by boiling between the preform and the liquid reactant. The separation caused by the film requires a substantial increase in temperature in order to maintain a constant heat flux. The transition region is unstable because the associated eigenvalues are positive (stability is normally associated with the negative real part of the eigenvalue that imply exponentially damped response).

In this example, the state vector could include hundreds of surface and internal temperature points, the corresponding densities of the preform, reactant concentrations, and species concentration. Each surface element would therefore have an associated nonlinear boiling heat transfer with parameters that would vary with location and the physical configuration of the process shown in FIG.

9

. Based on a given physical implementation and available empirical model data, a set of nonlinear continuous-time differential equations can be generated that describe the thermal kinetic system of FIG.

9

. These equations are then used to form the model for the system as a basis for making refined estimates of the system state variables.

As another example, consider the cylindrical water-cooled batch bio-chemical reactor system

90

of

FIG. 11

that is used in the manufacture of enzymes and drugs. Reactor

90

includes the reaction vessel

91

wherein the batch reaction takes place, input ports

92

and output ports

93

, agitator mechanism

94

, cooling manifold

95

, and reaction chamber

96

. The state variable vector x has elements {x

i

} that may typically include: cell concentration x

1

, the substrate concentration x

2

, final product concentration x

3

, dissolved oxygen concentration x

4

, temperature of the medium x

5

, and the pH value of the medium x

6

. These state vector elements may be prescribed for various radial and axial locations in the reactor. The real-time measurement vector z has elements {z

i

} that typically may include measurements at various locations in the reactor chamber of dissolved oxygen concentration z

1

, temperature of the medium z

2

, and pH of the medium z

3

. Input vector u has elements {u

i

} that may include measured inputs such as: cell concentration u

1

, feed stream flow rate u

2

, effluent stream flow rate u

3

, and cooling flow rate u

4

. The rates of the nonlinear bio-chemical process can be described by the so-called Michaelis-Menten kinetics or by more detailed structured nonlinear kinetics equations. The heat transfer and mass transfer is described using standard chemical engineering calculations (Reference: Bailey and Ollis, Biochemical Engineering Fundamentals, 2

nd

edition, McGraw Hill, 1987).

While it is possible to draw samples from the batch reactor during progress of the reaction for measuring of cell-mass concentration, substrate concentration, and the final product concentration, it is not practical or cost-effective to do so on-line in real-time. By using a continuous-time extended Kalman filter (CTEKF) to estimate all state variables in real-time, CTEKF can provide a cost-effective means for monitoring and controlling the reaction. All input vector elements (e.g. flow rates of feed streams and effluents, and the cooling flow) can be controlled based on the real-time estimates of the state variables. The CTEKF method can provide real-time state variable estimates that are comparable to those obtained by expensive off-line (non-real-time) analytical instruments.

The present invention is designed to overcome the prior-art deficiencies and problems in estimating states of nonlinear systems. The method, a continuous-time extended Kalman filter (CTEKF) for state estimation, is an extended Kalman filter (EKF) for continuous-time nonlinear systems using discrete time measurements and integration.

The present invention, by contrast, imposes fixed computation burden, permits time propagation in a single step that ensures numerical stability of integration as well as that of the filter. Stability of the filter in the present invention is arises from ensuring that

1) the time update for the state estimates and the covariance updates for the filtering equations use consistent time-varying linear system signal models; and

2) the time-varying linear signal model satisfies conditions for observability, controllability through disturbances and positivity of initial state error covariance matrix P

0

.

In the present invention, these conditions are easily satisfied regardless of accuracy of the non-linear system equation (19), given below, with respect to the real system that generates signals used by the filter so that stability is preserved under large model errors. Consequently, the resulting state estimates are stable and avoid the unstable consequences of prior-art EKF state estimation methods.

FIG. 12

is a general model for a nonlinear system representative of the type of system to which the CTEKF method can be applied. The equations describing the nonlinear system model are

{dot over (x)}=f

(

x,u

)+

w

z=h

(

x,u

)+

v

(19)

where x is the state vector, u the input vector, z the measurement vector,f(x,u) and h(x,u) are nonlinear function of x and u, and w and v are mutually independent white noise vectors with respective covariances R and Q.

A common approach, as previously described with reference to

FIG. 7

, is to linearize and discretize the nonlinear model of eqs. (19) and

FIG. 12

along a nominal trajectory resulting in sequence of linear signal models characterized by eqs. (13)-(17). The linearized EKF filter is given by eqs. (18). This approach can lead to degraded, unstable, performance due to deviations from the nominal trajectory.

To overcome the degraded and unstable performance of the prior-art EKF filter methods, in the CTEKF method, the EKF method is combined with numerical integration in order to overcome the problems experienced with the prior-art EKF methods when applied to continuous-time nonlinear system estimation. The CTEKF method may be summarized as follows:

1) the CTEKF gain and the covariances are computed using the discrete-time system model matrices and the recent measurement;

2) the state vector estimate is updated using the measurement vector;

3) the continuous-time signal model equations are linearized at the current estimated state vector element values;

4) the coefficient matrices obtained from step (3) are used to obtain a discrete-time system model using a matrix exponential;

5) the nonlinear differential system equations are integrated to obtain the next state vector using the matrix exponential;

6) the covariance matrix is updated for time update using results from (4).

For added numerical reliability, the “square-root” approach to implementing the Kalman filter is used. The square-root implementation guaranties non-negativity of the propagated covariance matrices.

Referring back to

FIG. 12

, vector integrator

502

integrates vector {dot over (x)} and produces state vector x at the output. The state vector x in combination with input vector u is operated-on by nonlinear vector input network

501

and produces a nonlinear output vector function, f(x,u) to which white noise vector is added for producing the noisy state vector time derivative. The output, x, of integrator

502

, and the input vector, u, are supplied to nonlinear vector network

503

to provide a noise free measurement vector, y, that is added to noise vector v to produce the available measurement vector z.

The procedure for linearizing the signal model of FIG.

12

and eqs. (19) may be best understood by referring to the signal-time model of

FIG. 13

in which waveform (a) represents a series, {u

k

m

}, of constant values of T duration each of the m

th

element u

m

(t), at the k

th

sampling time. Waveform (b) represents the x

m

(t), of the state vector x. Waveform (c) represents the z

m

(t), of measurement vector z (not necessarily corresponding to x

m

). Waveform (d) indicates the time-interval within each sample-interval, T, during which a conditional estimated state vector update, {circumflex over (x)}

k+1|k

, is made during the k

th

step.

At the beginning of the k

th

step, the values {circumflex over (x)}

k|k−1

, u

k

, and z

k

, and matrices S

k|k−1

, Q

k

, R

k

, and C

k−1

are known. S

k|k−1

, Q

k

, and R

k

are, respectively, the square-root state vector covariance matrix (Σ=SS

T

), the input noise (w) covariance vector, and the measurement noise (v) covariance vector, as previously defined. C

k−1

is the linearized output matrix from the previous step.

Using saved values of matrices, a measurement update is made by use of the following QR factorization (Golub, G. H., Van Loan, C. F., “Matrix Computations”, Johns Hopkins Univ. Press, Baltimore, Md., 1983, 147-153):

\begin{matrix} [\begin{matrix} R_{in}^{T} & K_{k}^{T} \\ 0 & S_{k &LeftBracketingBar; k}^{T} \end{matrix}] = Q [\begin{matrix} R_{k}^{T / 2} & 0 \\ S_{k &LeftBracketingBar; k - 1}^{T} C_{k - 1}^{T} & S_{k &LeftBracketingBar; k - 1}^{T} \end{matrix}] & (20) \end{matrix}

where Q is the QR transformation matrix that is applied to the partitioned matrix shown on the right for producing the upper triangular matrix on the left that contains the required Kalman filter gain, K

k

, and the updated square-root conditional covariance S

k|k

, and T is a matrix transpose operator. State vector {circumflex over (x)}k|k−

1

is updated using the following relationship:

{circumflex over (x)}

k|k

={circumflex over (x)}

k|k−1

+K

k

R

in

−1

(

z

k

−h

(

{circumflex over (x)}

k|k−1

, u

k

)) (21)

In an alternative embodiment, the output equation for z in eqs. (19) is linearized at {circumflex over (x)}

k|k−1

, u

k

to obtain C

k

for use in the place of C

k−1

in eq. (20). It should be noted that only the transformed matrix of eq. (20) needs to be saved; the transformation matrix Q does not have to be explicitly created or stored. This results in significant savings of storage space and computational burden. Also, the solution of the system of linear equations (eq. (21)) involving R

in

is simplified because R

in

is in upper triangular form as a result of the QR transformation and R

in

−1

can be obtained by the well known “back-solve” method. The Householder method for QR factorization is a preferred method because it can be applied in place to the R matrix (Golub and Van Loan, op. cit, 40-41).

Also, because the measurement update (eqs. (20, 21)) and the time update are separate and distinct steps, it is possible to accommodate the omission of a measurement update if the measurement vector data is not available, or if the measurement vector data is grossly unreliable. Consequently, this provides a method for state vector estimation that accommodates measurement sensors with different rates of observation and also provides a robust state vector estimation method that would provide for a graceful (incrementally) degraded performance due to failures causing a lack of measurement observations. In other words, upon lack of measurement data, simply set

{circumflex over (x)}

k|k

={circumflex over (x)}

k|k−1

S

k|k

=S

k|k−1

(22)

and proceed to the next step.

The linearized signal model equations take on the following form:

{dot over (x)}=A

k

x+B

k

u

z

k

=C

k

x+D

k

u,

for

k

=1,2, . . . (23)

where k is the time index, as previously defined.

FIG. 14

is a block diagram representation of the linearized signal model.

Equations (19) are linearized at the values of the vector elements corresponding to the conditional estimated state vector {circumflex over (X)}

k|k

and input vector u

k

as shown in FIG.

14

. The matrix A

k

is constructed by perturbing the elements of the state vector x

k

by a small amount, δ(x

k

), that correspond to each column i, for obtaining a column of coefficients approximating the partial derivative of the i

th

column (vector), A

k

(i)

, with respect to the i

th

element, x

k

(i)

, of state vector x

k

. Thus, from eqs. (19),

A

k

(i)

=[f

(

x

+

, u

k

)−

f

(

x

—

, u

k

)]/δ(

x

k

(i)

)

where

x

±

={circumflex over (x)}

k|k

±δ(

x

k

(i)

)/2 (24)

Similarly, the columns of vectors B

k

,C

k

, and D

k

are obtained as follows:

B

k

(i)

=[f

(

x, u

+

)−

f

(

x, u_)]/δ(

u

k

(i)

)

C

k

(i)

=[h

(

x

+

, u

)−

h

(

x

—

, u

)]/δ(

x

k

(i)

)

D

k

(i)

=[h

(

x, u

+

)−

h

(

x, u_)]/δ(

u

k

(i)

)

where

u

±

=u

k

±δ(

u

k

(i)

)/2 (25)

The perturbation δ(.) is a sufficiently small deviation so that the local gain is well approximated by the approximate partial derivatives A

k

(i)

,B

k

(i)

,C

k

(i)

, and D

k

(i)

. When this procedure is applied to each column of matrices A

k

,B

k

,C

k

, and D

k

, the gain matrices of

FIG. 14

are defined for the interval corresponding to step k.

The next step time update procedure involves integrating the nonlinear equations (26) to obtain {circumflex over (x)}

k+1|k

by using {circumflex over (x)}

k|k

as the initial condition and u

k

as the constant input signal over the duration of the k

th

step as shown in waveform (a) of FIG.

13

.

The simplest method for implementing numerical integration of

{dot over (x)}=f

(

x, u

) (26)

over the k

th

step interval is by the use of Euler's integration algorithm, i.e.

{circumflex over (x)}

(

t

k

+h|t

k

)=

hf

(

{circumflex over (x)}

(

t

k

|t

k

),

u

k

)+

{circumflex over (x)}

(

t

k

|t

k

),

{circumflex over (x)}

(

t

k

|t

k

)=

{circumflex over (x)}

k|k

and

{circumflex over (x)}

(

t

k

+T|t

k

)=

{circumflex over (x)}

k+l|k

(27)

where h is the selected integration step size and {circumflex over (x)}

k|k

is the initial value. Unfortunately, for stiff systems, very small step sizes (h<<T) may be needed in order to have the finite difference equation (27) acceptably approximate integration. This could result in an unacceptable computational burden.

A further consideration is that a mismatch between F

k

used in the filter covariance propagation and the effective F

k

used in the simulation time update can cause stability problems in the resulting time-varying Kalman filter. This is because the Euler's approximate integration in eq. (27) is not the same as the matrix exponential used in computing F

k

covariance update in eq. (33) shown below.

Because it is highly desirable to avoid the mismatch in the transition matrix, F

k

, used in the covariance propagation and the effective transition matrix used in the state variable time update, and to also avoid the computational burden imposed by the use of Euler's approximate integration method, the following development shows how the matrix exponential can be used for both operations.

To propagate state estimates in time, i.e. {circumflex over ({dot over (x)})}(t)=f({circumflex over (x)}(t),u(t)), and {circumflex over (x)}(kT)={circumflex over (x)}

k|k

, a locally linear model is used as follows:

\begin{matrix} {\frac{ⅆ}{ⅆ t} (δ \hat{x} (t)) = A_{k} δ \hat{x} (t) + f ({\hat{x}}_{k &LeftBracketingBar; k}, u_{k}) (\frac{\partial f}{\partial \hat{x}}) &RightBracketingBar;}_{{\hat{x}}_{k &LeftBracketingBar; k}} = A_{k} & (28) \end{matrix}

Exact integration of (28) leads to equation (29) below.

\begin{matrix} \hat{x} (kT + τ) = \hat{x} (kT) + \int_{λ = 0}^{τ} e^{A_{k} (τ - λ)} f ({\hat{x}}_{k &LeftBracketingBar; k}, u_{k}) ⅆ λ & (29) \end{matrix}

By letting f

k

=f({circumflex over (x)}

k|k

,u

k

), eq. (29) becomes

\begin{matrix} \hat{x} ((k + 1) T) = \hat{x} (kT) + \int_{s = 0}^{T} e^{A_{k} (T - s)} f_{k} ⅆ s & (30) \end{matrix}

The integral expression may be readily evaluated by using the following matrix relationship:

\begin{matrix} e^{[\begin{matrix} A_{k} & f_{k} \\ 0 & 0 \end{matrix}] T} = [\begin{matrix} e^{A_{k} T} & \int_{0}^{T} e^{A_{k} (T - s)} f_{k} ⅆ s \\ 0 & I \end{matrix}] = [\begin{matrix} F_{k} & G_{k} \\ 0 & I \end{matrix}] & (31) \end{matrix}

where I is the identity matrix. Equation (31) allows the integral of equation (30) to be evaluated by exponential of the zero-augmented matrix on the left that contains state feedback matrix A

k

and vector f

k

. It should be noted that the matrix exponential in accordance with eq. (31) not only evaluates the integral but also provides the exponentiation of A

k

as well.

Also note that the same matrix, A

k

, appears in the evaluation of the integral. Therefore, the time propagation eqs. (31), (32) use linear discrete-time time-varying signal models consistent with the ones used in the covariance update eq. (20) and below in eq. (33). For example, if the expression for z in eqs. (19) is linear, the signal models are identical. Consequently, stability of the Kalman filter procedure is readily ensured even if the linear discrete-time time-varying signal model is totally inaccurate with respect to the original continuous-time nonlinear system as long as the linear signal model satisfies observability, controllability through disturbances and positivity of the initial covariance. The conditions of observability, controllability through disturbances and positivity is easy to satisfy through proper choice of noise covariances and selection of sensor signals. Also, unlike the Euler or other integration methods (explicit Runge-Kutta) the matrix exponential update method safely “residualizes” fast-states in a stiff system.

Care must be taken in the computation of the matrix exponential e

AT

. The blind use of series expansion can lead to estimation instabilities. In one preferred embodiment, the method involves squaring and scaling is used to ensure numerically reliable results (Petkov, Christov and Konstantinov, Computational Methods for Linear Control Systems, Prentice Hall, 1991).

The updated conditional state vector, {circumflex over (x)}

k+l|k

, can now be obtained by

{circumflex over (x)}

k+1|k

={circumflex over (x)}

k|k

+G

k

(32)

where {circumflex over (x)}

k|k

is defined by eq. (21).

The square-root of the state covariance vector, S

k|k

, is now updated by using another QR factorization:

\begin{matrix} [\begin{matrix} S_{k + 1 &LeftBracketingBar; k}^{T} \\ 0 \end{matrix}] = Q [\begin{matrix} S_{k &LeftBracketingBar; k}^{T} F_{k} \\ Q_{k}^{T / 2} \end{matrix}] & (33) \end{matrix}

Again, the transformation can be performed in place because only the transformed matrix on the left needs to be retained.

FIG. 16

is a block diagram representation of the CTEKF. The block elements have indicia that are identical to the indicia used in

FIG. 15

for those items that are the same. Block element

608

is vector G

k

computed in equation (31). Block element

611

is the nonlinear vector function h(x, u) that maps states and exogenous inputs to output as in equation (19). Block elements

607

-

610

differ from those of FIG.

15

. Element

610

is a unit for taking the difference between the estimated measured output and the actual measured output, (z

k

−{circumflex over (z)}

k

) as implied in eq. (21), which is fed to element

607

, a matrix gain unit based on values generated by means of eq. (20). The output of element

607

is applied to vector adder

609

. The second input to

609

is provided by the output of delay unit

602

′ for providing {circumflex over (x)}

k|k

at the output of adder

609

. The output of adder

609

is combined with G

k

by adder

605

to produce x

k+1

as the input to delay unit

602

′.

FIG. 17

is a flow diagram of method CTEKF 700 that summarizes the method for implementing a CTEKF for estimating the state variables of a nonlinear system of the type shown in FIG.

9

. The method begins at step

700

with a set of initial values: {circumflex over (x)}

0|−1

, u

0

, z

0,

, C

−1

,S

0|−1

, and R

0

that have been defined previously. The iteration index k is set at k=0. At step

701

, a determination is made if a measurement update is to be made, and, if so, the process goes to step

702

where a measurement update is made using QR factorization in accordance with eqs. (20) and (21). Otherwise step

703

is invoked in which the previous values {circumflex over (x)}

k|k−1

and S

k|k−15

are used as the new measurement updated values. At step

704

, a model is created that is linearized about ({circumflex over (x)}

k|k

, u

k

) and matrices A

k

, B

k

, C

k

, and D

k

are computed in accordance with eqs. (23)-(25). At Step

705

time update using matrix exponential is carried out in accordance with eqs. (31) and (32). Steps

704

and

705

are followed by a time update in step

706

in which the state error covariance update is carried out using a square-root state vector covariance matrix as in eq. (33). Index k is incremented in step

707

and the process returns to step

701

.

One difficult problem often encountered in nonlinear recursive state and parameter estimation or parameter tracking is getting stuck in a local minimum that may be far away from the desired optimum global minimum. One method for handling this problem is to make multiple estimates with each estimate initiated with a random perturbation. Simulated annealing is a related method that uses sequences of random perturbations with programmed gain (temperature) schedule that allows estimates to escape local minima. For example, see Rutenbar, R. A., “Simulated Annealing Algorithms: An Overview”, IEEE Circuits and Devices Mag., pp. 19-26, January 1989.

FIG. 18

is a one dimensional example of the principle involved in simulated annealing in which the estimation error cost function is plotted as function of the value of the single state variable x. In seeking the global minimum at x=x

1

, there is a chance that the local minimum at x=x

2

will be selected by the Kalman filter. Because the estimated solution is “dithered” by system and measurement noise about the minimum selected, there is a chance that a noise perturbation will cause an estimate to “jump” out of a local minimum at x=x

2

into the valley of the global minimum at x=x

1

if the noise perturbation is sufficiently large. Simulations of nonlinear systems with local minimums have shown increased covariance Q

k

of the system noise w and/or increased covariance of the initial state vector, P

o

, permit convergence to the global minimum when the Kalman filter is started at a local minimum even when the assumed system signal model does not accurately represent the true system signal model.

In the context of nonlinear recursive state estimation, the process noise covariance Q may be controlled by applying a gain multiplier and a larger initial value which is decreased as a function of time. Also, the initial state vector covariance estimate, P

o

, may be increased by a large factor. The increase in either initial covariance allows a greater initial correction of the state vector estimate than the theoretical stochastically optimal value. However, the Kalman filter Riccati equation update ensures overall stability while permitting more diffusion noise to enter the Kalman filtering equations.

A typical time varying “temperature schedule” gain multiplier has the following form:

ξ

k

=ξ(

kT

)={square root over (

2c

+L /ln(2+

KT

+L ))} (34)

Q

k

=Q

0

(1+ξ

k

) (35)

where c is a constant (typically 0.5<c<1), Q

0

is a positive definite symmetric matrix corresponding to the initial process noise covariance matrix, Q

k

is the process noise covariance matrix at time kT, and In(.) is the natural logarithm.

We define an error metric as the scalar

c

k

=trace(

S

k|k

S

k|k

T

) (36)

where S

k|k

is defined in equation (20).

A multi-path simulated annealing algorithm is used that includes the following steps:

(i) choose N, the number of parallel CTEKF paths, typically between

4

and

10

;

(ii) compute a temperature schedule and the corresponding process noise covariance matrices according to eqs. (34) and (35);

(iii) initialize each of the N CTEKF state estimates using random initial state vectors in some region;

(iv) update the N CTEKF according to eqs. (20)-(25), and (31)-(33);

(v) compute the scalar value c

k

=trace(S

k|k

S

k|k

T

) for each CTEKF and choose the estimate corresponding to the smallest c

k

=trace(S

k|k

S

k|k

T

) as the best estimate and output it as required; and

(vi) identify the CTEKF with the highest value of c

k

=trace(S

k|k

S

k|k

T

) and add to its state estimate a random vector of variance equal to c

k

; increment k the iteration index.

An alternative embodiment skips addition of a random vector in step (vi) above for every m steps (typically m=100).

FIG. 19

is a flow diagram of a simulated annealing procedure (900) adapted to the CTEKF method. The procedure begins at step

901

in which the number of independent simulated annealing procedures, N, using the CTEKF method is specified. In step

902

, a common gain schedule, {ξ(kT)}, and process noise covariance is specified as in eqs. (34) and (35). In step

903

, N different randomized initial state vectors, one for each CTEKF path, are specified. Step

904

involves running N CTEKFs (paths) initialized in step

903

with each CTEKF operating on the same set of input and measurement observations for yielding N separate state variable vector estimates and covariance matrices at each sampling interval of T. In step

905

, compute the scalar value c

k

=trace(S

k|k

S

k|k

T

) for each CITEKF and choose the estimate corresponding to the smallest c

k

=trace(S

k|k

S

k|k

T

) as the best estimate and output it as required. In step

906

, identify the CTEKF with the highest value of c

k

=trace(S

k|k

S

k|k

T

) and add to its state estimate a random vector of variance equal to c

k

. Increment k the iteration index. The process is iterated by returning to step

904

.

A control system using CTEKF can be implemented as shown in

FIG. 1

where sensor unit

12

corresponds to the measurement system producing measurement vector z(t), state observer

13

corresponds to the CTEKF system described above and in

FIGS. 13-17

, and controller unit

14

is implemented by well known optimal control methods that optimize a performance index and use the Certainty Equivalence Principle (Bryson and Ho, op. cit.). The Certainty Equivalence Principle permits use of the estimated state in place of measured states in an optimal control solution.

As will be understood by those skilled in the art, many changes in the methods described above may be made by the skilled practitioner without departing from the spirit and scope of the invention, which should be limited only as set forward in the claims which follow.

Claims

1. A method for estimation of state variables of a physical nonlinear system with physical exogenous inputs that can be modeled by a set of continuous-time nonlinear differential equations, including input signals representing the physical exogenous inputs to the system, and measurement signals representing accessible measurements that are indicative of the state variables to be estimated, the method comprising:a) creating a nonlinear differential equation model of a nonlinear system using initial estimated state variables; b) obtaining the nonlinear system measurement signals in response to the nonlinear system input signals; c) creating an updated nonlinear differential equation model using the nonlinear differential equation model, the nonlinear system input and measurement signals, and a state variable estimation method for refining the initial estimated state variables, the state variable estimation method for producing updated estimated state variables using integration methods for state variable estimation of the nonlinear system with exogenous inputs; d) using a discrete-time covariance matrix update; and e) using a common linear perturbation signal model for estimating the updated state variables and a corresponding updated covariance matrix, wherein the linear perturbation model is a discrete-time time-varying linear system model.
2. The method of claim 1 wherein the square-root of the covariance matrix is propagated.
3. The method of claim 1 further comprising a step for simulated annealing by running multiple estimators and choosing the best estimate based on an error metric.
4. The method of claim 3 wherein an increased process noise covariance in the state variable estimation method is used, the increased process covariance generated in accordance with a prescribed schedule that begins by amplifying the initial process noise covariance.
5. The method of claim 1 wherein the state variable estimation method is based on an extended Kalman filter, modified for state variable time propagation by integration.
6. The method of claim 1 wherein the nonlinear differential equation model is given by {dot over (x)}=f(x, u)+w and z=h(x, u)+v where x is the state {dot over (x)} is a time derivative of the state variable vector x, f(x,u) is a nonlinear function of x and input vector u, z is a measurement vector which is function, h(x,u) of both x and u, and w and v are mutually independent noise vectors and wherein a state variable estimate, {circumflex over (x)}k, for time t=kT is propagated to a state variable estimate, {circumflex over (x)}k+1, for time t=(k+1)T by integrating the nonlinear differential equation model over a time interval of T as follows: x^k+1=x^k+∫s=0T⁢eAk⁢(T-s)⁢fk⁢ ⁢ⅆs,where fk=f({circumflex over (x)}k|k, uk) evaluated at time t=(kT), and Ak is a discrete state feedback matrix, evaluated at time t=kT, from a linearized approximation to f(x,u).
7. The method of claim 6 wherein integrating is performed by evaluating a matrix exponential as follows: e[Akfk00]⁢T=[eAk⁢T∫0T⁢eAk⁡(T-s)⁢fk⁢ ⁢ⅆs0I]=[FkGk0I],so that a time propagated state vector is obtained as follows,{circumflex over (x)}k+1|k={circumflex over (x)}k|k+Gk, and, by QR factorization, an updated conditional square-root covariance, Sk+1|k, for time (k+1) is obtained from [Sk+1⁢&LeftBracketingBar;kT0]=Q⁡[Sk⁢&LeftBracketingBar;kT⁢FkQkT/2]where Sk|k conditional square-root covariance matrix for time k, Qk is the process noise vector covariance, Q is a QR transformation matrix, and T represents a matrix transpose operator thereby using a same computational model for updating the estimated state variable and associated covariance.
8. The method of claim 7 further comprising a step for simulated annealing by running multiple estimators and choosing a best estimate based on an error metric.
9. The method of claim 8 wherein each estimator is initialized by a different random initial state vector.
10. The method of claim 8 wherein the error metric is a scalar value, ck, computed as ck=trace(Sk|kSk|kT).
11. The method of claim 8 wherein an increased process noise covariance is used in the state variable estimation method, the increased process noise covariance generated in accordance with a prescribed schedule that begins by amplifying the initial process noise covariance.
12. The method of claim 11 wherein the prescribed schedule comprises:a) computing a temperature schedule, ξk, that decreases as a function of time index k; b) computing a gain multiplier as (1+ξk); and c) multiplying the initial process noise covariance, Q0, by the gain multiplier for producing a process noise covariance at time index k as Qk=Q0(1+ξk).
13. The method of claim 12 wherein ξk is computed as ξk={square root over (2+L c /ln(2+L +kT))}, where c is a prescribed constant, and ln(.) is a natural logarithm function.
14. The method of claim 1 further comprising:estimating an updated covariance of the updated estimated state variables using a common state transition matrix for estimating the updated estimated state variables and the updated covariance for improved estimation quality; and computing a controller which uses the set of current-step system state variable estimates produce a set of system input control signals for controlling the physical inputs to the system, the controller is obtained by use of a performance index that specifies an objective performance criterion for the system.
15. A state vector estimation method for a real-time nonlinear system with exogenous inputs, the system represented by a set of continuous-time nonlinear differential state equations that include a state vector representing the system state vector that are to be estimated, and a nonlinear system matrix that defines the functional relationship between the input vector, the state vector, and the time-derivative of the state vector, the state vector being estimated from a set of accessible measurements that are representative of the state variables, the set of measurements, labeled as a measurement vector, taken at prescribed times and represented by a measurement vector, both the state vector and the measurement vector respectively contaminated by a state noise vector and a measurement noise vector, the method comprising the following steps:(a) constructing a set of state equations representing the nonlinear system with exogenous inputs; (b) establishing a set of state variable values for a current-time state vector, and a set of matrix element values for one each covariance matrix for a current-time state vector, for a current-time state noise vector, for a current-time input noise vector, and for a current-time output measurement noise vector; (c) acquiring a current-time measurement vector; (d) updating the state covariance matrices using the current-time measurement noise covariance vector; (e) estimating an updated current-time state vector from the current time measurement vector by use of a state vector estimating filter that operates on the current-time state vector using the current-time measurement vector, the covariance matrices for the current time state vector, for the current time state, and for the current-time measurement noise vector, the estimated updated state vector representing the state vector at the measurement current-time; (f) projecting the estimated updated state vector by integrating the system state equations over the prescribed time interval between the prescribed measurement current-time and the next prescribed measurement current-time for obtaining a current state vector using a discrete-time covariance matrix update; (g) projecting the updated state vector covariance matrix using the results of the system equation integration of step (f) using a linear perturbation signal model for estimating the updated state variables and a corresponding updated covariance matrix, wherein the linear perturbation model is a discrete-time time-varying linear system model; and (h) at the next measurement time, iterating steps (c)-(g).
16. The method of claim 15 wherein the square-root of the state covariance matrices are propagated.
17. The method of claim 15 wherein the updating of the state covariance matrix is performed by a matrix transformation that produces a triangular square-root covariance matrix.
18. The method of claim 17 wherein the matrix transformation is a QR factorization.
19. The method of claim 15 wherein the system equation is linearized at the initial value of the state vector, the linearization creating a locally linearized state feedback gain matrix that defines the state vector contribution to the time derivative of the state vector, given the current time input vector and the current-time state estimate.
20. The method of claim 19 wherein the linearization also creates a locally linearized input gain matrix for relating the input vector contribution to the time derivative of the state vector, given the current state and the current input vector.
21. The method of claim 19 wherein the step of updating the state vector by integrating the augmented locally linear state feedback gain matrix is performed by a matrix exponential.
22. The method of claim 15 wherein the steps updating the current-time state vector covariance matrix and the step of estimating an updated current-time state vector is not executed because the current-time measurement is not usable or available, and the current-time projected estimated updated state vector, together with the current-time projected updated state vector covariance matrix, are used as the estimated updated current time state vector and updated current time covariance matrix, respectively.
23. A method for estimation of state variables of a physical nonlinear system, the method being applicable to systems that can be modeled by a set of continuous-time nonlinear differential equations and to systems that can be modeled by discrete-time nonlinear difference equations, the method using measurement signals representing accessible measurements that are indicative of the state variables to be estimated, the method comprising:a) creating an extended Kalman filter (EKF) for estimating system state variables from the measurement signals in a dynamic system; and b) applying simulated annealing by running multiple EKFs and choosing the best estimate based on an error metric, wherein simulated annealing is initiated by a random perturbation.
24. The method of claim 23 wherein an increased process noise covariance is used in the EKF, the increased process noise covariance generated in accordance with a prescribed schedule that begins by amplifying the initial process noise covariance.
25. The method of claim 23 wherein an increased state vector covariance is used in the EKF, the increased state vector covariance generated in accordance with a prescribed schedule that begins by amplifying the initial state vector covariance.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation application of application Ser. No. 08/917,053, filed Aug. 22, 1997, U.S. Pat. No. 5,991,525.

US Referenced Citations (22)

Number	Name	Date
4368510	Anderson	Jan 1983
4577270	Sugano et al.	Mar 1986
4852018	Grossberg et al.	Jul 1989
4881178	Holland et al.	Nov 1989
4974191	Amirghodsi et al.	Nov 1990
5040214	Grossberg et al.	Aug 1991
5060132	Beller et al.	Oct 1991
5191521	Browsilow	Mar 1993
5214715	Carpenter et al.	May 1993
5355305	Seem et al.	Oct 1994
5377307	Hoskins et al.	Dec 1994
5412060	Wulff et al.	May 1995
5453226	Kline et al.	Sep 1995
5464369	Federspiel	Nov 1995
5506794	Lange	Apr 1996
5517594	Shah et al.	May 1996
5740033	Wassick et al.	Apr 1998
5748847	Lo	May 1998
5805447	Teng et al.	Sep 1998
5859773	Keeler et al.	Jan 1999
5963929	Lo	Oct 1999
5991525	Shah et al.	Nov 1999

Foreign Referenced Citations (2)

Number	Date	Country
9817439	Aug 1998	WO
9823889	Nov 1998	WO

Non-Patent Literature Citations (14)

Entry
Rutenbar, Rob A., “Simulated Annealing Algorithms: An Overview,” IEEE Circuits & Devices Magazine, Jan. 1989, vol. 5, No. 1, pp. 19-26.
Anderson, Brian D. O. & Moore, John B., “Optimal Filtering,” Prentice-Hall Information & System Sciences Series, 1997 pp. 1-61, 148-149, 192-197.
N.U. Ahmed, et al., “Modified Extended Kalman Filtering,” IEEE 1994, pp. 1322-1326.
Franklin, Gene F., et al., “Digital Control of Dynamic Systems,” New York: Addison-Wesley, 1992. pp. 506-513.
Monzumder, et al., “A Monitor Wafer Based controller for Semiconductor Processes,” In: IEEE Transactions on Semiconductor Manufacturing. New York: IEEE, 1994, pp. 400-411.
Gyugyi, et al., “Model-Based Control of Rapid Thermal Processing Systems.” In: First IEEE Conference of Control Applications. New York: IEEE, 1992, pp. 374-381.
Boning, D. “Run by Run Control Benchmarking: A White Paper.” Web page, http://www-mtl.mit.edu/rbrBench/whitepaper.html. 1996.
S. Joe Qin & Thomas A. Badgwell, An Overview of Industrial Model Predictive Control Technology, pp. 1-31.
Minh S. Le, Taber H. Smith, Duane S. Boning & Herbert H. Sawin, Run-to-Run Process Control on a Dual-Coil Transformer Coupled Plasma Etcher with Full Wafer Interferometry & Spatially.
Taber H. Smith, Duane S. Boning, Jerry Stefani & Stephanie Watts Butler, “Run by Run Advanced Process control of Metal Sputter Deposition,” 9 pages.
Duane Boning, William Moyne, Taber Smith, James Moyne & Arnon Hurwitz, Practical Issues in Run by Run Process Control, 1995, pp. 1-18.
Scott A Vander Wiel, William T. Tucker, Frederick W. Faltin & Necip Doganaksoy, Algorithmic Statistical Process Control: Concepts & an Application, Aug. 1992, vol. 34, No. 3, pp. 286-297.
Gene F. Franklin, J. David Powell & Michael L. Workman, Digital Control of Dynamic Systems, 1990, pp. 444-515.
Joan L. Mitchell, William B. Pennebaker, Chad E. Fogg, & Didier J. Legall, “MPEG Video Compression Standard,” 1997, pp. 283-311.

Continuations (1)

	Number	Date	Country
Parent	08/917053	Aug 1997	US
Child	09/372396		US

Method for real-time nonlinear system state estimation and control

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

US Classifications

Field of Search

US

International Classifications