The invention relates generally to state estimation after processing measurements of systems characterized by measurement errors and multidimensional parameters, for which the latter are unknown but may vary arbitrarily in time within physical bounds. In a particular aspect, the invention relates to the tracking of moving targets using estimation, which takes into consideration measurement errors and physical bounds or limits on parameters of the target track.
Consider the problem of tracking an airplane whose trajectory in three dimensions is an arbitrary curve with bounded instantaneous turn rate and tangential acceleration. The parameters of this tracking problem are the turn rate ω (which can be related to the curvature of the trajectory) and the tangential acceleration α. These parameters, ω and α, are neither exclusively constant nor strictly white noise stochastic processes, but vary arbitrarily in time within physical bounds. Time dependent, but bounded, parameters ω and α typically represent a maneuvering target as described in Y. Bar-Shalom, X. R. Li, and T. Kirubarajan, Estimation with Applications to Tracking and Navigation: Theory, Algorithms, and Software, New York, N.Y.: John Wiley & Sons, Inc., 2001, and in X. R. Li and V. P. Jilkov, “A Survey of Maneuvering Target Tracking—Part IV: Decision-Based Methods,” Proceedings of SPIE Vol. 4728 (2002), pp. 511–534.
This problem belongs to a more general problem of estimating the state of a system that depends on time dependent parameters with unknown values but with known bounds. In some situations, the Kalman filter solves this problem by including the parameters as part of an augmented state to be estimated, as described in C. Bembenek, T. A. Chmielewski, Jr., and P. R. Kalata, “Observability Conditions for Biased Linear Time Invariant Systems,” Proceedings of the American Control Conference, pp. 1180–1184, Philadelphia, Pa., June 1998, B. Friedland, “Treatment of Bias in Recursive Filtering,” IEEE Transactions on Automatic Control, pp. 359–367, Vol. AC-14, No. 4, August 1969, and D. Haessig and B. Friedland, “Separate-Bias Estimation with Reduced-Order Kalman Filters,” IEEE Transactions on Automatic Control, pp. 983–987, Vol. 43, No. 7, July 1998. Such a filter will be called a “full state” estimator. However, the parameters may vary too erratically to be considered as observables, as noted by G. J. Portmann, J. R. Moore, and W. G. Bath, “Separated Covariance Filtering,” Record of the IEEE 1990 International Radar Conference, 1990, pp. 456–460, or there may be too many parameters to estimate. In the case in which parameters cannot be estimated, filters, which do not augment the state vector with these parameters, often give better performance. Such a filter will be called a “reduced state” estimator. More generally, a “reduced state” or “reduced order” estimator uses fewer states than would be required to completely specify the dynamics.
The concept of “full state estimation” is fundamentally different from that of “reduced state estimation.” In the full state estimation context, the state estimation technique attempts to learn the unknown parameters (such turn rate ω and tangential acceleration α in the above mentioned airplane example). In the reduced state estimator, the estimator is not designed to perform any learning at all. In the airplane example, the bounded parameters ω and α are expected to change during the learning process, so that, at any given time, a learned parameter, such as ω or α, is likely to bear no relation to the actual parameter at that time.
According to Portmann, Moore, and Bath (supra), a full state estimator assumes “that accelerations last long enough and are constant enough to be observed and estimated.” Li and Jilkov (supra) observe that a full state estimator “suffers from two major deficiencies, which stem from assuming constant input and known onset time.” Except in the case of target maneuvers, target trajectories are very predictable. Since the onset time of a maneuver is not known, maneuvers are difficult to model as stochastic processes. For this reason, full state estimators are rarely used to track maneuvering targets. Kalman filters with white plant noise are currently used as reduced state estimators. Such Kalman filters are not necessarily optimal. Portmann, Moore and Bath could not solve the problem beyond a single parameter in a one-dimensional tracking scenario. They state in their article that their filter “can be modified in a straightforward manner to permit operation in two or three coupled dimensions” and that “The major differences” in one dimension versus multiple dimensions “lie in the special treatment of the cross-gain terms when computing the lags and in the form of the minimization process.” However, their method cannot be generalized beyond one dimension, and no such solution was ever published by them or by anyone else. In particular, their use of absolute values and signs of their single parameter cannot be generalized (see equations (20) and (21) in G. J. Portmann, J. R. Moore, and W. G. Bath, “Separated Covariance Filtering,” Record of the IEEE 1990 International Radar Conference, 1990, pp. 456–460).
Bar-Shalom, Blair, Li, Moore, and Kirubarajan (supra) define a track filter to be consistent if the state errors and innovations (i.e., measurement residuals) satisfy the tenets of Kalman filter theory, namely that the state estimation and innovation covariances correctly characterize the actual errors, and that the innovations are a white stochastic process as additionally set forth in W. D. Blair and Y. Bar-Shalom, “Tracking Maneuvering Targets with Multiple Sensors: Does More Data Always Mean Better Estimates?” IEEE Transactions on Aerospace and Electronic Systems, pp. 450–456, Vol. AES-32, No. 1, January 1996 and J. R. Moore and W. D. Blair, “Practical Aspects of Multisensor Tracking,” in Multitarget-Multisensor Tracking: Applications and Advances, Volume III, Y. Bar-Shalom and William Dale Blair, (ed.) Boston, Mass.: Artech House, 2000, pp. 43–44. Specifically, these three conditions for Kalman filter consistency are as follows:
Another definition of filter consistency from Bar-Shalom, Li, and Kirubarajan is that “A state estimator is consistent if the first and second order moments of its estimation errors are as the theory predicts.” This definition also applies to reduced state estimators, and is satisfied if the RMS state estimation errors lie within the one-sigma error ellipsoid of the state covariance as calculated by the filter. As stated by Bar-Shalom, Li, and Kirubarajan, “Since the filter gain is based on the filter-calculated error covariances, it follows that consistency is necessary for filter optimality: Wrong covariances yield wrong gain. That is why consistency evaluation is vital for verifying a filter design—it amounts to evaluation of estimator optimality.”
Consequences of filter inconsistency for tracking of maneuvering targets may be:
According to Moore and Blair (supra), “Track filter consistency is critical for effective fusion of data from multiple sensors with diverse accuracies. Maneuvering targets pose a particularly difficult challenge to achieving track filter consistency.” Blair and Bar-Shalom (supra) have shown an example where a Kalman filter used as an inconsistent reduced state estimator paradoxically yields worse errors with multisensor tracking than with single sensor tracking.
Note that when a filter is used to support a decision process, such as collision avoidance or detection-to-track correlation, the measure of performance is the frequency of false decisions. As stated by Portmann, Moore, and Bath (supra), “At any decision point in time (not necessarily at the time of a measurement), one needs both the best available estimate of object state and a firm confidence interval for this state that allows one to say with specified probability that the object state is in some region about the estimate regardless of whether it is accelerating and regardless of how long it has been accelerating. The confidence interval should be valid for an extreme target acceleration sequence which is based on what is known about target dynamics.”
As a reduced state estimator, the Kalman filter suffers from several difficulties in addition to its inconsistency for supporting decisions as discussed in the above mentioned Portmann, Moore, and Bath references, and in P. Mookerjee and F. Reifler, “Application of Reduced State Estimation to Multisensor Fusion with Out-of-Sequence Measurements,” Proceedings of the 2004 IEEE Radar Conference, Philadelphia, Pa., Apr. 26–29, 2004, pp. 111–116.
x(k+1)=Φx(k)+Γu(x(k),λ) (1)
where x(k) is the state vector at the kth sample time tk for k=0, 1, 2, . . . , and u(x(k),λ) is a system input that is a function of the state vector x(k) and arbitrarily unknown time-varying parameters λ with known bounds. This input function u(x(k),λ) may be nonlinear or linear. Here the matrices Φ and Γ are the system transition and input matrices at time tk. In general, these system matrices represent the relationship between the current state and the previous state of the system. The parameters λ are neither constant nor stochastic processes. These parameters have a known mean value
z(k)=Hx(k)+n(k) (2)
where at time tk, the matrix H is the measurement matrix and n(k) is the kth sample of the measurement noise, whose covariance matrix is N.
In
Block 214 of the prior art Kalman filter represents the accessing or inputting of system transition matrices Φ, Γ, F, and G, where
From block 214, the logic of the prior art Kalman filter of
From block 216, the logic of the prior art Kalman filter of
{circumflex over (x)}(k+1|k)=Φ{circumflex over (x)}(k|k)+Γu({circumflex over (x)}(k|k),λ) (5)
and its covariance is
S(k+1|k)=FS(k|k)F′+GWG′ (6)
From block 218, the logic of the prior art Kalman filter of
Q=HS(k+1|k)H′+N (7)
The filter gain matrix K is calculated as
K=S(k+1|k)H′Q−1 (8)
and the matrix L is calculated as
L=I−KH (9)
where I is the identity matrix.
From block 222 of
{circumflex over (x)}(k+1|k+1)={circumflex over (x)}(k+1|k)+K[z(k+1)−H{circumflex over (x)}(k+1|k)] (10)
S(k+1|k+1)=LS(k+1|k)L′+KNK′ (11)
Prior art (Kalman filter) uses the white process noise covariance W in (6) and obtains the optimal gain matrix K that minimizes the updated state covariance S(k+1|k+1) in (11).
Improved or alternative state estimation is desired.
In general, the invention relates to state estimation of a system having multidimensional parameters, which are unknown, time-varying, but bounded, in addition to and distinguished from state variables. The method comprises the steps of:
A method according to an aspect of the invention is for estimating the state of a system having multidimensional parameters in addition to state variables, which parameters are unknown, arbitrarily time-varying, except for known bounded values. For example, the turn rate and tangential acceleration of an aircraft are multidimensional arbitrarily time-varying parameters that have known bounds, in addition to the state of the aircraft given by its position and velocity. The state estimates are used to make decisions or to operate a control system or to control a process.
A method according to another aspect of the invention comprises the steps of observing a system having multidimensional parameters in addition to state variables, measuring aspects of its state in the presence of measurement errors to produce measurements. These measurements are applied to an estimating filter to produce estimates of the true states that cannot be otherwise discerned by an observer. These estimates are used to make decisions or to operate a control system or to control a process.
A method according to another aspect of the invention applies the measurements to an estimating filter that explicitly uses a mean square optimization criterion taking into account measurement errors and maximum excursions of the system parameters to produce estimates of the state of the system. The method then uses these estimates to make decisions or to operate a control system or to control a process.
A method according to another aspect of the invention is for estimating the state of a system having multidimensional parameters, which have known bounded values. The method comprises the steps of measuring aspects of the state of the system to produce measurements, and initializing state estimates {circumflex over (x)}(k0|k0) and matrices M(k0|k0), D(k0|k0), where matrix M(j|k) is defined as the covariance of the state estimation errors at time tj due only to the errors in the measurements z(i) for 1≦i≦k and a priori initial information that is independent of the parameter uncertainty, and matrix D(j|k) is defined as the matrix of bias coefficients, which linearly relates state estimation errors to the parameter errors, at time tj (after processing k=0, 1, 2, . . . measurements). Determinations are made of the system matrices Φ and Γ, and of the mean value
A parameter matrix Λ is generated, with Λ representing the physical bounds on parameters that are not state variables of the system. The state estimate {circumflex over (x)}(k|k) is extrapolated to {circumflex over (x)}(k+1|k) as
{circumflex over (x)}(k+1|k)=Φ{circumflex over (x)}(k|k)+Γu({circumflex over (x)}(k|k),
and the matrices M(k|k), D(k|k), and S(k|k) are extrapolated to M(k+1|k), D(k+1|k) and S(k+1|k), respectively, as
M(k+1|k)=FM(k|k)F′ (15)
D(k+1|k)=FD(k|k)+G (16)
S(k+1|k)=M(k+1|k)+D(k+1|k)ΛD(k+1|k)′ (17)
The noise covariance N is determined. The covariance of the residual Q is calculated as
Q=HS(k+1|k)H′+N (18)
The filter gain matrix K is calculated as
K=S(k+1|k)H′Q−1 (19)
and the matrix L is calculated as
L=I−KH (20)
where I is the identity matrix. At least one aspect z(k+1) of the state of the system is measured. The state estimate {circumflex over (x)}(k+1|k+1) is calculated as
{circumflex over (x)}(k+1|k+1)={circumflex over (x)}(k+1|k)+K[z(k+1)−H{circumflex over (x)}(k+1|k)] (21)
and the matrices M(k+1|k+1) and D(k+1|k+1) are calculated as
M(k+1|k+1)=LM(k+1|k)L′+KNK′ (22)
and
D(k+1|k+1)=LD(k+1|k) (23)
respectively. The total state covariance is calculated as
S(k+1|k+1)=M(k+1|k+1)+D(k+1|k+1)ΛD(k+1|k+1)′ (24)
A key difficulty of designing a Kalman filter is that the white plant noise covariance W (also called process noise covariance), which is selected to cope with the reduced state, must be optimized empirically. Empirical optimization is a difficult task in multisensor applications, as indicated by Mookerjee and Reifler, supra. It should be recognized that white noise cannot be used to “model exactly target maneuvers, which are neither zero-mean nor white—they are not even random,” Y. Bar-Shalom and X-R. Li, Multitarget-Multisensor Tracking: Principles and Techniques, Storrs, Conn.: YBS Publishing, 1995, p. 26. For example, in tracking applications, the white plant noise covariance W that gives optimal performance depends not only on the parameter matrix Λ, but also on other variables such as the measurement noise covariance and the data rate. The ratio of W to Λ can be more than two orders of magnitude (a ratio of 100:1). Thus in a Kalman filter W has to be empirically adjusted for optimal performance. Optimizing performance by choice of W in an example given by P. S. Maybeck and M. R. Schore, “Reduced-Order Multiple Model Adaptive Controller for Flexible Spacestructure,” IEEE Transactions on Aerospace and Electronic Systems, pp. 756–767, Vol. 28, No. 3, July 1992, with 6 states in each of the multiple filter models reduced from 24 states in the truth model, would be a daunting empirical task, which is automatically achieved through Λ using the invention.
For motion along a one-dimensional axis, Bar-Shalom, Blackman, Blair, Li, and Kirubarajan have suggested as guidelines that √{square root over (W)} be selected as a constant equal to 33%, 50%, or 100% of the maximum acceleration Bar-Shalom and Li; Bar-Shalom, Li, and Kirubarajan; and S. Blackman, Multiple-Target Tracking with Radar Applications Norwood, Mass.: Artech House, Inc., 1986. However, simple examples by Mookerjee and Reifler show that the optimal √{square root over (W)} can be more than an order of magnitude times the maximum acceleration, which is quite different from 33%, 50%, or 100%. In tracking applications, the white plant noise covariance W that gives optimal performance depends not only on the maximum acceleration, but also on other variables such as the measurement noise covariance and the data rate. In multisensor tracking the geometry of the sensors is equally important as the maneuvers. The task of finding an optimal W is especially difficult when measurements come from multiple sensors with different measurement characteristics, and according to Moore and Blair usually requires a great deal of empirical simulation. The optimal reduced estimator according to an aspect of the invention avoids the need for simulation by analytical modeling of the parameter bounds.
Using the physical bounds on the parameters ω (turn rate) and α (tangential acceleration) in the airplane example above, the optimal reduced state estimator is consistent for a maneuver. Li and Jilkov stated “tracking a maneuvering target assuming it is not maneuvering may have a serious consequence (e.g., track loss), while tracking a non-maneuvering target assuming it is maneuvering usually only suffers minor performance degradation”. By explicitly modeling the maximum excursions of ω and α, an optimal reduced state estimator according to an aspect of the invention satisfies this principle enunciated by Li and Jilkov.
In the airplane-tracking example, maximum accelerations produced by the bounds of the parameters ω and α, along the instantaneous normal and tangential airplane axes, bound all physically possible maneuvers. In the filter model of the invention, these maximum accelerations are represented in an equivalent statistical model by a multivariate Gaussian distribution of constant accelerations, whose one-sigma ellipsoid best approximates the maximum accelerations. Among all estimators (including reduced state Kalman filters) with the same reduced states, the optimal reduced state estimator according to an aspect of the invention is defined to have minimal covariance using this filter model. This covariance is the minimal covariance achievable by linearly weighting the predicted states with a new measurement at each successive update of the filter. The optimal reduced state estimator minimizes the mean-square and thereby, the root-mean-square (RMS) estimation errors for the maximum excursions of the parameters in the truth model. Furthermore, since the bounds on the parameters ω and α are included in the covariance that is minimized, the optimal reduced state estimator does not need white plant noise, as is required by Kalman filters, to cope with the reduced state.
The solution of the problem requires a completely different method, which is incorporated in an aspect of the invention. The simplified logic flow chart or diagram 300 of
From block 314, the logic of the invention of
From block 316 of
{circumflex over (x)}(k+1|k)=Φ{circumflex over (x)}(k|k)+Γu({circumflex over (x)}(k|k),
M(k+1|k)=FM(k|k)F′ (28)
D(k+1|k)=FD(k|k)+G (29)
S(k+1|k)=M(k+1|k)+D(k+1|k)ΛD(k+1|k)′ (30)
Thus, another difference between the invention herein and the prior art exemplified in
From block 318 of
Q=HS(k+1|k)H′+N (31)
The filter gain matrix K is calculated as
K=S(k+1|k)H′Q−1 (32)
and the matrix L is calculated as
L=I−KH (33)
where I is the identity matrix.
From block 322 of
The logic flows from block 324 of
{circumflex over (x)}(k+1|k+1)={circumflex over (x)}(k+1|k)+K[z(k+1)−H{circumflex over (x)}(k+1|k)] (34)
The matrices M(k+1|k+1) and D(k+1|k+1) are calculated as
M(k+1|k+1)=LM(k+1|k)L′+KNK′ (35)
and
D(k+1|k+1)=LD(k+1|k) (36)
respectively. Finally, the matrix of the total covariance is calculated as
S(k+1|k+1)=M(k+1|k+1)+D(k+1|k+1)ΛD(k+1|k+1)′ (37)
and equation (37) represents a mean-square criterion which may be used for a root-mean-square determination. The above equations (25)–(37) are set forth in
The calculations associated with block 326 of
A salient difference between the prior-art method and that of the invention is the introduction into the equations defining a multidimensional state estimation error covariance denoted above as M(j|k), attributable to measurement error and D(j|k)ΛD(j|k), representing the physical bounds of the parameters, and propagating certain coefficients (denoted as D(k|k) and D(k+1|k)) rather than the parameter itself, as described in the article by Portmann, Moore, and Bath 1990. As mentioned, that parameter cannot be propagated in more than one dimension. The matrix M(j|k) is defined as the covariance of the state estimation errors at time tj due only to the errors in the measurements z(i) for 1≦i≦k and a priori initial information that is independent of the parameter uncertainty. D(j|k) is defined as the matrix of bias coefficients, which linearly relates state estimation errors to the parameter errors, at time tj (after processing k=0, 1, 2, . . . measurements).
Thus, the invention uses a novel mean-square optimization criterion (equation (37)) which explicitly addresses the known physical bounds of the multidimensional parameters, and incorporates analytical modeling of the parameter bounds, whose modeling may be as precise as knowledge of the boundary values permits. The invention provides an exact implementable recursive solution that optimizes the mean-square criterion. The solution according to this aspect of the invention is both consistent and optimal for the criterion. As mentioned above, consistency and optimality were lacking in the prior art, leading to the paradox in which more data gave worse performance (W. D. Blair and Y. Bar-Shalom, “Tracking Maneuvering Targets with Multiple Sensors: Does More Data Always Mean Better Estimates?” IEEE Transactions on Aerospace and Electronic Systems, pp. 450–456, Vol. AES-32, No. 1, January 1996 and J. R. Moore and W. D. Blair, “Practical Aspects of Multisensor Tracking,” in Multitarget-Multisensor Tracking: Applications and Advances, Volume III, Y. Bar-Shalom and William Dale Blair, (ed.) Boston, Mass.: Artech House, 2000, pp. 43–44).
The solution described above applies to very general linear and nonlinear systems. In general, there are five broad classes of systems to which Kalman filters apply:
The current invention is different from the prior art in at least that it uses the matrix Λ to explicitly include the physical bounds on the uncertain parameters, it separates the state estimation error covariance S(j|k) into components, M(j|k) and D(j|k)ΛD(j|k), attributable to measurement error and parameter uncertainty, respectively, and separately propagates these covariances from one time index k to next time index k+1; and, based on these propagated covariances, the gain matrix K are computed that weight the measurements to form the state estimates. The values of these gains so computed are different from the prior art and provide solutions where none could be found before.
Number | Name | Date | Kind |
---|---|---|---|
4179696 | Quesinberry et al. | Dec 1979 | A |
4791573 | Zemany et al. | Dec 1988 | A |
5432816 | Gozzo | Jul 1995 | A |
6285971 | Shah et al. | Sep 2001 | B1 |
7009554 | Mookerjee et al. | Mar 2006 | B1 |
20030115232 | Lipp | Jun 2003 | A1 |
20040223480 | Nguyen et al. | Nov 2004 | A1 |
20050100082 | Ma | May 2005 | A1 |
20050128138 | McCabe et al. | Jun 2005 | A1 |
20050179580 | Cong et al. | Aug 2005 | A1 |
Number | Date | Country |
---|---|---|
03195989 | Aug 1991 | JP |
04326083 | Nov 1992 | JP |