This application is related to my commonly assigned, co-pending application Ser. No. 10/022,497, filed on even date herewith, entitled “Multipath Mitigation Filter”, now allowed, which is hereby incorporated by reference for all purposes as if expressly set forth verbatim herein.
1. Field of the Invention
The present invention pertains to the filtering of sensed signals and, in particular, to a Kalman filter including an adaptive measurement variance estimator.
2. Description of the Related Art
Discrete Kalman filtering is extensively used in modem day digital control and signal processing for a variety of purposes. Among these purposes are to: “smooth” signals, i.e., reduce measurement noise; observe unmeasured states; and/or predict future system states. In a typical discrete Kalman filter, all of the system parameters are specified. These system parameters typically include the:
For instance, consider a system described mathematically as follows using the notations defined in Table 1 below:
x[n]=A[n]x[n−1]+B[n]w[n]+C[n]u[n] (1)
z[n]=H[n]x[n]+v[n] (2)
wherein:
{circumflex over (x)}[n]={circumflex over (x)}[n|n−1]+K[n](z[n]−H[n]{circumflex over (x)}[n|n−1]) (9)
P[n]=(I−K[n]H[n])P[n|n−1]) (10)
Note the calculation of the variance estimation {circumflex over (R)}[n]. Note also that these are but mathematical descriptions of sensed signals, e.g., electrical, magnetic, or optical signals representing tangible real world phenomena or characteristics of such phenomena.
This technique can essentially be broken down into three parts. First, the current state is predicted (Eq. (3), Eq. (4)). Next, the variance of the measured quantity is estimated (Eq. (5), Eq. (6), and Eq. (7)). Then, the current state is updated with the measurement (Eq. (8), Eq. (9), Eq. (10)). This known technique's variance estimate {circumflex over (R)}[n] is often quite large. Also, it is possible for the measurement variance estimate to be zero, which happens in practice, and causes all manner of problems.
The present invention is directed to resolving, or at least reducing, one or all of the problems mentioned above.
The invention is a Kalman filter with an adaptive measurement variance estimator.
In a first aspect, the invention includes a signal filtering mechanism, comprising a Kalman filter and a variance estimator. The Kalman filter is capable of receiving an input signal, a measured quantity signal, and a variance estimate, and outputting a state estimate signal. The variance estimator is capable of estimating the variance of the measured quantity signal and generating the variance estimate signal for use in filtering the input signal and the measured quantity signal, wherein estimating the variance of the measured quantity signal includes determining a smoothed estimate of the measured quantity's variance from the measured quantity signal.
In a second aspect, the invention includes a method for estimating the current state of a discrete, vector-state, scalar-measurement system. The method begins by estimating the variance of a measured quantity for use in filtering an input quantity and the measured quantity. The estimation includes determining a smoothed estimate of the instantaneous prediction error's variance. The method also filters the input quantity and the measured quantity through a Kalman filter using the estimated variance of the measured quantity signal.
In other manifestations, the invention includes an electronic computing device programmed to perform such a method, a transmission medium over which such a method may be performed, or a program storage medium encoded with instruction that, when executed by a computer, perform a method for estimating the current state of a discrete, vector-state, scalar-measurement system. The method includes, inter alia, determining a smoothed estimate of the instantaneous prediction error's variance.
The invention may be understood by reference to the following description taken in conjunction with the accompanying drawings, in which like reference numerals identify like elements, and in which:
While the invention is susceptible to various modifications and alternative forms, specific embodiments thereof have been shown by way of example in the drawings and are herein described in detail. It should be understood, however, that the description herein of specific embodiments is not intended to limit the invention to the particular forms disclosed, but on the contrary, the intention is to cover all modifications, equivalents, and alternatives falling within the spirit and scope of the invention as defined by the appended claims.
Illustrative embodiments of the invention are described below. In the interest of clarity, not all features of an actual implementation are described in this specification. It will of course be appreciated that in the development of any such actual embodiment, numerous implementation-specific decisions must be made to achieve the developers' specific goals, such as compliance with system-related and business-related constraints, which will vary from one implementation to another. Moreover, it will be appreciated that such a development effort, even if complex and time-consuming, would be a routine undertaking for those of ordinary skill in the art having the benefit of this disclosure.
Turning now to the drawings,
For a continuous-time system the typical notation is x(t). Here, the invention is in reference to a discrete-time system with a (generally) uniform sampling rate Δt, denoted x[n], where n is an integer denoting the samples of x(t). That is, x[n]=x(nΔt) under ideal, uniform sampling. With this notation, a discrete, vector-state, scalar-measurement system model is denoted
x[n]=A[n]x[n−1]+B[n]w[n]+C[n]u[n] (11)
z[n]=H[n]x[n]+v[n] (12)
where w[n] and v[n] are mutually independent, white, noise processes with discrete-time variance functions Q[n]δ[n] and R[n]δ[n], respectively. Eq. (11) is the state model and Eq. (12) is the measurement model for the system.
Note that the present invention employs the same starting point as does the known filtering technique discussed above. The measurements z[n] and input u[n] are Kalman filtered via the sequence of equations
{circumflex over (x)}[n|n−1]=A[n]{circumflex over (x)}[n−1]+C[n]u[n] (13)
P[n|n−1]=A[n]P[n−1]A[n]T+B[n]Q[n]B[n]T (14)
{circumflex over (x)}[n]={circumflex over (x)}[n|n−1]+K[n](z[n]−H[n]{circumflex over (x)}[n|n−1]) (16)
P[n]=(I−K[n]H[n])P[n|n−1] (17)
to arrive at an estimate {circumflex over (x)}[n] of the true state x[n]. Eq. (13) is the prediction of x[n] and Eq. (14) is the associated covariance matrix P[n] prediction. Eqs. (15)–(17) update the state prediction {circumflex over (x)}[n|n−1] and the predicted covariance matrix P[n|n−1] with the measurement z[n].
However, the variance estimation of the measured quantity signal z[n] is performed differently than in the known technique discussed above. The estimation of R[n] is accomplished by filtering the squared instantaneous prediction error e2[n] where
e2[n]=(z[n]−H[n]{circumflex over (x)}[n|n−1])2 (18)
via Eq. (19):
{circumflex over (σ)}e2[n]={circumflex over (σ)}e2[n−1]+H[n]G[n](e2[n]−{circumflex over (σ)}e2[n−1]) (19)
which gives a smoothed estimate of the error variance, {circumflex over (σ)}e2[n]. A suitable choice, but not the only choice, for G[n] is K[n−1]. This implementation uses a matrix of all ones, written [1], for G[1] since K[0] is undefined. Solve
{circumflex over (R)}[n]2(2a2[n]−1)+{circumflex over (R)}[n][{circumflex over (σ)}e2[n](1−3a2[n])−2q[n]+{circumflex over (σ)}e2[n](q[n]+{circumflex over (σ)}e2[n]a2[n])=0 (20)
with the quadratic equation where
q[n]=(H[n]B[n])Q[n](H[n]B[n])T (22)
for the estimate {circumflex over (R)}[n] of R[n], which is used in the Kalman filter update cycle. Eqs. (18) thru (22) should be computed between the Kalman filter prediction and update; that is, between Eq. (14) and Eq. (15) as is shown in the flowchart of
The derivation of the determination for {circumflex over (R)}[n] used above shall now be presented. Without loss of generality, the discrete-time index notation [n] is suppressed except where explicitly required. It is Hx[n|n−1] that is interesting and its variance HP[n|n−1]HT. As a Kalman filter converges, P[n]≈P[n−1]. Letting n→∞, equality holds, that is, P[∞]=P[n]=P[n−1]. P[∞] can be found by solving the Ricatti equation. Likewise, as n→∞, σe2[∞]≡var(e[∞])=HP[∞]HT+R[∞]. Assume that {circumflex over (R)}[∞]=R[∞], that is, the estimator converges to the true value. From the Ricatti equation,
For convenience, make the approximation
where H−1=(HHT)−1 HT is the Moores-Penrose inverse of H. The reason for performing this approximation is so that the substitution
HP[∞]HT=σe2[∞]−R[∞] (25)
can be made to eliminate P[∞]. Combining (21)–(25) and performing some algebra yields
R2[∞](2a[n]2−1)+R[∞][σe2[∞](1−3a[n]2)−2q[n]]+σe2[∞](q[n]+σe2[∞]a[n]2)=0 (26)
Typically, a Kalman filter converges, or “stabilizes,” after a dozen or so measurements. After this point, it is also expected that the smoothed, squared, instantaneous error {circumflex over (σ)}e2[n] has converged to approximately σe2[∞]. This suggests using Eq. (26) at sample time n instead of ∞ which leads to the above invention. The approximation in Eq. (24) changes the left term of the right-hand side of Eq. (23). Consequently, adjustment of q, generally upward, may improve performance.
For a more concrete example of this first embodiment, consider a simple two-dimensional state system where x1[n] is position, x2[n] is velocity, and w[n] is acceleration. The position state x[n] is measured. The matrices of interest are then
H[n]=[1 0] (29)
where Δt is the time step and σw2 is the acceleration variance. Consequently, a[n]=1 and q[n]=σw2Δt3/3. Note, however, it has been experimentally found that q[n]=σw2Δt provides better results. Eq. (20) then becomes
{circumflex over (R)}[n]2−2{circumflex over (R)}[n][{circumflex over (σ)}e2[n]+q[n]]+{circumflex over (σ)}e2[n](q[n]+{circumflex over (σ)}e2[n])=0 (30)
which is easily solved with the quadratic equation for the positive root
{circumflex over (R)}[n]=(q[n]+{circumflex over (σ)}e2[n])+√{square root over (q[n](q[n]+{circumflex over (σ)}e2[n]))} (31)
This value is then used in the estimation, updates, and determinations discussed above.
Initial operation of the Kalman filter should not employ the solution of Eq. (20) until the filter has stabilized, or “converged.” Up to that point, a fixed, reasonable, a priori estimate of the input measurement variance R[n] should be used in place of {circumflex over (R)}[n]. The Kalman filter should otherwise be initialized with standard methods such as a priori estimates x0 and P0 for initializing x[n] and P[n], respectively. The conditions defining “convergence” or “stabilization” are well known to those skilled in the art. In one particular embodiment, the initialization period is chosen to be a number of iterations (e.g., 10–20) experimentally determined as a point after which the Kalman filter has converged. This initialization takes approximately 10 seconds in this particular embodiment. However, the initialization period and its determination will be implementation specific.
Thus, one difference between the known technique discussed above and the present invention lies in the estimation {circumflex over (R)}[n] of the variance of the measured quantity.
However, in the filtering technique discussed above relative to
x[n]=Ax[n−1]+Bw[n] (32)
z[n]=Hx[n]+v[n] (33)
wherein w[n] and v[n] are independent white Gaussian noises with covariance function matrices Qδ[n] and Rδ[n], respectively. The measurements z[n] can be Kalman filtered via the sequence of equations similar to those set forth above. However, a prediction {circumflex over (x)}[n|n−1] for the current state x[n] is determined using only the previous state vector value {circumflex over (x)}[n−1] (i.e., the state at the time of the previous sampling):
{circumflex over (x)}[n]|n−1]=A{circumflex over (x)}[n−1] (34)
Eq. (14)–Eq. (22) can then be applied in the manner discussed above for the general case. Again, in determining {circumflex over (R)}[n] for the next Kalman filter cycle in the implementation of
In a second embodiment alternative to that discussed relative to
e2[n]=(z[n]−H[n]{circumflex over (x)}[n])2 (35)
The advantage is the error variance is lower after the filtering update. This changes the equation for {circumflex over (R)}[n] to
which can be solved using the quartic or other polynomial root solving procedure. A special case of interest is when a=1. This reduces the quartic to a quadratic that is more easily solved yielding
{circumflex over (R)}[n]={circumflex over (σ)}e2[n]+√{square root over (q[n]{circumflex over (σ)}e2[n])} (37)
The disadvantage of this scheme is that the estimate {circumflex over (R)}[n] of R[n] to the Kalman filter equations is delayed one iteration. Thus, Eq. (15) is replaced with
and the variance estimation and Kalman filter “update” steps are temporally swapped.
For one implementation of this second embodiment, consider the special case of a=1. The required square root is typically an expensive operation. The square root can be eliminated by filtering the absolute error
|e[n]|=|z[n]−H[n]{circumflex over (x)}[n]| (39)
via
{circumflex over (σ)}e[n]={circumflex over (σ)}e[n−1]+H[n]G[n](S[n]|e[n]|−{circumflex over (σ)}e[n−1]) (40)
instead of the squared error. The constant S[n] is a scale factor that accounts for the difference between using the absolute versus the squared error. For Gaussian noise, a typical model, S[n]=√{square root over (π/2)}≈1.25. Then Eq. (37) becomes
{circumflex over (R)}[n]=({circumflex over (σ)}e[n])2+{circumflex over (σ)}e[n]√{square root over (q[n])} (41)
Typically, the square root of q[n] is a constant independent of n and only need be computed once, or is otherwise inexpensive to compute. This one implementation of the second embodiment follows the
The derivation of the determination for {circumflex over (R)} used in this second embodiment shall now be presented. In this variation of the second embodiment, the alternate scheme the squared error is e2[n]=(z[n]−H[n]{circumflex over (x)}[n])2. The procedure is modified by eliminating P[∞] with the appropriate substitution found from the expected value of e2[n],
yielding
which can be solved using a quartic or other root solving procedure. In the modified implementation of this embodiment, the constant S[n] in the modified alternate scheme is found via
where E{ } is the expectation operator.
Returning to
Now, extrapolating from the first and second embodiments illustrated in
The variance estimate {circumflex over (R)}[n] for the measured quantity z[n] is then determined, as set forth in the box 815. This estimation is performed using the method 400 in
This concludes the detailed description. Note that some portions of the detailed descriptions herein are presented in terms of a software implemented process involving symbolic representations of operations on data bits within a memory in a computing system or a computing device. These descriptions and representations are the means used by those in the art to most effectively convey the substance of their work to others skilled in the art. The process and operation require physical manipulations of physical quantities. Usually, though not necessarily, these quantities take the form of electrical, magnetic, or optical signals capable of being stored, transferred, combined, compared, and otherwise manipulated. It has proven convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the like.
It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantifies. Unless specifically stated or otherwise as may be apparent, throughout the present disclosure, these descriptions refer to the action and processes of an electronic device, that manipulates and transforms data represented as physical (electronic, magnetic, or optical) quantities within some electronic device's storage into other data similarly represented as physical quantities within the storage, or in transmission or display devices. Exemplary of the terms denoting such a description are, without limitation, the terms “processing,” “computing,” “calculating,” “determining,” “displaying,” and the like.
Furthermore, the invention has numerous applications in a wide variety of fields. One such application is disclosed in my commonly assigned, co-pending application Ser. No. 10/022,497, filed on even date herewith, entitled “Multipath Mitigation Filter”, now allowed, incorporated by reference above. Multipath is a condition wherein received electromagnetic radiation may have propagated over multiple paths, which introduces error or uncertainty in measurements derived from those signals. For instance, a radar signal received after being echoed from an object may have traveled directly and/or indirectly from the object to the receiver. The present invention can be used to mitigate uncertainty or error arising from this phenomenon. Still other applications may become apparent to those skilled in the art having the benefit of this disclosure. For example, the invention may also be used in an image processing application. In this case, x- and y- coordinates are used instead of time and spatial invariance is used instead of time-invariance.
Finally, the particular embodiments disclosed above are illustrative only, as the invention may be modified and practiced in different but equivalent manners apparent to those skilled in the art having the benefit of the teachings herein. Furthermore, no limitations are intended to the details of construction or design herein shown, other than as described in the claims below. It is therefore evident that the particular embodiments disclosed above may be altered or modified and all such variations are considered within the scope and spirit of the invention. Accordingly, the protection sought herein is as set forth in the claims below.
Number | Name | Date | Kind |
---|---|---|---|
5640429 | Michels et al. | Jun 1997 | A |
5867411 | Kumar | Feb 1999 | A |
6463411 | Li et al. | Oct 2002 | B1 |
6693979 | Kumar | Feb 2004 | B1 |
20030171830 | Thomson et al. | Sep 2003 | A1 |
Number | Date | Country | |
---|---|---|---|
20030115232 A1 | Jun 2003 | US |