The invention pertains to a method and system for calibrating coefficients of an observer of a variable state xk of a physical system. The invention also pertains to a recording medium to implement this method.
An observer estimates the state variable xk from measurements yk of physical quantities or magnitudes of this system where the index k identifies the instant of measurement.
There are known ways of calibrating the coefficients of an observer by successive experiments with different sets of coefficients until one set of coefficients gives the desired operation by this observer. Typically, the behavior desired for the observer is:
This calibration by experiment is part of a heuristic approach. The calibration is often lengthy, and it is difficult to be certain that the best possible calibration of the observer has been obtained.
To resolve this problem, other alternative methods have been proposed. For example, the patent application CA 2 226 282 describes a method for calibrating coefficients of a Kalman filter by means of a neural network. However, these alternative methods are often as complex to implement as the method that uses experimentation.
The invention is therefore aimed at proposing a method of calibration that is simpler to implement.
Thus, an object of the invention is a method for calibrating coefficients of an observer comprising:
where:
The above method is used to determine the coefficients enabling an efficient functioning of the observer in a simple way and without recourse to different experiments. Furthermore, the coefficients thus determined make it possible to ensure that the criterion f(p) has been minimized.
Finally, the use of the set Δ takes account of the fact that there is an uncertainty on the estimation {circumflex over (z)}k. Taking account of this uncertainty when determining the vector p that minimizes the criterion f(p) gives better calibration than if this uncertainty were not taken into account. In particular, aberrant approaches which could minimize the criterion f(p) are eliminated.
The embodiments of this calibration method may include one or more of the following characteristics:
These embodiments of the calibration method furthermore have the following advantages:
An object of the invention is also an information-recording medium comprising instructions to implement the above method when the instructions are executed by a programmable electronic computer.
An object of the invention is also a system for calibrating coefficients of an observer of a state variable xk of a physical system from measurements yk of physical quantities of this system where the index k identifies the instant of measurement, the system comprising:
where:
The invention shall be understood more clearly from the following description, given purely by way of a non-exhaustive example and made with reference to the appended drawings of which:
Here below in this description, the characteristics and functions well known to those skilled in the art shall not be described in greater detail.
Here, a trajectory designates the variations in time of a variable. The variable that varies in time may be any unspecified variable. In particular, this variable is not necessarily a position in space. For example, it may a temperature or any other physical quantity which varies over time.
For example, the object 4 is a probe or a catheter introduced into a human body. The object 4 is mobile in the referential system 8.
The referential system 8 is a fixed referential system having three orthonormal axes X, Y and Z.
The localization of the object 4 in the referential system 8 consists in finding its xp, yp, zp position and its θx, θy and θz orientation. The angles θx, θy and θz represent the orientation of the object 4 respectively in relation to the axes X, Y and Z.
To localize the object 4, the system 6 herein comprises magnetic field sources and magnetic field sensors some of which are linked to the referential system 8 while others are fixed without any degree of freedom to the object 4 to be localized. Here, the magnetic field sensor 10 is linked without any degree of freedom to the object 4. The sensor 10 is for example a triaxial sensor, i.e. a sensor capable of measuring the projection of the magnetic field on three non-colinear measurement axes. Such a sensor measures the direction of the magnetic field. More generally, this sensor also measures the amplitude of the magnetic field.
For example, the measurement axes are mutually orthogonal. These axes are fixedly linked to the object 4.
This sensor 10 is connected by means of a flexible wire link to a processing unit 12.
The unit 12 is also connected to three sources 14 to 16 of the magnetic field. These sources are for example magnetic field triaxial sources. A magnetic field triaxial source emits a magnetic field, along three mutually non-colinear emission axes. For example, such a source is formed by several aligned magnetic field uniaxial emitters, respectively on each of the emission axes of the source. The uniaxial emitter mainly emits the magnetic field along a single axis. For example, it is a coil whose turns are wound about a same emission axis. In this case, the emission axis coincides with the winding axis of the turns.
Here, the sources 14 to 16 are identical to each other. These sources are fixed in the referential system 8.
The processing unit 12 powers the sources 14 to 16 with alternating current to generate the magnetic field measured by the sensor 10. The unit 12 also acquires the measurements of the magnetic field made by the sensor 10.
The unit 12 is equipped with an observer 18 which localizes the object 4 in the referential system 8 from the measurements acquired. Here, the observer 18 determines the position and orientation of the object 4 by resolving a system of equations. This system of equations is obtained by modeling the magnetic interactions between the sources 14 to 16 and the sensor 10. In this system of equations, the position xp, yp and zp and the orientations θx, θy and θz of the object 4 are the unknown quantities while the values of the other parameters are obtained from measurements made by the sensor 10. Further information on such systems of equations may for example be found in the patent application EP 1 502 544.
Here, the observer 18 is a Kalman filter.
This observer 18 builds an estimation {circumflex over (x)}k of a state variable xk from a measurement yk. The variable xk is a vector containing the position xp, yp, zp of the object 4 and its orientation θx, θy and θz in the orthonormal referential system 8, the measurement yk is a measurement vector comprising the measurements made along the three measurement axes of the sensor 10 at an instant k.
The equations of the Kalman filter of the observer 18 are the following:
where:
The matrix Fk is for example obtained by modelizing the magnetic interactions between the sources 14 to 16 and the sensor 10 on the basis of the laws of electromagnetism.
The gain Kk+1 of the Kalman filter is given by the following relationship:
K
k+1
=P
k+1/k
C
T
k+1
[C
k+1
P
k+1/k
C
T
k+1
+R
k+1]−1 (3)
where Rk+1 is the matrix of covariance of the noise in the measurement yk.
The given equations of the Kalman filter are classic and are therefore not described in greater detail herein.
The use of a Kalman filter gives a precise estimation of the position and orientation of the object 4. However, before this, the matrices Rk and Qk have to be calibrated, i.e. the values of their coefficients have to be determined. To simplify the description, it is assumed here that the values of these coefficients are constant. The matrices Rk and Qk are therefore also denoted as R and Q here below.
Here pR and pQ denote the set of coefficients respectively of the matrices R and Q to be determined.
Once the Kalman filter has been calibrated, these sets PR and PQ are registered in a memory 20 connected to the unit 12 so that they can be used by the observer 18 to estimate the position of the object 4.
To this end, the system 30 is equipped with one or more sensors 32 which measure a variable zk. This variable zk is a variable whose value is a function of the variable xk. More specifically, the variable zk is connected to the variable xk by a known function φ. This function φ may be a linear function or on the contrary a non-linear function. For example, the sensor 32 measures the acceleration along the X, Y and Z axes of the object 4. In this case, the sensor 32 is formed by several accelerometers.
In another embodiment, the sensor 32 can directly measure the position of the object 4 with sensors other than those used to carry out the measurement yk. For example, the sensor 32 may, to this end, have several cameras or other sensors and other magnetic field sources to measure the position of the object 4. In this particular case, the function φ is the identity function.
The system 30 also has a computer 34 capable of acquiring the measurements of the sensor 32. This computer 34 is also capable of calibrating the observer automatically by determining the set of coefficients PR and PQ from the measurements made by the sensor 32.
In the particular case shown in
The computer 34 is made with a programmable electronic computer capable of executing the instructions recorded in an information-recording medium. Here, the computer 34 is connected to a memory 36 and this memory 36 contains the instructions needed to execute the method of
The working of the calibration system 30 shall now be described in greater detail with reference to the method of
Initially, at a step 40, the relationship φ which connects the variable zk to the variable xk is set up. For example, to this end, the laws of physics are used.
Then, at a step 42, the number of coefficients to be used to calibrate the observer 18 is determined. To this end, knowledge on the physical working of the system 6 is used. Furthermore, this step applies the principle of parsimony according to which the number of coefficients to be calibrated is limited to the greatest possible extent.
For example, in the particular case described herein, it is assumed that the measurements made on each of the measurement axes are independent of one another. Thus, the matrix R is taken to be a diagonal matrix. Similarly, it is assumed that the errors in the state equation are independent. Thus, the matrix Q is taken to be a diagonal matrix. They therefore have the following form:
R=diag(p1R, . . . ,plR),piR≧0,i=1:l (7)
Q=diag(p1Q, . . . ,pnQ),piQ≧0,i=1:n (8)
where:
Furthermore, the matrices R and Q are symmetrical matrices defined as being positive, i.e. all matrices of which all the Eigen values are positive. To ensure this condition, the matrices R and Q can be written as noted here below:
R=XTX and Q=YTY, where the matrices X and Y are diagonal matrices. It can be noted that when the matrices R and Q are any unspecified matrices, the matrices X and Y are higher triangular matrices.
At the end of the step 42, with the assumptions made, the vector p of coefficients to be determined is defined by the following relationship:
p=[p
T
R
,p
T
Q]T (9)
At a step 44, the criterion to be optimized to determine the vector p is chosen. Here, this vector is given by the following relationship:
where:
The function φ which appears in the criterion (10) is the same as the one introduced here above. This function φ is used to obtain the estimation {circumflex over (z)}k of the variable zk from the estimation {circumflex over (x)}k as indicated in the following relationship:
{circumflex over (z)}
k=φ({circumflex over (x)}k,p) (11)
It will be noted that the function φ also depends on the vector p since the estimation {circumflex over (x)}k which constitutes the argument of the function φ is itself a function of the vector p. For this reason, in the above relationship, the estimation {circumflex over (x)}k and the vector p are indicated as the argument of the function φ.
At a step 46, the set Δ of constraints to be met by the vector p is defined. This set Δ comprises at least one constraint which dictates that estimation {circumflex over (z)}k and the variable zk should be proximate to each other. More specifically, this constraint herein dictates that the trajectory of the variable zk should be included, at least on an average, in a corridor 48 of uncertainty (
This corridor 48 is demarcated by two boundaries Lmin and Lmax respectively situated on each side of the trajectory of estimation {circumflex over (z)}k. The corridor of uncertainty represents the fact that the estimation {circumflex over (z)}k is only known with an uncertainty attached to it. This uncertainty comes from the fact that there is uncertainty in the estimation {circumflex over (x)}k used to build the estimation {circumflex over (z)}k. Thus, to calibrate the observer 18 properly, it is necessary that, for the majority of the instants k, the variable zk should be included in the corridor 48.
In the particular case shown in
Preferably, the distances a and b are a function of the uncertainty in the estimation {circumflex over (z)}k. For example, the distances a and b are all the greater as the mean square deviation σ{circumflex over (z)}
The threshold S1 is for example defined by the following relationship:
S
1
=√{square root over (N)}ασ
{circumflex over (z)}
(12)
where:
For each vector p potentially minimizing the criterion (10), it is possible to determine the mean square deviation σ{circumflex over (z)}
At the step 46, other constraints of the set Δ may be defined. For example, on the basis of the physical sense of some of the coefficients of the sets pR and pQ, it is possible to set limits on these coefficients. For example, if pieces of information on the measurement noise of the sensor 10 are known, it is possible on the basis on these pieces of information, to limit the values that can be taken by certain coefficients of the set pR and pQ.
Once the set Δ has been defined, in a step 50, the sensor 32 measures the variable zk at N different instants. The number N of instants is typically greater than the number of coefficients to be determined contained in the vector p.
Once the variables zk have been measured, they are acquired by the computer 34. Then, a step 52, the different variables and coefficients needed to minimize the criterion (10) are initialized. For example, the initial matrix P0/0 of estimation of the covariance in the error of the estimation {circumflex over (x)}0 is defined by the following relationship:
P
0/0
=λI,λ>>1
where:
The constant λ is adjusted according to the confidence placed in the initialization of {circumflex over (x)}0.
The choice of a constant λ far greater than one means that the uncertainty in the initial estimation {circumflex over (x)}0 is very great.
Here, the initial estimation {circumflex over (x)}0 and the initial value of the coefficients of the vector p are drawn randomly.
Then, a step 54 is performed for determining the vector p which minimizes the criterion (10). In this step, the values of the vector p are made to vary until a vector p is found that minimizes the criterion (10) and at the same time meets the constraints of the set α. This step is performed by means of a tool of optimization under constraints. Such tools of optimization under constraints are known. For example such a tool especially is the “Fmincon” function available in the “MATLAB®” development environment, inter alia for digital computation.
Once the vector p which minimizes the criterion (10) has been determined, at the step 56 the values of the coefficients of this vector p are used to calibrate the coefficients of the matrices R and Q of the observer 18. For example, during this step, the values of these coefficients are copied and recorded in the memory 20.
Then, at a step 58, the localizing system 6 uses the coefficients determined by the system 30 to determine the position and orientation of the object 4 from the measurements made by the sensor 10.
Many other embodiments are possible. For example, for a same physical system and for a same observer, it is possible to make different choices on the number of coefficients of this observer that have to be determined in order to calibrate it. For example, the number of coefficients may be even further reduced by requiring that the different coefficients of the diagonal matrix R or Q should all be proportional to a same coefficient. Conversely, the method described here above can also be applied to the case where the matrices Q and R are not diagonal matrices but only positively defined symmetrical matrices. This assumption on the contrary increases the number of coefficients to be determined by the system 30.
In the criterion (10), the Euclidian norm can be replaced by another norm.
Constraints other than those described here above can be used. For example, it is possible to require that the variable zk should be systematically or routinely included within the corridor of uncertainty 48 at each instant k. This constraint is stronger than the one described with reference to
Different sets of constraints can be used at different instants. For example if, at certain instants belonging to a set L, the uncertainty in the measurement of the variable zk is very low, it is then possible to use a greater constraint at these instants. For example if the instant k belongs to the set L, then it is possible to require that the variable zk should be routinely included in a more limited corridor of uncertainty. For the instants that do not belong to the set L, the constraint used will be weaker. For example, it will require solely that, on an average, the variable zk should be included in the corridor of uncertainty 48. The corridor of uncertainty used to define the constraint when the instant k belongs to the set L is not necessarily the same as the one used to define the constraint when the instant k does not belong to this set L.
Additional constraints other than the ones requiring that the variable zk should be in the corridor of uncertainty of the trajectory of the estimation {circumflex over (z)}k may be used. For example, it is possible to define constraints that dictate a low variation of estimation {circumflex over (z)}k between two successive instants.
Choices other than the ones described above are possible for initializing the optimization of the criterion (10). For example, if the initial value of the variable x0 is known, this value is used to initialize the variable, x0 and the coefficients of the matrix P0 are set as a function of the trust in this value.
The uncertainty on the estimation {circumflex over (z)}k may be obtained by means other than the observations of the observer. For example, this uncertainty may be determined experimentally or on the basis of systems of equations other than those defining the observer 18.
The measured variable zk and the measurement yk may coincide. Thus it is not necessary to resort to an additional sensor such as the sensor 32.
The calibration method described here above also applies to the Extended Kalman Filter as well as to the Unscented Kalman Filter (UKF) or any other recursive filter having parameters to be set. More generally, the calibration method described here above can be used to calibrate any observer of a state variable whose coefficients have to be determined beforehand. Thus, the coefficients liable to be determined by means of the above method are not limited to those of a covariance matrix.
The invention has been described in the context of a given application. However, it is not limited to this type of application. Other applications may be taken into account, such as for example applications for obtaining measurements in the probing of seabeds for which the physical system makes it possible to have measurements representing surface/seabed distances or the depths of seas.
Number | Date | Country | Kind |
---|---|---|---|
FR1052261 | Mar 2010 | FR | national |