METHOD AND APPARATUS FOR OBJECT LOCALIZATION

FIELD

The present disclosure generally relates to a method and an apparatus for object localization.

BACKGROUND

Wireless object localization may utilize GPS (Global Positioning System) to track an object outdoors. However, according to GPS-based object localization techniques, it is difficult to track the object when the object to be tracked is located indoors or located within an environment in which communication with a GPS satellite is difficult, such as inside of a building or the like in a surrounding interfere with the communication. Hence, in situations where the GPS-based object localization techniques become inefficient or ineffective, alternative object localization techniques need to be considered. Two categories of alternative object localization techniques are known.

A first category of the alternative object localization techniques relies on video-based or video-processing solutions. Approaches making use of a unique camera may perform the object localization by exploiting redundancy of information between successive images picked up by the same 2D (Two-Dimensional) or 3D (Three-Dimensional) camera, as in the case of SLAM (Simultaneous Localization And Mapping), for example, and/or by the tracking feature points. A probalistic model of a scene as well as the object in the scene tracked by the object localization of the camera can be updated along the process. Some basic geometry information related to the scene may also be integrated into an algorithm of the model. However, such approaches are computationally expansive, are more adapted to slow motions, and are dependent on the camera precision, such as camera resolution, as well as on the environment in which the object localization is performed.

Other video-based or video-processing solutions take advantage of a multiple RGB (Red, Green, Blue) or infra-red camera, as well as some trackers to infer the position of the object that is tracked. For example, the trackers may be infra-red markers on the object to be tracked, or SIFT (Scale-Invariant Feature Transform) points computed on the images picked up by the camera, for example. A model of the camera, representing a relationship between the 3D object position and its position in the 2D images, as well as complex calibrations requiring human interaction, are required to perform the object localization. Moreover, although some of these video-based or video-processing solutions may exhibit interesting performances outdoors, such solutions may remain expansive and may not always be robust depending on illumination conditions, for example.

On the other hand, a second category of the alternative object localization techniques relies on wave-based solutions, including GPS, radio, laser, infra-red, ultra-sound, or the like. Approaches making use of the waves may perform the object localization using a network of distance sensors which are able to compute their distance to the object to be tracked, in line of sight, or even with obstacles interposed between the distance sensor and the object to be tracked, using Ultra Wide Band-based technologies, for example. When the network is calibrated such that the positions of all distance sensors are known relative to a given reference frame, schemes based on one or a plurality of triangulations may enable the position of the object to be tracked relative to the reference frame with a certain accuracy. In this case, each distance sensor needs to be able to communicate its distance measurement to a server which performs a 3D position estimation.

To evaluate the distance to the tracked object, the distance sensors generally make use of the measurement of a ToF (Time-of-Flight) of an electromagnetic wave emitted towards the object to be tracked. Two sub-strategies may be considered, where a first sub-strategy measures the ToF of the wave after its reflection on the object to be tracked using ultra-sound or laser technologies, for example, while a second sub-strategy directly measures the ToF between the distance sensor and the object to be tracked. In this case, the object needs to be equipped with a dedicated device, synchronized with an emitter, as in solutions provided by semiconductor chips for object localization manufactured by BeSpoon SAS or Decawave Ltd. for example.

Estimating the distance between two distance sensors may be quite straightforward on the basis of the latter technologies described above, however, estimating the 3D position of the object to be tracked from the network of distance sensors may be not as easy, especially when the estimated distance between two distance sensors includes noise.

Studies have been made on auto-calibration of sensor nodes for 2D tracking, and mainly for indoors, due to applicative context and GPS availability. Trilaterations are the main tools for auto-calibration sometimes known as SLAT (Simultaneous Localization and Tracking), as explained in Jorge Guevara et al., “Auto-localization algorithm for local positioning systems”, Ad Hoc Networks, Vol. 10, pp. 1090-1100, 2012.

Computations for the auto-calibration may be performed with additional requirements of a moving node inside the network, as proposed in Rainer Mautz et al., “A Robust Indoor Positioning and Auto-Localisation Algorithm”, Journal of Global Positioning Systems, Vol. 6, No. 1, pp. 38-46, 2012. Some assumptions on this motion or on the geometry of the set of moving nodes are then used in a Kalman filter-based or extended Kalman filter-based strategy to infer anchor positions. However, no strategy has been proposed to optimize the positioning of the anchors in 3D space for an optimal tracking set-up, as may be seen from C. Priyadharsini et al., “Survey on Auto-Calibration of Beacon Nodes in Indoor Location Computing”, International Journal of Emerging Technology and Advanced Engineering, Vol. 2, Issue 11, pp. 170-175, 2012 (for a review).

In the second category of the alternative object localization techniques, the number of anchors needs to be three (3) or greater, and preferably four (4) or greater in order to increase the accuracy of the object localization. However, if the 3D position of the object to be tracked were estimated based on distances from four (4) or more anchors using a system of linear equations (or linear system), the system of linear equations would become over-determined and no solution could be obtained. Consequently, it is difficult to increase the accuracy of the object localization.

SUMMARY

According to an aspect of the present disclosure, a method for calibrating positions of anchors in a reference frame of a network, wherein the network includes distance sensors forming the anchors that are fixed in respective positions and a distance sensor forming a tag associated with an object of which three-dimensional position in the reference frame of the network is estimated may include performing an auto-calibration process including a constrained optimization in which 3D coordinates of the anchors in the reference frame of the network are estimated so that an associated, estimated Gram matrix fits a Gram matrix measured from the anchors, wherein the reference frame of the network is formed based on positions of N anchors, where N is four or greater.

According to another aspect of the present disclosure, an apparatus for calibrating positions of anchors in a reference frame of a network, wherein the network includes distance sensors forming the anchors that are fixed in respective positions and a distance sensor forming a tag associated with an object of which three-dimensional position in the reference frame of the network is estimated may include a processor configured to perform an auto-calibration process including a constrained optimization in which 3D coordinates of the anchors in the reference frame of the network are estimated so that an associated, estimated Gram matrix fits a Gram matrix measured from the anchors, wherein the reference frame of the network is formed based on positions of N anchors, where N is four or greater.

According to still another aspect of the present disclosure, a computer-readable storage medium having stored therein a program which, when executed by a computer, causes the computer to perform a process for calibrating positions of anchors in a reference frame of a network, wherein the network includes distance sensors forming the anchors that are fixed in respective positions and a distance sensor forming a tag associated with an object of which three-dimensional position in the reference frame of the network is estimated, the process including performing an auto-calibration process including a constrained optimization in which 3D coordinates of the anchors in the reference frame of the network are estimated so that an associated, estimated Gram matrix fits a Gram matrix measured from the anchors, wherein the reference frame of the network is formed based on positions of N anchors, where N is four or greater.

The object and advantages of the present disclosure will be realized and attained by means of the elements and combinations particularly pointed out in the claims.

It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention, as claimed.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram illustrating an example of a network of distance sensors in a first embodiment;

FIG. 2 is a block diagram illustrating an example of a distance sensor in the first embodiment;

FIG. 3 is a flow chart for explaining a process of the first embodiment;

FIG. 4 is a flow chart for explaining a process of a second embodiment; and

FIG. 5 is a flow chart for explaining a process of a third embodiment.

DESCRIPTION OF EMBODIMENTS

Embodiments of the present invention will be described with reference to the accompanying drawings.

A description will now be given of a method, an apparatus, and a computer-readable storage medium, in each embodiment according to the present invention.

In the second category of the alternative object localization techniques described above, one embodiment estimates a 3D position of an object to be tracked based on distances from four (4) or more anchors using a system of linear equations (or linear system). A network of distance sensors may be categorized into two kinds of distance sensors, namely, a distance sensor forming an anchor that is fixed in a position during operation thereof, and a distance sensor forming a tag attached to or embedded in the object to be tracked. The object to be tracked may be a mobile apparatus, such as a mobile phone, a tablet device, a drone, or the like. Hence, the tag is movable in position during operation thereof. Each distance sensor may have a known configuration capable of measuring or evaluating its distance from another distance sensor which may be the anchor or the tag with a certain degree of accuracy. As an example, the distance sensor may measure its distance from the other distance sensor by detecting an RSSI (Received Signal Strength Indicator) from signals received from the other distance sensor. Accordingly, each distance sensor may have a communicating function capable of communicating with other distance sensors, such that the distance sensors in the network are communicable with each other.

The object may be tracked using a strategy similar to that used by the auto-calibration described above to infer the anchor positions, such as the Kalman filter-based or extended Kalman filter-based strategy, and the Bayesian approaches, for example.

FIG. 1 is a diagram illustrating an example of a network 1 of distance sensors in a first embodiment. As an example, the network 1 illustrated in FIG. 1 includes ten (10) anchors A₀through A₉, and a tag T. The anchors A₀through A₉, and the tag T are indicated by circular symbols in FIG. 1. Of course, the number of anchors is not limited to ten (10), and N anchors may be provided in the network 1, where N is a natural number greater than or equal to four (4). Further, more than one (1) tag T may be movable within the network 1, however, it may be readily understood that processes described hereinafter with respect to one (1) tag T may be performed similarly for other tags movable within the network 1. For the sake of convenience, the circular symbol is hatched for the user-defined origin to be described below, the circular symbol is shaded for the user-defined reference to be described below, and the circular symbol is black for the target T.

In this embodiment, a framework can be provided to estimate the 3D position of the object to be tracked, inside the network of communicable distance sensors which are calibrated, and take advantage of all of the information of the network of distance sensors in order to robustify the estimation of the 3D position of the object to be tracked.

In this embodiment, a 3D object localization process to estimate the 3D position of the object to be tracked may use a calibrated network and a weighted least squares regression strategy.

The distance sensors in the network 1 illustrated FIG. 1 are categorized into the anchors A₀through A₉and the tag T. However, the distance sensors forming the anchors A₀through A₉and the tag T may have the same configuration, that is, the configuration illustrated in FIG. 2, for example. FIG. 2 is a block diagram illustrating an example of the distance sensor in the first embodiment. As illustrated in FIG. 2, a distance sensor 2 includes a transmitter 21, a receiver 22, a CPU (Central Processing Unit) 23, and a storage unit 24 that are connected via a bus 27. The connection of the transmitter 21, the receiver 22, the CPU 23, and the storage unit 24 is not limited to the connection illustrated in FIG. 2 using the bus 27, and other connections may be employed.

The distance sensor 2 may further include a keyboard 25 and a display unit 26 that are connected to the bus 27. When the object to be tracked is the drone, for example, the keyboard 25 and the display unit 26 may be omitted. On the other hand, when the object to be tracked is the mobile phone or the tablet device, for example, the keyboard 25 and the display unit 26 may be provided. The keyboard 25 is an example of an input device from which instructions, data, or the like may be input to the CPU 23. The display unit 26 may display messages for an operator or a user of the distance sensor 2, and results of processes, such as a tag localization process, for example. Of course, the keyboard 25 and the display unit 26 may be integrated into a single unit such as a touchscreen panel, for example.

The transmitter 21 may have a known configuration including a function to transmit signals by wireless communication. The receiver 22 may have a known configuration including a function to receive signals by wireless communication. The transmitter 21 and the receiver 22 may use mutually different antennas (not illustrated), or use a single common antenna (not illustrated). The CPU 23 is an example of a processor or computer that controls operations of the distance sensor 2. The storage unit 24 may store programs to be executed by the CPU 23, intermediate data of computations and processes performed by the CPU 23, parameters used by the computations and processes performed by the CPU 23, and results of the computations and processes performed by the CPU 23, including a distance of the distance sensor 2 from another distance sensor 2 within the network 1. The storage unit 24 may be formed by any suitable device capable of electronically storing signals (or data), such as a semiconductor memory device, and a drive unit installed with a recording medium. The storage unit 24 may form an example of a non-transitory computer-readable storage medium which stores the program to be executed by the CPU 23. In an example, the computer program is downloadable from a communication network such as an internet.

Of course, a configuration of the distance sensors 2 forming the anchors A₀through A₉, and a configuration of the distance sensor 2 forming the tag T may be mutually different, as long as each distance sensor 2 is communicable with other distance sensors 2 and is able to estimate its distance from another distance sensor 2 with a certain degree of accuracy.

Next, a description will be given of the 3D object localization process in a first embodiment, to estimate the tag position, that is, the position of the object to be tracked, in a reference frame described hereunder.

[Reference Frame Determination]

The reference frame may be determined by a reference frame determination process as follows for a case in which N anchors (A_i)_0≦i<Nare set up in the network 1. An orthonormal reference frame (A_i₀,{right arrow over (m₁)},{right arrow over (m₂)},{right arrow over (m₃)}) may be determined based on anchor positions where the object localization process is to be performed. To solve this problem, amongst all of the anchors (A_i)_0≦i<N, it is assumed for the sake of convenience that a selection has been made to identify:

- (a1) an anchor identifier I₀of the anchor A_I0=A₇, satisfying 0≦I₀<N, associated to an origin of the reference frame;
- (a2) an anchor identifier I₁of the anchor A_I1=A₅, satisfying 0≦I₁<N, so that

$\frac{\overset{}{A_{I_{0}} A_{I_{1}}}}{ \overset{}{A_{I_{0}} A_{I_{1}}} } = \overset{}{m_{1}};$

- (a3) an anchor identifier I₂of the anchor A_I2=A₆, satisfying 0≦I₂<N, so that 0<{right arrow over (A_I₀A_I₂)}.{right arrow over (m₂)} and {right arrow over (A_I₀A_I₂)}.{right arrow over (m₃)}=0; and
- (a4) an anchor identifier I₃of the anchor A_I3=A₂, satisfying 0≦I₃<N, so that 0<{right arrow over (A_I₀A_I₃)}.{right arrow over (m₃)}.

In other words, after the anchors (A_i)_0≦i<Nare set up in the network 1 within a space in which a tag T is movable, in a more or less controlled manner, the selection has been made to provide, amongst all of the anchors (A_i)_0≦i<N:

- (b1) the anchor A_I0to define the origin of the reference frame;
- (b2) the anchor A_I1to define a first axis of the reference frame;
- (b3) the anchor A_I2with a positive value along {right arrow over (m₂)} to define a plane P of a reference elevation (for example, an elevation 0) as well as a direction of a second axis perpendicular to the first axis; and
- (b4) the anchor A_I3with a positive elevation to define a direction of a third axis perpendicular to the plane P of the first and second axes.

The selection may be made manually by a user, or may be made automatically by the CPU 23. In the latter case, the reference frame that is automatically determined may be less convenient or intuitive for the user when compared to that determined manually in the first case, depending on the situation.

Coordinates of the anchors (A_i)_0≦i<Nand the tag T in the reference frame can be denoted by

${\begin{matrix} x_{i}^{1} \\ x_{i}^{2} \\ x_{i}^{3} \end{matrix}}_{0 \leq i < N} and {\begin{matrix} x_{T}^{1} \\ x_{T}^{2} \\ x_{T}^{3} \end{matrix}},$

respectively.

[Tag Localization]

The tag T inside the network 2 of the anchors (A_i)_0≦i<Nis localized by the tag localization process on the basis of its distance to each of the different anchors (A_i)_0≦i<N.

The distance of the tag T to each of the different anchors (A_i)_0≦i<Nmay be estimated with a measurement error. This measurement error may be modeled as centered noise with an associated variance σ².

To estimate the 3D position of the tag T and thus the 3D position of the object to be tracked, it may be shown on one hand that

{right arrow over (A_I₀A_l)}.{right arrow over (A_I₀T)}=½(∥{right arrow over (A_I₀A_l)}∥²+∥{right arrow over (A₁₀T)}∥²−∥{right arrow over (A_lT)}∥²)

and on the other hand that

{right arrow over (A_l₀A_l)}.{right arrow over (A_I₀T)}=Σ_k=1³x_i^kx_T^k,

where the anchor positions

${\begin{matrix} x_{i}^{1} \\ x_{i}^{2} \\ x_{i}^{3} \end{matrix}}_{0 \leq i < N}$

are known and the distances from the anchors (A_i)_0≦i<Nto the tag T are measurable.

Consequently, the 3D position of the tag T may be estimated from a solution to the following matricial equation

$A {\begin{matrix} x_{T}^{1} \\ x_{T}^{2} \\ x_{T}^{3} \end{matrix}} =,$

where:

- (c1) A is a (N−1)×3 matrix in which each line of the matrix indicates the 3D coordinates of one anchor of the set of anchors

${(A_{i})}_{\underset{i \neq I_{0}}{0 \leq i < N}}$

(that is, all anchors except the anchor associated to the origin of the reference frame with the identifier I₀); and

- (c2) is a column vector of dimension (N−1) composed of vertical concatenation of the associated estimation of ½(∥{right arrow over (A_I₀A_t)}∥²+∥{right arrow over (A_l₀T)}∥²−∥{right arrow over (A_lT)}∥²) for

${\begin{matrix} 0 \leq i < N \\ i \neq I_{0} \end{matrix}$

from the network 1 of the distance sensors 2.

It should be noted that if a distance ∥{right arrow over (A_I₀A_l)}∥ is already known from the auto-calibration, Monte-Carlo iterations or the like may be performed to robustify the estimation to obtain the 3D position of the tag T.

It may be observed from the above equations that when the number of anchors becomes greater than or equal to four (4), which is preferable from the view point of increasing the accuracy of the object localization, the system of linear equations (or linear system) becomes over-determined.

In order to solve this over-determined system, a least squares estimation represented by the following matricial equation may be solved to estimate the 3D position of the tag T.

$= {(A^{T} A)}^{- 1} A^{T}$

However, under the assumption of a Gaussian measurement noise to estimate the distance between two distance sensors 2 of the network 1, one can show that a (N−1)×(N−1) covariance matrix E of the measurement error associated to a scalar quantity of the column vector custom-character may be estimated by

cov( custom-character )=∥{right arrow over (A_I₀T)}∥²σ²+¾σ⁴

and

var( custom-character )=(μ{right arrow over (A_I₀T)}∥²+∥{right arrow over (A_lT)}∥²)σ²+ 3/2σ⁴.

Consequently, a more robust estimation of the 3D position of the tag T may be performed by making use of a weighted least squares estimation represented by the following matricial equation

$= {(A^{T} Σ^{- 1} A)}^{- 1} A^{T} Σ^{- 1} .$

FIG. 3 is a flow chart for explaining a process of the first embodiment. The process illustrated in FIG. 3 may be executed by the CPU 23 illustrated in FIG. 2. In step ST31 illustrated in FIG. 3, the CPU 23 performs the reference frame determination process described above. In step ST32, the CPU 23 performs the tag localization process described above, and the process ends. In step ST32, in the case where the object to be tracked may be a mobile phone or a tablet device having the display unit 26, the CPU 23 may display information on the estimated 3D position of the tag T on the display unit 26.

According to this embodiment, in a case in which the 3D position of the object to be tracked is estimated based on the distances from four (4) or more anchors using the system of linear equations (or linear system), it is possible to avoid the system of linear equations from becoming over-determined, and positively obtain a solution for the estimation of the 3D position of the object to be tracked. In addition, it is possible to improve the accuracy of the object localization compared to conventional methods, by employing the weighted least squares regression strategy in the calibrated network.

Even in a case in which the distance sensors have a raw accuracy on the order of 10 cm to 15 cm, for example, the 3D position of the object to be tracked can be estimated with an accuracy on the order of 3 mm, for example, where the accuracy indicates a maximum amount of error from the actual 3D position of the object.

The object localization of this embodiment may be applied to various consumer and professional applications, including object (user, drone, mobile phone, etc.) positioning indoors or inside a stadium, for example, indoor object tracking with obstacles, or the like. In addition, the object localization of this embodiment may be designed to be compatible with various communication networks of distance sensors. Further, the object localization of this embodiment may be designed to be compatible with low SNRs (Signal-to-Noise Ratios) of the distance measurements.

Next, a description will be given of a second embodiment. In this embodiment, the network of distance sensors may be the same as the network 1 illustrated in FIG. 1 for the first embodiment.

In this embodiment, a framework can be provided to estimate the 3D position of different distance sensors in the network to optimize anchor positions for an optimal tracking of the object inside the network.

More particularly, this embodiment optimizes the anchor positions where the anchors are to be set up inside a user constrained environment, depending on the user environment and constraints, in order to accurately estimate the 3D position of the object to be tracked.

[Anchor Position Optimization]

A strategy is provided to optimize the initial positioning of the anchors by an anchor position optimization process in order to make the estimation of the 3D position of the tag T, that is, the 3D position of the object to be tracked, as robust as possible.

The positions of the anchors in the network 1 within the space in which the tag T is movable, has an influence on the estimation of the 3D position of the tag T. Hence, this embodiment optimizes the positions of the anchors, or the anchor positions.

By taking into consideration the matricial equation

$= {(A^{T} Σ^{- 1} A)}^{- 1} A^{T} Σ^{- 1}$

used in the first embodiment described above to estimate the 3D position of the tag T, it is possible to estimate a 3×3 covariance matrix custom-character of the error of estimation which can be expressed by the following equation.

custom-character =(A^TΣ⁻¹A)⁻¹

In order to optimize the anchor positions, a variance of the error of estimation is minimized, where a mean value of the variance is proportional to trace( custom-character ) and depends on the anchor positions contained in the A matrix. However, =(T) depends also on the 3D position of the tag T since Σ=×(T).

When a 3D domain in which the tag T is to be subjected to the object localization is denoted by Ω, a mean optimal location (or position) of the anchor can be defined in order to evaluate the 3D position of the tag T on the domain Ω as a solution to the following optimization problem

A*=argmin(J(A))

subject to

${\begin{matrix} k (A) = 0 \\ l (A) > 0 \end{matrix}$

where

- (d1) A* is a optimal matrix containing in its line the optimal locations of the anchors;
- (d2) J(A) is an objective function defined by

J(A)=trace(∫_Ω(A^TΣ(T)⁻¹A)⁻¹dT);

and

- (d3) k and l respectively are context-specific constraints of equalities and inequalities where the user is to incorporate the environmental constraints of the user's anchor positioning, including fixed anchors, anchors on a specific surface or border, or the like.

This optimization problem is difficult to solve when a large number of local minima are included, because the integral term may not be easily tractable. Given a specific initial location (or position) provided by the user, an optimization algorithm such as IPO (Interior Point Optimizer) already introduced before may be used to find the optimal solution close to this initial positioning. The integral term may be estimated by Monte-Carlo-based strategies, for example.

When the user does not provide the initial positioning of the anchors, a metaheuristic of the optimization, including evolutionary algorithms, for example, may be used to find a first raw good candidate which is then provided to a classical optimization algorithm.

FIG. 4 is a flow chart for explaining a process of the second embodiment. The process illustrated in FIG. 4 may be executed by the CPU 23 illustrated in FIG. 2. In step ST41 illustrated in FIG. 4, the CPU 23 performs the anchor position optimization process described above, and the process ends. In step ST41, in the case where the object to be tracked may be a mobile phone or a tablet device having the display unit 26, the CPU 23 may display information on the optimized anchor positions on the display unit 26 in order to urge the operator or the user to set up the anchors at the optimized anchor positions, preferably before the process of the first embodiment illustrated in FIG. 3 is started.

According to this embodiment, it is possible to further improve the accuracy of the object localization by the optimized anchor positions, in addition to obtaining the advantageous features of the first embodiment described above. Of course, this embodiment may be suitably combined with the first embodiment described above.

Next, a description will be given of a third embodiment. In this embodiment, the network of distance sensors may be the same as the network 1 illustrated in FIG. 1 for the first embodiment.

In this embodiment, a framework can be provided to estimate the 3D position of the different distance sensors in the network by avoiding manual calibration and thus introducing auto-calibration.

More particularly, this embodiment performs a user-adaptive auto-calibration on the network 1 of distance sensors 2, including a geometric-based initialization, and an optimization-strategy to affine and robustify initial values of the calibration.

In this embodiment, the reference frame is automatically built from a set of anchors and user settings, as well as a constrained-optimization based method to automatically determine the 3D coordinates of the anchors in the reference frame, to perform the so-called auto-calibration.

[Auto-Calibration]

The reference frame may be user-defined, in a manner similar to that of the first embodiment described above.

As in the case of the first embodiment described above, the 3D coordinates of the anchors (A_i)_0≦i<Nand the tag T in the reference frame can be denoted by

${\begin{matrix} x_{i}^{1} \\ x_{i}^{2} \\ x_{i}^{3} \end{matrix}}_{0 \leq i < N} and {\begin{matrix} x_{T}^{1} \\ x_{T}^{2} \\ x_{T}^{3} \end{matrix}},$

respectively. In this embodiment, an auto-calibration process automatically estimates 3D coordinates

${\begin{matrix} x_{i}^{1} \\ x_{i}^{2} \\ x_{i}^{3} \end{matrix}}_{0 \leq i < N}$

of the anchors in the reference frame. The approach used to estimate the 3D coordinates

${\begin{matrix} x_{i}^{1} \\ x_{i}^{2} \\ x_{i}^{3} \end{matrix}}_{0 \leq i < N}$

of the anchors in the reference frame does not require any active interaction with the user and relies only on the good estimation of a matrix G=

${(G_{ij})}_{0 \leq \begin{matrix} i \\ j \end{matrix} < N}$

or scalar products called a Gram matrix. Each component of this Gram matrix is a scalar product defined by

G_ij={right arrow over (A_l₀A_l)}.{right arrow over (A_I₀A_j)},

and it may be easily shown that

G
_ij={right arrow over (A_i₀A_l)}.{right arrow over (A_I₀A_j)}=½(∥{right arrow over (A_I₀A_l)}∥²+∥{right arrow over (A_I₀A_j)}∥²−∥{right arrow over (A_lA_j)}∥²).

Such a scalar quantity of the Gram matrix may thus be estimated from by the distance sensors 2, because the distance between the anchors is known.

On the other hand, given the notation introduced before, each component of the Gram matrix can be represented by

G_ij={right arrow over (A_I₀A_l)}.{right arrow over (A_I₀A_j)}=Σ_k−1³x_i^kx_j^k.

The auto-calibration process of this embodiment is thus a constrained optimization in which the 3D coordinates of the anchors (A_i)_0≦i<Nin the reference frame are estimated so that the associated, estimated Gram matrix fits the Gram matrix measured from the distance sensors.

[Constrained Optimization Formulation]

The constrained optimization performed by the auto-calibration process may be formulated by

X*=argmin F(X)

subject to

${\begin{matrix}  (X) = 0 \\ h (X) > 0 \end{matrix}$

where:

- (e1) X is a column vector of dimension 3×N composed of vertical concatenation of the 3D coordinates

${\begin{matrix} x_{i}^{1} \\ x_{i}^{2} \\ x_{i}^{3} \end{matrix}}_{0 \leq i < N}$

of the anchors in the reference frame;

- (e2) X* is a column vector of dimension 3×N composed of vertical concatenation of optimal values of the 3D coordinates

${\begin{matrix} x_{i}^{1} \\ x_{i}^{2} \\ x_{i}^{3} \end{matrix}}_{0 \leq i < N}$

of the anchors;

- (e3) F is a scalar objective function defined by

F(X)=Σ_i=0^N-1Σ_j=0^j≦i[(Σ_k=1³x_i^kx_j^k)−Ĝ_ij]²

which computes an L2-Norm of a difference between the estimated and the measured scalar products of the Gram matrix and takes into account a symmetry of the problem;

- (e4) g is an equality constraint defined by

$ (X) = [\begin{matrix} x_{I_{0}}^{1} \\ x_{I_{0}}^{2} \\ x_{I_{0}}^{3} \\ x_{I_{1}}^{2} \\ x_{I_{1}}^{3} \\ x_{I_{2}}^{3} \end{matrix}]$

which implements the equalities implied by the selection described in the first embodiment associated with the user-defined reference frame; and

- (e5) h is an inequality constraint defined by

$h (X) = [\begin{matrix} x_{I_{1}}^{1} \\ x_{I_{2}}^{2} \\ x_{I_{3}}^{3} \end{matrix}]$

which implements the inequalities implied by the selection described in the first embodiment associated with the user-defined reference frame.

Because Ĝ is only a noisy measurement of the Gram matrix, the above defined cost function can make use of all the distance information at the same time to robustify a final estimation of the anchors. Moreover, since the auto-calibration process does not suppose strong duration constraints, Monte-Carlo iterations can also be performed to more accurately estimate Ĝ_ijand obtain better final results.

Gradient or Jacobian matrices as well as Hessian matrices of F, g, and h are analytically tractable and may be given to any constraint optimization algorithm such as the IPO (Interior Point Optimizer), for example. However, any suitable numeric approximation of these quantities may also be used to compute the optimal selection, such as finite difference approximation for the gradient or Jacobian matrices, or BFGS (Broyden-Fletcher-Goldfarb-Shanno) approximation for the Hessian matrices, for example.

To find the optimal solution, a good initial value is preferably provided to the optimization algorithm. Such a good initial value may be difficult to provide, however, this embodiment employs a quite straightforward geometric approach.

[Geometric Initialization]

To estimate an initial solution for the 3D position of the anchors, a progressive strategy based on the selection described in the first embodiment associated with the user-defined reference frame is employed, where each component of the anchors is estimated iteratively, starting from the first one (along {right arrow over (m₁)}) to the last one (along {right arrow over (m₃)}).

First, the estimation of (x_i¹)_0≦i<Nis performed. In this case, an estimation of such a component may be obtained from

$x_{i}^{1} = \vec{A_{I_{0}} A_{ι}} \cdot \vec{m_{1}} = \vec{A_{I_{0}} A_{ι}} \cdot \frac{\vec{A_{I_{0}} A_{I_{1}}}}{ \vec{A_{I_{0}} A_{I_{1}}} } ≃ \frac{{\hat{G}}_{I_{1} i}}{\sqrt{{\hat{G}}_{I_{1} I_{1}}}}$

that is, from

${}^{0}x_{i}^{1} = \frac{{\hat{G}}_{I_{1} i}}{\sqrt{{\hat{G}}_{I_{1} I_{1}}}} .$

Second, the second components (x_i²)_0≦i<Nare considered. Given the assumptions made on A_I₂in the user-defined reference frame, it may be similarly shown that

x
_I
₂
²=√{square root over (∥{right arrow over (A_I₀A_I₂)}∥²−x_I₂¹²)}≈√{square root over ({circumflex over (G)}_I₁_I₂−⁰x_I₂¹²)}=⁰x_I₂².

Then, the following equation

$\vec{m_{2}} = \frac{\vec{A_{I_{0}} A_{I_{2}}}}{x_{I_{2}}^{2}} - \frac{x_{I_{2}}^{1}}{x_{I_{2}}^{2}} \vec{m_{1}}$

stands, and consequently, ∀i≠I₂,

$\begin{matrix} x_{i}^{2} = \vec{A_{I_{0}} A_{ι}} \cdot \vec{m_{2}} = \vec{A_{I_{0}} A_{ι}} \cdot \frac{\vec{A_{I_{0}} A_{I_{2}}}}{x_{I_{2}}^{2}} - \frac{x_{I_{2}}^{1}}{x_{I_{2}}^{2}} \\ \frac{{\hat{G}}_{I_{1} I_{2}}}{{}^{0}x_{I_{2}}^{2} \sqrt{{\hat{G}}_{I_{1} I_{1}}}} \frac{{\hat{G}}_{I_{1} i}}{\sqrt{{\hat{G}}_{I_{1} I_{1}}}} \\ = \frac{{\hat{G}}_{I_{1} I_{1}} {\hat{G}}_{I_{2} i} - {\hat{G}}_{I_{1} I_{2}} {\hat{G}}_{I_{1} i}}{G_{I_{1} I_{1}} {}^{0}x_{I_{2}}^{2}} \end{matrix} \vec{A_{I_{0}} A_{ι}} \cdot \vec{m_{1}} ≃ \frac{{\hat{G}}_{I_{2} i}}{{}^{0}x_{I_{2}}^{2}} -$

that is, ⁰x_I₂²=√{square root over (Ĝ_I₁_I₂−⁰x_I₂¹²)} and ∀i≠I₂,

${}^{0}x_{i}^{2} = \frac{{\hat{G}}_{I_{1} I_{1}} {\hat{G}}_{I_{2} i} - {\hat{G}}_{I_{1} I_{2}} {\hat{G}}_{I_{1} i}}{{\hat{G}}_{I_{1} I_{1}} {}^{0}x_{I_{2}}^{2}}$

stand.

Third, the third components (x_i³)_0≦i<Nare considered. Because the first and second components become known from the above, the following equation

x
_i
³=±√{square root over (∥{right arrow over (A_I₀A_l)}∥²−(x_i¹²+x_i²²))}≈±√{square root over ({circumflex over (G)}_ii−(⁰x_i¹²+⁰x_i²²))}

can be obtained, where the sign of the latter, second quantity may be obtained by analyzing the sign of the following scalar product

[{right arrow over (A_I₀A_l)}−(x_i¹{right arrow over (m₁)}+x_i²{right arrow over (m₂)})].{right arrow over (A_I₀A_I₃)}={right arrow over (A_I₀A_l)}.{right arrow over (A_I₀A_I₃)}−(x_i¹x_I₃¹+x_i²x_I₃²)≈Ĝ_I₃_i−(⁰x_i^{1 0}x_I₃¹+⁰x_i^{2 0}x_I₃²)

that is,

⁰
x
_i
³=Sign(Ĝ_I₃_i−(⁰x_i^{1 0}x_i₃¹+⁰x_i²⁰x^x_i₃²))√{square root over (Ĝ_ii−(⁰x_i¹²+⁰x_i²²).)}

An initialization vector X⁰, that is, the column vector of dimension 3×N composed of the vertical concatenation of the initial values

${\begin{matrix} {}^{0}x_{i}^{1} \\ {}^{0}x_{i}^{2} \\ {}^{0}x_{i}^{3} \end{matrix}}_{0 \leq i < N},$

is thus provided to the optimization algorithm. These initial values are sensitive to the noise on the measurement of the distances, however, these initial values are good initial candidates for the anchor position optimization.

FIG. 5 is a flow chart for explaining a process of the third embodiment. The process illustrated in FIG. 5 may be executed by the CPU 23 illustrated in FIG. 2. In step ST51 illustrated in FIG. 5, the CPU 23 performs the auto-calibration process described above, and the process ends. In step ST51, the CPU 23 may transmit via the transmitter 21, information on the 3D coordinates of the anchors in the reference frame, obtained by the auto-calibration, to the anchor or the tag T that performs the process of the first embodiment illustrated in FIG. 3. In a case in which the tag localization process of step ST32 illustrated in FIG. 3 and the auto-calibration process of step ST51 are performed by the same distance sensor 2, for example, the CPU 23 of this distance sensor 2 may obtain the information on the 3D coordinates of the anchors in the reference frame, obtained by the auto-calibration, when performing the tag localization.

According to this embodiment, it is possible to further improve the accuracy of the object localization by the completely passive auto-calibration process, in addition to obtaining the advantageous features of the first embodiment described above. Of course, this embodiment may be suitably combined with the first embodiment described above, or with both the first and second embodiments described above.

In each of the embodiments described above, the 3D object localization may be performed by the distance sensor forming the tag. However, a part of the 3D object localization may be performed by the distance sensor forming at least one of the anchors, in order to suitably distribute a load of the 3D object localization amongst the distance sensors forming the tag and the anchors. In addition, a part of the 3D object localization may be performed by the distance sensor forming one of the anchors acting as a representative anchor. When a part of the 3D object location is performed by the distance sensor forming the anchor, the anchor may measure the distance from the tag and transmit the measured distance to the tag, for example. In addition, the representative anchor may measure the distance from the tag, receive from other anchors the respective measured distances from the tag, and transmit the distance measured by the representative anchor and the distances measured by the other anchors to the tag for use in estimating the 3D position of the tag. Further, the representative anchor may determine a suitable coordinate system of a closed internal space in which the tag is movable. Accordingly, the object localization may be performed by the processor of the tag, or by a processor including a processor of the distance sensor forming at least one of the anchors and a processor of the distance sensor forming the tag.

Further, the functions of the representative anchor described above, for example, may be performed by a center apparatus that is communicable with the anchors and the tags. Preferably, the center apparatus performs at least a part of the 3D object localization that may put a relatively large load on the anchors and/or the tags.

Although the embodiments are numbered with, for example, “first,” “second,” or “third,” the ordinal numbers do not imply priorities of the embodiments. Many other variations and modifications will be apparent to those skilled in the art.

All examples and conditional language recited herein are intended for pedagogical purposes to aid the reader in understanding the invention and the concepts contributed by the inventor to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions, nor does the organization of such examples in the specification relate to a showing of the superiority and inferiority of the invention. Although the embodiments of the present invention have been described in detail, it should be understood that the various changes, substitutions, and alterations could be made hereto without departing from the scope of the invention.

METHOD AND APPARATUS FOR OBJECT LOCALIZATION

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)