Tracking a plane (e.g. floor) with a single camera is a problem solved by a projective transformation. A projective transformation maps lines to lines (but does not necessarily preserve parallelism). Any plane projective transformation can be expressed by an invertible 3×3 matrix in homogeneous coordinates; conversely, any invertible 3×3 matrix defines a projective transformation of the plane. Projective transformations (if not affine) are not defined on all of the plane, but only on the complement of a line (the missing line is “mapped to infinity”). A projective transformation has eight degrees of freedom (8 DOF), and is not a linear transformation and thus, difficult to deal with.
A transformation that preserves lines and parallelism (maps parallel lines to parallel lines) is an affine transformation. An affine transformation has six degrees of freedom.
According to aspects of the present invention there is provided a method of plane transformation comprising: capturing by a first camera a reference frame of a given plane from a first angle; capturing by a second camera a destination frame of said given plane from a second angle different than said first angle; defining coordinates of matching points in said reference frame and said destination frame; using said first and second angles to calculate first and second respective rotation transformations to a simulated plane parallel to said given plane; applying an affine transformation between said reference frame coordinate on said simulated plane and said destination frame coordinate on said simulated plane; and applying a projective transformation on said simulated plane destination frame coordinate to calculate said destination frame coordinate.
The angles may be provided by at least one inertial measurement unit attached to said camera.
Calculating first and second rotation transformations to a simulated plane may comprise calculating Euler angles.
Calculating first and second respective rotation transformations to a simulated plane parallel to said given plane comprises, for each said defined coordinates: applying a first projective transformation on said reference frame coordinate to calculate said reference frame coordinate on said simulated plane; and applying a second projective transformation on said destination frame coordinate to calculate said destination frame coordinate on said simulated plane.
The first camera and the second camera may be the same camera.
For better understanding of the invention and to show how the same may be carried into effect, reference will now be made, purely by way of example, to the accompanying drawings.
With specific reference now to the drawings in detail, it is stressed that the particulars shown are by way of example and for purposes of illustrative discussion of the preferred embodiments of the present invention only, and are presented in the cause of providing what is believed to be the most useful and readily understood description of the principles and conceptual aspects of the invention. In this regard, no attempt is made to show structural details of the invention in more detail than is necessary for a fundamental understanding of the invention, the description taken with the drawings making apparent to those skilled in the art how the several forms of the invention may be embodied in practice. In the accompanying drawings:
The following description is presented to enable one of ordinary skill in the art to make and use the invention as provided in the context of a particular application and its requirements. Various modifications to the described embodiments will be apparent to those with skill in the art, and the general principles defined herein may be applied to other embodiments. Therefore, the present invention is not intended to be limited to the particular embodiments shown and described, but is to be accorded the widest scope consistent with the principles and novel features herein disclosed. In other instances, well-known methods, procedures, and components have not been described in detail so as not to obscure the present invention.
In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. However, it will be understood by those skilled in the art that the present invention may be practiced without these specific details.
The present invention provides a method of linearizing the problem of plane tracking, by reducing the problem to an affine problem. The method uses data from at least one camera and at least one inertial measurement unit such as a gyro device mounted in the same device as the camera, e.g. a smart phone.
The process starts with capturing a plane (e.g. floor) from two different angles.
The process then comprises two stages:
In this stage the inputs to the algorithm are a set of matching points and three rolling angles of the device(s) from the two scenes (frames) captured from different angles. We will refer to the two scenes as reference and destination scenes.
We use feature matching and tracking techniques such as SIFT, SURF, KAZE, Optic Flow etc. in order to match features of the two scenes, resulting in N tracking points, where N can be a large number (e.g. several hundred points).
Unlike prior art techniques that use the RANSAC (Random Sample Consensus) algorithm iteratively to select the best four points that define the plane out of the N tracking points, as shown in the flowchart of
The preprocessing flowchart 200 is shown in
In steps 210 and 240 the reference frame and the destination frame are provided respectively.
In steps 220 and 250 the 3D orientations of both frames are calculated by calculating Euler angles (for example, using gyro, acceletometer or other inertial measurement units attached to the capturing device or devices).
In steps 230 and 260 reference points are extracted (using various methods of feature extraction, such as SIFT, SURF, KAZE) from both frames.
In step 270 the extracted points from both frames are matched (using various methods of feature matching such as FLANN matcher).
In step 280 N resulting tracking points are determined.
(X,Y)=InvRotDst(AffineTrns(RotRef(x,y)))
The foregoing description of the embodiments of the invention has been presented for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed. It should be appreciated by persons skilled in the art that many modifications, variations, substitutions, changes, and equivalents are possible in light of the above teaching. It is, therefore, to be understood that the appended claims are intended to cover all such modifications and changes as fall within the true spirit of the invention.
This patent application claims priority from and is related to U.S. Provisional Patent Application Ser. No. 62/272,717, filed Dec. 30, 2015, this U.S. Provisional Patent Application incorporated by reference in its entirety herein.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB2016/058014 | 12/27/2016 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62272717 | Dec 2015 | US |