Electronic sensors and signal processing devices are used to determine the movement and position of other objects or of the electronic sensor. To determine the movement and position, the electronic sensor captures different scans of a real-world scene, where the scans contain common features. The processing device then compares the captured scans to determine the movement that occurred between the capture of the different scans. In certain applications, like safety-critical applications, constraints require that the process of determining and evaluating the movement of electronic sensors or other objects be theoretically justified and assessable.
For the reasons stated above and for other reasons stated below which will become apparent to those skilled in the art upon reading and understanding the specification, there is a need in the art for improved systems and methods for theoretically justified transformation hypotheses.
The Embodiments of the present invention provide systems and methods for evaluating plane similarity and will be understood by reading and studying the following specification.
Systems and methods for determining plane similarity are provided. In one embodiment a system comprises a sensor configured to acquire a plurality of frames of data, and a processing unit coupled to the sensor, the processing unit configured to process the plurality of frames of data. The processing unit is further configured to store the plurality of frames of data on at least one memory device, read a first frame of data from the plurality of frames stored on the at least one memory device, and read a second frame of data from the plurality of frames stored on the at least one memory device. Additionally, the processing unit is configured to extract a first plane from the first frame of data, extract a second plane from the second frame of data, and calculate a divergence to measure a similarity between the first plane and the second plane.
Understanding that the drawings depict only exemplary embodiments and are not therefore to be considered limiting in scope, the exemplary embodiments will be described with additional specificity and detail through the use of the accompanying drawings, in which:
In accordance with common practice, the various described features are not drawn to scale but are drawn to emphasize specific features relevant to the exemplary embodiments.
In the following detailed description, references are made to the accompanying drawings that form a part hereof, and in which is shown, by way of illustration, specific illustrative embodiments. However, it is to be understood that other embodiments may be utilized and that logical, mechanical, and electrical changes may be made. Furthermore, the method presented in the drawing figures and the specification is not to be construed as limiting the order in which the individual acts may be performed. The following detailed description is, therefore, not to be taken in a limiting sense.
Embodiments of the present disclosure provide systems and methods for using divergence to evaluate plane similarity. Divergence is a statistical function that measures the distance between two different probability distributions. As planes in two separate frames are represented as probabilistic distributions of points, a divergence calculation determines the statistical distance between the two planes. The statistical distance resulting from the divergence calculation measures the similarity of the two planes and if the resulting divergence is low enough, the planes can be said to represent the same plane. Further, using divergence to compare probability distributions is theoretically justified and assessable. Therefore, divergence comparisons can be used when system constraints require that design considerations be theoretically justified and assessable, like in safety critical systems.
In alternate embodiments, sensor 110 captures either two or three dimensional data frames. For example, in one embodiment where sensor 110 captures two dimensional frames, sensor 110 comprises a camera. In a another embodiment, where sensor 110 captures three dimensional frames, it comprises a 3D scanning or flash LiDAR camera (for example, such as the Swissranger SR-3100 manufactured by Mesa Imaging AG), or a 3D scanning rotating LiDAR (such as HDL-64E LiDAR manufactured by the Velodyne corporation, for example). In other embodiments, sensor 110 comprises devices such as, but not limited to, a radar sensor (such as a millimeter wave radar or weather radar, for example), a combination of radar and an electro-optical camera, or other combinations of sensors. In other embodiments, a three dimensional point-cloud is generated from data gathered over time or along a motion trajectory using sensors such as, but not limited to, sonar sensors, laser or radar altimeters, or sensors used for surveying methods, bathymetry, radar topography, structure from motion methods or interferometery.
In one embodiment, in operation, as sensor 110 captures data, the captured data are transmitted to processing unit 115 and stored on memory device 120. In one embodiment, processing unit 115 is a programmable device that processes the data as instructed by instructions stored on memory device 120.
Memory device 120 is an electronic hardware device for storing machine readable data and instructions. In one embodiment, memory device 120 stores data received in frames captured by sensor 110 into a frame point storage 122. In one embodiment, memory device 120 stores data in frame point storage 122 in a form that associates particular data with a particular frame. For example, a 3-D point cloud describing a first frame is stored in frame points A and a 3-D point cloud describing a second frame is stored in frame points B.
Processing unit 115 also stores processed information into memory device 120. For example, in one embodiment, processing unit 115 extracts planes from captured data stored in frame point storage 122. As with the frame point storage 122, in one embodiment, processing unit 115 stores extracted planes in plane storage 124 such that memory device 120 associates the extracted planes with a frame of data acquired by sensor 110. For example, processing unit 115 stores planes extracted from a frame A in plane set A and stores planes extracted from a frame B in plane set B. Other information derived directly or indirectly from data are also stored in memory device 120, such as one or more transformations (which describe differences between two frames of data captured by sensor 110 at different moments in time) are stored in transformation storage 132.
Processing instructions stored on memory device 120 direct processing unit 115 to process data received from sensor 110 and memory device 120 using divergence to evaluate plane similarity. For example, plane extracting instructions 126 direct processing device 115 to extract planes from a frame of data and store the planes in a plane set in plane storage 124. Plane merging instructions 128 instruct processing device 115 to identify merge able planes in a frame of data and merge the identified planes together. Plane matching instructions 130 direct processing device 115 to identify matching planes in different frames of data and use the matched planes to evaluate and test a transformation hypothesis. Each of these three functions using divergence to evaluate plane similarity are described in greater detail below.
Plane extraction to evaluate plane similarity using divergence is performed on sets of data captured by sensor 110. For this example, it is assumed that sensor 110 is a device that provides three-dimensional data points in a frame (such as a LIDAR, for example). However, in alternative embodiments, three-dimensional data is gathered through means other than sensors that provide three-dimensional data points. For example, three-dimensional data is acquired by combining (fusing) data from multiple two-dimensional sensors, such as multiple cameras.
Processing unit 115 identifies planar features from each frame of data provided by sensor 110. A planar feature is feature extracted from the data that has the appearance of a geometric plane and is defined by a set of points. In other words, if all points in a set x satisfy the geometric equation of a plane within some small margin or threshold t, as shown in the following equation, then the set of points x defines a planar feature.
|n′x−d|≦t,
where n is the normal vector of the planar feature and d is the orthogonal distance from the planar feature to the origin.
The normal vector n and the orthogonal distance d are estimates calculated from the set of points that make up the planar feature. The points in the set of points which define a planar feature are said to be inliers or inlier data points because they are statistically consistent with points that would likely be found within the plane. Similarly, the points which do not meet the above criteria for a given planar feature are referred to as outliers because they are statistically not consistent with points that would likely be found within the plane. The planar features typically correspond to real world planes that form objects captured within the frame. For example, planar features often correspond to surfaces of objects such as a top of a table, a side of a box, a wall of a room or building, etc.
Embodiments of the present invention are not limited to data represented in rectangular coordinates. In alternate embodiments, other alternate coordinate systems are used. For example, in one embodiment, sensor 110 provides range data in spherical coordinates (i.e. a horizontal angle, a vertical angle, and a distance) rather than rectangular coordinates. In one embodiment, processing unit 115 converts the spherical coordinates to Cartesian coordinates. In other embodiments, processing unit 115 performs its function using the particular alternate coordinate system directly. For example, although the calculations described below are based on Cartesian coordinate space, one of ordinary skill in the art would appreciate that they could be redrafted to apply to any other particular coordinate space.
Further, other mathematical representations of the input data would be applicable to the extraction of features other than Cartesian planes, such as various two-dimensional manifold shapes. Thus, the description provided herein in terms of Cartesian coordinate space is provided by way of example and is not to be taken as limiting the scope of the present application.
It should also be noted that the calculations presented herein can be modified to extract other geometrical two-dimensional manifolds which can be expressed, for example, by the mathematical equation f(x, θ)≦t, where x denotes points, θ denotes a set of geometrical parameters that can be estimated, and f is a function. In the case of planar features described herein, the equation is expressed as f(x, θ)=|n′x−d|, and θ={n, d}. For an exemplary case of spherical features, the equation can be expressed as f(x, θ)=|(x−a)′(x−a)−r2 and θ={a, r}. Thus, the mathematical equation f(x, θ)≦t, is the generalized equation for determining the point-to-feature distance of each point, where each point whose point-to-feature distance is less than or equal to the threshold is identified as an inlier data point of that feature. It is to be understood that the description below regarding planar features can also be adapted to such other features, either in addition to, or rather than, planar features.
The method proceeds to 204 with dividing the data into a plurality of cells. That is, the processing time of the plane extraction can be improved by dividing the data into cells and processing the cells in parallel with separate processors. This is optional and for other embodiments, method 200 proceeds from 202 to 206 and the entire data-set of data captured by the sensor is processed as a single cell.
At block 206, the method proceeds with generating a plurality of hypothetical planes based on the data. Each hypothetical plane is characterized by its geometrical parameters, particularly by the normal vector n and the orthogonal distance d of the respective plane. Exemplary methods of generating each of the plurality of hypothetical planes are described in more detail in
At block 208, the method proceeds with selecting a representative hypothetical plane from the plurality of hypothetical planes. In particular, the selected representative hypothetical plane is a plane which provides a value of a quality function that is closer to a desired value than the non-selected hypothetical planes. The quality function for selecting the representative hypothetical plane is defined as a function whose value assesses the quality regarding how well a feature matches desired characteristics, although an explicit comparison with a desired value is not required.
For example, in one embodiment, a quality function of the representative hypothetical plane is a function of the number of inliers that define representative hypothetical plane. Hence, a desired value for the quality function could be defined as the largest number of inliers available from the calculated hypothetical planes rather than any specific value. Alternatively, the desired value for such a quality function could be any constant value. For this example, the hypothetical plane selected as representative hypothetical plane is the plane from the data-frame that has a quality function the greatest number of inliers provides a value of a quality function that is closest to the desired value and is selected as representative of a plane of an object in the scene. Thus, in some embodiments, the quality function is characterized as a local maximization/minimization problem for the respective cell. In such embodiments, the desired value is the local maximum/minimum value provided by the hypothetical planes.
It is to be understood that other quality functions can be used in other embodiments. Other exemplary quality functions include, but are not limited to, a function of the variance of inliers-to-plane distance and a function of the plane point density. For example, the function of the plane point density can be expressed as −|iv-plane point density|, where iv is a desired value and the plane point density is the number of inliers divided by the estimated plane size. In such embodiments, an explicit comparison is made through the quality function. Thus, variations of different quality functions can be implemented in different embodiments.
Having a hypothetical plane with parameters n (the normal vector) and d (the orthogonal distance), the number of inliers is calculated as the number of points x within the cell (or within the entire point cloud, if no cell splitting was performed), that satisfy |n′x−d|≦t described above, for a given (e.g. user-specified) margin or threshold t. In one embodiment, the margin is selected to be t=0.1 m. The equation dist=|n′x−d|, described above determines the point-to-plane distance for each point. If the point-to-plane distance for a given point is less than the threshold value, 0.1 m in this example, then that point is included in the set of inlier data points x.
At block 210, the method proceeds with refining the selected plane to improve the estimate of the planar feature. This is optional and the process may in other embodiments proceed from block 208 to block 212. Exemplary methods of refining the selected plane are described in more detail with respect to
At block 212, the method proceeds with computing a parametric description of the extracted plane. For further processing in some applications, such as matching planar features between scenes, it might not be feasible to use the inliers of the plane directly due to computational and memory demands. Hence, in some implementations, the selected plane is described by the normal vector and orthogonal distance, in addition to other parameters such as the mean point (also referred to as centroid), to reduce the data size. For the mean point, all the points on the plane are used to compute the arithmetic mean. Also, since the inliers are a set of points, plane extracting instructions 126 instruct processing unit 115 to use a 3-D covariance matrix of the inlier points and additional parameters such as the number of inlier points to describe the plane. Plane extraction instructions 126 direct processing unit 115 to calculate the mean point (centroid) of a plane and the covariance matrix and store the information describing the detected plane in a plane set on plane storage 124.
To calculate the centroid ĉi and the covariance matrix Pi, the inlier points in the plane are given by a set of three dimensional points. For example, each plane Πi is described as a set of points as shown by the following equation:
Πi={πi,j}j=1n
Plane extraction instructions 126 direct processing unit 115 to compute the centroid ĉi and covariance matrix Pi for each plane in the NA planes stored in a plane set. The centroid ĉi and covariance matrix Pi, for each plane in plane set, are described by the following equations:
where ni is the number of points in the plane Πi.
Alternatively, other estimators are used to describe a plane. For example, the covariance is computed by the formula
It is to be understood that blocks 206 to 212 can be repeated to select a plurality of planes. For example, in some embodiments, blocks 206 to 212 are repeated until all the points in the point cloud or respective cell are determined to be part of a plane or until a maximum defined number of iterations have been performed. In some embodiments, each iteration is performed on the set of data points reduced by the number of data points already identified as inliers of a plane in previous iterations. In other embodiments, the input points are used in the successive iterations and additional logic is used to ensure the solutions' uniqueness, such as, but not limited to, filtration of the hypotheses generated in the block 206 to be non-repeating. In addition, whereas in this example, only one plane is selected at block 208, in other implementations multiple hypothetical planes may be selected. For example, each hypothetical plane having more than a threshold number of inliers is selected in some embodiments. Similarly, in some embodiments, each hypothetical plane having less than a threshold number of inliers is discarded. In some embodiments, the threshold varies with plane parameters. For example, in some embodiments, a smaller threshold is used for a plane having a centroid farther from the coordinates' origin than for a plane closer to the coordinates' origin.
In method 300 above, a plane has to cover a substantial part of the cell to get a reasonably high probability of the three randomly selected points belonging to the same plane. However, the method 350 assumes that a plane is at least a partially continuous structure and if a randomly selected point belongs to the plane, its small neighborhood belongs there, too. The method 350 begins at block 352 with selecting a random point within a cell. At block 354 method 350 proceeds with selecting all points in a small neighborhood of the selected point, including the selected point. In one embodiment, for Velodyne LiDAR, 6 adjacent lasers and 1.2° azimuth span for the neighborhood is used. In other embodiments, different neighborhood sizes are used.
At block 356, method 350 determines whether there are enough points within the neighborhood. In particular, in this embodiment, method 350 determines if the number of points is greater or equal to a given threshold. The given threshold is set to at least 3, since 3 points are needed to define a plane. In one embodiment, half of the expected points within the neighborhood are used as the threshold. The number of expected points can be determined from sensor characteristics. If the number of points is below the threshold the plane hypothesis is considered to be invalid at block 362 and another plane hypothesis may be generated.
If the number of points is greater than or equal to the threshold, method 350 continues at block 358 with estimating the normal vector and the orthogonal distance of the hypothetical plane from all the points within the selected neighborhood. As described above, any commonly known mathematical estimator, such as a least squares estimator, can be used to estimate the normal vector and the orthogonal distance of the hypothetical plane.
Method 350 proceeds at block 360 with checking the planarity of the selected points within the selected neighborhood. In particular, the normal vector n and the orthogonal distance d, estimated at block 358, are used to determine how many points x within the neighborhood of points used to estimate n and d satisfy the plane constraint |n′x−d|≦t. In one embodiment, the threshold is set to t=0.05 m. In one embodiment, 80% of points x within the neighborhood need to satisfy the constraint, otherwise the hypothetical plane is declared to be invalid. This is optional and for other embodiments, method 350 proceeds from 358 to 364 as the inlier data points are determined without first checking the planarity of points within a neighborhood.
At block 364, method 350 proceeds with determining the inlier data points based on the point-to-plane distance of each data point. The point-to-plane distance for each point is calculated using the calculated normal vector and orthogonal distance. If the point-to-plane distance is smaller than or equal to a threshold, the respective data point is identified as an inlier.
Method 400 proceeds at block 404 with computing a new set of inliers that includes all the points x within a cell that satisfy the equation |n′x−d|≦t. At block 406, method 400 proceeds with determining if the number of inliers increased. For example, in some embodiments, method 400 is repeated until no new inliers are found. If the number of inliers increased, method 400 proceeds at block 408 with determining whether a maximum number of iterations has been reached. In one embodiment, only a single iteration through the cycle is used. In other embodiments, higher numbers of iterations are used. If the maximum number of iterations has not been reached, the method continues at block 402. If the maximum number of iterations has been reached, method 400 ends at block 410. In yet another embodiment, a different metric than a number of inliers is calculated and the refinement is repeated until there is no improvement of the respective metric.
When L iterations of method 550 have been performed, the method 500 proceeds at block 504 with selecting from L sets of inliers the set with highest number of inliers. In one embodiment, when multiple sets have the same highest number of inliers, the first set with the highest number of inliers is selected. In other embodiments, different selection criterions are employed.
Method 500 proceeds at block 506 with again estimating the normal vector and orthogonal distance from the selected highest-number-of-inliers set. After updating the normal vector and the orthogonal distance, the final inliers are determined as a set of points x within the cell, which satisfy the equation |n′x−d|≦t described above, at block 508, where the method 500 ends.
In one embodiment, plane matching using divergence is used to evaluate plane similarity between sets of planes. In one embodiment, plane matching instructions 130 direct processing unit 115 to compare planes extracted from different frames of data. Sensor 110 captures a first frame of a real-world scene 105. Either at a subsequent point in time or from a different location, sensor 110 captures a second frame of the world scene 105. Because sensor 110 might have moved between the capture of the first frame and the second frame, the respective frames are assumed to represent different views of real world scene 105.
As explained above, in system 100, processing unit 115 identifies planes contained within a frame. In one embodiment, those planes are stored as a plane sets in plane storage 124 in terms of a centroid and a covariance matrix. In other embodiments, other plane parameters can be stored in plane storage 124 instead of centroid and covariance matrix. For example, but not limited to, plane storage 124 stores normal vectors, orthogonal distances, and the like. When, for example, the centroid ĉiA and covariance matrix PiA for each plane in the NA planes stored in plane set A, is estimated by the following equations:
where niA is the number of points in the plane ΠiA, i=1, . . . , NA.
Further, the centroid ĉkB and covariance matrix PkB, for each plane in plane set B, is estimated by the following equations:
where nkB is the number of points in the plane ΠkB, k=1, . . . , NB.
In one embodiment, the three dimensional points that comprise a plane are assumed to follow a multivariate Gaussian distribution. The multivariate Gaussian distribution describes a set of correlated real-valued random variables each of which is clustered around the centroid. While the points can follow other probability distributions, the true distribution of the points is unknown and the Gaussian distribution has the highest entropy for a particular centroid and covariance matrix. However, in other embodiments, other probability distributions are used to describe the three dimensional points that comprise a plane. Modeling the three dimensional points as a realization of a random variable described by a probability distribution facilitates the computation of divergence values. Divergence, as used herein, is a function that measures the distance between two probability distributions. Divergence values are calculated using divergence measures such as Kullback-Leibler divergence, Jenson-Shanon divergence, Bhattacharyya distance, and Hellinger distance. Divergence values are also calculated using mutual information, where mutual information is a measurement of the mutual dependence of two variables, correlation, and the like.
When planes are identified in a plane set in terms of a centroid and covariance matrix, plane matching instructions 130 direct processing unit 115 to identify planes that exist in different plane sets. Plane matching instructions 130 also instruct processing unit 115 to estimate a transformation that describes the difference between the position of two different frames. Plane matching instructions 130 instruct processing unit 115 to identify planes that are found in both plane set A and plane set B by calculating the divergence between the planes in both plane sets. Through finding the minimal divergences between the planes of plane set A and the planes of plane set B, using the transformation evaluation instructions 134, processing unit 115 generates a transformation hypothesis that describes the movement of sensor 110 between the capture of a first frame and a second frame. To generate the transformation hypothesis, processing unit 115 applies an initial hypothesis to the planes in one of plane set A or plane set B, where the initial hypothesis attempts to describe the differences between a first frame and a second frame. In some implementations, the transformation hypothesis includes a translation vector and a rotation matrix that are represented as constants. Alternatively, the values used for the translation vector and the rotation matrix are uncertain and the uncertainty is represented by probability distributions. When the transformation hypothesis is applied to the planes in one of plane set A and plane set B, plane matching instructions 130 direct processing unit 115 to calculate a divergence value for combinations of planes in plane set A with planes in plane set B. The combination of planes in plane set A and planes in plane set B that yields the lowest divergence calculation is assumed to be the best matching plane combination. In one embodiment, transformation hypotheses are stored in transformation storage 132.
When processing unit 115 finds a combination of planes that yields the lowest divergence values, the transformation hypothesis is evaluated. In one embodiment, plane matching instructions 130 direct processing unit 115 to combine the results of the divergence calculations for the different matched planes identified in plane set A and plane set B. The combined result is then compared against a threshold or other divergence calculation to evaluate the quality of the hypothesis. When a transformation hypothesis is found that meets a predefined hypothesis criteria, the hypothesis is stored as a final transformation hypothesis. The use of divergence to perform plane matching is described in greater detail below in regards to
Process 600 begins at block 610 by applying a transformation to all of the planes in plane set A or a subset of planes in plane set A. The planes in plane set A are defined by a centroids ĉA and covariance matrices PA. To apply the transformation, process 600 uses a translation vector t and a rotation matrix R. In one embodiment, process 600 determines new centroids ĉA,R, and covariance matrices PA,RT for planes in plane set A as defined by the following equations:
ĉ
A,RT
=R(ĉA−t)
P
A,RT
=RP
A
R
T
In other embodiments, other equations can be used. The values for translation vector t and a rotation matrix R are generally constant, however, they may also be random variables with a mean and covariance matrix.
Process 600 proceeds at block 620 by calculating the divergences between the allowable combinations of planes in plane set B and planes in transformed plane set A. Assuming that the distribution of the points in each plane conforms to a Gaussian distribution, the distribution of a transformed plane in plane set A is defined by the following equation:
P
A,RT
=N(ĉA,RT,PA,RT), which appears like the following when expanded:
where x represents a point, which belongs to the transformed plane in plane set A. Likewise, each plane in plane set B is also defined by a centroid ĉB and a covariance matrix PB. Further, a plane in plane set B is also described by a probability distribution as shown by the following equation:
p
B
=N(ĉB,PB), which appears like the following when expanded:
where x represents a point, which belongs to the plane in plane set B.
The transformed distributions associated with plane set A and the distributions associated with plane set B are compared against one another using a Kullback-Leibler divergence to determine the similarity between two planes. The Kullback-Leibler divergence is calculated according to the following equation:
As the Kullback-Leibler divergence is not symmetric, the reverse divergence for the two planes is calculated according to the following equation:
and then averaged as follows:
Process 600 performs the same calculation for each allowable combination of planes from plane set A and plane set B to acquire a D value for each allowable combination. In other embodiments, alternative measures to Kullback-Leibler are used to compare probability distributions.
The above equations for calculating divergence between planes compare the planes in all three axes of freedom in three-dimensional space. In alternate implementations, the divergence computation compares the planes in restricted directions (axes of freedom). For example, in one implementation, the divergence calculation compares the densities of the planes in the direction of an average normal vector. The D value for calculating the divergence in the direction of an average normal vector is given by the following equation:
The value rA,RT is given by the equation rA,RT=N(nTĉA,RT,nTPA,RTn) and the value rB is given by the equation rB=N(nTĉB,nTPBn), where n is a normalized average vector of normal vectors of particular planes calculated as
In another implementation, the divergence compares the densities in the directions of normal vectors. In this implementation, the D value is represented by the following equation:
The values rAB, rAA, rBA, and rBB are defined by the following equations:
r
AB
=N(nBTĉA,RT,nBTPA,RTnB)
r
AA
=N(nA,RTTĉA,RT,nA,RTTPA,RTnA,RT),
r
BA
=N(nA,RTTĉB,nA,RTTPBnA,RT), and
r
BB
=N(nBTĉB,nBTPBnB).
The value nA,RT is a normal vector of the rotated and translated plane from plane set A. The value nB is a normal vector of a plane from plane set B.
In a further implementation, the divergence compares the densities in the direction of the smallest eigenvector. In this implementation, the D value is represented by the following equation:
The value rA,RTev is given by the equation rA,RTev=N(nBiTĉA,RT,nBiTPA,RTnBi) and the value rBev is given by the equation rBev=N(nBiTĉB(i),nBiTPB(i,i)nBi) where i is the index of the smallest eigenvalue of PB and nBi is its corresponding eigenvector. Note that the previously mentioned possibilities for computation of the D value are examples and not a complete list of all possibilities.
Process 600 proceeds at block 630 by identifying the combination of planes that yielded the lowest D values for the plane combinations. The planes that yield the lowest D values are considered to be the most similar planes. Further when the D values are identified, process 600 proceeds at block 635 by comparing the D values against a threshold T as shown by the following equation:
D≦T.
When a D value is less than or equal to the threshold T, a plane from plane set A is considered to match a plane from plane set B and the transformation hypothesis used to calculate the D values along with the resultant D values are stored in a memory device (shown at 650). When the D value is greater than the threshold T, a new transformation is identified and process 600 proceeds at block 640 by identifying a different transformation hypothesis and iteratively recommencing at block 610.
Further, when the D values are calculated, process 700 proceeds to block 730 by determining a quality measure for the transformation hypothesis. Process 700 calculates the quality measure by compiling all the D values stored in the memory to make a unitary quality measure. The unitary measure of the D values is formed by summing, or weighted summing, the D values together, finding the average or weighted average of the D values, multiplying or weighted multiplying of the D values together, and the like. In some embodiments, when weights are used for calculation of the D values, weights are set a priori by a user. Alternatively, the weights are determined by properties of each pair of planes. For example, properties of planes that are used to determine weights include plane orientation in space, plane size, plane smoothness, and the like. The quality measure is compared against a quality measure threshold value to determine the sufficiency of the transformation hypothesis. When the quality measure is calculated and the quality measure indicates that the transformation hypothesis was not of sufficient quality, process 700 directs process 600 to recommence to find a new transformation hypothesis. When the quality measure is sufficient, process 700 proceeds to block 740 by storing the quality measure and transformation hypothesis on at least one memory device such as memory device 120. The transformation hypothesis is then used as a final transformation hypothesis.
The methods above produce an output in the form of a hypothesis stored on memory device 120. The hypothesis defines a transformation that represents an estimation of the differences between a first frame and a second frame. In some implementations, the transformation is used to transform planes appearing in a first frame into planes appearing in a second frame, or match planes appearing in the first frame with corresponding planes in the second frame. In other words, a plane identified in the first can be identified and located in the second frame using the transformation hypothesis.
In one embodiment, plane merging using divergence is used to evaluate two planes which might represent a single physical surface in a real world scene. In some embodiments, plane merging instructions 128 in
In certain embodiments, the three dimensional points that comprise a plane are assumed to follow a multivariate Gaussian distribution. In other embodiments, other probability distributions are used. Because the points follow a probability distribution, processing unit 115, executing plane merging instructions 128, uses divergence values to compare the centroid and covariance matrix of a plane with the centroid and covariance matrix of another plane in a plane set. In comparing the distributions of two different planes from the same plane set, processing unit 115 determines the similarity between two different planes. If the divergence value between two different planes is below a certain threshold, processing unit 115 merges the planes together. Processing unit 115 calculates the divergence using similarity measures like Kullback-Leibler divergence, Jenson-Shanon divergence, Bhattacharyya distance, Hellinger distance, mutual information, correlation, and the like. A further description of the execution of plane merging instructions 128 is found below.
As illustrated in
In one embodiment, processing unit 115, executing plane merging instructions 128, estimates the plane area using an envelope model. The envelope constructed around the plane is in fact an ashlar with four narrow sides orthogonal to the plane. The remaining two sides (the largest ones) are parallel with the plane. The area of either largest side of the envelope is used as a measurement of the area of the plane it represents. The two largest sides are parallel and both their shape and their area are identical. In other embodiments, other estimates of the plane area might be used.
In one embodiment, an envelope for a plane is constructed as follows based on determining a covariance matrix P for the three-dimensional points set corresponding to the detected plane. A plane is specified by its centroid ĉ, points covariance matrix P, normal vector n and the orthogonal distance d. Having a plane consisting of N plane points (sensor returns) xi, the following relations hold:
the envelope is then constructed from the principal components of the plane as follows:
From the covariance matrix P, one can get the eigenvalues λ1, λ2 and λ3 and corresponding eigenvectors v1, v2 and v3, where λ1≧λ2≧λ3. The eigenvalues are variances in directions of eigenvectors (the principal components). Eigenvectors are orthogonal and both eigenvectors and eigenvalues depend on the orientation and size of the plane point-cloud in a 3-dimensional space. Moreover, since the point cloud is a plane, v3≈n. Points in the v1 and v2 directions are spread rather uniformly while they are Gaussian in v3 direction. In one embodiment, before constructing the envelope, λ1 and λ2 are each multiplied by (0.9*√3)2 to compensate for different spread of points in these directions. This correction ensures that the envelope is supposed to contain around 90% of plane points.
Because the eigenvalues are variances in the main directions, one can therefore take their square roots to get standard deviations. In one embodiment, the square root of the smallest eigenvalue (λ3) is used to obtain a measurement of a plane's thickness. In other embodiments, other estimates of the plane's thickness might be used. The square roots of the remaining two eigenvalues (λ1 and λ2) are used to model the plane as a rectangle. Having unit-length eigenvectors v1 and v2, four corners of the rectangle that models the plane are then given by c±√{square root over (λ1)}v1±√{square root over (λ2)}v2. The area of such rectangle is given as 2*√{square root over (λ1)}*2*√{square root over (λ2)}, which is an estimate of the plane size. The above description is one way to estimate the plane area and is not meant to be limiting. Other means for defining area are available to those of ordinary skill in the art and can be used to differentiate larger detected planes from smaller planes from the data set. Processing unit 115, executing plane merging instructions 128, constructs a rectangular parallelepiped (or an ashlar) envelope around the point cloud representing the plane. Since eight vertices of such ashlar are given as c±√{square root over (λ1)}v1±√{square root over (λ2)}v2±√{square root over (λ3)}v3, our representation is equivalent to taking the largest side from the ashlar (which has 6 sides—3 pairs, each pair consisting of sides of exactly same shapes) and using it to estimate the plane area.
If the plane area estimate is used only for sorting purposes, in some embodiments, multiplicative constants can be omitted. For example, the estimate √{square root over (λ1)}*√{square root over (λ2)} could be used instead of 2*√{square root over (λ1)}*2*√{square root over (λ2)} and the multiplication by (0.9*√3)2 as mentioned above can be skipped.
Once an area for each of the detected planes is calculated, processing unit 115 sorts the planes in the list of detected planes 810 in descending order. This ordering is performed because the largest detected planes are the most likely to be real and distinct planes rather than false positives, and thus are the most stable and provide the most accurate estimates. As mentioned above, in some embodiments, other sorting criteria might be used than the plane area.
For each plane in the detected plane list 810, the point prediction estimator calculates a number of predicted points that can be expected to form the plane for a given plane's parameters. This can alternately be performed either before or after list of detected planes 810 is sorted by area. Point prediction is sensor specific. That is, for a plane of a given size, location, and orientation, one can expect there to be “N” number of point returns on that plane when the resolution of the sensor is known. For example, for the Velodyne LiDAR, lasers are distributed in specific horizontal and vertical angular resolutions. The number of laser points returned for a given plane depends on the distance of the plane (not the orthogonal distance), the angle under which it is seen from the point of view of the LiDAR, and the size of the plane.
In one embodiment, one can use spherical angles to predict the number of sensor returns. There are many algorithms for spherical angle computation known to persons skilled in the art. Having the spherical angle, the number of returns can be predicted when the angular resolution of the sensor is known.
In the explanations that follows, the sensor used to obtain data is the Velodyne LiDAR HDL-64E. One of ordinary skill in the art after reading this specification would appreciate that description provided below is readily adapted to other sensors.
Since the performance of the spherical angle predictor is not always sufficient, in another embodiment, a model-based estimator might be used. First, the plane is again modeled by the rectangular model described above. In this case, include all multiplication constants when constructing the rectangular model. So, the rectangular model vertices are given by c±√{square root over (λ1)}v2±√{square root over (λ2)}v2 where both λ1 and λ2 are before multiplied by (0.9*√3)2 as described above. The challenge is that the rectangular model will not likely be orthogonal to the direction of view, but it is typically skewed in various directions.
We proceed constructing the number of points (sensor returns) prediction model the following way. Recalling that a plane is represented by its equation n′x=d and its centroid c, projection axes for a Velodyne LiDAR, uV; vV; wV are computed the following way:
1.
2. vV is given by the following conditions v′VuV=0: and v′V[0 0 1]=0. This specifies a line. When the norm is 1, two solutions emerge differing by sign either of which can be picked. The solution is found as
This formula does not provide a unique solution in a special case, when
In such a case, any unit-length vector orthogonal to [0 0 1] can be picked as vV, for example [1 0 0].
3. wV=uV×vV
To estimate lengths of intersection of the rectangle envelope and both horizontal and vertical projection planes, define the horizontal projection plane as w′Vx=0 and the vertical projection plane as v′Vx=0, x being an arbitrary point in three-dimensional space. Looking at the horizontal case first, compute the direction vector dh of the intersection from the following conditions: dh′n=0 (dh belongs to the plane), dh′wV=0 (dh belongs to the horizontal projection plane), and dh′vV=1 (dh is not orthogonal to vV). The vertical direction vector dv is derived the same way. Therefore we get:
dv and dh are further normalized, since they are not unit-length by default.
Denoting the vectors representing sides of the plane-rectangle as a,b (with their norm being equal to rectangle sides), also denote
Starting with dh, compute intersections with two infinite-length bands, one formed by ‘b’ sides of the rectangle and the second one formed by ‘a’ sides. The minimum of those two is the intersection with the rectangle envelope. Therefore, for the length of horizontal intersection ih, we have
Similarly, for the vertical intersection,
Having iv, ih, compensate for skewness of the plane, taking iv·|d′v wV| and ih·|d′h vV| instead. Since compensated iv·|d′v wV| and ih·|d′h VV| are evaluated on a vector orthogonal to c, use trigonometric functions to obtain αh and αv using:
The final estimate of points on the plane for Velodyne LiDAR HDL-64E is given as:
Accordingly, the estimator 815 updates the list of detected planes 810 to include a predicted point estimate for each plane in the list, as shown at 820.
The process 800 proceeds to a filter algorithm, illustrated at 825, which removes suspected false planes from the list of detected planes 810, based on area and predicted point estimates, to arrive at a list of planes that are candidates for merger 840. Filter algorithm 825 begins at 831 with discarding any plane(x) from the list of detected planes 810 that contains fewer laser returns in either the vertical or horizontal direction than a predetermined minimum point criteria. For example, in one embodiment, filter 825 discards any plane whose envelope contains less than 4 laser returns in a vertical direction, or less than 7 laser returns in a horizontal direction. Next, filter 825 proceeds to block 832 applying a second criteria and compares the number of actual laser return points received within the envelope against the number of predicted points estimated for that plane by point prediction estimator 815 (shown at 832). For example, in one embodiment, filter algorithm 825 proceeds by discarding any plane(x) from the list of detected planes 810 where the ratio of the number of predicted points to the number of actual points is greater than or equal to a discard criteria (for example, ≧8). Planes from the list of detected planes 810 that emerge from the filter 825 form the list of candidates for merger 840. Note that the remaining planes in the list of candidates for merger 840 remain sorted according to the used sorting criteria.
In one embodiment, an optional third criteria is applied after the second criteria. Filter algorithm 825 proceeds at block 833 by setting aside any remaining planes in the list of detected planes 810 where the ratio of the number of predicted points to number of actual points is greater than or equal to a “set aside” criteria (for example ≧5). Planes that meet this set aside criteria will be removed from the list of detected planes 810 but not discarded. Instead, set aside planes are placed into a separate list of “set-aside” planes 845 which will be separately considered for merging as described in more detail below. Accordingly, for embodiments that apply this optional third criterion, planes from the list of detected planes 810 that are not discarded or set-aside emerge from the filter 825 as the list of candidates for merger 840.
In one embodiment, a process for building a list of merged planes is described in
Primary merge algorithm 910 begins by seeding a list of merged planes 990. At block 920, primary merge algorithm 910 seeds list of merged planes 990 by selecting the largest plane from the list of candidates for merger 840 and moving it into list of merged planes. Since the list of candidates for merger 840 is sorted by area in descending order, the largest plane will be the first plane from that list. In other embodiments, when different sorting criterion is used, primary merge algorithm selects a plane other than the largest plane on the list by selecting the plane according to the sorting criterion that was used to organize the planes.
In each iteration, primary merge algorithm 910 proceeds at block 930 by taking the first (the largest, since the list is ordered by size in descending order) plane from the list of candidates for merger 840 (shown at 930) and removes it from the list. Primary merge algorithm 910 then continues at block 940 by sequentially examining all planes that are already in the list of merged planes 990. For each pair of planes formed by the plane taken from the list of candidates for merger and by a plane from list of merged planes 990, primary merge algorithm 910 proceeds at 950 by calculating the mathematical divergence between the planes. In one embodiment, two planes are considered similar if the divergence between the two planes is less than or equal to a predetermined divergence threshold.
In this embodiment, as explained above, a plane is described by a probability distribution with both a centroid ĉ and a covariance matrix P. Therefore, the first plane taken from the list of planes is described by centroid ĉ1 and covariance matrix P1 and the second plane taken from the list of merged planes is described by centroid ĉ2 and covariance matrix P2. As the planes are defined by a centroid and a covariance matrix, the distribution of the points characterizing the planes is assumed to be in a Gaussian distribution as shown by the following equations:
p
1
=N(ĉ1,P1) for the first plane; and
p
2
=N(ĉ2,P2) for the second plane.
In other embodiments, other probability distributions are used. Further, when other probability distributions are used, the distributions will use parameters that characterize the plane that may include parameters other than the centroid and covariance matrix.
Merge Algorithm 910 evaluates the similarity of the first plane and second plane by calculating the divergence between the first and second planes. In some implementations, the primary merge algorithm 910 calculates the divergence using Kullback-Leibler divergence. When Kullback-Leibler divergence is used, algorithm 910 calculates the divergence according to the following equation:
As the Kullback-Leibler divergence is not symmetric, the reversed divergence is also calculated according to the following equation:
The results of the divergence calculations are averaged together to calculate a divergence value for the first and second planes according to the following equation:
Primary merge algorithm 910 proceeds at block 955 by comparing the divergence value D against a threshold value T. If D≦T, then the first and second planes are considered similar.
If a pair of planes is not similar, then primary merge algorithm 910 continues at block 965 by returning to block 940 and sequentially examining other planes in the list of merged planes and proceeds to the consideration of the next pair. When the divergence between two planes is below a threshold value, primary merge algorithm 910 proceeds at block 960 with creating a hypothetical merged plane.
Primary merge algorithm 910 creates a hypothetical merged plane from two planes where the divergence between the two planes is less than a threshold value, and by mathematical computation determines a probabilistic representation that includes, but is not limited to, the hypothetical merged plane's centroid, normal vector, plane thickness, covariance matrix, and the like.
In this implementation, the mathematical computations determine a new merged centroid and covariance matrix based on the points in the first and second planes. First plane includes n1 points, where Π1={π1,j}j=1n
In another embodiment, the merged plane parameters are estimated from the parameters, such as ĉM and PM, of the two planes being merged without using the original points of the two planes being merged.
When the hypothetical merged plane has been created, the merge algorithm 910 computes divergences between both original planes and the merged plane and proceeds at 968. At 968, merge algorithm 910 compares both divergences against a threshold value. In certain embodiments, the threshold value used at 968 is smaller than the threshold value used at 955. When both the calculated divergences are less than the threshold value, primary merge algorithm 910 proceeds at 970 by replacing the first plane from the pair of compared planes in the list of merged planes with the hypothetical merged plane. Primary merge algorithm 910 then returns to block 930 to select a different non-merged plane from the list of candidates for merger 840. When the calculated divergence is greater than the threshold, merge algorithm 910 proceeds at block 965 by determining whether there are other planes yet to be examined in list of merged planes 990. If there are other planes, primary merge algorithm 910 leaves the first plane in the list of merged planes 990 and the iterative algorithm continues by checking whether there is still at least one plane that has to be processed at step 985, returning to step 930 and picking another plane from the list of candidates for merger 840. When a given plane from the list of candidates for merger 840 is tested against every plane in the merged plane list and the divergence of each test is greater than the respective threshold, then primary merge algorithm 980 at block 980 proceeds by adding the given plane to the list of merged planes 990 as a distinct plane, and removes the given plane from the list of candidates for merger 840. Such a plane is added to the list of merged planes because it may represent an independent distinct plane in the real world scene rather than a fragment of another plane already in the merged plane list 990. Merge algorithm 910 proceeds at block 985 by continuing until all the planes from the list of candidates for merger 840 are either merged into a plane in the merged plane list 990, or added to the merged plane list 990 as a distinct plane.
In one embodiment, assuming that no optional list of “set-aside” planes was generated, then the output from the primary merge algorithm 910 represents the final list of planes. In one alternate embodiment, to arrive at a final list of planes, those planes from the merged plane list that have an area less than or equal to a minimum area threshold (such as 0.1 m2, for example) are discarded. The final list of planes may then optionally be sorted by area.
In one embodiment, where the optional list of “set-aside” planes was generated, a secondary merge algorithm 1010 is applied to determine whether any of the “set-aside” planes can be merged with any plane from the list of merged planes 990 generated by the primary merge algorithm 910. This process is described in
Secondary merge algorithm 1010 attempts to merge planes from the list of set-aside planes 845 with planes in the list of merged planes 990. Planes in the list of set-aside planes that are not “loosely similar” to any plane in the list of merged planes are discarded.
In each iteration, the secondary merge algorithm 1010 begins at block 1020 by taking the first plane from the list of set-aside planes 845 and removing it from list 845. Secondary merge algorithm 1010 then continues at block 1030 by sequentially examining the planes that are already in the list of merged planes 990. Secondary merge algorithm 1010 then proceeds at block 1040 by calculating the divergence for a pair of planes formed by the plane taken from the list of set-aside planes and by a plane from the list of merged planes 990.
Secondary merge algorithm 1010 proceeds at block 1045, if the calculated divergence for the pair of planes exceeds a predefined threshold, the pair of planes is not similar and secondary merge algorithm 1010 proceeds to block 1055 and determines if there are other planes available for comparison in list of merged planes 990. When there are still planes available for comparison in list of merged planes 990, secondary merge algorithm returns to block 1030 by sequentially examining other planes in the list of merged planes. When the calculated divergence is below the predefined threshold, the algorithm 1010 proceeds at block 1050 by creating a hypothetical merged plane and, by mathematical computation, determining the centroid and covariance matrix, in conjunction with other characteristic parameters, of the hypothetical merged plane.
The secondary merge algorithm 1010 then calculates divergences between both original planes and the hypothetical merged plane. Secondary merge algorithm 1010 proceeds at block 1058 by checking whether both divergences are below a given threshold. In certain embodiments, the threshold at 1058 is smaller than the threshold used at block 1045. If both divergences are less than the threshold, the secondary merge algorithm 1010 continues at 1060 by replacing the first plane from the pair of planes with the hypothetical merged plane in the list of merged planes 990 and the method returns to block 1020, where another candidate for merger is picked from the list of set-aside planes, 845. If either of the calculated diverges is greater than the threshold tested at 1058, secondary merge algorithm 1010 checks, at 1055, whether there are still planes in list of merged planes 990 to be compared to the selected plane. If there are further planes in list of merged planes 990 to compare against the selected plane, secondary merge algorithm 1010 proceeds by returning to 1030. When a second plane, from the pair, picked from the list of set-aside planes is tested against every plane in the merged plane list and the calculated divergence values fail to be below the predefined threshold, the second plane is discarded. Secondary merge algorithm 1010 proceeds at 1075 by iteratively processing the list of set aside planes 845 until every plane in that list is either merged into the list of merged planes 990, or discarded.
For this embodiment, the resulting list of merged planes 990 that is output from the secondary merge algorithm 1010 represents the final list of planes. In another alternate embodiment, to arrive at a final list of planes, those planes emerging from secondary algorithm 1010 that have an area less than or equal to a minimum area threshold (such as 0.1 m2, for example) are discarded. The final list of planes may then be optionally sorted by area or other desired criteria.
The method 1100 proceeds to block 1115 with estimating a number of predicted points expected to form each plane based on its area and orientation, and based on resolution characteristics of the sensor. That is, for a given sensor, a given number of return points can be estimated for a plane of a given size and relative orientation with respect to the sensor. One means for calculating predicted point is provided above.
The method 1100 proceeds to block 1120 with generating a list of detected planes that includes, but is not limited to, the area of each plane, and the number of predicted points expected to form the plane. The planes in list of detected planes are ordered by plane area in descending order, or other desired criteria set by a user, as described above.
The method 1100 proceeds to block 1125 with filtering the list of detected planes to produce a list of candidates for merger, where filtering the list of detected planes discards any plane not satisfying an actual points received criterion and discards any plane not satisfying a primary predicted-points to actual-points ratio criterion. In one embodiment, filtering the list of detected planes further identifies a list of set-aside planes that satisfy the primary predicted-points to actual-points ratio criterion but do not satisfy a secondary predicted-points to actual-points ratio. These planes are set-aside for later processing to see if they can be merged with planes formed by the first merging algorithm. If they cannot be merged, they are discarded. Planes included in the list of set-aside planes are not also included in the list of candidates for merger.
The method 1100 proceeds to block 1130 with applying a primary merge algorithm to the list of candidates for merger, wherein the primary merge algorithm iteratively produces a list of merged planes by calculating the divergence between planes forming a hypothetical merged plane, wherein the hypothetical merged planes each comprise a first plane from the list of merged planes and a second plane from the list of candidates for merger. As discussed above, if all calculated divergences between planes forming the hypothetical merged plane and divergences between the hypothetical merged plane and original planes are below respective predefined thresholds, the primary merge algorithm replaces the first plane in the list of merged planes with the hypothetical merged plane, and removes the second plane from the list of candidates for merger. When at least one calculated divergence is not below a predefined threshold, the plane picked from the list of candidates for merger to the list of merged planes as a distinct plane.
In one embodiment, when the list of set-aside planes is optionally generated at 1125, the method 1100 further optionally includes applying a secondary merge algorithm using the list of candidates for merger and the list of set-aside planes. The secondary merge algorithm tests hypothetical merged planes that each comprises of a first plane from the list of merged planes and a second plane from the list of set-aside planes by comparing the divergence between the first plane and the second plane and among both the first and the second plane and the hypothetical merged plane against a predefined respective thresholds. When all the divergence values are below given respective thresholds, the planes are retained and the hypothetical merged plane replaces the plane from the list of merged planes while the plane from the list of set-aside planes is discarded. In certain embodiments, all planes from the list of set-aside planes that fail to have a all divergence values less than the respective threshold are discarded.
The method 1100 proceeds to block 1140 with outputting a final list of planes based on the output of the primary merge algorithm. In one embodiment, prior to outputting the final list of planes, the list is filtered to remove any plane that has an area not satisfying a minimum area threshold (such as 0.1 m2, for example). In one embodiment, the final list of planes is sorted by area. In other embodiments, different sorting criteria is used to sort the final list of planes. In one embodiment, the final list of planes is stored to a physical data storage device such as, but not limited to a drive or memory.
The method described above thus can be viewed as performing two separate tasks. One removes false positives which are planes discarded because they are defined by only a small number of points compared to the number of points we would expect. The other performs the merging of planes. The two tasks can operate independently and in alternate embodiments, either can be skipped. For example, the primary merge algorithm in block 1130 can, in one embodiment, operate on a list of detected planes that has not been filtered based on predicted point estimates.
In one embodiment, the transformation hypothesis is applied to the practical field of self navigating vehicles.
In certain embodiments, sensor 1210 transmits captured data to processing device 1212, where upon processing 1212 device stores the data in a frame points storage 1222 on data storage device 1220. Data storage device 1220 also stores computer instructions that direct processing unit 1215 to calculate a transformation hypothesis from the data stored in frame point storage 1222. Data storage device 1220 stores a plane extracting instructions 1226, a plane merging instructions 1228, a plane matching instructions 1230, and transformation evaluation instructions 1234. Plane extracting instructions 1226 direct processing unit 1215 to extract probability distributions representing identified planes from frame points stored in frame points storage 1222 and store them in plane storage 1224. Plane merging instructions 1228 direct processing unit 1215 to iteratively compare the identified planes in a frame of data and merge frames that are similar to one another as explained above in relation to primary and secondary merge algorithms 910 and 1010. Plane matching instructions 1230 instruct processing unit 1215 to compare planes in different frames of data, using the transformation evaluation instructions 1234, calculate a transformation hypothesis based on the divergence between planes in the frames, and evaluate the quality of the transformation hypothesis as described in relation to processes 600 and 700 in
In one embodiment, in operation, sensor 1210 captures a first frame of a real world scene 1205. Vehicle 1200 subsequently travels to a second location and sensor 1210 captures a second frame of real-world scene 1205. In one implementation, vehicle 1200 has at least approximate knowledge of its own coordinates with respect to a first frame of the real world scene 1205 as it captures the first frame of data. From the first and second frames of data, processing unit 1215 calculates and stores, on a data storage device 1220, a transformation hypothesis. In one embodiment, vehicle 1200 then determines coordinates for its new position by applying the transformation hypothesis stored on data storage device 1220 to its coordinates in the navigation frame. The difference in coordinates is also used to determine vehicle parameters such as, but not limited to, vehicle 1200's velocity (when time between data capture is known), heading, and orientation (i.e., yaw, pitch, and roll). In another embodiment, vehicle 1200 applies the transformation hypothesis to known obstacles previously identified in the first frame to estimate the relative position of those objects at its new location, even when those objects do not appear in the second frame. As this suggests, it is not necessary for the two frames used for determining the hypotheses to be sequentially taken. frames taken minutes, hours, days or years apart are also processed against current frames as long as the frames contain at least overlapping data associated with a relatively static scene 1205. Further, it is not necessary for the data to be captured by the same sensor 1210. Data captured from multiple sensors are used as long as they implement the same transformation when capturing the scene into a projection. Also, the two projections for which the plane matching is desired do not have to be captured by the same sensor at two times, but equivalently by two or more devices at the same time or some may be generated from a priori known data.
In another embodiment, in operation, sensor 1210 captures a frame of data associated with a real world scene 1205. Using a priori given map of planes, the processor 1215 calculates the divergence between planes in the frame of data and planes in the map. The hypothesis then defines the position of the vehicle 1200 in the navigation reference frame aligned with the map.
In another embodiment, the transformation hypothesis is readily applied to the field of automated map building using vehicle 1200 or to obtain the matching planes for triangulation or reprojection purposes, such as for 3D stereoscopic reprojections. For example, with alternate embodiments of the present invention, static planes identified in one data frame are correlated to similar planes identified in a second data frame in order to combine the two frames into a third frame that preserves information regarding the relative position of objects in the two frames. By repeating this process, as vehicle 1200 travels, a map is developed and saved into memory 1220 that can serve various purposes, it might be used, for example, to identify pathways that can be traversed without hindrance from obstacles or it might serve for navigation of other vehicles, etc. Similarly, in other embodiments, the processing unit 1215 applies plane matching using divergence processes to create a mosaic frame in memory 1220 from separate captured frames, by overlapping correlating planes from a first and second captured frames.
The method 1300 proceeds at 1310 with applying a transformation hypothesis to a first plane in the first plane set. Method 1300 proceeds at 1312 with calculating a divergence value between the transformed first plane and a second plane in the second plane set. The method proceeds at 1314 with writing the divergence value to the at least one memory device. In at least one embodiment, the divergence is used to further calculate a transformation hypothesis. Alternatively, the divergence is used to evaluate the quality of the transformation hypothesis.
Several means of hardware are available to implement the systems and methods of the current invention as discussed in this specification. These means of hardware include, but are not limited to, digital computer systems, microprocessors, general purpose computers, programmable controllers and field programmable gate arrays. Therefore other embodiments of the present invention are program instructions resident on computer readable storage media which when implemented by such devices, enable them to implement embodiments of the present invention. Computer readable media include any form of physical computer data storage hardware, including but not limited to punch cards, magnetic disk or tape, any optical data storage system, flash read only memory (ROM), non-volatile ROM, programmable ROM (PROM), erasable-programmable ROM (E-PROM), random access memory (RAM), or any other form of permanent, semi-permanent, or temporary memory storage system or device. Program instructions and code include, but are not limited to computer-executable instructions executed by computer system processors and hardware description languages such as Very High Speed Integrated Circuit (VHSIC) Hardware Description Language (VHDL).
Although specific embodiments have been illustrated and described herein, it will be appreciated by those of ordinary skill in the art that any arrangement, which is calculated to achieve the same purpose, may be substituted for the specific embodiments shown. Further, elements of the various embodiments described above can be combined to form yet other embodiments. Therefore, it is manifestly intended that this invention be limited only by the claims and the equivalents thereof.
This application is related to co-pending U.S. patent application Ser. No. 12/775,865 (applicant docket number H0024871) entitled “SYSTEM AND METHOD FOR EXTRACTION OF FEATURES FROM A 3-D POINT CLOUD” filed on May 7, 2010, herein incorporated in its entirety by reference and referred to herein as the '865 application. This application is related to co-pending U.S. patent application Ser. No. 12/436,224 (applicant docket number H0020938) entitled “SYSTEMS AND METHODS FOR EXTRACTING PLANAR FEATURES, MATCHING THE PLANAR FEATURES, AND ESTIMATING MOTION FROM THE PLANAR FEATURES” filed on May 6, 2009, herein incorporated in its entirety by reference and referred to herein as the '224 application. This application is related to co-pending U.S. patent application Ser. No. 12/644,559 (applicant docket number H0023848) entitled “SYSTEMS AND METHODS FOR MATCHING SCENES USING MUTUAL RELATIONS BETWEEN FEATURES” filed on Dec. 22, 2009, herein incorporated in its entirety by reference and referred to herein as the '559 application. This application is related to co-pending U.S. patent application Ser. No. 12/846,265 (applicant docket number H0027096) entitled “SYSTEMS AND METHODS FOR PROCESSING EXTRACTED PLANE FEATURES” filed on Jul. 29, 2010, herein incorporated in its entirety by reference and referred to herein as the '265 application.