Embodiments discussed herein regard devices, systems, and methods for merging three-dimensional (3D) point clouds. Error of the merged 3D point cloud can be bounded by errors of the individual 3D point clouds.
Discussed herein are methods, systems, and devices for registering a first 3D point cloud (or a portion thereof) to one or more other 3D point clouds (or a portion thereof) to generate a merged 3D point cloud. One or more the first and second 3D point clouds can include an associated error. The associated error can be propagated to the merged 3D point cloud. The error of the 3D point cloud can be used in a downstream application. Example applications include targeting and mensuration. A targeteer (one who performs targeting) can benefit from the error to better inform their targeting location choice. A mensuration of an object can benefit from the error as well.
The merged 3D point clouds can include error that is better than either of the first and second 3D point clouds individually. For example, if the first 3D point cloud includes a lower error (relative to the second 3D point cloud) in the x and y directions and the second 3D point cloud includes a lower error (relative to the first 3D point cloud) in the z direction, the merged 3D point cloud can include error bounded by the first 3D point cloud in the x and y directions and by the second 3D point cloud in the z direction. The merged point cloud can thus inherit the better of the errors between the first and second point clouds for a specified parameter.
Reference will now be made to the FIGS. to describe the methods, systems, and devices for merging 3D point clouds with error propagation.
The first 3D point set 104 or the second 3D point set 106 can include a point cloud, a 3D surface, or the like. The first 3D point set 104 and the second 3D point set 106 can include (x, y, z) data for respective geographic regions. The geographic regions of the first 3D point set 104 and the second 3D point set 106 at least partially overlap. One or more of the first point set 104 and the second point set 106 can include intensity data. Intensity data can include one or more intensity values, such as red, green, blue, yellow, black, white, gray, infrared, thermal, or the like. One or more of the first point set 104 and the second the point set 106 can include error data. The error data is illustrated as being a separate input in
The tie points 108 can associate respective points between the first 3D point set 104 and the second 3D point set 106. The tie points 108 can indicate a first point (x1, y1, z1) in the first 3D point set 104, a second point (x2, y2, z2) in the second 3D point set 106 or an error associated with the tie point 108 (shown as separate input tie point error 110). The tie point error 110 can indicate how confident one is that the first and second points correspond to the same geographic location. The tie point error 110 can include nine entries indicating a covariance (variance or cross-covariance) between three variables. The three variables can be error in the respective directions (x, y, z). A covariance matrix representation of the tie point error 110 is provided as
where the diagonal terms are respective variances in the given directions, and the off-diagonal terms are covariances between the directions.
The first or second point set error 112 can sometimes be improved, such as to be more rigorous. Sometimes, the first or second point set error 112 can be in a form that is not digestible by the bundle adjustment operation 118. The point set error 112 can be conditioned by a condition point set error operation 114 to generate an error matrix 116. The condition point set error operation 114 can include generating a covariance matrix 116 of error parameters of the first 3D point set 104 or the second 3D point set 106. The error parameters can include seven parameters. Three of the parameters can include translation in x, y, and z, respectively. Three of the parameters can be for rotation in x, y, and z (roll, pitch, and yaw), respectively. One of the parameters can be for a scale factor between the first 3D point set 104 and the second 3D point set 106. An example of the matrix 116 produced by the condition point set error operation 114 is provided as
where
The bundle adjustment operation 118 can receive the tie points 108, tie point error 110, first 3D point set 104, second 3D point set 106, and the error matrix 116 at input. The bundle adjustment operation 118 can produce a merged 3D point set 120 and a merged 3D point set error 122 as output. The bundle adjustment operation 118 can use a least squares estimator (LSE) for registration of the first 3D point set 104 and the second 3D point set 106. The operation 118 is easily extendable to merging more than two 3D data sets even though the description regards only two 3D data sets at times. The bundle adjustment operation 118 can use one or more photogrammetric techniques. The bundle adjustment operation 118 can include outlier rejection. The bundle adjustment operation 118 can determine error model parameters for the 3D data sets. Application of the error model parameters to the first 3D point set 104 and the second 3D point set 106, results in the relative alignment (registration) of the first 3D point set 104 and the second 3D point set 106.
This section establishes some preliminary notational conventions and symbol definitions used for developing the formulations for the bundle adjustment operation 118. The bundle adjustment operation 118 can include identifying a ground point that reduces a discrepancy between the ground point and corresponding points in respective images, and then adjusting points in the 3D data sets in a manner that reduces the discrepancy. The term “3D data set” is sometimes referred to as an “image”. For convenience, example sizes of vectors and matrices are indicated below the symbol. Thus, the symbol
denotes a matrix A with N rows and M columns. Column vectors from R3 thus have the annotation 3×1. Components of a vector V are written as
If the vector includes diacritical marks or distinguishing embellishments, these are transferred to the components, as in
Equation modeling of the relationship between points in one 3D space to corresponding points in another 3D space is now described. A common reference space is established across all of the images. The reference space can be constructed to accommodate a simultaneous adjustment of more than two images. Correspondences can be formed between points in the reference space and the measured conjugate point locations in each image. The observation equation can be represented as Equation 1:
where {circumflex over (V)} is a reference-space 3D coordinate, {tilde over (V)} is the observation of {circumflex over (V)} in an image and the orientation and offset relationship between reference space and image space is taken from the orientation matrix T and offset vector
where the symbols “c” and “s” denote trigonometric cosine and sine functions, respectively. The quantities
refer to rotation angles (roll, pitch and yaw) about an image's x, y, and z axes respectively. The scalar s represents an isometric scale correction factor (nominally zero). The above form is conducive to modeling a simultaneous least squares adjustment of all images' offsets and orientations, provided that estimates of reference space coordinates for all conjugate image observations vectors are available. This form is more suitable and flexible than explicitly holding a single image as a reference for at least one of several reasons: (1) there are reference space ground coordinates that permit the potential use of ground control points, whose a priori covariances are relatively small (e.g., they carry high weighting in the solution); (2) the above formulation is suitable for a simultaneous adjustment for data that includes small or minimal overlap (mosaics), as well as, many images collected over the same area (stares) or any combination in between; and (3) allowing all images to adjust simultaneously provides improved geolocation accuracy of the post-adjusted and merged images. parameters.
The symbol {circumflex over (V)} will be referred to as a ground point (akin to tie point ground locations and ground control point locations in a classical photogrammetric image adjustment). The symbol {tilde over (V)} will be referred to as a ground point observation (akin to image tie point observation locations in a classical photogrammetric image adjustment).
Unlike the classical photogrammetric treatment, {circumflex over (V)} and {tilde over (V)} are both “on the ground” in the sense that they both represent ground coordinates in 3D (in the classical imagery case, the observations are in image space and are thus 2D coordinates). Further, the point may very well not be “on the ground” but could be on a building rooftop, treetop canopy, etc. However, the terminology “ground point” and “ground point observation” will be used.
If j is taken to be the index of an arbitrary ground point and i to be the index of an arbitrary image, the observation equation (Equation 1) can be written as Equation 3
{tilde over (V)}ij=(1+si)Ti({circumflex over (V)}j−
where {circumflex over (V)}j≡[{circumflex over (x)}j ŷj {circumflex over (z)}j]T as the jth ground point,
is the orientation matrix between image i and the reference space frame and where si is image i scale correction factor. Thus, {tilde over (V)}ij is the coordinate of the ith image's observation of ground point j.
If a particular ground point is found in two or more images, it can serve as a point which ties the images together (one of the tie points 108). These are generically referred to as tie points. A single tie point is often referred to as a collection of image observations (with coordinates) of the same point on the ground along with the corresponding ground point (with coordinates).
Since the observations over many images i are the measurements containing error, the true ground point coordinates are generally unknown. To facilitate this, an initial estimate of the ground point location can be computed. The initial estimate is provided as Equation 5 as the average of the 3D observations
The ground points themselves are treated as derived (but unconstrained) observations and allowed to adjust in performance of the operation 118. There can be an observation of interest whose true ground coordinates are well known. These are classically called ground control points (or GCPs). Since this development can accommodate both GCPs and tie points, the more general terms of “ground point” and “ground point observation” are sometimes used (as contrasted with “tie point ground coordinate” and “tie point observation”).
The bundle adjustment operation 118 can operate on two or more images taken over a same area (with observations for tie points, sometimes called a stare scenario); two or more images taken in strips (forming a mosaic of data, with 2-way, 3-way, or m-way observations in strip overlap regions); tie points in which the corresponding ground points may appear in two or more images, incorporation of GCPs for features in imagery, providing an absolute registration; accommodation of a full covariance for tie point observations. This is conducive for tie point correlation techniques which are highly asymmetrical (e.g., as long as the asymmetry can be characterized as a measurement covariance).
The relationship between ground point coordinate estimates {circumflex over (V)}j and the corresponding image observations {tilde over (V)}ij can be understood as a stare scenario between three misregistered images.
For the development of the LSE formulation (and associated preprocessing) that can be performed by the bundle adjustment operation 118, more definitions are provided in Table 1.
Ground point observations can be indexed by ground point j and image i (as in {tilde over (V)}ij) or by linear indexing, b (as in {tilde over (V)}b). Use of the subscripting depends upon the context. In the former, it is of interest to characterize the fact that a particular ground point j appears on a particular image i. In the latter, it is of interest to enumerate all observations independent of to which image or to which ground point they refer.
Since some 3D point set data is presented in a “world” space coordinate system (e.g., Universal Transverse Mercator (UTM) map projection) and since the observation Equation 3 is image dependent, some coordinate frame definitions and transformations can aid understanding.
If it is assumed that ground point observation locations are specified in world coordinates, it is of interest to transform the ground point observation locations to be “image” relative. Further, it can be of interest to obtain the ground locations and image offsets themselves to be relative to a “local” reference coordinate frame.
A motivation for a local reference coordinate frame can be to remove large values from the coordinates. For example, UTM coordinates can typically be in the hundreds of thousands of meters. This makes interpretation of the coordinates more difficult, for example, when examining a report of updated coordinate locations. A motivation for an image-relative coordinate frame can be so that the interpretation of the orientation angles comprising the Ti matrices can be relative to the center of the data set. This is contrasted with the origin of rotation being far removed from the data set (e.g., coincident with the local reference frame origin in the mosaic scenario).
In both cases, the transformations between coordinate frames simply involve a 3D translation. The mnemonics W, R and I are used to denote the “world”, “reference” and “image” coordinate frames, respectively. To facilitate the transformations, the following convention is established. A superscript on a vector denotes the coordinate frame to which it is referred. Thus {tilde over (V)}ijW corresponds to the world space coordinates of a particular tie point observation, while {tilde over (V)}ijR and {tilde over (V)}ijI represent the same tie point observation but referred to the reference frame and image frame, respectively.
Following the above convention, the symbol can VAB represent “the location of the origin of frame A coordinatized in frame B”. Thus, VRV can represent the location of the reference frame in the world coordinate system (e.g., UTM coordinates of the origin of the reference frame). The relationship between an arbitrary vector VR coordinatized in the reference frame and the same vector VW coordinatized in the world frame can be represented by Equation 6
VW=VR+VRW Equation 6
The reference frame can be established as an average of all of the world-space coordinates of tie points. This offset (denoted VRW) can be determined using Equation 7
For simplicity, it can be assumed that the reference frame origin, referred to by the world frame, can be computed by a process external to the bundle adjustment operation 118 (e.g., by the process that assembles the tie points 108 for use in the bundle adjustment operation 118).
The image frame (e.g., a frame defined on a per-image basis) can be the world coordinates of the center of an image. Under the assumption that there are bounding coordinates in the image data (specifying the min and max extents of the data in world-frame X, Y and Z), the center of the data can thus be taken to be the respective averages of the min and max extents. Since this image frame refers to world space, the computed offset is denoted VIW. If bounding coordinates are not available, value for VIW is taken as the average of the tie point locations over the specific image i, as described in Equation 8
The image frame offset in reference space coordinates is taken to be the initial value for
Since the tie point observation values can be input in world coordinates and since the observation equation domain assumes reference frame coordinates, some preprocessing of the input data can help make it consistent with that assumed by the observation equation (Equations 1 or 3). The tie point observation coordinates can be converted from world space to reference space. This can be performed for each observation per Equation 10.
{tilde over (V)}ijR={tilde over (V)}ijW−VRW Equation 10
Next, since the true ground point coordinates used in Equation 3 can be unknown, they can be estimated. The ground point coordinates can be assumed to be coordinatized in the reference frame. The initial estimated values for the ground coordinates of each tie point can be computed as an average of the ground point observations over all images in which it appears as described by Equation 11
Since the true locations of the tie point ground coordinates can be treated as unknown, the a priori covariance can reflect this by treating the errors in the ground coordinates as numerically unconstrained (in units of meters squared) as described by Equation 12
{umlaut over (Σ)}j≡diag([101210121012]) Equation 12
The tie point observation coordinates for use in the observation equation can be converted to image-relative coordinates using Equation 13.
{tilde over (V)}uj={tilde over (V)}ijR−
Next, a least squares formulation and solution are discussed. Since the observation equation, Equation 1 or 3, is non-linear in the orientation angles that form Ti, the least squares problem becomes a non-linear least squares problem. Equation 3 can be linearized. Solving the linearized equation can be a multidimensional root finding problem (in which the root is the vector of solution parameters).
For simplification in notation of the linearization, consider a fixed image and a fixed ground point. Let the unknown error model parameters (offset, orientation, and scale correction) be represented by Equation 14:
The observation equation for ground point observation V can be written as Equation 15
{tilde over (V)}=(1+s)T({circumflex over (V)}−
where T is the true image orientation matrix,
If one wishes to include the ground point coordinates {circumflex over (V)} as additional observations, the solution for X and {circumflex over (V)} can be cast as a root solving problem based on Equation 16
F(X;{circumflex over (V)})=0 Equation 16
where
F(X;{circumflex over (V)})={tilde over (V)}−(1+s)T({circumflex over (V)}−
In vector form, the function, F, can be represented by Equation 18
The function F can be approximated using a first-order Taylor series expansion of F about initial estimates X(0) and {circumflex over (V)}(0) as in Equation 19
where X(0) is an initial approximation of X, {circumflex over (V)}(0) is an initial approximation of {circumflex over (V)}, the Jacobians
are the partial derivatives of F evaluated at X(0) and {circumflex over (V)}(0) respectively, {dot over (Δ)}≡[Δ
The Jacobians can be written as Equations 20 and 21
Note that the dot symbols are merely notations, following the classical photogrammetric equivalent, and do not intrinsically indicate “rates,” as is sometimes denoted in other classical physics contexts.
In matrix notation, Equation 19 can be written as
Since the problem is nonlinear, the estimation of the parameter vector can be iterated (via a multi-dimensional extension of the Newton-Raphson method for root finding, or other technique). The solution can include relinearization at each iteration. The relinearization can be performed about the most recent estimate of the parameter vector. The linearized form of Equation 22 at iteration (p) can be represented as in Equation 24.
where X(p) is the pth iteration estimate of the parameter vector X, {circumflex over (V)}(p) is the pth iteration estimate of {circumflex over (V)},
is the Jacobian of F with respect to X evaluated at X(p),
is the Jacobian of F with respect to {circumflex over (V)} evaluated at {circumflex over (V)}(p)), {dot over (Δ)} is a vector of corrections to X for the pth iteration, and {umlaut over (Δ)} is a vector of corrections to {circumflex over (V)} for the pth iteration.
With each iteration, the parameter and ground point vectors can be updated with the most recent correction as in Equations 25 and 26.
X(p)=X(p-1)+{dot over (Δ)} Equation 25
{circumflex over (V)}(p)={circumflex over (V)}(p-1)+{umlaut over (Δ)} Equation 26
For the initial iteration, initial values for X(0) and {circumflex over (V)}(0) can be estimated as discussed previously. The system represented by Equation 24 is now linear in {dot over (Δ)} and {umlaut over (Δ)}. A linear solver can be used to solve for the parameters.
For a particular image i and a particular ground point j, Equation 23 can be written as Equation 27
The discrepancy vector for the pth iteration is thus be represented as in Equation 28
εij(p)=−F(X(p);{circumflex over (V)}(p)) Equation 28
and thus
εij(p)=−[{tilde over (V)}ij−(1+si)Ti({circumflex over (V)}j−
To accommodate a simultaneous solution of all images and ground points, Equation 27 can be extended as
Equation 30 can be re-written as Equation 31
BΔ=E Equation 31
then the normal equation matrix can be represented as Equation 32 or Equation 33
(BTB)Δ=BTE Equation 32
ZΔ=H Equation 33
It can be less efficient to form B as in Equation 30, for one or more of the following reasons: (1) B is very sparse; (2) the quantities {dot over (B)}ij and {umlaut over (B)}ij are nonzero if and only if ground point j is observed on image i. For this reason, the classical development of the normal matrix BTB and right-hand side vector BTE uses summations over the appropriate indexing. These summations are provided in the normal matrix partitioning below.
The foregoing equations form a foundation for the present problem that is sufficient for development of the normal equations, examination of the normal matrix structure and formulation of the normal equation solution.
The normal equation can be written as in Equation 34
ZΔ=H Equation 34
The matrices can be partitioned as in Equations 35-37
The quantities {dot over (K)}, {umlaut over (K)}, Ċ and {umlaut over (C)} are described in more details elsewhere herein.
Combining Equations 35, 36 and 37 yields Equation 38
The matrix Z can thus be represented as Equation 39
The matrix {dot over (N)} can be written as Equation 40
and analogously {dot over (W)} can be written as Equation 41
The block entries of {dot over (N)}i can be defined as in Equation 42
The subscripts ij on the {dot over (B)}ij matrices indicate that they are a function of image i and ground point j.
The matrix N can be expanded as in Equation 43
{dot over (W)} can be expanded as in Equation 44:
The block entries of Equation 43 can be defined as in Equation 45
The matrix
The block entries of N from Equation 45 can be defined as in Equation 47
In a similar development the right hand side matrix H from Equation 34 can be expanded as in Equation 48
The subblocks of H can be defined as in Equations 49 and 50
with the discrepancy vector εij defined as in Equation 29 and with
{umlaut over (C)}(p)=Ċ(p-1)−Ċ(0) Equation 51
{umlaut over (C)}(p)={umlaut over (C)}(p-1)−{umlaut over (C)}(0) Equation 52
The values for Ċ(0) and {umlaut over (C)}(0) are the initial parameter values. The initial values for the translation parameters portion of Ċ(0) can be taken to be the
The initial values of {umlaut over (C)}(0) can be taken to be the values of the ground point coordinates {circumflex over (V)}j(0) as computed in accord with Equation 11.
The solution to the normal equation matrix on iteration (p) can be determined as in Equation 53
Δ(p)=Z−1H Equation 53
At each iteration, the parameters can be updated via Equations 51 and Equation 52 and the normal matrix can be formed and solved again. The process can continue until the solution converges. Examples of the convergence criterion can be discussed in the following section.
Since the solution is iterated, a convergence criterion can be established. An example of a convergence criterion is to compute the root-mean-square (RMS) of the residuals as in Equation 54
The value in the denominator of Equation 54 represents the number of degrees of freedom (e.g., the number of observation equations minus the number of estimated parameters).
Since typically q>>7m Equation 54 can be estimated as in Equation 55
The condition q>>7m can be guaranteed with sufficient redundancy of ground point observations as compared with the number of images (e.g., enough tie points are measured between the images so that the aforementioned condition is satisfied).
Convergence happens when the residuals settle to the same values on consecutive iterations. The convergence criterion can be
|R(p)−R(p-1)|<δ Equation 56
where δ is a prescribed tolerance.
A rigorous formulation for the standard error of unit weight (to be used in error propagation discussed elsewhere) is provided in Equation 57
where ndof is the number of degrees of freedom—the number of observation equations minus the number of error model solution parameters:
ndof=q−7m Equation 58
Since blundered points can be effectively removed from the solution via deweighting, the number of observations remaining effectively doesn't include the blunders. To be strictly correct, the value for q in Equation 58 can be the number of non-blundered observations.
The full form of the matrix Equation 34 can be reduced under the assumption that the errors in the ground point locations are uncorrelated. Under this assumption, the error covariance matrix of the ground point locations {umlaut over (Σ)} becomes a block-diagonal matrix of 3×3 matrix blocks. Since it is a sparse matrix, its inverse is easily computed by inverting the 3×3 diagonal blocks. The development in this section reformulates the normal equations taking advantage of this. The result is a reduced normal equation matrix in which the size of the normal matrix is 6m×6m instead of (6m+3n)×(6m+3n). This gives the obvious advantage that the size of the normal matrix is much smaller and remains invariant with the number of ground points.
The reduced system formation is sometimes referred to as a “ground point folding,” since the ground point portion of the reduced normal matrix is incorporated into the image portion. The development of the reduced normal equation begins with the original normal equation from Equation 34 and repeated as Equation 59
ZΔ=H Equation 59
To facilitate ground point folding into a reduced normal equation matrix, Equation 59 can be re-written as Equation 60
where
Ż={dot over (N)}+{dot over (W)} Equation 61
{umlaut over (Z)}={umlaut over (N)}+{umlaut over (W)} Equation 63
{dot over (H)}={dot over (K)}−{dot over (W)}Ċ Equation 64
{umlaut over (H)}={umlaut over (K)}−{umlaut over (W)}{umlaut over (C)} Equation 65
Suppose a matrix system ZΔ=H is partitioned into blocks of the appropriate sizes as
where the matrices A and D are both square.
Further, assume that matrix D is non-singular and can be represented as a sparse block diagonal matrix. Then
[A−BD−1C][a]=[c−BD−1d] Equation 67
Applying Equation 67 to Equation 59 provides the reduced normal matrix equation
[Ż−
The reduced normal equation matrix can be written as in Equation 69
M{dot over (Δ)}=C Equation 69
where M≡[Ż−
Next it is of interest to examine the form of the components of the reduced system for an efficient implementation. Let {circumflex over (Z)}≡
By extension
The blocks of {circumflex over (Z)} in Equation 71 can be the equivalent
The assumption that errors in the a priori ground points are uncorrelated yields Equation 72
where {umlaut over (Σ)}j−1 is the inverse of the a priori covariance matrix for ground point j. Thus
The general row and column term for {circumflex over (Z)} can then be given by
and, by the definition of
The matrix M can thus be written as in Equation 75
The reduced matrix M can be formed by first storing the diagonal entries of Ż and then subtracting the summed entries of the subtrahend in Equation 75 (namely the {umlaut over (Z)}r,c defined in Equation 74).
Since the subblocks of the subtrahend are merely summations over the ground point indexes, j, the matrix, M, can be built by iterating over the ground points (assuming the minuend of Equation 75 on-diagonals were formed in advance) and subtracting out the contributions for a particular ground point in the appropriate place within M.
The constant column vector C can be formed similarly with some of the same matrices:
After the matrices M and C are built, the solution vector for the adjustable parameters from the reduced system can be computed as
{dot over (Δ)}=M−1C Equation 77
The solution vector can be decomposed into per-image-adjustable vectors {dot over (Δ)}i for each image i as in Equation 78:
After the solution vector {dot over (Δ)} for the image-adjustable parameters is obtained, the solution vector {umlaut over (Δ)} for corrections to the ground point positions can be extracted (or “unfolded”) from the reduced system. To formulate the extraction, Equation 67 can be used to obtain Equation 79
represents the correction vector for the ground points then
where {dot over (Δ)}i is the adjustable parameter correction vector for image i. Thus
where Ij is as defined as the index set of images upon which ground point j is an observation.
This section provides formulations for extraction of a posteriori error covariances for ground points. If a priori sensor model error estimates are available (and reliable), the errors may be propagated to the space of the registration error models. In this case, the error propagation is a rigorous predicted error for the accuracy of the a posteriori ground point locations.
The a posteriori error covariances of the image parameters are the appropriate subblocks of the inverse of the reduced normal matrix M−1 from Equation 69 (after application of the variance of unit weight, as described at the end of this section). For the full normal matrix solution, the a posteriori error covariance can be the inverse of the normal matrix, Z−1, times the variance of unit weight. For the reduced system, however, the a posteriori error covariances of the ground points can be extracted from M−1 by unfolding. To facilitate this, the full normal matrix can be written as
Denote the inverse matrix blocks as
Note that, {dot over (Σ)} and {umlaut over (Σ)} as defined are distinctly different from those defined in previous sections. (The symbols in the present section are a posteriori covariances and those in previous sections are a priori covariances). However, this subtle distinction is not problematic if the appropriate context is adhered.
The a posteriori covariance between ground points r and c can be represented as block element
With n as the number or ground points and m as the number of images,
The rth row of {umlaut over (Σ)} involves only {umlaut over (Z)}r−1 of the first {umlaut over (Z)}−1 matrix in term two of Equation 86. Similarly, the cth column of {umlaut over (Σ)} involves only {umlaut over (Z)}c−1 of the second {umlaut over (Z)}−1 matrix in term two. Thus
{umlaut over (Σ)}r,c=δ(r,c){umlaut over (Z)}c−1+{umlaut over (Z)}r−1
where the delta function can be
Now the form of the (r, c) block of
The rth row of G involves only the rth row of
Now
Thus
Tr,c=Σt∈I
where Ij is the index set of images upon which ground point j is an observation. Substituting Equation 91 into Equation 87 yields
{umlaut over (Σ)}r,c=δ(r,c){umlaut over (Z)}c−1+{umlaut over (Z)}r−1[Σt∈I
The a posteriori covariance is usually defined by scaling the inverse of the normal matrix by an estimate of the variance of unit weight. An estimate of the variance of unit weight is denoted as [σ(p)]2 and is provided in Equation 57. Thus, the above formulation can be used, but instead defining
For a full normal matrix solution, Z−1 is readily available, thus the a posteriori covariance of the error model parameters and ground points can be the right hand side of Equation 93.
The right hand summand of Equation 92 includes the factor [σ(p))]2 since it includes {dot over (Σ)}st. However, the left hand summand does not include the factor. This can be compensated for by a modified form of Equation 92
{umlaut over (Σ)}r,c=[σ(p)]2δ(r,c){umlaut over (Z)}c−1+{umlaut over (Z)}r−1[Σt∈I
If the standard error of unit weight σ(p) is deemed to be unreliable (e.g., is much greater than unity) this may be an indicator of improper (or incorrect) a priori error covariance in the process. One can still, however, be able to provide a reliable error estimate from the least squares process by simply forcing the standard error to one (e.g., by setting σ(p)←1.0 in Equations 93 and 94.
The operation 302 can include setting the solution vector X and the correction vector ΔX to the zero vector1: 1 The solution vector X is set to the fixed-point location for the linearization. If an a priori estimate is available, it is used here in place of the zero vector.
The operation 304 can include computing the discrepancy vector for each observation as provided in Equation 29. The operation 306 can include building the normal equations matrices and solving for the correction vector as provided in Equation 53. The operation 308 can include updating the parameter vector for the current iteration as provided in Equations 51 and 52. Details of the operation 308 for unfolding of the ground points for the folded normal equation solution is provided via pseudocode below.
The operation 310 can include computing the residuals (final discrepancies) as provided in Equation 29. The operation 312 can include computing a standard error of unit weight via Equation 57. Note that the standard error can the square root of the left hand side of the Equation 57 (e.g., σ(p)=√{square root over ([σ(p)]2))}.
If the delta between the current and previous standard error of unit weight is less than the convergence criterion in absolute value, the solution nominally converged. To accommodate blunder rejection, the convergence criterion check can be augmented with a check to see if the blunder weights should be used in continuation of the solution (“useBW”, indicating to use “blunder-checking weighting”). If convergence occurs and useBW is true, this is an indicator to perform blunder checking, and this time using a normalized residual computation in order to check for blunders on the next iteration.
If useBW is true, blunders can be computed. If there are blunders remaining, the blunder “cycle” number is incremented and the process is repeated with the correction vector reset to a priori values (e.g., go to operation 302). If there are no blunders remaining, a check can be performed to see if the number of post convergence blunder cycles can be set to zero. This check can be performed to effectively force one more solution after all blunders have been eliminated.
If useBW is false and it is currently a first iteration of a blundering solution, useBW can be set to true. This has the effect of forcing the normalized residual blunder iteration for determining the blunders on subsequent iterations. In this case, a solution has converged but normalized blunder residuals have not been computed. Setting useBW to true can forces this to happen on the next solution iteration. The solution can be iterated by going to the operation 306. If there are no more blunders and the number of blunders is not zero, this indicates the “non-blunder iteration” solution has converged.
The operation 118 can include providing a report that includes an iteration number, current correction vector ΔX, current iteration estimates of parameters and ground points (e.g., as computed in equations 51 and 52), standard error of unit weight (e.g., as provided in Equation 55). The operation 118 can include a check for non-convergence by examining the current iteration number with a maximum number of iterations, M. If the number of iterations exceeds the maximum, stop the iteration process. The solution did not converge. An exception can be raised and the operation 118 can be complete.
The following is a pseudocode outline for the operation 118 for computing the full normal equations solution. This first pseudocode does not include ground point folding.
The names in {braces} allude to method (e.g., function) names in a software implementation. Also, within the development below, ground point indexes, ground point observation indexes and image indexes are assumed to be zero-relative. For efficiency in the implementation, the following elements can be cached in a per-image workspace object, which is updated with each iteration:
The pseudocode begins by setting the non-linear least squares iteration index (p) to zero.
ij: Sum {dot over (B)}ijT{tilde over (W)}ij{umlaut over (B)}ij into Z.block(i, m + j)
What follows is pseudocode for the operation 118 for building the reduced normal equations system, computing corrections to ground point positions and performing error propagation via extraction of data from the reduced normal matrix. This portion of the pseudocode includes ground point coordinate folding.
As in the full normal solution provided in the previous pseudocode, the same per-image elements are cached in a workspace object and updated with each iteration. The algorithm for the reduced solution can be broken into two major portions: priming and folding. Priming involves storing of weights and the contributions along the diagonal of the full normal equation matrix (and corresponding data for the right hand column vector H). This corresponds to the Ż portion of Z. Thus, priming involved formation of the minuends of Equation 75 and Equation 76. Folding can include incorporation of the subtrahends of the aforementioned Equations.
To provide an efficient implementation, a ground point workspace can be created. The workspace can include the following elements: {umlaut over (Z)}j, {umlaut over (H)}j, {umlaut over (Z)}j−1. These things are indexed by ground point for the ground point workspace. The technique can begin by setting the non-linear least squares iteration index (p) to zero.
After the solution vector {dot over (Δ)} is obtained, unfolding the ground point corrections is a matter of employing Equation 80, replicated here for reference:
The full 3n×3n ground error covariance matrix
may be obtained by invoking the method for r∈{1, 2, . . . , n} and for c∈{r, r+1, . . . , n}. Note that the indexing for c starts with r since the full ground covariance matrix is symmetric (i.e., build the upper triangle of
and “reflect about the diagonal” to obtain the lower symmetric portion).
What follows regards how to perform operation 314. The operation 314 proceeds given the outputs of the LSE techniques discussed. The compensation applies the inverse of the observation equation, accommodating the various relative frame offsets to arrive at compensated world space coordinates from misregistered world space coordinates
VregW=Compensatei(VmisregW) Equation 106
The motivation for providing the inputs and outputs in world space coordinates can be that is the native space of the inputs and desired space of the outputs for each element of each image's point cloud.
For an arbitrary misregistered vector VmisregW on image i, the compensation formula can be performed as in Equation 107
where Ti is constructed from the solution vector θi(p) and the other symbols in Equation 107 are defined elsewhere. Note that the values for −
The example computer system 400 includes a processor 402 (e.g., a central processing unit (CPU), a graphics processing unit (GPU) or both), a main memory 404 and a static memory 406, which communicate with each other via a bus 408. The computer system 400 may further include a video display unit 410 (e.g., a liquid crystal display (LCD) or a cathode ray tube (CRT)). The computer system 400 also includes an alphanumeric input device 412 (e.g., a keyboard), a user interface (UI) navigation device 414 (e.g., a mouse), a mass storage unit 416, a signal generation device 418 (e.g., a speaker), a network interface device 420, and a radio 430 such as Bluetooth, WWAN, WLAN, and NFC, permitting the application of security controls on such protocols.
The mass storage unit 416 includes a machine-readable medium 422 on which is stored one or more sets of instructions and data structures (e.g., software) 424 embodying or utilized by any one or more of the methodologies or functions described herein. The instructions 424 may also reside, completely or at least partially, within the main memory 404 and/or within the processor 402 during execution thereof by the computer system 400, the main memory 404 and the processor 402 also constituting machine-readable media.
While the machine-readable medium 422 is shown in an example embodiment to be a single medium, the term “machine-readable medium” may include a single medium or multiple media (e.g., a centralized or distributed database, and/or associated caches and servers) that store the one or more instructions or data structures. The term “machine-readable medium” shall also be taken to include any tangible medium that is capable of storing, encoding or carrying instructions for execution by the machine and that cause the machine to perform any one or more of the methodologies of the present invention, or that is capable of storing, encoding or carrying data structures utilized by or associated with such instructions. The term “machine-readable medium” shall accordingly be taken to include, but not be limited to, solid-state memories, and optical and magnetic media. Specific examples of machine-readable media include non-volatile memory, including by way of example semiconductor memory devices, e.g., Erasable Programmable Read-Only Memory (EPROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks.
The instructions 424 may further be transmitted or received over a communications network 426 using a transmission medium. The instructions 424 may be transmitted using the network interface device 420 and any one of a number of well-known transfer protocols (e.g., HTTP). Examples of communication networks include a local area network (“LAN”), a wide area network (“WAN”), the Internet, mobile telephone networks, Plain Old Telephone (POTS) networks, and wireless data networks (e.g., WiFi and WiMax networks). The term “transmission medium” shall be taken to include any intangible medium that is capable of storing, encoding or carrying instructions for execution by the machine, and includes digital or analog communications signals or other intangible media to facilitate communication of such software.
The subject matter can be understood by way of some examples.
Example 1 can include a method for registering a first three-dimensional (3D) image to a second 3D image with error propagation, the method comprising reducing a sum aggregate of discrepancies between respective tie points and associated 3D points in the first and the second 3D images; adjusting 3D error models of the first and second 3D images based on the reduced discrepancies to generate registered 3D images, and propagating an error of the first or second 3D images to the registered 3D image to generate error of the registered 3D images.
In Example 2, Example 1 can further include conditioning the error of the first and second 3D images before propagating the error.
In Example 3, Example 2 can further include, wherein the conditioned error includes errors in translation in x, errors in translation in y, errors in translation in z, errors in roll, errors in yaw, errors in pitch, and errors in scale between the first and second images.
In Example 4, at least one of Examples 1-3 can further include, wherein reducing the sum aggregate of discrepancies includes using a least squares estimator between the tie points and the associated 3D points in the first and second 3D images.
In Example 5, at least one of Examples 1-4 can further include, wherein the tie points include respective tie point errors and reducing the sum aggregate of discrepancies is further determined based on the tie point errors.
In Example 6, at least one of Examples 1-5 can further include, wherein the error of the registered 3D images is an improvement over the initial errors of the first 3D image and the second 3D image.
In Example 7, at least one of Examples 1-6 can further include, wherein reducing the discrepancies include folding ground points associated with the tie points into a representation of the first and second 3D images.
In Example 8, Example 7 can further include unfolding corrections to the ground points from the representation.
In Example 9, at least one of Examples 1-8 can further include, wherein the first 3D image and the second 3D image are views of first and second geographical regions, respectively, that overlap.
Example 10 can include a non-transitory machine-readable medium including instructions that, when executed by a machine, cause a machine to perform operations for registering a first three-dimensional (3D) image to a second 3D image with error propagation, the operations comprising reducing a sum aggregate of discrepancies between respective tie points and associated 3D points in the first and the second 3D images, adjusting 3D error models of the first and second 3D images based on the reduced discrepancies to generate registered 3D images, and propagating an error of the first or second 3D images to the registered 3D image to generate error of the registered 3D images.
In Example 11, Example 10 can further include, wherein the operations further comprise conditioning the error of the first and second 3D images before propagating the error.
In Example 12, Example 11 can further include, wherein the conditioned error includes errors in translation in x, errors in translation in y, errors in translation in z, errors in roll, errors in yaw, errors in pitch, and errors in scale between the first and second images.
In Example 13, at least one of Examples 10-12 can further include, wherein reducing the sum aggregate of discrepancies includes using a least squares estimator between the tie points and the associated 3D points in the first and second 3D images.
In Example 14, at least one of Examples 10-13 can further include, wherein the tie points include respective tie point errors and reducing the sum aggregate of discrepancies is further determined based on the tie point errors.
In Example 15, at least one of Examples 10-14 can further include, wherein the error of the registered 3D images is an improvement over the initial errors of the first 3D image and the second 3D image.
Example 16 can include a memory including first and second three-dimensional (3D) images of first and second geographical regions stored thereon, and processing circuitry coupled to the memory, the processing circuitry configured to reduce a sum aggregate of discrepancies between respective tie points and associated 3D points in the first and the second 3D images, adjust 3D error models of the first and second 3D images based on the reduced discrepancies to generate registered 3D images, and propagate an error of the first or second 3D images to the registered 3D image to generate error of the registered 3D images.
In Example 17, Example 16 can further include, wherein the processing circuitry is further configured to reduce the discrepancies by folding ground points associated with the tie points into a representation of the first and second 3D images.
In Example 18, Example 17 can further include, wherein the processing circuitry is further configured to unfold corrections to the ground points from the representation.
In Example 19, at least one of Examples 16-18 can further include, wherein the first 3D image and the second 3D image are views of first and second geographical regions, respectively, that overlap.
In Example 20, at least one of Examples 16-19 can further include, wherein the processing circuitry is further configured to condition the error of the first and second 3D images before propagating the error and wherein the conditioned error includes errors in translation in x, errors in translation in y, errors in translation in z, errors in roll, errors in yaw, errors in pitch, and errors in scale between the first and second images.
Although an embodiment has been described with reference to specific example embodiments, it will be evident that various modifications and changes may be made to these embodiments without departing from the broader spirit and scope of the invention. Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense. The accompanying drawings that form a part hereof, show by way of illustration, and not of limitation, specific embodiments in which the subject matter may be practiced. The embodiments illustrated are described in sufficient detail to enable those skilled in the art to practice the teachings disclosed herein. Other embodiments may be utilized and derived therefrom, such that structural and logical substitutions and changes may be made without departing from the scope of this disclosure. This Detailed Description, therefore, is not to be taken in a limiting sense, and the scope of various embodiments is defined only by the appended claims, along with the full range of equivalents to which such claims are entitled.
This invention was made with Government support under government contract HM047618C0039. The Government has certain rights in this invention.
Number | Name | Date | Kind |
---|---|---|---|
6757445 | Knopp | Jun 2004 | B1 |
7310440 | Dolloff | Dec 2007 | B1 |
9245201 | Jin et al. | Jan 2016 | B1 |
9947128 | Korb | Apr 2018 | B2 |
20030202701 | Schuler | Oct 2003 | A1 |
20090175498 | Kochi | Jul 2009 | A1 |
20140334686 | LaFarelle | Nov 2014 | A1 |
20150371431 | Korb | Dec 2015 | A1 |
20160314593 | Metzler | Oct 2016 | A1 |
20210019937 | Gallaway | Jan 2021 | A1 |
20210097280 | Sharp, III | Apr 2021 | A1 |
20210256722 | Staab | Aug 2021 | A1 |
Number | Date | Country |
---|---|---|
201839722 | Nov 2018 | TW |
201915953 | Apr 2019 | TW |
201928883 | Jul 2019 | TW |
Entry |
---|
“International Application Serial No. PCT/US2021/019397, International Search Report dated May 14, 2021”, 5 pgs. |
“International Application Serial No. PCT/US2021/019397, Written Opinion dated May 14, 2021”, 6 pgs. |
Blasone, Giacomo, et al., “Monitoring sediment source areas in a debris-flow catchment using terrestrial laser scanning”, Catena, Elsevier, Amsterdam, NL, vol. 123, (Jul. 28, 2014), 23-36. |
Pritt, Mark D, et al., “Georegistration of motion imagery with error propagation”, Full Motion Video (FMV) Workflows And Technologies For Intelligence, Surveillance, And Reconnaissance (ISR) And Situational Awareness, SPIE, (May 11, 2012). |
Wang, Tiesheng, et al., “Research on registration method and precision in terrestrial 3D laser scanning”, Proceedings of SPIE, IEEE, US, vol. 9808, (Dec. 9, 2015), 980800-980800. |
Yu-Bin, Liang, et al., “Semiautomatic Registration of Terrestrial LaserScanning Data Using PerspectiveIntensity Images”, IEEE Geoscience And Remote Sensing Letters, IEEE Service Center, New York, NY, US, vol. 14, No. 1, (Jan. 1, 2017), 28-32. |
“Taiwanese Application Serial No. 110105974, First Office Action dated Dec. 30, 2021”, with English translation, 20 pgs. |
Number | Date | Country | |
---|---|---|---|
20210271854 A1 | Sep 2021 | US |