1. Field of the Invention
The present disclosure relates to image processing, and more particularly to a system and method for registering objects.
2. Description of Related Art
Referring to the process of aligning data from liquid chromatography and mass spectrometry (LC-MS) and 2D electrophoretic gel (2DE) experiments, in 2DE, proteins form a set of “spots” on a gel. Software is used identify and quantify the spots, giving is each spot a characteristic mass, isoelectric point, and intensity. This is analogous to the mass-charge ratio, retention time, and intensity values found in LC-MS maps. Popular packages for comparing the sets of spots between 2DE gels include Flicker, CAROL, Delta2D (a product of Decodon GmbH), and Melanie. The exact comparison mechanism differs in each package, but these software packages are designed for pairwise comparisons of gel images. Typically, the user is expected to compare a new gel to a reference database of 2DE gels.
There has been related algorithmic work conducted in the two-dimensional case, where the bounded error model yields rectangles, which may be used in building rectangle overlap graphs.
The clique building approach in two dimensions requires that all maximal cliques of a rectangle overlap graph be found. This problem has been addressed by a number of authors in the statistical estimation literature, where finding all maximal cliques in a rectangle overlap graph is a subproblem for maximum likelihood estimation with respect to bivariate interval censored data. The algorithms from the literature fall into two categories, those that merely describe the rectangular regions of mutual overlap defined by the maximal cliques (type I), and those that explicitly compute each rectangle's membership in the maximal cliques (type II).
An exemplary type I algorithm finds the rectangular regions defined by the maximal cliques in O(n2) time. This result is one in a line of proposed solutions, where others have implemented a type I algorithm in O(n3) time and O(n5) time, and a type II algorithm in O(n5) time.
General clustering algorithms are usually not suited to the bounded error model of data alignment. Some require prior knowledge of the total number of objects, which is not available to us. Others require various other parameters whose selection is less obvious than the error bounds derived from the sensors.
Therefore, a need exists for a system and method for registering objects.
According to an embodiment of the present disclosure, a computer-implemented method for registering objects of interest across a plurality of data acquisition types includes providing image data including the objects of interest corresponding to the plurality of data acquisition types, providing a plurality of constraints on groups which may be determined for the objects of interest, determining a set of possible groupings of the objects of interest according to the plurality of constraints, searching the set of possible groupings for groupings of the objects of interest according to an optimization function, and storing the groupings of the objects of interest to a computer-readable media.
The plurality of constraints are error bounds on sensor data corresponding to a detection of each of the objects of interest. Determining the set of possible groupings of the objects of interest is performed according to a bounded error model of the error bounds corresponding to the objects of interest. Searching determines which grouping from the set of possible groupings best satisfies the optimization function.
Providing the plurality of constraints on groups comprises converting a plurality of features of the image into boxes in d-dimensional space, wherein d is greater than 2, and wherein the plurality of constraints are implemented as a box for each feature, the box representing error bounds on sensor data corresponding to a detection of the features. Determining the set of possible groupings of the objects of interest according to the plurality of constraints comprises determining a set of mutually-intersecting boxes.
According to an embodiment of the present disclosure, a computer-implemented method for registering objects of interest across a plurality of data acquisition types includes inputting image data including features, the image data including inputs corresponding to the plurality of data acquisition types, providing a plurality of constraints on groups which may be determined for the features, determining a set of possible groupings of the features according to the plurality of constraints, searching the set of possible groupings for groupings of objects of interest according to an optimization function, wherein the set of possible groupings includes groupings of the objects of interest and groupings of features that do not correspond to the objects of interest, and storing the Groupings of the objects of interest to a computer-readable media.
Providing the plurality of constraints on groups comprises converting the features into boxes in 4-dimensional space, wherein d is greater than 2, and wherein the plurality of constraints are implemented as a box for each feature, the box representing error bounds on sensor data corresponding to a detection of the features. Determining the set of possible groupings of the features is performed according to a set of mutually-intersecting boxes of the features.
Searching determines and removes groupings violating transitivity.
According to an embodiment of the present disclosure, a program storage device is provided readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for registering objects of interest across a plurality of data acquisition types. The method steps include providing image data including the objects of interest corresponding to the plurality of data acquisition types, providing a plurality of constraints on groups which may be determined for the objects of interest, determining a set of possible groupings of the objects of interest according to the plurality of constraints, searching the set of possible groupings for groupings of the objects of interest according to an optimization function, and storing the groupings of the objects of interest to a computer-readable media.
Preferred embodiments of the present invention will be described below in more detail, with reference to the accompanying drawings:
According to an embodiment of the present disclosure, a method registers objects of interest across multiple data acquisitions. The method has features of a clustering algorithm and is specialized for data that fits a certain model of uncertainty. Constraints are placed on the way in which objects can be grouped. The method finds a set of possible groupings, which obey the constraints, and searches only over the restricted set of solutions.
According to an embodiment of the present disclosure, a model assumes that the attributes of the objects are being measured by imperfect sensors making stochastic errors, and that the objects are static and otherwise anonymous. Each attribute is represented as a real number with an error bar placed around it. Objects can be considered identical if and only if their attributes are approximately the same, as defined by the intersection of respective error bars.
Under this model, an acquisition is a set of sensor measurements on a set of objects. The observations from the acquisitions are organized into sets corresponding to the same object under simplifying assumption that two objects that may be identical based on sensor observations are identical in the absence of ambiguity or other evidence to the contrary.
According to an embodiment of the present disclosure, a system and/or method includes components including a module/method to efficiently find all maximal sets of potentially identical objects; a framework to resolve the ambiguous cases that arise in these sets, such as when an object appears in more than one set; and a module/method to heuristically help the user select the error bounds under certain assumptions.
The method implements a paradigm for registering objects of interest across multiple data acquisitions. The paradigm is a clustering method specialized to a data model with certain underlying assumptions.
In general, clustering algorithms partition a set of objects into groups based on a notion of similarity. These groups are supposed to contain objects that are “closely related”, a phrase whose formal definition varies depending on the method. The clustering algorithm produces these groups by an implicit or explicit optimization method. Further, according to an embodiment of the present disclosure, hard constraints are placed on the way in which objects can be grouped. These constraints are designed so that the method can efficiently describe all possible solutions that obey the constraints. An optimization method is used to select a best solution. The number of possible solutions under the constraints is typically smaller than all possible partitions of the data, making the method fast for many large problems and allowing more costly optimization schemes for the constrained optimization phase of the method.
Hence, systems and methods are described in terms of experiments, which measure properties of anonymous objects, for determining, based solely on the measurements, which objects are identical across the set of experiments.
The Model
In the model, an object refers to a physical object to be observed in some experiments. Let U={O1,O2, . . . ,OM} denote the universe of objects of interest in the experiments. The experiments have the effect of observing the objects through a set of d sensors.
It is assumed that the experiments are processed independently through an imperfect method to detect the objects. Let ε={E1,E2, . . . ,EK} be the set of K experiments. Each experiment Ek is itself a set of features, denoted fk(i) where 1≦i≦|Ek|, that have been detected and are believed to correspond to objects. However, this correspondence is not necessarily one-to-one. Some features may be spurious (false positives) and do not correspond to real objects; in other cases no feature is detected corresponding to a particular object in an experiment. This may either be due to a false negative, or the object being absent in that particular experiment. In the case where the feature fk(i) does correspond to an object Om, we define π(fk(i))={Om}; otherwise, π(fk(i)=Ø.
Further, the features are modeled as being subject to stochastic noise. Each feature fk(i) is represented as a vector in d, whose entries correspond to readings from the d sensors. The reading from sensor j is denoted fk(i)[j], and is assumed to be subject to both the inherent inaccuracy of the sensor and the imprecision of the feature construction method.
Instead of modeling this inaccuracy directly, for example, by imposing some distribution upon it, the following hard constraint is imposed. Let εj: →{} be an error bound on the features in dimension j. More specifically, εj maps a feature f to an interval [εjl(f[i]),εjτ(f[j])]. Typically, f[j]εεj(f[j]), although this is not technically needed.
Axiom 1 (Bounded Measurement Error). For all features f1,f2ε∪k=1KEk such that π(f1)∩π(f2)≠Ø and for all j, 1≦j≦d we can select j such that εj(f1[j])∩εj(f2[j])≠Ø.
The error bounds form intervals on the real number line contain each sensor measurement. For two features to be observations of the same object, the interval associated with each corresponding sensor measurement must intersect. Two features which satisfy these constraints are said to be compatible.
Example. Suppose that a device measures the mass of a molecule with an accuracy of ±1%. In one experiment, the molecule is observed to have a mass of 1000 AMU. Its actual mass must then lie in the interval [990.1, 1010.1] since 990.1+0.01(990.1)=1000 and 1010.1−0.01(1010.1) 1000.
Now suppose that a molecule with a mass of 998 were observed in a subsequent experiment. Under the same model, the true mass of the second molecule would lie in the interval [988.1, 1008.1]. The intersection of the intervals for the two observations is [990.1, 1008.1], meaning that a molecule in this mass range could explain both observations and hence the features are compatible.
Note that this example demonstrates a feature of the model: the objects and features are anonymous; there is no way to know for certain, within the context of the model, whether two features are truly observations of the same entity. Instead, the model simply provides a definition under which two features are potentially the same. Attempting to determine which features are the same given the features' anonymity requires solving a potentially complex optimization problem. However, assuming that compatibility implies identity allows for substantially reducing the search space for this optimization problem.
There are cases in which features are compatible under the model and yet may not be considered identical. One involves violations of transitivity, a property that must hold for an identity relationship. If features f1 and f2 are compatible and features f2 and f3 are compatible, then there are two possibilities: either f1 and f3 are compatible, in which case all three features may be considered identical, or at least one of π(f1)∩π(f2)=Ø and π(f2)∩π(f3)=Ø holds. Another case in which compatible features may not be identical occurs when features f1 and f2 are compatible, but both f1 and f2 derive from the same experiment Ek. How this case is handled is application-dependent; the experiment may or may not be able to make multiple observations of the same object.
Approach
Based on the model above, it is assumed that a set of K experiments are given, each being a set of features. Each feature, in turn, is a vector of d numbers. Given this, the set of all features may be partitioned into subsets that correspond to the same underlying object. No interest is taken in those features that do not correspond to objects, as long as they do not end up in sets with features that do correspond to objects.
Referring to
Applications
In summary, the model and approach (described subsequently) apply at least to problems with the following features:
It is worth pointing out some common tasks which do not fit the model without some modification or preprocessing. These include data in which the sensor error is biased (for example, due to miscalibration of an instrument), data in which the objects' properties change from experiment to experiment (for example, tracking moving objects; the position is changing), data in which the feature attributes are not metric, and data in which the feature attributes are not independent and/or must be considered in combination when determining similarity.
A Fast Method for Finding All Maximal Mutually-Intersecting Sets of Boxes in d-Dimensional Space
This section addresses the problem of finding all maximal sets of mutually compatible features. Assuming that the error function j, 1≦j≦d, has been chosen for all sensors, so that the features are converted into boxes in d-dimensional space. Also, note that if a feature is missing a particular sensor observation, it can be assigned an interval which spans the space of observations.
Problem Statement
Let B be a set of n iso-boxes in d. Each box BiεB can be represented as a set of d non-empty intervals, each denoted Xj(Bi), 1≦j≦d, and called the extent of Bi in dimension j. Adopting the convention that each interval is closed on the left and right, and notation so that Xj(Bi)=[xjl(Bi),xjτ(Bi)]. When a discussion centers on a particular box, the Bi may be omitted and the notation Xj may be used for the extent in dimension j and [xjl,xjr] as the corresponding interval.
For clarity of presentation, we impose the condition that there be a unique ordering of all interval end points in each dimension. In other words, for each dimension j, there cannot be distinct boxes B and B′ such that xjl(B)=xjl(B′), xjr(B)=xjr(B′), or xjl(B)=xjr(B′). This condition is no burden in practice, because a consistent scheme of handling ties can easily he devised.
The term clique is used to refer to a set of mutually intersecting boxes. That is, if C={B1,B2, . . . ,Bm} is a clique, then ∀j, 1≦j≦d, ∩i=1mXj(Bi)≠Ø. The implication is that each clique has an area of intersection in d which is itself a box. The box is possibly degenerate in one or more dimensions, but this has no practical effect on our method. We denote the area of intersection for clique C as box AC and borrow corresponding notation to say that the extent of AC in dimension j is Xj(AC), where
It is also worth noting that for each dimension
and xjr(AC)=minBεCxjr(B).
A clique C is maximal if and only if there does not exist a box Bεβ−C such that C∪{B} is a clique. Given the set β, it is possible to explicitly find all maximal cliques occurring in β.
Solution
Let G(β) be an undirected graph such that there is a vertex corresponding to each box in β and an edge between every pair of intersecting boxes. Such a graph is called the box intersection graph, and there is an obvious correspondence between the maximal cliques in this graph and the maximal cliques defined in our problem statement. However, no attempt is made to explicitly create G(β) and pursue a graph-theoretic approach to finding the cliques; instead, an approach based on computational geometry is used.
For all d>1, the slice operator on box B at x, Sd(B,x), is defined as the projection of B into d−1 obtained by dropping Xd if xεXd, or Ø otherwise. More formally, let B′ be a box in d−1 where ∀j, 1≦j≦d−1, Xj(B′)=Xj(B). Let xε, and define Sd(B,x)=B′ if xεXd, or Ø otherwise.
The slice set of box Bi, Sid, may be defined as follows:
Sids={Sd(Bj,xdr(Bi)):Sd(Bj,xdr(Bi))≠Ø,1≦j≦n}
Informally, Sid is the set of boxes in β which intersect the hyperplane in d normal to dimension d at xdr(Bi), projected down onto that hyperplane. The effect of the projection is to eliminate dimension d.
Using the slice set concept and a small set of lemmas to propose a recursive method for finding the maximal cliques of β; the recursion proceeds on the number of dimensions, and the base case is reached when d=1 (or, optionally, when d=2). Later, a direct, efficient method for the case where d=1 and reference another for the case where d=2 are given.
Lemma 1. Let C be a maximal clique of G(β). Then C is a maximal clique of G(Sid) for some BiεC.
Proof: Let BiεC be the box with minimum xdr(Bi). Since C is a clique, it must be the case that for all BεC, xdl(B)<xdr(Bi). Furthermore, by definition, xdl(B)≧xdr(Bi). Therefore, all elements of C occur in Si. It is easy to see that by the definition of a clique, all elements of a clique in d must form a clique in their first d−1 dimensions; hence, the elements of C form a clique in Sid. C needs to be maximal with respect to Sid. If there were some other box B′ that were in Sid and could be added to C, then this rectangle would also intersect all rectangles in dimension d at xdr(B) and hence C would not be maximal in G(β).
For convenience, the set of maximal cliques of G(β) are denoted as C, and the set of maximal cliques in G(Sid) that contain Bi as Cld. The consequence of Lemma 1, stated succinctly as C⊂∪j=1nCld, shows how to proceed toward finding the maximal cliques of β:C
Step 1. If d=1, calculate the maximal cliques of β directly.
Step 2. Otherwise, calculate each Sid and recursively find the corresponding Cld.
Step 3. Filter out those elements of Cld which are not maximal with respect to G(β).
Referring to Step 1, an exemplary algorithm is given above with respect to finding cliques in one dimension. Since each Sid is simply a set of boxes in d−1, Step 2 is a straightforward recursive usage of the algorithm. The subtle catch is that only those cliques containing Bi are retained for Step 3. Conceptually this can be accomplished by simple post-proccssing of the result of the recursive application, although an implementation that does not construct cliques that do not contain Bi in the first place will be more efficient.
The remainder of this section on Step 3. Step 3 depends on the construction and processing of the slice sets in a particular order. Let Pd be the set of all interval end points in dimension d; that is, Pd=∪i=1n{xdl(Bi),xdr(Bi)}. Let {right arrow over (P)}d be a vector of length 2n containing the elements of Pd sorted in increasing order, recalling that for simplicity we assume that all elements of Pd are unique.
Let L be a data structure representing a set of boxes. The data structure must support fast insertion and deletion of elements, and enumeration of all elements of the set in O(|L|) time. Examples of such a data structure would be a balanced binary tree or hash table that uses the index i of each Bi as a key.
The slice sets are enumerated by considering each member x of {right arrow over (P)}d in increasing order. There are two cases for each x: either x=xdl(Bi) or x=xdr(Bi) for some Biεβ, meaning that either x is the start of Bi in a left-to-right sweep of dimension d, or the end. Suppose x is the start of Bi. In this case, Bi is inserted into L. If x is the end Bi, then L contains exactly those intervals in S d i. Sid is extracted, Bi is removed from L, and Sid recursively processed to generate Cld. The following lemma demonstrates why it is useful to generate and process the slice sets in this order.
Lemma 2. Let CεCjd be a maximal clique of G(Sjd) that is not maximal with respect to G(β). Then there exists a clique CεCid with xdr(Bi)<xdr(Bj) such that C′⊂C.
Proof: By Lemma 1, a maximal clique C of G(β) that contains C′ must be contained in some Cid. Suppose that xdr(Bi)>xdr(Bj). By definition Bjε′, but Bj∉C′ because Sd(Bj,xdr(Bi)) must be Ø. The implication is that C′⊂C. Hence, it must be the case that xdr(Bi)<xdr(Bj).
Thus, as long as the cliques are considered in increasing order of xdr, it can be guaranteed that all cliques found in the slice sets that are not maximal with respect to G(β) will be observed after their containing maximal clique. Testing for clique containment is accomplished via the computational geometry result of the next lemma.
Lemma 3. Let CεC be a maximal clique of G(β). The clique C′⊂C if and only if AC⊂AC.
Proof: Suppose first that C′⊂C. It follows immediately from the comments in Section 2.1 on areas of intersection that AC⊂AC. Therefore, the centroid of AC is contained in AC.
Conversely, suppose that AC⊂AC′ and let x be an arbitrary point such that x□AC. This implies xεAC′, so all of the rectangles of C′ must also contain x. Hence all rectangles in C and C′ share a common point of intersection, so the set C′=C∪C′ is a clique. Since C is maximal, this means that C″⊂C′, and hence C′⊂C.
Thus, in order to test if C′ is a sub-clique of a previously-observed clique C, it can be tested to see if AC⊂AC′. Another scheme may be considered as: an arbitrary point x can be selected from each clique C as it is output (in practice, the centroid
Finding Cliques in One Dimension
The base case for the d-dimensional problem is achieved when d=1, although there is a direct solution for d=2 that may perform better than the recursive algorithm when d=2.
An algorithm for the one-dimensional case is summarized here for completeness. Let 1 be a set of n intervals in . For each Iiε1, let Ii=[xl(li),xr(li)]. As before, for simplicity of presentation we assume all of the interval end points are unique. Let P be the set of all interval end points; that is, P=∪i=1n{xl(li),xr(li)}. Let {right arrow over (P)} be a vector of length 2n containing the elements of P sorted in increasing order, recalling that for simplicity we assume that all elements of P are unique. Let p(i)=l if P[i]=xl(li) for some j; otherwise p(i)=r because it must be the case that {right arrow over (P)}[i]=xr(lj) for some j.
Let Si denote the set of intervals containing the point {right arrow over (P)}[i]. For completeness, define p(0)=r.
Theorem 1. Si is a maximal clique of intervals if and only if p(i)=r and p(i−1)=l.
A sweepline procedure built around Theorem 1 appears in
function find_cliques_ld(ia: Array of Interval): List of Clique
Pseudo-code for one-dimensional algorithm (see
The Constrained Optimization Problem
General Framework
Recall that the goal of our method is to take a collection of features identified in a set of experiments and partition the collection into sets of features representing the same object. The method of Section 2 does not directly accomplish this goal. How close the set of cliques is to the final solution depends on the nature of the data and the error bounds.
Recall that ε represents the set of experiments Ek, and that each Ekεε is a set of features. Let F denote the set of features across all experiments; that is, F=∪k=1KEk.
Furthermore, the method is are only concerned with “true” features f such that π(f)≠Ø. This subset of F is denoted as Fπ. An identity relationship partitions Fπ into equivalence classes Π1,Π2, . . . ,ΠR. The method is interested in finding the partition that satisfies the relationship f1, f2εΠ, if and only if π(f1)=π(f2).
Since the features are anonymous, π(f) is a hidden variable. Assuming access to properties of the feature π′(f) and a function φ(f,f′) is constructed that approximates Pr[π(f)=π(f′)|(f),π′(f′)]. The function φ may rely on the same data used to derive the boxes, and/or may depend on other information. Given such a function, find a partition of π that maximizes
φ may be expensive to determine, and the search space of all possible partitions is large. Both the search space and the number of times φ is evaluated by using constraints imposed by the set of maximal cliques C found by the method of the previous section are restricted.
Theorem 2. The partition Π of Fπ which maximizes (1) satisfies the property that for all ΠrεΠ, Πr⊂C for some maximal clique CεC.
Proof: Suppose that f,f′εΠr. Then, by Axiom 1, the boxes corresponding to these features intersect; since this hold for all pairs of features in Πr, the boxes derived from these features must form a clique in G(β). Since C is the set of all maximal cliques in G(β), the clique induced by Πr must be a subset of one of these cliques.
Note that C is not a partition of F only because some features appear in more than one member of C; each feature is guaranteed to participate in at least one clique. Hence, it is possible to transform C into the optimal partition Π by performing a series of two operations:
Assignment: Any feature which appears in multiple cliques must be assigned to a single clique and removed from the others.
Partition: Under φ, some features may not be likely to be identical, despite being placed in the same clique. Hence, cliques may be partitioned.
A number of standard combinatorial optimization methods can be used to search for the optimal partition, beginning with C and using the operations above to generate potential solutions.
The Parsimony Restriction
Suppose that φ relies upon the same sensor data used to generate β. In that case, there is no reason to partition a clique, since the sensor data indicates that the members may be identical and there is no external reason to believe that they are not. This notion is captured in the Principle of Parsimony, also known as Occam's Razor:
Axiom 2 (Principle of Parsimony). One should not increase, beyond what is necessary, the number of entities required to explain anything.
Hence, in the situation that is impossible (or rare) to have information utilized by φ which contradicts the compatibility of features, the optimization method can ignore the Partition operation and focus on optimizing via Assignment, leading to more efficiency. In this case, it is only necessary to compute φ(f,f′) if f and f′ appear in the same clique, and at least one of them appears in multiple cliques.
Estimating the Error Bounds
The error function εj for each sensor should be known a priori from knowledge of the sensor, internal calibration, and/or external calibration. However, this is not always the case for a variety of reasons. In cases where the functions are not known ahead of time or where we wish to confirm our prior knowledge, we can attempt to estimate them from the data. A heuristic according to an embodiment of the present disclosure is based on making some assumptions about the error bounds and the nature of what the box overlap graph “should” look like with a good choice of error bounds.
The following assumptions are made about the functions:
Furthermore assume that many objects are observed in all experiments. Hence, it can be expected that a good choice of θ would lead to a large number of cliques whose size is K, and that these cliques would usually be vertex-disjoint from the other cliques in the graph. If these assumptions are valid for a particular data set, then we can choose the vector θ that induces a set of boxes β and overlap graph G(β) where the number of connected components in G(β) that are complete subgraphs of size K is maximized.
Let Ω represent the universe of all possible choices of θ. Since θεRd, Ω may initially appear to be infinite. However, since the metric depends on the finite number of possible configurations of G(β), Ω can be thought of as a finite set of vectors whose values induce the different configurations.
Let f and f′ be features, and let {circumflex over (θ)}(f,f′) be such that
In other words, {circumflex over (θ)}(f,f′) represents the smallest value of θj such that f and f′ are compatible. Now consider features f″ and f′″. We say that {circumflex over (θ)}(f,f′)≦{circumflex over (θ)}(f″,f′″) if and only if {circumflex over (θ)}(f,f′)≦{circumflex over (θ)}(f″,f′″) for all j, 1≦j≦d. Note that this relationship implies that under parameters {circumflex over (θ)}(f″,f′″), f and f′ are also compatible.
Let {circumflex over (Ω)} be the set {{circumflex over (θ)}(f,f′): f,f′εF}. New define the ⊕ operator such that By θ⊕θ′ is a d-dimensional vector where element j, 1≦j≦d, is defined as max{θj,θ′j}. Let Ω be the closure of {circumflex over (Ω)} under ⊕.
Theorem 3 Let β′ be the set of boxes derived from F by θ′ and G(β′) be the overlap graph derived from β′. There exists a θεΩ which derives a set of boxes β from F such that G(β) is isomorphic to G(β′).
Proof: It can be shown how to select θ to produce the isomorphism. Match the vertices derived from the same element of F under θ and θ′; such that is it only needed to show how to choose θεΩ to achieve the same edge structure.
Let {circumflex over (Ω)}′={{circumflex over (θ)}(f,f′):{f,f′}εGE(β′)}, where GE denotes the edge set of G. Define θ such that for all j, 1≦j≦d, θj=max{circumflex over (θ)}ε{circumflex over (Ω)}′{circumflex over (θ)}j. Clearly for all {circumflex over (θ)}ε{circumflex over (Ω)}′, {circumflex over (θ)}≦θ. This implies that for all eεGE(β′),eεGE(β).
Now suppose e=(f,f′) is an edge in GE(β). Consider an arbitrary j, 1≦j≦d. For this j, {circumflex over (θ)}j(f,f′)≦θj. By the way θ was chosen, this means that there was an edge E′=(f″,f′″) in GE(β′) such that {circumflex over (θ)}j(f″,f′″)=θj. Since e′ is in G(β′), it also follows that {circumflex over (θ)}j(f″,f′″_≦θj′. Thus, {circumflex over (θ)}j(f,f′)≦θj≦θj′ for all j, and so eεGE(β′).
A Simple Exact Method
Let θ* be a vector such that for all j, 1≦j≦d, θj′=θj for some θε{circumflex over (Ω)}. Let Ω*={θ*} given {circumflex over (Ω)}.
Theorem 4. Ω⊂Ω*.
Proof. Let the set Dj(Ω)={θjθεΩ}. In other words, Dj is the set of all values appearing in dimension j of the vectors in Ω. Recall that Ω is the closure of {circumflex over (Ω)} under ⊕. It can be claimed that Dj(Ω)=Dj({circumflex over (Ω)}). Let θεΩ−{circumflex over (Ω)}. Therefore θ={circumflex over (θ)}(1)⊕{circumflex over (θ)}(2)⊕ . . . {circumflex over (θ)}(M), where each {circumflex over (θ)}(m)ε{circumflex over (Ω)}. However, this means that θj is the result of successively taking the maximum of each θj(m) and θj(m+1), so therefore θj=θj(m) for some m. Thus, θj=Dj. Since Ω* can equivalently be defined as D1×D2× . . . ×Dd, Ω⊂Ω*.
The enumeration of the elements of Ω* includes constructing the set Dj for each dimension j and using loops to enumerate the elements of Ω*=D1×D2× . . . ×Dd. For each θj, construct the corresponding β and G(β) and find the complete connected components of G(β).
The enumeration process can be accelerated by noting that if θ≦θ′ and G(β) and G(β′) are the respective graphs induced by θ and θ′, then GE(β)⊂GE(β′). The standard UNION-FIND data structure can be used to identify the connected components of G(β) as edges are added by values of θ in increasing order under the ≦ relation, Finding an optimal decomposition of Ω* into increasing sequences of parameters is an interesting problem; a simple (but sub-optimal) solution is to sort the Dj sets prior to iteration, thereby forming runs of increasing subsequences.
A Faster Heuristic Method
Since the size of Ω* is O(n2d), a faster heuristic method is needed for most applications. The steepest descent method in which the state θ has successors {θ′|θ≦θ′} 1X) is recommended here. This makes state transitions efficient by using the UNION-FIND data structure to quickly update the optimization criterion. Various heuristics might he used to choose the initial state, including states suggested by prior knowledge or expectation, or states determined by the distributions of the values in the Dj, sets. It is to be understood that the present invention may be implemented in various forms of hardware, software, firmware, special purpose processors, or a combination thereof. In one embodiment, the present invention may be implemented in software as an application program tangibly embodied on a program storage device. The application program may be uploaded to, and executed by, a machine comprising any suitable architecture.
Referring to
The computer platform 301 also includes an operating system and microinstruction code. The various processes and functions described herein may either be part of the microinstruction code or part of the application program (or a combination thereof), which is executed via the operating system. In addition, various other peripheral devices may be connected to the computer platform such as an additional data storage device and a printing device.
It is to be further understood that, because some of the constituent system components and method steps depicted in the accompanying figures may be implemented in software, the actual connections between the system components (or the process steps) may differ depending upon the manner in which the present invention is programmed. Given the teachings of the present disclosure provided herein, one of ordinary skill in the related art will be able to contemplate these and similar implementations or configurations.
Having described embodiments for a system and method for registering objects, it is noted that modifications and variations can be made by persons skilled in the art in light of the above teachings. It is therefore to he understood that changes may be made in embodiments of the present disclosure that are within the scope and spirit thereof.
This application claims the benefit of Provisional Application No. 60/712,962 filed on Aug. 31, 2005 in the United States Patent and Trademark Office, the contents of which are herein incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
60712962 | Aug 2005 | US |