The present invention concerns method for elastic registration between a two-dimensional digital image and a slice of a three-dimensional volume with overlapping content.
The field of the invention is the field of two-dimensional (2D) to three dimensional (3D) image registration/fusion either in the case of same modalities or different ones.
The 2D-3D images registration is an important problem in computer vision, and finds applications in multiple medical procedures, as radiotherapy planning and treatment verification, spinal surgery, hip replacement, neuro-interventions and aortic stenting.
Image guided surgeries, laparoscopic or endoscopic, also benefit from the 2D-3D image registration where imaging acquisition devices are associated with the surgical devices
Applicative fields of the invention include image guided adaptive radiotherapy planning and treatment, image-guided biopsy as breast under ultrasound guidance or prostate biopsies using MRI images and transrectal ultrasound (TRUS) achieve a higher detection of cancer. Another application domain is image-guided surgery, as laparoscopic or endoscopic, in particular neurosurgery which benefits from 2D-3D image fusion for evaluation of intracranial tumours and vascular pathologies.
In many medical imaging applications, it is particularly useful to achieve a registration/mapping between a two-dimensional image of a partial view of an object of interest that is captured during an operation, also called intra-operative capture, whereas a three-dimensional volume where the same object of interest is present partially or completely acquired at a pre-operative stage. It should be noted that the object of interest could exhibit shape changes during surgery (tissue shift, respiratory motion, cardiac heart beating). Such registration is useful to guide surgeons during operation through the mapping of visual observations at surgery to the 3D pre-operative high resolution annotated data.
A particular problem in medical applications is that tissue shift, as well as breathing and heart motion during surgery, cause elastic deformations of the organs and make the registration of a 2D image acquired during operation with a 3D volume previously computed a particularly challenging problem.
More generally, medical applications that can benefit from an elastic 2D to 3D registration, with possibly multi-modal acquired images, are application fields for the invention.
Several methods for achieving registrations between pre-operative 3D images and intra-operative 2D images are known, but they require either a learning stage or to pre-process input data. The quality of the final result is highly dependent on the learning stage.
Many known methods are either dedicated to specific organs (for example the ones requiring manual segmentation of the organ of interest that perform registration by segmentation) or could address the task for specific image modalities and therefore cannot be re-used in other applications.
An aim of the present invention is to provide a 2D-to-3D elastic/deformable registration/fusion method that overcomes the cited drawbacks.
To this end, the invention concerns, according to a first aspect, a method for elastic registration of a two-dimensional source digital image of an object of interest with a slice of a three dimensional target volume of the object of interest, the method comprising defining a Markov Random Field framework comprising at least one undirected pairwise graph superimposed on the two-dimensional image domain comprising at least a set of regular vertices and at least a set of edges, defining at least a grid of control points, each control point corresponding to a vertex of the set of vertices, and a neighborhood system of edges associated with vertices.
The method of the invention comprises:
defining a set of multi-dimensional labels of a discrete space to associate to each control point, wherein a spatially displaced control point is obtained by assigning a given label to said control point, a control point displacement defining a transformation adapted to obtain a plane slice of the target volume and an in-plane deformation transformation of the source digital image,
obtaining a set of labels defining a set of displaced control points, by minimizing an energy target function, the set of displaced control points defining a plane slice of the target volume that elastically matches the two-dimensional source digital image according to the energy target function.
The energy target function being expressed as a sum of terms including at least one first order term computed as a sum, over the control points, of unary potentials depending on the vertex multi-dimensional label, and/or at least one second order term computed as a sum, over the pairs of control points according to a second order neighborhood system, of pairwise potentials depending on pairs of multi-dimensional labels of two connected vertices, and/or at least one higher order term, over higher order cliques of control points according to a higher order neighborhood system, of higher order potentials depending on a number of labels depending on the order of said cliques.
Advantageously, the invention provides an elastic registration method using Markov Random Fields that is able to determine simultaneously the plane and the in-plane dense deformation, which can be adapted to different image modalities.
According to an embodiment, each control point is assigned a five-dimensional label of a sampled discrete five-dimensional label space, each label comprising three parameters defining a displacement vector in a Cartesian coordinate system and two parameters defining a vector over a unit sphere, and wherein the in-plane deformation transformation of the source digital image and the plane transformation are determined simultaneously by said five-dimensional labels.
According to another embodiment, the undirected pairwise graph comprises two sets of vertices and three sets of edges, the two sets of vertices being regular and of same cardinality, a first set of vertices being labeled with labels from a first two-dimensional label space modeling in-plane deformations and a second set of vertices being labeled with labels from a second three-dimensional label space modeling the plane slice position, every control point having two associated vertices, a first vertex from the first set of vertices and a second vertex from the second set of vertices.
According to yet another embodiment, the undirected pairwise graph comprises a set of vertices, each vertex having an associated three dimensional label, and wherein two sets of edges are defined, a first set of edges composed of triplets of neighboring vertices of triangular shape, and a second set of edges composed of triplets of horizontal or vertical collinear neighboring vertices.
According to an embodiment, the obtaining of a set of displaced control points comprises a master-slave architecture:
communicating to each slave a rigid transformation, each slave generating a plane slice from the target volume corresponding to said rigid transformation,
obtaining an in-plane deformation transformation that minimizes a local energy criterion computed between the source image and the extracted plane slice, and
memorizing an associated local energy term that minimizes the local energy.
According to a second aspect, the invention concerns a device for elastic registration of a two-dimensional source digital image of an object of interest with a slice of a three dimensional target volume of the object of interest, comprising a computing unit adapted to implement a method comprising defining a Markov Random Field framework comprising at least one undirected pairwise graph superimposed on the two-dimensional image domain comprising at least a set of regular vertices and at least a set of edges, defining at least a grid of control points, each control point corresponding to a vertex of the set of vertices, and a neighborhood system of edges associated with vertices.
The device comprises:
a defining unit for defining a set of multi-dimensional labels of a discrete space to associate to each control point, wherein a spatially displaced control point is obtained by assigning a given label to said control point, a control point displacement defining a transformation adapted to obtain a plane slice of the target volume and an in-plane deformation transformation of the source digital image,
an obtaining unit for obtaining a set of labels defining a set of displaced control points, by minimizing an energy target function, the set of displaced control points defining a plane slice of the target volume that elastically matches the two-dimensional source digital image according to the energy target function,
the energy target function being expressed as a sum of terms including:
The advantages of the device of the invention are analogous to the advantages of the method of the invention briefly introduced above.
According to a third aspect, the invention concerns a non-transitory program storage device readable by a processing system, tangibly embodying a program of instructions executed by the processing system to perform the method steps for elastic registration of a two-dimensional source digital image of an object of interest with a slice of a three dimensional target volume of the object of interest as briefly described above.
The advantages of the non-transitory program storage device of the invention are analogous to the advantages of the method of the invention briefly introduced above.
The invention will be better understood in the light of the detailed description and accompanying drawings listed below, which are only exemplary and by no way limiting:
The invention is described in the context of medical imaging, wherein a two-dimensional (2D) image of a partial view of an object of interest is captured during an operational process, whereas a 3D volume of the same object of interest is computed in a pre-operational stage. For example, the 2D image is a Magnetic Resonance Image (MRI) or an Ultrasound image, whereas the 3D volume is obtained by computed tomography (CT) or MRI.
The invention method for elastic registration of a two-dimensional source digital image of an object of interest with a slice of a three dimensional target volume of the object of interest, the method comprising defining a Markov Random Field framework comprising an undirected pairwise graph superimposed on the two-dimensional image domain comprising at least a set of regular vertices and at least a set of edges, defining at least a grid of control points, each control point corresponding to a vertex of the set of vertices, and a neighborhood system of edges associated with vertices.
The invention solves an optimization problem, that can be generally defined by the following energy target function:
Where V is the set of vertices defined, E1 is a second order neighborhood system associated to edges relating two vertices, E2 is a higher order neighborhood system associated to cliques Ci of vertices of higher order, for example third order or more, and the number of edges in a clique Ci is designated by |Ci|
The parameters a,b,c are real positive weights which are selected according to the embodiment of the invention, as explained in detail hereafter.
The potential functions gp(xp) and fp
In a general view, the energy target function comprises at least one first order term computed as a sum, over the control points, of unary potentials depending on the vertices labels, and/or at least one second order term computed as a sum, over the pairs of control points according to a given second order neighborhood system, of pairwise potentials depending on of the assigned labels of two connected vertices, and/or at least one higher order term, over higher order cliques of control points according to a higher order neighborhood system, of higher order potentials depending on a number of labels depending on the order of said cliques.
Several different embodiments of the invention will be described hereafter.
A deformation grid of the control points of regular grid G is defined using the coordinate transformation:
Where x is a point in the source image, p is a position vector of a control point, dp is a displacement vector of control point p. A label up is associated with each control point p and displacement dp. The function η( ) is a weighting function for the displacement vector which determines the interpolation strategy.
According to a first embodiment of the invention, a plurality of rigid transformations Ai defined in a three dimensional cartesian space by 6 coordinates Ai=(tx(i),ty(i),tz(i),rx(i),ry(i),rz(i)) where the first three parameters tx,ty,tz are parameters defining a translation and the following three rx,ry,rz are parameters defining a rotation, the following steps are implemented according the first embodiment of the invention.
An MRF (Markov Random Field) energy target function to be minimized by T is defined by the following terms:
Where E is a set of connected control points according to a neighborhood system, |G| is the number of points of G, |E| is the number of point of the neighborhood system and t is an iteration counter.
The first term of the energy function (3)
is a first order term formed by unary potentials that encode data terms (similarity criterion), and the second term
is a second order regularization term, containing binary potentials that are used to impose smoothness on the deformation field.
It should be noted that in present embodiment arbitrary similarity metric can be used to compare the source and the target image, which is a slice plane of the target volume, as well as arbitrary smoothness constraints imposing different degrees of smoothness on the deformation field.
Advantageously, the registration method casts the MRF optimization problem to a linear programming one and uses the fast primal-dual algorithm to determine a coordinate transformation for in-plane deformation that minimizes the energy function as defined above. In order to take advantage of modern parallel architectures, the global transformation problem is decoupled from the local deformation and the considered optimization problem is solved through parallel creation of multiple instances of the global transformation and the optimization of the associated energy corresponding to the local transformation.
The subsequent steps run in parallel through a master-slave architecture. The master maintains two variables, one that corresponds to the minimal energy Emin initially set to infinity and one that corresponds to the optimal transformation, initially set to the identity transformation. The master in a first step 20 generates m-possible planes Ai=(tx(i),ty(i),tz(i),rx(i),ry(i),rz(i)).
Each plane is communicated (step 22) to one of the slaves, or multiple planes are communicated to the same slave.
Each slave, for each plane or slice Ji according to transformation Ai is extracted from the target volume J.
A 2D-2D registration using Markov Random Fields between the source image I and the target image Ji is applied in step 24. In this step, a regular deformation grid of control points is defined on the source image I, providing a sparse representation of a deformation field. The regular grid is deformed under the influence of image similarity criteria and regularization constraints to obtain an in-plane deformation of the source image I in order to get the best match with the target image Ji.
In a preferred embodiment, the method applied for the 2D-2D registrations is the method described in U.S. Pat. No. 8,126,291, the contents of which are herein incorporated by reference in their entirety.
An MRF energy target function to be minimized by in-plane transformation T is defined by the following terms:
Where E is a set of connected control points according to a neighborhood system, |G| is the number of points of G, |E| is the number of point of the neighborhood system and t is an iteration counter.
The first term of the energy function (4)
is formed by unary potentials that encode data terms (similarity criterion), and the second term
is a regularization term, containing binary potentials that are used to impose smoothness on the deformation field.
It should be noted that in present embodiment arbitrary similarity metric can be used to compare the source and the target image as well as arbitrary smoothness constraints imposing different degrees of smoothness on the deformation field.
The registration step 24 provides an energy term Ei associated to the transformation. This energy is communicated to the master (step 26) that compares it with the existing minimum energy (step 28). If the energy is lower than the existing minimum energy then the minimum Emin is set to Ei and the optimal transformation to the one corresponding to the minimum energy.
In this embodiment, each control point can be considered as having an associated label lp composed of the label up previously defined for the energy minimization in order to obtain the in-plane transformation and a plane index i designating the plane transformation among the plurality of plane transformations considered. The overall process allows determining the label lp for each control point for selecting the in-plane and the rigid plane transformation minimizing a global energy criterion defined by the energy target function.
Preferably, m is comprised between 100 and 10000.
Advantageously, the method is parallelizable both in CPU and GPU, the steps 24 and 26 being naturally carried out in parallel.
Therefore, the first embodiment of the invention has the advantage of computational efficiency.
According to a second embodiment of the invention, a direct elastic 2D-3D registration is applied, based on Markov Random Fields.
Given a 2D source image I and a 3D target volume J, the objective is to determine a 2D-2D in-plane local deformation TD and a bi-dimensional slice of the volume J, defined by a plane {circumflex over (π)}(J), which minimize the following objective function, also called I energy target function:
E=M((I·TD(x),π[J](x))+R(TD,π) (5)
Therefore, the in-plane deformation and the plane transformation to be found are defined as:
where x represents the pixels of the source image I, M is the data term and R is the regularization term.
The data term M measures the matching quality between the in-plane deformed 2D source image and the slice π[J] of the 3D volume J. The regularization term R imposes certain constraints on the final solution, that can be used to render the problem well posed primarily acting on the geometric properties of the deformation field.
The plane {circumflex over (π)}[J] that minimizes the equation (6) indicates the slice of the 3D volume J that is more similar to the deformed source image. The deformation field {circumflex over (T)}D represents the in-plane transformation that minimizes the objective function. Such a plane/deformation can be represented in various ways. One can consider 3D displacements of three points (that define a plane but require at least three points) or adopt an over-parameterized model that associates a plane normal and a plane displacement at each control point endowed with additional geometrical constraints to impose consistency on the plane selection and the in-plane deformation.
In the second embodiment of the invention, an undirected pairwise graph G=V,E is super-imposed image in the 2D domain, with a set of vertices V and a set of cliques or edges E, as schematically shown in
The vertices V, forming a regular lattice, are interpreted as control points of a bi-dimensional quasi-planar grid that models at the same time the in-plane deformation and the current position of the 2D image in the 3D volume, therefore the position of the slice of the 3D volume matching the source image I.
where x is a point in the source image, p is a position vector of a control point, dp is a displacement vector of control point p. A label up is associated with each control point p and displacement dp. The function η( ) is a weighting function for the displacement vector which determines the interpolation strategy.
The grid represents a Free Form Deformation (FFD) model where each control point has only local influence on the underlying image deformation.
The set of edges E is formed, in this embodiment, by a regular grid of 4-neighbor connections as illustrated schematically on
The vertices of the set V move by assigning them different labels ui of a label space L until an optimal position according to a given criterion is found.
Preferably, the criterion is the minimization of an energy target term, therefore the optimal position in the label space is the position that minimizes the energy target term defined.
In order to deform the pairwise graph G, the label space L including all possible values that a vertex may take, is first defined.
Preferably, in this second embodiment, the label space L consists of 5-tuples u with the following structure:
u=(dx,dy,dz,φ,θ) (8)
where the first three parameters (dx,dy,dz) define a displacement vector di in the Cartesian coordinate system illustrated in
Given a control point pi=(pxi,pyi,pzi) defining the initial position of a vertex and assigning label ui=(dxi,dyi,dzi,φi,θi) to this control point, a displaced control point p′i defining a position of vertex in the target volume is obtained:
p′
i=(pxi+dxi,pyi+dyi,pzi+dzi) (9)
Additionally, a plane πi containing the displaced control point p′i and whose unit normal vector is Ni of coordinates (φi,θi) is obtained, as schematically illustrated in
In
The MRF energy target function to be optimized is formed as a summation of a first order data term and a second order regularization term.
In this second embodiment, the data term is formed by unary potentials g={gi( )}, each vertex having an associate unary potential gi, and the regularization terms is function of pairwise potentials f={fi( )} associated with the edges of graph G.
The unary potentials are typically used for encoding the similarity score between the deformed plane and the target volume, whereas the pairwise potentials act as regularizers and play an important role in obtaining consistent geometrically and high quality results.
The minimization energy problem in the context of discrete MRF is defined as:
Where ui,ujεL are the labels associated to the vertices vi,vjεV
Advantageously, the formulation of the unary potential is independent on a similarity measure implemented. Therefore, any intensity based metric δ capable of measuring the distance between two images may be applied, for example a sum of absolute differences, mutual information, normalized cross correlation resulting on a modular with respect to the image modalities to be registered approach.
The unary potentials are calculated for each control point pi, using its associated plane πi[J] of the target 3D volume J and the source image I.
In a preferred embodiment, a patch Ωi, centered in pi, of plane πi[J], is extracted from the 3D target volume J, so that a metric δ can be calculated between the intensity values of the patch Ωi and the corresponding area of the source image. If pixel-wise comparisons are considered in the metric then the similarity criterion can be expressed by the following formula:
Where x designates the spatial coordinates of pixels belonging to patch Ωi.
It should be noted that the patch orientation is determined in such a way that it is consistent with the one of the corresponding patch in the source image I.
The patch Ωi is preferably of regular geometric shape, for example a square area centered on control point pi. Patches corresponding to adjacent control points can either overlap or not overlap depending on the degree of smoothness that is expected with respect to the estimated transformation. Overlapping patches will tend to smooth further the deformation compared to the non-overlapping ones.
In an embodiment, particularly adapted in the case where two images of the same modality are compared, the metric δ is the sum of absolute differences:
In multimodal scenarios, where different modalities are compared, for example CT and ultrasound images, statistical similarity measures such as Mutual Information (MI) are preferably used. Indeed, in these scenarios, the corresponding objects do not have the same intensities in two images.
Mutual information is defined using joint intensity distribution P(i,j) and the marginal intensity distribution P(i) and P(j) of the images. These local distributions are determined empirically using the observed samples at the source image and the target patch:
More generally, the framework described can be generalized using any similarity measure defined over two bi-dimensional images.
The regularization terms, also called pairwise potentials as mentioned above, encode the regularization of the displacement field.
In a preferred variant of this second embodiment, the pairwise potentials are defined by two terms, the first regularization term F1 controlling the grid deformation assuming that the grid is a plane, the second regularization term F2 maintaining a plane structure of the grid after applying displacements to the control points.
The first and second terms are weighted by a coefficient α as indicated by the following equation:
f
ij(ui,uj)=αF1(ui,uj)+(1−α)F2(ui,uj) (14)
The in-plane grid deformation is controlled using a distance preserving approach, so as to preserve the original distance between the control points of the grid, which are assumed to be co-planar.
Preferably, the distance metric is an Euclidean distance between control points, designed by ∥ ∥.
For example, the first term F1 that regularizes the in-plane deformation is given by:
Where p0,i and P0,j are the original positions of the control points in the source image I. The distance between the original positions of the control points is dependent on the considered grid resolution, and varies from one level to the next in a multiresolution approach as explained in detail hereafter.
The second regularization term F2 penalizes the average distance between a control points and the plane corresponding to a neighboring control point. The aim is to maintain a quasi-planar structure of the grid.
Given that the distance between a point and a plane is zero when the point is in the plane, the second regularization term F2 is minimum when both control points are in the same plane.
The distance between a point p=(px,py,pz) and a plane π defined by normal vector N=(nx,ny,nz) and point q=(qx,qy,qz) is calculated as:
The second regularization term F2, that is symmetric, is then defined by the average of Dπj(pi+di) and Dπi(pj+dj):
F
2(ui,uj)=½(Dπ
The normal vectors in the label space are expressed using spherical coordinates with a fixed ratio r=1. The following representation equivalence applies between Cartesian coordinates (nx,ny,nz) and spherical coordinates (r,φ,θ):
n
x
=r sin(θ)cos(φ),ny=r sin(θ)sin(φ),nz=r cos(θ) (18)
Preferably, the method called FAST-PD, described in the U.S. Pat. No. 8,131,069, included herein by reference, is used for determining the optimal labels according to the energy target function of formula indicated by equation (10) above.
In an implementation of this second embodiment, in order to reduce the computational time to convergence, a pyramidal approach is applied. In this approach, several grid resolution levels are used, from a coarse to a fine spacing between control points. For each grid resolution, a fixed number of iterations of the registration algorithm are performed, choosing the optimal five dimensional label for each control point of each grid resolution level and updating the control points position with this information from one grid resolution level to the next one.
For each resolution level, during the iterations over one grid resolution, the size of the displacement vectors and angles that form the label space is reduced so as to improve the search space sampling. The transformation obtained after several optimization steps at a given resolution level is used to initialize the position of a finer regular grid at the next level.
For example, from one resolution grid to the next finer one, the grid size multiplied by 2 by introducing a new control point at equidistance from two adjacent ones.
Preferably, the number of grid resolution levels varies between 3 to 5.
Preferably, the maximum displacement of a control point is restricted to a fraction of the grid spacing, for example to 0.4 times the grid spacing. Advantageously, this constraint guarantees a diffeomorphic deformation field.
According to a third embodiment of the invention, the label space is decoupled in order to decrease the computational cost. The graph-structure is decoupled in two different (identical) ones, the first acting on the in plane deformation and the second acting on the plane selection. Advantageously, such a decoupling allows considering big displacements and small deformations in the search space.
In this third embodiment, every vertex of the previous embodiment is decoupled in two different connected vertices, one of them modeling the in-plane deformation and the other one the position of the plane slice in the target volume J. This is analogous on creating two separated graphs of the same size and topology with different types of vertices, taking values in two different label spaces, one for the in-plane deformation and another one for the plane position, which can be sampled with different precisions.
According to this embodiment, two identical graphs are defined and connected by a pairwise edge for all corresponding vertices, having associated pairwise potentials in order to encode the matching similarity measure.
As shown in
The vertices VI and VP form respective regular lattices noted 70 and 72 in
This modeling is equivalent to a single grid modeling, as already described with respect to the second embodiment of the invention, every control point pk of the grid 74 having two associated vertices, vkIεVI and vkPεVP.
Two label spaces are defined, a first label space LI associated with the vertices in VI and a second label space LP associated with the vertices in VP.
The first label space LI is a bi-dimensional space that models in-plane deformation using displacement vectors lI=(dx,dy).
The second label space LP is tridimensional and indicates the plane in which the corresponding control point is located, comprising labels lP=(φ,θ,λ) where φ and θ are angles defining a vector N over a unit sphere, expressed using spherical coordinates while λ corresponds to the displacement applied to the control point p along the normal to the plane defined using the (φ,θ) unit sphere plane parameterization. Together with parameter λ they define the position of the plane associated with a given control point, as shown in
Considering the analogy with the second embodiment, every control point pk is associated with two vertices vkIεVI and vkPεVP and their respective labels, lkIεLI and lkPεLP. A plane πk with normal vector N contains the displaced control point pk+lkI. The parameter λ indicates the magnitude of the translation applied to the plane πk in the direction given by vector N in order to place the plane in its final position, as schematically shown in
The energy target that guides the optimization process is defined over the pairwise terms as follows:
Where ei,jI acts as an in-plane regularizer, associated with edges in EI, ei,jP acts as plane regularizer, associated with edges in EP, and ei,jD are data terms associated with edges in ED, liI and liP are labels respectively from the first label space LI and the second label space LP.
The data term is defined for every control point pk using the information provided by the two associated graph vertices, and is encoded in pairwise terms ei,jD.
As proposed in the second embodiment, in a preferred implementation, a patch Ωk of the plane πk centered on the displaced control point defined by label liI and normal to the vector defined by the label liP is extracted from the 3D volume J, and a similarity measure δ is calculated between the image signal of the patch Ωk and the corresponding area over the source image I, as schematically shown in
In an embodiment, particularly adapted in the case where two images of the same modality are compared, the metric δ is the sum of absolute differences.
Alternatively, in multimodal scenarios, the similarity measure could be any information theoretic measure like for example Mutual Information.
Two regularization terms, similar to the ones defined in the second embodiment, are defined.
The first regularization term controls the in-plane deformation and is defined between vertices included in VI.
In a distance preserving approach, the first regularization term is defined as:
Where po,i and po,j are the original positions of the control points.
The second regularization term penalizes the distance between the control points and the plane corresponding to the neighboring ones.
The distance between a point p=(px,py,pz) and a plane π defined by normal vector N=(nx,ny,nz) and point q=(qx,qy,qz) is calculated as:
The second regularization term is then defined as:
e
i,j
P(liP,ljP)=½(Dπ
Where p′i and p′j are the positions after applying label liP to pi and label ljP to pj respectively.
Preferably, the method called FAST-PD, described in the U.S. Pat. No. 8,131,069 is used for determining the optimal labels according to the energy formula above.
In an implementation of this second embodiment, in order to reduce the computational time to convergence, a pyramidal approach is applied. In this approach, several grid resolution levels are used, from a coarse to a fine spacing between control points. For each grid resolution, a fixed number of iterations of the registration algorithm are performed, choosing the optimal five dimensional label for each control point of each grid resolution level and updating the control points position with this information from one grid resolution level to the next one.
For each resolution level, during the iterations over one grid resolution, the size of the displacement vectors and angles that form the label space is reduced so as to improve the search space sampling. The transformation obtained after several optimization steps at a given resolution level is used to initialize the position of a finer regular grid at the next level.
For example, from one resolution grid to the next finer one, the grid step multiplied by 2 by introducing a new control point at equidistance from two adjacent ones.
Preferably, the number of grid resolution levels varies between 3 to 5.
Preferably, the maximum displacement of a control point is restricted to a fraction of the grid spacing, for example to 0.4 times the grid spacing. Advantageously, this constraint guarantees a diffeomorphic deformation field.
According to a fourth embodiment, more accurate and more complex energy terms are formulated using higher order neighborhood systems or cliques. The advantage of such an approach is that it provides a framework for exact and precise estimation of the in-plane and out-plane transformation using a low dimensional label space (translations in the 3 dominant directions) and a graph structure of a small number of nodes.
In this fourth embodiment, a new formulation of the 2D-3D non-rigid registration problem consists of an undirected graph GH=V,E super-imposed on the 2D image domain with a set of vertices V and a set of cliques E=ED∪ER. The vertices are interpreted as control points of a bi-dimensional quasi-planar grid that models at the same time the in-plane deformations and the current position of the 2D image into the 3D volume for obtaining a plane slice of the target volume. In order to represent the in-plane deformations, the grid is interpreted as a Free Form Deformation (FFD) where each control point has only a local influence on the deformation.
Vertices V are labeled with three dimensional vectors li belonging to a label three dimensional label space LH, indicating the position of the control point in a 3D Cartesian coordinate system.
Two types of cliques are defined, as schematically illustrated in
The cliques in ED are triplets of neighboring vertices {Vi,Vj,Vk} of triangular shape, as illustrated in
Advantageously, using third-order triangular cliques for data terms allows the estimation of the exact position of the plane slice, without explicit transformation.
The cliques in ER are horizontal or vertical collinear third-order cliques, as schematically shown in
Advantageously, using triplets of collinear points for regularization allows the encoding a smoothness prior based on the discrete approximation of the second-order derivatives using only the vertices position.
The control points, corresponding to vertices of the grid, are displaced by assigning them different labels until an optimal position is found, according to an optimality criterion defined by a high order energy target function.
Preferably, a simple three-dimensional labeling space LH, sampled as shown in
The energy target function to be minimized is expressed as:
Where data terms Dijk are associated with triangular triplets (i,j,k)εED, regularization terms Rijk are associated with collinear horizontal and vertical triplets (i,j,k)εER, li is a label associated to a displacement vector (dx,dy,dz) assigned to vertex i, and β is a parameter weighting the contribution of the data and regularization terms.
The data terms Dijk are defined over a triangular cliques d=(i,j,k) using the source image I and the corresponding plane πd(J) extracted from the target volume J defined by the control points of the clique. The formulation of the data term is independent of the similarity measure δ. The data term associated with clique d is defined as:
where xεΩd, Ωd corresponds to the triangular area defined by the control points of clique d over the plane πd(J), after applying the labels (li,lj,lk) to the vertices.
In an embodiment, particularly adapted in the case where two images of the same modality are compared, the metric δ is the sum of absolute differences.
Alternatively, in multimodal scenarios, information theoretical measures like for example Mutual Information can be used.
More generally, the data term being defined independently of the similarity measure, any appropriate similarity measure can be chosen according to the application envisaged.
The regularization terms Rijk impose the planar and regular structure of the grid.
A regularization term Rijk is defined for every clique of three collinear vertices, in horizontal and vertical directions. Further, extra collinear cliques are defined for vertices that are collinear but not contiguous, in order to propagate the regularization so that the planar structure is conserved. The regularization term is formed of two parts, a first one that insures the plane structure of the grid and a second one that regularizes the in-plane deformations.
Planar consistency is enforced through a discrete approximation of the second-order derivatives of the grid.
Given three contiguous control points (pi,pj,pk) and their corresponding labels (li,lj,lk), the following first part term of the regularization term is defined:
Where ∥ ∥ represents the Euclidean norm and Δ is the distance between the control points (pi,pj,pk) of the clique.
In order to achieve in-plane deformation smoothness, a distance preserving approach is applied.
The ratio ψij of the Euclidean distance between the displaced control points pi+li and pj+lj and the original distance between their initial positions po,i and po,j is defined as:
The term Ψij is defined so as to be symmetric with respect to the displacement of the points and to penalize with the same cost when displaced control points are closer or more distant, that is to say it is not dependent on the direction of displacement:
Ψij=(1−ψij)2+(1−ψij−1)2 (28)
For a given clique d=(i,j,k), the following second part term of the regularization term is defined:
Finally, the regularization term is preferably defined as:
R
ijk(li,lj,lk)=(1−γ)RijkA(li,lj,lk)+γRijkB(li,lj,lk) (30)
where γ is a weighting factor, comprised between 0 and 1, used to calibrate the regularization term.
The objective function defined in formula (24) proposed in this fourth embodiment is to be optimized, e.g. minimized.
According to a first variant, a mapping to a factor graph is used for optimization.
A factor graph, as known in information theory, is a bipartite graph that factorizes a given energy target function, expressing which variables are arguments of which local functions.
In this implementation, a bipartite graph GH′=V′,F′,E′ is defined, where V′ is the set of variable vertices formed by the vertices of GH, F′ is the set of all the factors fεF, where every factor f is associated to one clique from the high order graph GH, and the set E′⊂V′×F′ defines the relation between the vertices and the factors.
The previous three-dimensional label space is used for labeling graph vertices.
Every factor f has an associated function φf:V′3→ that can correspond to a data or regularization term from equations (25), (26), (29) respectively.
The energy target function of the discrete labeling problem in the context of factor graphs is given by:
Where x corresponds to a given labeling of the graph G′H and vi,vj,vk are the variables in the neighborhood of factor f.
The optimal labeling {circumflex over (ε)} is the labeling with associated minimal energy:
Different algorithms can be used to optimize this model.
In this embodiment, dual decomposition based methods may be applied. Dual decomposition consists in decomposition the graph into smaller trees that are sub-problems simpler to optimize, and combining the solutions found for the sub-problems to find a final labeling of the graph.
Each of the sub-problems is called a slave. The task of combining the solutions is performed by a master, handling conflicting suggestions from slaves that assign different labels to shared vertices. The master applies n iterative processes based on projected subgradient method, which encourages slaves to agree on a common label for shared vertices. The slaves are selected so as to cover at least once every vertex and edge of the initial graph.
MRF optimization based on dual decomposition is known in the art, and has in particular been described in the article “MRF Energy minimization and beyond via dual decomposition”, by N. Komodakis et al, published in IEEE Transactions on Pattern Analysis and Machine Intelligence, pages 531-552, 2011.
In this alternative embodiment, two types of slaves are proposed, depending whether they are associated with cliques in ED (data term) or cliques in EH (regularization term).
In an exemplary embodiment illustrated in
As in the previous embodiments, advantageously, in a practical implementation, in order to further improve the computational time, a pyramidal approach is applied. In this approach, several grid resolution levels are used, from a coarse to a fine spacing between control points.
Preferably, the maximum displacement of a control point is restricted to a fraction of the grid spacing, for example to 0.4 times the grid spacing. Advantageously, this constraint guarantees a diffeomorphic deformation field.
In a general view, the registration method of the invention, as exemplified by the embodiments described above comprises:
It is to be understood that embodiments of the present invention can be implemented in various forms of programmable hardware, software, firmware, special purpose processes, or a combination thereof.
In one embodiment, the present invention can be implemented in software as an application program tangible embodied on a computer readable program storage device. The application program can be uploaded to, and executed by, a machine comprising any suitable architecture.
In another embodiment, the present invention can be implemented in a hardware associated with a processing unit (scanner, ultrasound acquisition device, graphics processing units, FPGAs, etc.)
Referring now to
The support circuits can include circuits such as cache, power supplies, clock circuits, and a communication bus. The memory 104 can include random access memory (RAM), read only memory (ROM), remote access hardware drive (conventional hard drive or cloud storage) accessible through direct connection, wired wireless ethernet or a combination thereof. The present invention can be implemented as a routine 114 that is stored in memory 104 and executed by the CPU 102 or by GPU 108 to process the signal from the source image 116. As such, the computer system 100 is a general purpose computer system that becomes a specific purpose processing system when executing the routine 114 of the present invention.
The processing system 100 also includes an operating system and micro instruction code. The various processes and functions described herein can either be part of the micro instruction code or part of the application program (or combination thereof) which is executed via the operating system. In addition, various other peripheral devices can be connected to the computer platform such as an additional data storage device and a printing device.
In a variant, a routine 114 that is stored in memory 104 and executed by the CPU 102 or by GPU 108 to process the signal from the source image 116 is part of a medical processing programmed device, for example an echography device.
It is to be further understood that, because some of the constituent system components and method steps depicted in the accompanying figures can be implemented in software, the actual connections between the systems components (or the process steps) may differ depending upon the manner in which the present invention is programmed.
Given the teachings of the present invention provided herein, one of ordinary skill in the related art will be able to contemplate these and similar implementations or configurations of the present invention.
While the present invention has been described in detail with reference to a preferred embodiment, those skilled in the art will appreciate that various modifications and substitutions can be made thereto without departing from the spirit and scope of the invention as set forth in the appended claims.
Number | Date | Country | |
---|---|---|---|
61749503 | Jan 2013 | US |