1. Technical Field
The present invention relates to image segmentation, and more particularly to a system and method for sparse volume segmentation for 3D scans.
2. Discussion of Related Art
As the scanners resolution gets better and better, the slice thickness gets thinner and thinner, and the number of slices for a given organ increases. Therefore, it takes longer and longer to process the data. If this trend goes on, computers capacity (memory+CPU) will not be sufficient to process the whole data, no matter how powerful the segmentation method is.
Therefore; a need exists for sparse volume segmentation for 3D scans.
According to an embodiment of the present disclosure, a computer readable medium is provided embodying instructions executable by a processor to perform a method for sparse volume segmentation for 3D scan of a target. The method including learning prior knowledge, providing volume data comprising the target, selecting a plurality of key contours of the volume data, building a 3D spare model of the volume data given the plurality of key contours, segmenting the volume data using the 3D sparse model, and outputting a representation of the volume data based on the segmentation using the 3D sparse model.
According to an embodiment of the present disclosure, a computer readable medium is provided embodying instructions executable by a processor to perform a method for sparse volume segmentation for 3D scan of a target. The method includes providing a 3D sparse model of a volume data including the target comprising key indices, key contours at the key indices, an interpolation operator, and an interpolated volume, setting a globally register of the volume data to the 3D sparse model in a model space, segmenting the plurality of key contours at the key indices, inferring a segmentation of a portion of the volume data from the plurality of key contours, and outputting a representation of the volume including the plurality of key contours and the portion of the volume data inferred from the plurality of key contours.
According to an embodiment of the present disclosure, a computer system includes a processor, and a computer readable medium embodying instructions executable by the processor to perform a method for sparse volume segmentation for 3D scan of a target, the method comprising, learning prior knowledge, providing volume data comprising the target, selecting a plurality of 2D key contours of the volume data, building a 3D spare model of the volume data comprising the plurality of 2D key contours, segmenting volume data using the 3D sparse model, wherein a segmentation of a portion of the volume data is inferred from the 3D sparse model, and outputting to a memory device a representation of the volume based on the segmentation using the 3D sparse model.
Preferred embodiments of the present invention will be described below in more detail, with reference to the accompanying drawings:
Volume segmentation is a relatively slow process and, in certain circumstances, the enormous amount of prior knowledge available is underused. The technique presented in this disclosure allows the use of prior knowledge to build a 3D statistical model that is used to infer the whole volume from a set of critical key contours. These features are then segmented from a dataset, and the rest of the volume is interpolated using a linear regression on the statistical model. The resulting process is more efficient than standard segmentation since most of the workload is concentrated on the critical points, but also more robust, since the interpolated volume is consistent with the prior knowledge statistics. It is applicable to any 3D organ or volume, from any modality.
The present method aims at exploiting prior learning to focus the segmentation workload on critical contours, inferring the rest of the volume and then adjusting locally the segmentation. The overall, method is not only faster than global segmentation (for identical segmentation method), but more robust to local minima, as it uses prior knowledge to constrain the volume shape.
Referring to
The method includes finding the contours of an input image 101 that are most relevant to infer the whole volume, for example, see
Given N−1 training volumes and 1 testing volume, and given a set of key contour indices C1, the interpolation operator H is computed from the training set {Vk}k=1 . . . N-1, and tested on the remaining volume VN, so that
H=arg minH {Σk=1 . . . N-1∥Vk−H(Clk)∥2} (1)
and
Test measure=∥H(Clk)−Vn∥2. (2)
The test provides the quality measurement for the given set of indices. Using a discreet optimization algorithm (such as Genetic Algorithm), the set of indices I that minimizes the test measure in equation (2) is determined. One of ordinary skill in the art would recognize, in view of the present disclosure, that other methods may be used for determining key contour indices, for example, see equations (7) and (8).
Once the key indices have been determined, the 3D Sparse Model 102 is built to include the key indices I, the key contours at the key indices, CI, the interpolation operator H, and the interpolated volume Vinterp=H(CI).
Given the 3D sparse model, an organ may be segmented 103 and a segmentation of the volume may be output 104 to a display, memory device, etc. Referring to
Referring now to an exemplary implementation of segmenting a liver, according to an embodiment of the present disclosure, a statistical analysis of the data is combined with a reconstruction model from sparse information: only the most reliable information in the image is used, and the rest of the liver's shape is inferred from the model and the sparse observation. The resulting process is more efficient than standard segmentation since most of the workload is concentrated on the critical points, but also more robust, since the interpolated volume is consistent with the prior knowledge statistics. The experimental results on liver datasets prove the sparse information model has the same potential as PCA, if not better, to represent the shape of the liver. Furthermore, the performance assessment from measurement statistics on the liver's volume, distance between reconstructed surfaces and ground truth, and inter-observer variability demonstrates the liver is efficiently segmented using sparse information.
For the choice of sparse information 101, consider a shape x and its partition into m elements x=(x1, . . . , xm) (see
∀kε[1, m], xk=ρ(x,k) (3)
In the remaining of the disclosure, this continuous parameterization is assumed when not specified. The approach recovers a small, e.g., minimal, description length set of |B| sub-elements B={xt
∀kε[1,m], φ(xt
Considering now a training set of P exemplars X=[x1, x2, . . . , xP] registered in a reference space Ωr 100; toward optimal reconstruction of the training set from the basis B, the distance between the reconstruction and the existing samples is minimized. To this end, let a metric ψ:[Ωr×Ωr]→ measures the distance between two sub-elements. Then, assuming the number of components of the training set is fixed, such reconstruction minimizes
Such an approach is purely geometric and does not account for the image support of each sub-element.
Turing now to image support; Recall that the sub-elements of a given exemplar xp have some underlying image support noted w=(w1p, . . . , wmp). The optimum basis B include elements that are confidently extracted from the data; therefore, the basis minimizes
where g is a monotonically decreasing function, and Tθ−1(xt
The selection of key contours 101 is robust to parameter variability. Considering a slight variation on the selection of the basis, noted δxt, for the interpolation precision of the model not to be significantly affected,
that is reformulated in terms of a cost by defining a smoothness function η( ), like the error-two norm,
Evar(B,φ)=η(∇BEint(B,φ)) (8)
Such a penalty term introduces robustness in the basis selection step, as well to the reconstruction process. Now, one integrates these three constraints into a single cost function: E(B,φ)=Eint(B,φ)+αEsup(B)+βEvar(B,φ) where α and β are problem specific normalizing constants (results have shown little sensibility to small variations of α and β). The cost function E is minimized with respect to the interpolation function φ and the basis B. Such a process cannot be described in a general fashion, but a gradient descent is an excellent choice when considering linear interpolation models, while more advanced non-linear optimization methods like neural networks can be considered for non-linear cases. Last, but not least the residual cost that characterizes the sparse information model is used to determine the best number K of key components that optimizes the Minimum Description Length. In order to demonstrate the efficiency of such a model for volumetric organ segmentation, the particular case of liver segmentation in CT images is considered. The same approach is easily adapted to any other organ, in any dimension.
Referring to the sparse knowledge-based segmentation 103; knowledge-based segmentation is a dominant approach to organ extraction from 3D images. The sparse model is built by selecting a minimal set B of 2D contours (represented in an explicit or an implicit fashion) along with an interpolation function φ to reconstruct the whole 3D surface in the reference space Ωr. During the segmentation, the global transformation Tθ that relates the reconstructed model to the observation volume is to be determined, along with the set B of 2D contours that fits the observation.
Referring to the model construction 102; the experiment is conducted on segmentation for medical imaging for the case of liver in Computed Tomography (CT). The training set is represented by exemplars x by 3D distance maps to the closed surface Γ defined by the liver's edge C in the volumetric data:
Such a selection is motivated from its implicit nature, as well as the ability to introduce surface-based as well as area based criteria in the segmentation process. Classic explicit parameterizations like triangulated surfaces, or other form of parametric snakes can also be considered.
The acquisition process guides the choice for the definition of the sub-elements: since the image volume is reconstructed slice by slice, with maximum resolution in the slice plane, the axis of projection vi
In order to determine the best possible interpolation class, different models for φ have been tested. Generalized linear interpolation for each slice i has been determined to be a good compromise between complexity and interpolation quality. The solution (2D contour) at each slice xi is reconstructed using a particular linear combination Hi of the key contours xt
Eint is a quadratic function with global minimum, and since the reference space Ωr is a continuous space, the minimization of Eint benefits from the large literature on quadratic functions minimization.
The image support wi at slice i is defined by the Kullback-Leibler distance between the pixels intensity distributions inside and outside the 2D contour and the a priori learned histograms. Knowing a priori the normalized histogram hin (resp. hout) of the pixels intensity inside (resp. outside) the liver, and computing the pixels intensity distribution pin and pout inside and outside of the reconstructed shape on the key slices,
The key contours are chosen 101 so as to minimize the impact of variations in their position, and of errors in the contours extraction in the key slices. Since a continuous interpolation of the 2D contours is introduced in equation (3), the impact of an infinitesimal change dc in the slice index may be written as the squared magnitude of the gradient of xt
In order to determine the number K, the indices of the key contours t1, . . . , tk as well as the interpolation operator H, a gradient descent optimization method is used and combined with the Schwarz Bayesian criterion to determine the optimum cardinality of the basis. After registering the volumes with m=100 slices, the optimum number of key slices is determined, in this example 5 key slices are used. The selected key slices form the 3D sparse model 102.
For model-based segmentation 103, with sparse model 102 in hand, the volumetric segmentation includes the segmentation of the shape at key slices, where the whole 3D segmentation problem is reduced to a small set of parallel 2D contours to be segmented at specific locations. Therefore, one needs to optimize an image-based cost function with respect to both the set of key contours B=xt
To this end, the cost function includes the intensity-based likelihood of each pixel, assuming that normalized histograms inside (hin) and outside (hout) the liver are available (if not, one recovers them on-the-fly). Then, the posterior likelihood of the partition with respect to the two classes is maximized to obtain the key contours B and the transformation Tθ:
where H(xt
Experimental validation of methods described herein have been performed. Turing first to the dimensionality reduction using a sparse information model, before proving sparse information models are efficiently used to segment an organ in volumetric data, one needs to quantify the error introduced by the sparse models dimension reduction and compare it with common techniques such as PCA. The volumetric data is acquired on Sensation 16 CT scanners, with an average resolution of 1 mm in axial plane and 3 mm along the longitudinal axis. 31 volumes (different oncology patients, with or without pathologies such as tumors) are used in our experiments on a leave-one-out basis: 30 volumes are used to build the models (sparse and PCA) and the last one is used for testing.
Table (1) summarizes the error introduced by dimensionality reduction for PCA (30 modes), linear interpolation and Sparse Information Model with 5 slices. This error measure is defined as the symmetric difference between the two volumes V1 and V2:
The results demonstrate that the sparse information model with 5 key elements provides the same reconstruction quality than linear PCA with 30 modes of variation. However, the PCA results have a large variance because diseased organs are poorly represented by a Gaussian model in the linear. PCA space. Nevertheless, a larger study with different pathologies could demonstrate kernel PCA best represents the shapes.
A second portion of the experimentation demonstrates that sparse information models can efficiently be used for segmentation. For that purpose, it is assumed an expert (i.e. either a human expert, or an expert system such as the ones described in the literature) roughly initializes the rigid transformation and the key contours. When no user interaction is available, a preprocessing step, such as exhaustive search or coarse-to-fine search, is to be developed. In the case of PCA, the segmentation problem is solved by minimizing the cost function resulting from the intensity-based likelihood of each pixel in the volumetric image:
Equation (15) is minimized in the PCA's parametric space, where the shapes' distribution is modeled using kernels. The kernels are justified by the poor modeling of the samples distribution by a Gaussian. For the PCA segmentation, all the m slices of the volume are used, whereas the Sparse Information Model only segments the K slices determined during the model construction (see equation (13)).
Table (2) summarizes the symmetric difference (see equation (14)) between ground truth and the segmented liver obtained using the sparse information model and PCA. Neighboring structures of similar intensities juxtapose the liver in a way that PCA estimates as a shape variation. On the contrary, the Sparse Model ignores the regions with low support, and reconstructs the information in these regions based on other visual clues elsewhere in the image. For information, the inter-observer symmetric difference in table (2) indicates the symmetric difference between livers segmented by different experts using the same semi-automatic tool. The results seem to demonstrate sparse information models outperform active shape models. Nevertheless, it must be underlined that the training and evaluation datasets are different. Furthermore, the shape model is built from smoothed surface meshes, while the training shapes used here are represented by distance functions (see equation (9)) and are not smoothed. However, as one suspects, Sparse Information Models are sensitive to initialization. To quantify this, two different Sparse Segmentations were performed by segmenting by hand the key slices in the datasets, and comparing the reconstruction results with the ground truth. The difference in quality (symmetric difference with ground truth) between the different reconstructions ranges from 0.02% to 6.73%. Moreover, this variance is not correlated to the IOV (correlation coefficient of 0.47); otherwise stated, a volume with high inter-observer variability may be segmented by the SIM in a way that is robust to initialization, and reciprocal may be true. Indeed, the IOV depends on the whole organ's structure while the SIM's quality only depends on the key slices. Furthermore, the maximum quality difference of 6.73% is below the maximum IOV symmetric difference (7.83% in table (2)).
Herein, a family of dimension reduction techniques have been described based on intelligent selection of key sub-elements with respect to reconstruction quality, image support and variability of these key sub-elements. It is demonstrated that sparse information models can be used for dimensionality purposes, and can efficiently be integrated into a segmentation framework in the context of volumetric organ segmentation. This technique has been applied to the problem of liver segmentation in volumetric images with successful results compared to common dimensionality reduction techniques based on linear projections and kernel distributions. On top of interpolation and segmentation quality, this method is also very fast since only the most important and most reliable information is processed for the reconstruction of the whole information. However, a statistical shape model may not be sufficient to represent the exact shape of the liver; in a post-processing step, a local optimization—using active contours for instance—may be needed for better results. This local optimization would not be computed from sparse information. Further work will investigate the use of non-linear models for the interpolation function, as well as a subsequent refinement that will locally adjust the reconstruction from the model to the actual image information by taking into account the confidence in the reconstruction. More advanced prior models using axial coronal and sagittal sparse information would be an extension of these methods, as it would diminish the quality difference between two differently initialized segmentations. Further, methods described herein may be used for feature extraction, classification and content-based image indexing and retrieval.
It is to be understood that the present invention may be implemented in various forms of hardware, software, firmware, special purpose processors, or a combination thereof. In one embodiment, the present invention may be implemented in software as an application program tangibly embodied on a program storage device. The application program may be uploaded to, and executed by, a machine comprising any suitable architecture.
Referring to
The computer platform 501 also includes an operating system and micro instruction code. The various processes and functions described herein may either be part of the micro instruction code or part of the application program (or a combination thereof) which is executed via the operating system. In addition, various other peripheral devices may be connected to the computer platform such as an additional data storage device and a printing device.
It is to be further understood that, because some of the constituent system components and method steps depicted in the accompanying figures may be implemented in software, the actual connections between the system components (or the process steps) may differ depending upon the manner in which the present invention is programmed. Given the teachings of the present invention provided herein, one of ordinary skill in the related art will be able to contemplate these and similar implementations or configurations of the present invention.
Having described embodiments for a system and method for sparse volume segmentation for 3D scans, it is noted that modifications and variations can be made by persons skilled in the art in light of the above teachings. It is therefore to be understood that changes may be made in the particular embodiments of the invention disclosed which are within the scope and spirit of the invention as defined by the appended claims. Having thus described the invention with the details and particularity required by the patent laws, what is claimed and desired protected by Letters Patent is set forth in the appended claims.
This application claims priority to U.S. Provisional Application Ser. No. 60/812,373, filed on Jul. 9, 2006, which is herein incorporated by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
7450746 | Yang et al. | Nov 2008 | B2 |
7620226 | Unal et al. | Nov 2009 | B2 |
7646902 | Chan et al. | Jan 2010 | B2 |
7764817 | Georgescu et al. | Jul 2010 | B2 |
7787683 | Khamene et al. | Aug 2010 | B2 |
7873185 | Cremers | Jan 2011 | B2 |
7916919 | Zheng et al. | Mar 2011 | B2 |
20030020714 | Kaus et al. | Jan 2003 | A1 |
20040068187 | Krause et al. | Apr 2004 | A1 |
20050232485 | Brown et al. | Oct 2005 | A1 |
20060039600 | Solem et al. | Feb 2006 | A1 |
20060147114 | Kaus et al. | Jul 2006 | A1 |
20070014462 | Rousson et al. | Jan 2007 | A1 |
20070091085 | Wang et al. | Apr 2007 | A1 |
20070098221 | Florin et al. | May 2007 | A1 |
20070275647 | Eger | Nov 2007 | A1 |
20080002870 | Farag et al. | Jan 2008 | A1 |
20080025592 | Jerebko et al. | Jan 2008 | A1 |
20080123914 | De Bliek et al. | May 2008 | A1 |
20080161687 | Suri et al. | Jul 2008 | A1 |
20080180448 | Anguelov et al. | Jul 2008 | A1 |
20080292194 | Schmidt et al. | Nov 2008 | A1 |
20080294401 | Tsin et al. | Nov 2008 | A1 |
20090030657 | Berg et al. | Jan 2009 | A1 |
20090052756 | Saddi et al. | Feb 2009 | A1 |
20090136103 | Sonka et al. | May 2009 | A1 |
20090154785 | Lynch et al. | Jun 2009 | A1 |
20090161926 | Florin et al. | Jun 2009 | A1 |
20090190815 | Dam et al. | Jul 2009 | A1 |
20100074499 | Wels et al. | Mar 2010 | A1 |
20100134487 | Lai et al. | Jun 2010 | A1 |
20100176952 | Bajcsy et al. | Jul 2010 | A1 |
20100329529 | Feldman et al. | Dec 2010 | A1 |
20110038516 | Koehler et al. | Feb 2011 | A1 |
20110052028 | Shreiber | Mar 2011 | A1 |
Number | Date | Country | |
---|---|---|---|
20110123095 A1 | May 2011 | US |
Number | Date | Country | |
---|---|---|---|
60812373 | Jun 2006 | US |