This is a U.S. non-provisional application of U.S. provisional patent application Ser. No. 60/795,660, filed Apr. 27, 2006, by Rousson et al., the entirety of which application is incorporated herein by reference.
The present invention relates to a method for automatic segmentation of digital image data.
Automatic image segmentation is a well-recognized problem medical imaging applications that is being addressed in many different ways. Atlas-based segmentation is one known solution which treats segmentation as a registration problem by elastically matching a pre-segmented atlas to the target image. Examples of such atlas-based segmentations are described by D. L. Collins, C. J. Holes, T. M. Peters, and A. C. Evans, Automatic 3-D model-based Neuroanatomical Segmentation, H
Another known segmentation method is active shape and appearance models described by T. F. Cootes, D. Cooper, C. J. Taylor, and J. Graham, Active Shape Models—Their Training and Application, C
Thus, there is a continuing need for an improved method of automatically segmenting images in medical imaging applications.
According to an embodiment, described is a general framework to improve the performance of existing algorithms for segmenting structures in 3-D digital image data such as commonly encountered in medical imaging. This improvement incorporates inter-structure spatial dependencies among structures to enhance segmentation algorithms. Structures are ranked according to their dependencies, segmentation of several structures are carried out according to the resulting hierarchy that improves each individual segmentation and provides automatic initializations. The ordering of the structures according to their dependencies can be predetermined off-line and stored in the image processing system and utilized by the image processing system during segmentation.
According to another embodiment, disclosed is a program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform the method steps for segmenting structures in 3-D digital image data described above. The benefit of the segmentation framework and its methods presented herein is that it enhances the segmentation performance of existing automatic segmentation algorithms by combining the segmentation algorithms into a systematic framework to produce more accurate automatic segmentations of multiple structures in an image.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee.
All drawings are schematic illustrations and the structures rendered therein are not intended to be in scale. It should be understood that the invention is not limited to the precise arrangements and instrumentalities shown, but is limited only by the scope of the claims.
Different anatomical structures often have strong spatial dependency among each other. This spatial dependency is usually present in a hierarchical manner, i.e., the shape and pose variations of one structure is fully or partially bounded by those of other more stable structures. This type of spatial dependency is referred to herein as “ordered spatial dependency” because of its ordered nature. Radiologists routinely rely on ordered spatial dependency to help them locate and identify structures that have large variations in shape, pose, and appearance by searching its presence relative to other structures that are much easier to identify. The novel image segmentation framework disclosed herein improves segmentation algorithms by taking advantage of this inter-structure ordered spatial dependency in an explicit manner by proposing a novel general image segmentation framework.
The ordered spatial dependency of all possible target structures are determined from pre-segmented training images and stored in the medical imaging system. The pre-segmented training images are preferably created from manually-segmented images. This data bank of ordered spatial dependencies serve as a learned model of the target structures and applied to improve both the performance and robustness of the particular segmentation algorithm utilized.
A key benefit of the segmentation framework is that it can be integrated with any existing segmentation algorithms. Many powerful and effective segmentation algorithms such as seeded region growing (described, for example, in R. Adams and L. Bischof, Seeded Region Growing, IEEE T. PAMI, 16(16):641-647, 1994), watershed (described, for example, in L. Vincent and P. Soille, Watersheds in Digital Spaces: An Efficient Algorithm Based on Immersion Simulations, IEEE T. PAMI, 13(6):583-598, 1991), active contours (described, for example, in M. Kass, A. Witkin, and D. Terzopoulos, Snakes: Active Contour Models, I
Because structures often have a strong spatial dependency, the segmentation framework disclosed herein involves defining a new spatial prior for the structure of interest based on neighboring structures. This dependency is introduced by registering (or warping) the structure of interest to a common reference coordinate system based on neighboring structures. This can be achieved by computing the elastic matching of the neighboring structures from one image to a reference one, and then applying the same elastic matching to the structure of interest. This modeling can be implemented for each structure, based on the structures already segmented. This leads us to the definition of a hierarchical segmentation framework.
The novel segmentation framework has two important contributions: 1) the explicit modeling and utilization of ordered spatial dependency for segmentation; and 2) the estimation of the optimal segmentation sequence for segmenting multiple structures. Unlike the known atlas-based segmentation methods that treat segmentation as a registration problem by elastically matching a pre-segmented atlas to the target image, the segmentation framework according to an embodiment uses elastic matching to enforce the spatial dependency and restricts the plausible segmentation space rather than using it to obtain the final segmentation. In other words, the segmentation framework models the relative locations of the structures between one another. The actual segmentation of each structure is then performed using a pre-selected segmentation algorithm.
[Modeling Ordered Spatial Dependency]
The segmentation framework and its utility in segmenting multiple structures from MR brain images will be described. Let {S1, . . . , SN} be the set of N target structures (classes) to be segmented. Here, SN denotes N structures in one given subject (i.e. a patient). We assume a dataset of M annotated images to be available for each of the N structures: {sij; i=1, . . . , N, j=1, . . . , M}. Given a manual segmentation s (e.g. delineation by a doctor) of the target structure Sε{S1, . . . , SN}, according to the segmentation framework, a smooth approximation of the conditional probability of an image location x (i.e. a voxel x) being inside the structure s is determined to be:
ps(x|s)∝exp(Hε(φ(x))−1
where φ is the distance transform of s and Hε is a regularized Heaviside function with ε controlling the level of smoothness. This conditional probability of the voxel x is determined for each manual segmentation s. This distribution gives high probability to voxels inside s and low probability to the ones outside, and the smoothness of the transition is related to the distance to the interface. The conditional probability of a voxel x belonging to the background of s as p
p
Now, to combine all the annotated instances of a target structure Si to define the spatial prior probability of the target structure Si, the manual segmentations, the annotated images, are placed in a common reference. At this point, the ordered spatial dependency is considered, i.e., Si's dependency on known neighboring structures. The principle is to align all the manual segmentations sij of the target structure Si to a common coordinate system using the known neighboring structures as reference anchor structures. This is done by estimating a warping between each instance of the reference anchor structure(s) to a common reference. The warping is a non-rigid transformation and normalizes the shape(s) of the reference anchor structure(s) to a common reference. For this estimation, each structure instance is represented by a level set function that serves as input of an image-based registration. Such estimation method is described by C. Chefd'hotel in Geometric Methods in Computer Vision and Image Processing: Contributions and Applications, PhD thesis, Ecole Normale Suprieure de Cachan, April 2005. When several anchor structures are available, they are merged as a single multi-component one. This allows to constrain even more the deformation field in between the structures and further enhances the accuracy of the resulting segmentation.
Next, these warpings are applied to the corresponding structures sij. Let {tilde over (s)}ij be the output representing the segmentation transformed by the warping ψij and {tilde over (φ)}ij be its level set representation, then, the spatial prior probability of the target structure Si and its background
A priori, it is not necessarily known which of the known structures can serve as the anchor structures because it is not known which of the known structures the target structure Si is spatially dependent on. These spatial dependencies are determined by defining the spatial probability according to a subset of other structures {Sk, k≠i}. Let Vi represent all possible subsets of {Sk, k≠i}, viεVi, the segmented structures, and vij as the corresponding annotated structures in the training image j. With these notations, all structures sij are registered to a reference image by estimating the warpings that align vij to a reference set vir. Therefore, for each choice of subset vi, we end up with different registrations and hence, a different spatial prior probability for the target structure Si. Since this probability is subject to the selected subset vi, the notations psi(x|vi) and p
[Integrating Spatial Prior in the Segmentation Process]
Next, this spatial prior is incorporated in a general segmentation framework and the complete framework obtained when a level set based approach is considered will be described in detail. Given an image I and a set of segmented structures v, another structure s of the class S is to be segmented in the image I. The class S is any one of the classes Si considered previously. The statistical formulation of this segmentation problem using a maximum a posteriori estimation involves maximizing the posterior conditional distribution p(s|I,v), or the probability that s is in the image I and in the set of segmented structures v. Making the assumption that I and v are not correlated, the optimal structure is the one maximizing
p(s|I,v)=p(s|I)p(s|v).
The first term can be expressed with any statistically defined segmentation algorithm whereas the second term allows integration of the spatial prior learned above. To incorporate this prior knowledge, an assumption is made that the prior probabilities of the locations x are independent and identically distributed. This allows incorporation of the spatial prior probability term introduced above:
where sin and sout are respectively the domain of image inside and outside the structure s. At this point, the formulation is very general and it does not dependent on any particular segmentation algorithm. Hence, any appropriate segmentation algorithms such as graph-cuts (see Y. Boykov, O. Veksler, and R. Zabih, Fast Approximate Energy Minimization via Graph Cuts, IEEE T. PAMI, 23(11):1222-1239, 2001; see also, H. Lombaert, Y. Sun, L. Grady and C. Xu, A Multilevel Banded Graph Cuts Method for Fast Image Segmentation, In Proc. ICCV, pages 259-265, 2005), and level set based surface evolutions (see M. Kass, A. Witkin, and D. Terzopoulos, Snakes: Active Contour Models, I
In the following, an embodiment of the segmentation framework system is developed using the level set based surface evolutions but as mentioned before, the segmentation framework of the invention can be used in conjunction with any available segmentation algorithms. In the level set algorithm, the target structure of interest is represented as the zero crossing of an embedding function
φ:Ω→R:s={xεΩ|φ(x)=0}. Hence, the problem of finding the surface s becomes the one of finding a real function φ that maximizes: p(s|I,v)→p(φ|I,v). Equivalently, the optimal solution can be obtained from the minimization of the energy:
E(φ)=−log p(φ|I,v)=−log p(φ|I)−log p(φ|v)
In this example, the surface evolutions segmentation algorithm of T. F. Chan and L. A. Vese (see Active Contours Without Edges, IEEE T. IP, 10(2):266-277, 2001)) is used to define the first term with a region-based criteria and a regularity constraint. To use the spatial prior (the ordered spatial dependency), the anchor structures from the current image are first registered to the reference anchor structures used for modeling the ordered spatial dependency. Let ψ be the obtained warping, the whole energy is then expressed as follows:
E(φ)=−∫Ω(Hφlog pin(I(x))+(1−Hφ)log pout(I(x))+v|∇Hφ|)dx−λ∫Ω(H100 log ps(ψ(x)|v)+(1−Hφ)log p
where pin and pout are the intensity distributions inside and outside the structures. They can be estimated on-line (i.e. contemporaneously) or a priori from the learning set. We minimize this energy using a gradient descent which is obtained by computing the corresponding Euler-Lagrange equation drives to the following curve evolution:
where {tilde over (φ)}cj stands for the warpings estimated during the modeling phase for the current shape. The segmentation of s is obtained by evolving φ according to this equation until convergence (the initialization is discussed in the next paragraph). For an efficient implementation, the image
can be estimated off-line, and then it can be warped to the current image domain using ψ.
[Estimation of the Optimal Segmentation Sequence]
To learn the subset vi that would optimally segment Si, the segmentation is applied on a second set of annotated images. For each vi, one can measure the quality of the segmentation according to a chosen similarity measure Z between the automatic and “true” segmentation. Assuming that, if Sj depends on Sk, Sk cannot depend on Sj, the objective is to estimate the optimal sequence for segmenting the structures such that structures classified higher can be used to segment lower-classified ones. Once all the segmentations are obtained for a given sequence, the overall quality of the segmentation of the target structures in each of the sequences is measured by comparing the results with the manual segmentation according to a similarity measure Z. Dice coefficient can be used for the similarity measure Z which is defined as twice the Jaccard's Coefficent. This process is repeated for each combination of possible sequence. Then by evaluating/comparing the similarity measures for each sequence, the sequence with the best similarity measure is determined as the optimal sequence is determined. The best similarity measure means that the segmentation is closest to the manual segmentation in the annotated training images. The optimal sequence is given by:
where O is set of all permissible sequences, and ŝij(O) is the segmentation obtained automatically in the image j for the structure Si using the sequence O. In general, if the number of structures to segment is relatively small (<10) all combinations can be tested fairly quickly. If the first structure is fixed, the number of combinations for N structures is equal to (N−1). Even though, this number gets high for N=10, this can be an off-line process and the user can introduce heuristics to reduce the number of possible sequences.
[Hierarchical Segmentation]
Once the optimal sequence is determined, the selected segmentation algorithm is processed for each structure successively by starting with the first structure in the ordering to the last structure. To initiate this process, the first structure necessarily must be segmented without a spatial prior. In most medical images, this can be done easily by fixing the first structure to a generally stable structure, such as, the envelope of the body. For example, when looking to segment structures in the subjects head, the skull will serve well as the first structure. Thus, the segmentation algorithm is run for each structure in the ordering starting with the fixed first structure. Then, to segment each structure, the associated level set also need to be initialized. One solution is to place seeds inside each structure. This would provide a good initialization but requires a user interaction. However, the spatial prior can be used to automate these initializations by selecting the voxels with a prior probability superior to a threshold τ. More precisely, the initial level set φi0 used to extract the structure Si is set as follows:
Once initialized with the spatial prior, the level set is projected to a signed distance functions. This is repeated after each iteration of the level set evolution. Using this technique, the segmentation of all N structures can be obtained automatically. Only the weights ε, v, λ and τ must be set before starting the process.
Referring to
The first step is determining the optimal segmentation sequence. In this example, looking at the brain image shown in
Column (a)—sequence (3,1,0,2)
Column (b)—sequence (3,0,1,2)
Column (c)—sequence (3,0,2,1)
Column (d)—sequence (3,1,2,0)
Column (e)—sequence (3,2,0,1)
Column (f)—sequence (3,2,1,0)
where, 0=lateral ventricle, 1=caudate nucleus, 2=thalamus, and 3=skull.
Once the optimal spatial ordering is determined, the approach was validated using a leave-one-out strategy on the thirteen (13) available training images. A few results are presented in
Therefore, the novel image segmentation framework presented herein is one that learns the ordered spatial dependency among structures to be segmented and applies it in a hierarchical manner to both provide automatic initializations and improve a particular segmentation algorithm's performance. The efficacy of the segmentation framework is demonstrated by applying it to the MR brain image segmentation with level set based surface evolutions algorithm as its segmentation algorithm. The benefit of the segmentation framework presented herein is that it boosts the segmentation performance by combining existing segmentation algorithms into a systematic framework.
The invention described herein can be automated by, for example, tangibly embodying a program of instructions upon a storage media, readable by a machine/system capable of executing the instructions. A general purpose computer is an example of such a machine/system. The computational algorithms discussed herein necessary for manipulation and processing of the digitized image data and the program instructions to execute those algorithms as described herein to conduct automatic segmentation according to the invention can be embodied as a program of executable instructions upon a storage media. Such instructions can be read by the general purpose computer and loaded on to the computer's temporary memory devices (e.g. ROM) or its permanent memory devices (e.g. hard drives), for example, and then a processor on the computer can execute the instructions to perform segmentation on digitized image data. The digitized image data can be provided to the computer from the imaging devices/systems such as CAT systems, MR systems etc. via one of the data input ports provided on the computer as will be obvious to one of ordinary skill in the art. Examples of the storage media are well know in the art and would include such devices as, a readable or writable CD, flash memory chips (e.g. thumb drives), various magnetic storage media, etc.
The essential features of the invention having been disclosed, further variations will now become apparent to persons skilled in the art. All such variations are considered to be within the scope of the appended claims. Reference should be made to the appended claims, rather than the foregoing specification, as indicating the true scope of the subject invention.
Number | Name | Date | Kind |
---|---|---|---|
7095890 | Paragios et al. | Aug 2006 | B2 |
7177471 | Paraglos et al. | Feb 2007 | B2 |
7200269 | Paragios et al. | Apr 2007 | B2 |
7424153 | Paragios et al. | Sep 2008 | B2 |
Number | Date | Country | |
---|---|---|---|
20070253611 A1 | Nov 2007 | US |
Number | Date | Country | |
---|---|---|---|
60795660 | Apr 2006 | US |