AUTOMATED SEGMENTATION OF IMAGE STRUCTURES

Information

  • Patent Application
  • 20080031521
  • Publication Number
    20080031521
  • Date Filed
    February 28, 2007
    17 years ago
  • Date Published
    February 07, 2008
    16 years ago
Abstract
Methods and systems for segmenting images, wherein the image pixels are categorized into a plurality of subsets using one or more indexes, then a log-likelihood function of one or more of the indexes is determined, and one or more maps are generated based on the determination of the log-likelihood function of one or more of the indexes.
Description

DRAWINGS

These and other features, aspects, and advantages of the present invention will become better understood when the following detailed description is read with reference to the accompanying drawings in which like characters represent like parts throughout the drawings, wherein:



FIG. 1 illustrates eigenvalues and intensity when used in a spherical coordinate system;



FIG. 2
a is an image of a retina used to illustrate one of the examples.



FIG. 2
b illustrates the segmented foreground pixels based on the shape index and normalized-curvature index for the image shown in FIG. 2a.



FIG. 2
c illustrates the segmented foreground pixels based on the shape index and intensity for the image shown in FIG. 2a.



FIG. 2
d illustrates the segmented foreground pixels based on the intensity and normalized-curvature index for the image shown in FIG. 2a.



FIG. 2
e illustrates an estimated probability map for the image shown in FIG. 2a.



FIG. 2
f illustrates probability values greater than 0.5, indicating the pixels more likely to be vessels than being background, for the image shown in FIG. 2a.



FIGS. 3
a-3f illustrate the estimated class conditional distribution and log-likelihood functions of the retina image shown in FIG. 2a: a) illustrates the distribution functions of the intensity, c) illustrates the normalized-curvature index, e) illustrates the shape index. For FIGS. 2a, 2c and 2e, the distribution of foreground, background and all pixels are plotted with dotted, dashed, and solid lines, respectively. FIG. 2b illustrates the estimated log-likelihood functions based on the intensity, FIG. 2d illustrates the normalized-curvature index, and FIG. 2f illustrates the shape index.



FIG. 4
a is the image of the retina shown in FIG. 2a shown again for comparison to FIGS. 4b-4d.



FIG. 4
b illustrates segmented pixels that have intensity value above a threshold, T for the image shown in FIG. 4a.



FIG. 4
c illustrates segmented pixels when the threshold, T, is decreased by 5% for the image shown in FIG. 4a.



FIG. 4
d illustrates segmented pixels when the threshold, T, is increased by 5% for the image shown in FIG. 4a.



FIG. 5
a illustrates an image of a membrane marker and estimated foreground subsets (white color) and background subsets (black color) based on two of the features used in this example



FIG. 5
b illustrates the segmented foreground pixels based on the shape index and normalized-curvature index for the image shown in FIG. 5a.



FIG. 5
c illustrates the segmented foreground pixels based on the shape index and intensity for the image shown in FIG. 5a



FIG. 5
d illustrates the segmented foreground pixels based on the intensity and normalized-curvature index for the image shown in FIG. 5a. The gray color shows the indeterminate pixels that are not included in either foreground or background subsets.



FIG. 5
e illustrates the estimated probability map for the image shown in FIG. 5a.



FIG. 5
f. illustrates the probability values greater than 0.5, indicating the pixels more likely to be vessels than being background, for the image shown in FIG. 5a.



FIGS. 6
a-6f illustrate the estimated class conditional distribution and log-likelihood functions of the membrane image shown in FIG. 5a: a) illustrates the distribution functions of the intensity, c) illustrates the normalized-curvature index, e) illustrates the shape index. For FIGS. 6a, 6c and 6e, the distribution of foreground, background and all pixels are plotted with dotted, dashed, and solid lines, respectively. FIG. 6b illustrates the estimated log-likelihood functions based on the intensity, FIG. 6d illustrates the normalized-curvature index, and FIG. 6f illustrates the shape index.



FIG. 7
a illustrates an image of a nuclei marker and estimated foreground subsets (white color) and background subsets (black color) based on two of the features used in this example



FIG. 7
b illustrates the segmented foreground pixels based on the shape index and normalized-curvature index for the image shown in FIG. 7a.



FIG. 7
c illustrates the segmented foreground pixels based on the shape index and intensity for the image shown in FIG. 7a



FIG. 7
d illustrates the segmented foreground pixels based on the intensity and normalized-curvature index for the image shown in FIG. 7a. The gray color shows the indeterminate pixels that are not included in either foreground or background subsets.



FIG. 7
e illustrates the estimated probability map from the empirical log-likelihood function for the image shown in FIG. 7a.



FIG. 7
f illustrates the probability map from the parametric log-likelihood function, for the image shown in FIG. 7a.



FIGS. 8
a-8f illustrate the estimated class conditional distribution and log-likelihood functions of the nuclei image shown in FIG. 7a: a) illustrates the distribution functions of the intensity, c) illustrates the normalized-curvature index, e) illustrates the shape index. For FIGS. 8a, 8c and 8e, the distribution of foreground, background and all pixels are plotted with dotted, dashed, and solid lines, respectively. FIG. 8b illustrates the estimated log-likelihood functions based on the intensity, FIG. 8d illustrates the normalized-curvature index, and FIG. 8f illustrates the empirical and the model based log-likelihood functions of the shape index which are represented with solid and dashed lines, respectively.



FIG. 9
a illustrates an example of raw image intensities for membrane, nuclei and c-Met markers.



FIG. 9
b illustrates the detected compartments for the membrane, epithelial nuclei, stromal nuclei and cytoplasm for the image shown in FIG. 9a.



FIG. 10
a illustrates an example of raw image intensities for a retinal image.



FIG. 10
b illustrates the detected vasculature network for the image shown in FIG. 10a.



FIG. 11 is an embodiment of the system.





DETAILED DESCRIPTION

The quantitation of biomarkers can be accomplished without giving definite decisions for each pixel, but rather computing the likelihood of a pixel belonging to a region. For example, instead of identifying membrane pixels, the likelihood of a pixel being a membrane can be computed, which is essentially the probability of a pixel being a membrane. Such probability maps can be computed using the intensity and geometry information provided by each channel. A likelihood function estimator that calculates the probability maps of membranes and nuclei structures in images is presented. Starting from known initial geometric constraints, the algorithm iteratively estimates empirical likelihood functions of curvature and intensity based features. The distribution functions are learned from the data. This is different than existing parametric approaches, because it can handle arbitrary mixtures of blob-like and ridge-like structures. In applications, such as tissue imaging, a nuclei image in an epithelial tissue comprises, both ridge-like and blob-like structures. Network of membrane structures in tissue images is another example where the intersection of ridges can form structures that are partially blobs. Accurate segmentation of membrane and nuclei structures forms the base for higher level scoring and statistical analysis applications. For example, distribution of a target protein on each of the segmented compartments can be quantified to reveal protein specific pathways. Then the pathway can be related to clinical outcomes.


Retina images are used to illustrate this example embodiment, and are used only to illustrate one or more of the steps of the methods and systems described. Although the steps of the methods are illustrated in this example in connection with the elongated vascular structures of the retina, the steps are equally applicable to other tissues and biological structures.


Eigenvalues of the hessian matrix are used in this example embodiment to detect ridge-like and blob-like structures. Although such eigenvalues are used in this example because of their invariance to rigid transformations, other known feature detection algorithms may be used. The Hessian of an image I(x, y) is defined as










H






(

I






(

x
,
y

)


)


=


[








2


I







(

x
,
y

)





x
2










2


I







(

x
,
y

)





x




y












2


I







(

x
,
y

)





y








x










2


I







(

x
,
y

)





y
2






]

.





(
1
)







The eigenvalues (λ1(x, y)≦λ2(x, y)) of the Hessian matrix can either be numerically calculated or analytically written in terms of the elements the Hessian Matrix;










λ
12



(

x
,
y

)


=


1
2




{









2


I







(

x
,
y

)





x
2



+






2


I







(

x
,
y

)





y
2














(






2


I







(

x
,
y

)





x
2



-





2


I







(

x
,
y

)





y
2




)

2

+

4






2


I







(

x
,
y

)





x








y









}

.






(
2
)







The eigenvalues encode the curvature information of the image, and provide useful cues for detecting ridge type membrane structures, or blob type nuclei structures. However the eigenvalues depend on image brightness. Below are two examples of curvature based features that are independent of image brightness;











θ






(

x
,
y

)


=


tan

-
1




(



λ
1



(

x
,
y

)




λ
2



(

x
,
y

)



)



,




(
3
)








φ






(

x
,
y

)


=


tan

-
1






(




λ
1



(

x
,
y

)


2

+



λ
2



(

x
,
y

)


2


)


1
/
2



I






(

x
,
y

)





,




(
4
)







and refer them as shape index, and normalized-curvature index respectively. This is essentially the same defining the eigenvalues in a polar coordinate system (See FIG. 1). This transformation also results in bounded features,








-


3

π

4




θ






(

x
,
y

)




π
4


,




and 0≦φ(x, y)≦π/2.


The image intensity I(x, y) is a significant information source. However, due to brightness variations across different images and within the same image, it is difficult to determine right intensity thresholds, and parameters to adjust for these variations. An intensity histogram of a retina image (FIG. 2a) is plotted in FIG. 3a (solid line). Due to large variations of the intensity, the histogram is far from a clear bimodal distribution. A simple thresholding test reveals such intensity variations. FIG. 4b shows segmented pixels that have intensity value above a certain threshold. FIGS. 4c and 4d shows the dramatic change in the segmentation results when this threshold value is decreased or increased by 5%.


Using known geometric cues, an initial segmentation based on the shape index and the normalized-curvature index separates the image pixels into three subsets: background, foreground, and indeterminate. Indeterminate subset comprises all the pixels that are not included in the background or foreground subsets. From these subsets, the background and foreground intensity distributions, as well as the intensity log-likelihood functions are estimated. The example algorithm used in this embodiment continues iterating by using two out of the three features at a time to estimate the distribution of the feature that is left out. Usually three iterations are usually sufficient for a convergence. As described below, these log-likelihood functions are combined in this embodiment to determine the overall likelihood function. A probability map that represents the probability of a pixel being a foreground may then be calculated.


The log-likelihood functions are estimated based on the assumption that the intensity and the feature vectors defined in Equations 3 and 4 are independent. Notice that these equations are normalized such that they measure a ratio rather than absolute values. The arctangent operation in these equations maps these measures onto a bounded space. If the overall image brightness is increased or decreased, these metrics stay unchanged. Starting with initial log-likelihoods determined based on the known geometry of the ridge-like or blob-like structures, the algorithm uses two out of these three feature sets to estimate the class membership of each pixels (foreground, background, or indeterminate), and use the pixel classes to estimate the class conditional probability, and the log-likelihood of the third feature. This procedure is repeated, either for a certain number of iterations or convergence in log-likelihood functions is achieved.


The following table illustrates example embodiments of algorithms that may be used in the methods and systems. In Step-A, the class memberships are determined based on two of the three features. Note that the union of the foreground pixels, SF, and the background pixels, SB, is a subset of all the pixels. In other words, subsamples are taken from the dataset in which there is a higher confidence that class membership may be determined. In this embodiment, only these points are then used to estimate log-likelihood function of the other feature. In Step-B, the decision boundary is estimated along the direction of the feature that is not used in Step-A. Although not necessary for the estimation of the log-likelihood functions, the decision boundaries can be used for enforcing monotonicity constraints for some of the log-likelihood functions. Step-C estimates the log-likelihood functions as a function of the class conditional functions. For the intensity and normalized-curvature index, the monotonicity constraints are enforced. In this embodiment, this implies that, for example for the intensity feature, the brighter a pixel is the more likely it is to be on a foreground.














Define f1(x, y) = I(x, y), f2(x, y) = φ(x, y), f3(x, y) = θ(x, y)


Compute initial log-likelihood functions L(f2(x, y)), and L(f3(x, y))









do









for k=1:3









A. Estimate the foreground and background sets using two sets of features









SF = {(x, y): L(fi(x, y)) ≧ εi, L(fj(x, y)) ≧ εj}



SB = {(x, y): L(fi(x, y)) ≦ −εi, L(fj(x, y)) ≦ −εj}









where (i, j) ∈ {1, 2, 3}, i ≠ j ≠ k









B. Estimate the decision boundaries {circumflex over (T)}k



C. Estimate the log-likelihood function


















L


(


f
k



(

x
,
y

)


)


=


log


(


P


(


(

x
,
y

)




S
F

/


f
k



(

x
,
y

)




)



P


(


(

x
,
y

)




S
B

/


f
k



(

x
,
y

)




)



)




log


(


P


(




f
k



(

x
,
y

)


/

(

x
,
y

)




S
F


)



P


(




f
k



(

x
,
y

)


/

(

x
,
y

)




S
B


)



)














Enforce monotonic increasing constraint for the shape index and the



normalized-curvature index









end for









until stopping criteria met










The initial log-likelihood functions are defined in this embodiment as





L(f2(x,y))=2ε2(U(φ(x,y)−φM)−0.5).  (5)






L(f3(x,y))=ε3(U(θ(x,y)−θL)−U(θ(x,y)−θU)−U(θ(x,y))),  (6)


where U is the unit step function, and εi are the likelihood thresholds for each feature. Now using these initial log-likelihoods, the sets in Step-A would be equivalent to the following sets,





SF={(x,y):θL≦θ(x,y)≦θU,φ(x,y)>φM}  (7)





SB={(x,y):θ(x,y)≧0,φ(x,y)≦φM},  (8)


where θL=−3π/2, θU=−π/2 for blobs, and θL=−π/2−Δ1, θU=−π/2+Δ2 for ridges. These parameters can be easily derived for different geometric structures. For example, for bright blobs on a dark background, both eigenvalues are negative, hence the angle between them is less than −π/2. Since the angle is relative to the larger eigenvalue, it is bounded by −3π/2. The ridge margins are at small angles, Δ1 and Δ2, for straight ridges they are equal. For the initial sets, subsamples are taken from θ≧0 to observe background pixels. Note that due to noise, the background pixels can have any curvature index. However, in this embodiment only a subset with positive polar curvature is sufficient to estimate the intensity distribution for the background pixels. An initial threshold for normalized-curvature index, φM, is set to the median value of all the normalized-curvature index values.



FIG. 2
b shows the initial background (black), foreground (white), and indeterminate (gray) subsets computed using the shape index and the normalized-curvature index for the image shown in FIG. 2a. These initial subsets are typically not complete (has many false negatives), but they typically have very few false positives. As such, they provide enough information to estimate the distribution of the feature (intensity) that is left out. From these subsets, class conditional distribution functions, and the log-likelihood functions of the intensity for the background and the foreground are estimated and shown in FIG. 3a (dashed plot) and (dotted plot), respectively. Given the estimated initial sets, SF, and SB, the class conditional intensity distribution of the foreground, P(I(x, y)/(x, y)εSF), and the background, P(I(x, y)/(x, y)εSB) are estimated. The dotted plots and dashed plots in FIGS. 3a, 6a, and 8a, show the estimated intensity distributions of blob-like and ridge like images from the initial subsets shown in FIGS. 2b, 5b, and 7b, respectively.


Next, given the initial log-likelihood function of the shape index, and the estimated log-likelihood function of the intensity, the background/foreground subsets may be recomputed, as shown in FIG. 2c. The class conditional distribution functions are estimated using these subsets (FIG. 3c), as well as the log-likelihood function (FIG. 3d) for the normalized-curvature index. The monotonicity constraint is imposed in this example for the log-likelihood function of the normalized-curvature index, implying that the foreground has a higher curvature for a given intensity value than the background. FIGS. 5c and 7c show the subsets derived from intensity and shape index for membrane and nuclei structures, shown in FIG. 5a and FIG. 7a, respectively. The class conditional density functions are shown in FIGS. 6c and 8c; and the log-likelihood functions are shown in FIGS. 6d and 8d.


In one iterative embodiment, the same procedure is repeated for the shape index. The estimated log-likelihood functions for the intensity and the normalized-curvature index are used to form the background/foreground subsets, FIG. 2d. Then, based on these subsets the class conditional functions, and log-likelihood functions are estimated as shown in FIGS. 3e and 3f, respectively. The significant peak at −π/2 in FIG. 3e for the vessel pixels is as expected, because in this example for vessels, one eigenvalue is zero and one eigenvalue is negative. The small peak at zero is due to the valley type structures in between two vessels that are close by in the bright regions of the image. FIGS. 5d and 7d show the subsets for the membrane and nuclei images. The estimated functions using these subsets are shown in FIGS. 6e-f, and 8e-f. In the membrane class conditional functions (FIG. 6e), the foreground peak is at an angle slightly less than −π/2, and foreground class conditional function is significantly higher than the background class condition function for all values smaller than −π/2. Although initialized differently, the nuclei class conditional functions in this example converge similar to membrane distribution functions. This is due to significant amount of ridge-like structures in the epithelial nuclei. Although the proportion of the mixtures between blobs and ridges is different, the algorithm used in this example learns the likelihood densities from the data. The monotonicity constraint is used to stabilize the convergence of the algorithm.


The monotonicity constraint is imposed by first estimating the decision boundaries. An optimal intensity threshold for the intensity and the normalized-curvature index are estimated by maximizing the a Posteriori Probabilities (MAP),












T
^

k

=





arg





max

T






P






(



I






(

x
,
y

)




T
/

(

x
,
y

)





S
F


)


+

P






(



I






(

x
,
y

)


<

T
/

(

x
,
y

)





S
B


)






for





k


=
1


,
2.




(
9
)







In this example, the goal is to minimize the overall error criteria when the a priori distributions for the background and the foreground are equal. Since an estimate is known, from this example, for the class conditional distributions, the value of the decision threshold is determined by a one-dimensional exhaustive search, rather than any parametric approximations. While there is only one decision boundary along the intensity, and normalized-curvature index dimensions, there can be multiple boundaries along the shape index feature. Therefore, a monotonicity constraint is not imposed on the log-likelihood function of the shape index in this example.


Although the log-likelihood functions are estimated in this example in Step-C, for small values of numerator and denominator, this expression can become undefined or unstable. Therefore, a modified empirical log-likelihood function is defined by imposing the non-decreasing constraint as follows,











L
*



(


f
k



(

x
,
y

)


)


=

{





sup






(


L






(


f
k



(

x
,
y

)


)


,


L
*



(



f
k



(

x
,
y

)


-
Δ

)



)







f
k



(

x
,
y

)


>


T
^

k







L






(


f
k



(

x
,
y

)


)







f
k



(

x
,
y

)


=


T
^

k







inf






(


L






(


f
k



(

x
,
y

)


)


,


L
*



(



f
k



(

x
,
y

)


+
Δ

)



)







f
k



(

x
,
y

)


<


T
^

k





,






for





k

=
1

,
2






(
10
)







where Δ is the bin size of the histogram used to estimated the intensity distributions. Equation 10 is calculated recursively starting from {circumflex over (T)}k estimated by Equation 9. This is used in this example to ensure that the estimated empirical log-likelihood function does not change the decision boundary when the log-likelihood function (L*(fk(x, y))=0) is used for decision. In the above example equation, the index, k, is defined for the first two features, not for all of them, therefore excluding the shape index. Example empirical non-decreasing intensity log-likelihood functions are shown in FIGS. 3b, 3d, 6b, 6d, 8b, and 8d for vessel, membrane, and nuclei structures. The algorithm used in this example, repeats Steps A-C for all features until a stopping criterion is met. The methods and systems are not limited to these criteria. Different stopping criteria can be defined, such as, but not limited to, the rate of change in the estimated decision boundaries. This example algorithm tends to converge in three interations when used in connection with images of membranes and nuclei. Therefore, in this example three iterations were used.


The methods and systems described may be used to process and analyze many different kinds of images for any number and type of purposes depending on the analytical tools desired for a given application. The methods and systems are particularly useful for analyzing images that comprise blob-like and/or ridge-like structures, or other similar structures that can be differentiated from one another based at least in part on shape, geographical and/or topographical features. For example, such images may include, but are not limited to, images of biological structures and tissues. For example, the methods and systems are useful for differentiating structures and tissues comprising vascular features, neural features, cellular and subcellular features.


Building again upon the assumption that the features are independent, the joint log-likelihood function can be computed from the individual log-likelihood functions,













L






(

x
,
y

)


=

log






(


P






(


(

x
,
y

)





S
F

/
I







(

x
,
y

)






F






(

x
,
y

)






θ






(

x
,
y

)



)



P






(


(

x
,
y

)





S
B

/
I







(

x
,
y

)






F






(

x
,
y

)






θ






(

x
,
y

)



)



)








=

log






(





P






(


(

x
,
y

)





S
F

/
I







(

x
,
y

)



)






P






(


(

x
,
y

)





S
F

/
F







(

x
,
y

)



)







P






(


(

x
,
y

)









S
F

/
θ







(

x
,
y

)



)









P






(


(

x
,
y

)





S
B

/
I







(

x
,
y

)



)






P






(


(

x
,
y

)





S
B

/
F







(

x
,
y

)



)







P






(


(

x
,
y

)









S
B

/
θ







(

x
,
y

)



)






)








=


L






(
I
)


+

L






(
F
)


+

L






(
θ
)










(
11
)







A probability map representing the probability of a pixel being a foreground may be calculated from the joint log-likelihood functions as follows,









P






(


(

x
,
y

)





S
F

/
I







(

x
,
y

)






F






(

x
,
y

)






θ






(

x
,
y

)



)


=





L






(

x
,
y

)




1
+



L






(

x
,
y

)





.





(
12
)








FIGS. 2
e, 5e, and 7e show the estimated probability maps for vessel, membrane, and nuclei images, respectively. In this example, as shown in FIG. 3b, the previously determined threshold is used as the optimal decision boundary. A binary decision map may then be computed by thresholding this probability map, such as using 0.5 as the decision criterion, (FIGS. 2f, 5f).


While the estimated binary decision maps for the vessel and membrane structures comprise accurate segmentation boundaries, the nuclei decision map tends to result in over-segmented regions. This is due to the large amount of light scattering around the nuclei, particularly in between compactly located epithelial nuclei, and inside the ring shaped epithelial nuclei where the scattered light makes relatively bright regions. Since the regions in between nearby nuclei and inside ring nuclei have high curvature and high intensity, these regions adversely contribute to the class conditional estimation of the shape index. A model-based likelihood function that deemphasizes the unexpected geometric structures is fitted to the nuclei log-likelihood functions. The dashed line in FIG. 8f shows such a function modeled by the sum of two Gaussian functions, where their parameters are estimated from the data values less than −π/2, and with a fixed lower bound set to e−5. The resulting probability map is shown in FIG. 7f. A connected component analysis and hole filing algorithm fills in the hollow epithelial nuclei centers. The probability value is set to 0.5 (gray color in FIG. 7f) for the filled in values, so that they are defined as nuclei pixel in the binary decision map. Based on the quantitation task, the empirical likelihood functions or the model-based likelihood functions may be used. In this example, the model-based function is used because it results in isolated nuclei segments that may be used to differentiate the epithelial nuclei from the stromal nuclei for use in the following example for detecting epithelial nuclei.


Many molecular markers target either epithelial nuclei or stromal nuclei. Current practice in molecular imaging uses biomarkers such as keratin to differentiate the epithelial tissue from the stromal tissue. However, in this example, the curvature based methods obviate the need for markers to differentiate epithelial tissue from stromal tissue. As a result, the staining process is less complex and makes the biological and optical resources available for multiplexing other targets. The example computational algorithms used in one or more of the example embodiments, exploit the knowledge that epithelial nuclei have membrane structures surrounding them. The nuclei in the epithelial tissue are larger and more densely populated than nuclei in the stromal tissue.


The morphological differences between epithelial and stromal nuclei may be defined in this example, which is for illustration only, by identifying a superset of the nuclei, cytoplasm, and membrane set. For example, S(x, y), when used to denote this superset, may be defined as the union of the detected compartments,






S(x,y)=C(x,y)∪M(x,y) ∪N(x,y),  (13)


where C(x, y), M(x, y), and N(x, y) denote cytoplasm, membrane, and nuclei pixels. Cytoplasm, in this example, is defined as the union of set of small regions circumscribed by membrane and nuclei pixels. Since the stromal nuclei are not connected through membrane structures, and are sparsely distributed, they can be detected by a connected component analysis of S(x, y). An epithelial mask, E(x, y), may be generated as a union of large connected components of S(x, y). For the sample images in this example, any connected component larger than 800 pixels is accepted as a part of the epithelial mask. The nuclei set is then separated into epithelial nuclei (Ne(x, y)) and stromal nuclei (Ns(x, y)) by masking,






N
e(x,y)=N(x,yE(x,y),  (14a)





Ns(x,y)=N(x,y)·(1−E(x,y)).  (14b)



FIG. 9
b shows the computed different regions: membrane, epithelial nuclei, stromal nuclei, and cytoplasm. The epithelial nuclei shown in FIG. 9B are clearly differentiated from the stromal nuclei.


As noted, the methods and systems may be used in a variety of applications. Segmenting digital images of tissue microarrays is an example of one such application. In this example, multiple channel digital images are segmented into multiple regions (segments/compartments) as one of the steps for quantifying one or more biomarkers. In this example, the quantitation is accomplished without having to make definite decisions for each pixel, but rather by determining the likelihood that a given pixel belongs to a region. For example, instead of identifying membrane pixels, the likelihood of a pixel being a membrane can be computed. This likelihood represents the probability that a given pixel is belongs to a membrane region. Probability maps of these regions may be computed using the intensity and geometry information derived from each channel. For example, FIG. 9a shows the measured intensities of a multiple channel image, showing the nuclear stain (Dapi), the membrane stain (Pan-cadherin), and a target protein (cMet). The probability maps computed for the nuclei and membrane are shown in FIGS. 7f and 5e, respectively. The brightness on these images represents the probability value: white representing the probability value of one, black representing the probability value of zero, and any shade of gray being proportional with the probability value. A definite decision for each pixel can be easily determined by thresholding the probability maps. Such decisions are used to separate the epithelial nuclei from the stromal nuclei, and to detect the cytoplasm. The cytoplasm is also represented as a probability map of ones and zeros. FIG. 9b shows the computed different regions for membrane, epithelial nuclei, stromal nuclei, cytoplasm. The background and the extra cellular matrix are shown as black.


Translocation of a target protein between different regions can be quantified based on the probability maps. The distribution of a target protein (cMet) on each of the regions can be represented by a probability distribution functions (PDF). For example the PDF of the cMet on the membrane is the weighted empirical distribution of the cMet, where the membrane probability map determines weights. A translocation score may then be generated based on one or more or pairs of regions. In this example, there are five regions (membrane, epithelial nuclei, stromal nuclei, cytoplasm, and extra cellular matrix). The translocation score is defined, in this example, as the normalized mean difference between the corresponding PDFs. These translocation scores may be used to reflect clinical outcome or to explore the association with life expectancy.


As noted, the methods and systems may be used to analyze a variety of images. The microscopy images, used in this example, may be calibrated in advance by using fluorescent calibration targets. Such calibration may not possible for some images, such as the retinal image. However, illumination correction techniques may be applied to correct such variations. A commonly used illumination correction technique is homomorphic filtering defined as,






I′(x,y)=exp(log(I(x,y))−log((I(x,y)*G(x,y))),  (15)


where I′(x, y) is the new corrected image, G(x, y) is a Gaussian filter, and * is a convolution operation. By replacing the image with the corrected intensities, images with large intensity variations can be segmented more accurately using the same algorithms described. To eliminate any artifacts introduced by the homomorphic filtering, the shape index and the normalized-curvature index is preferably calculated from the original intensity values. FIG. 10 shows the segmentation result of the retina image using the corrected intensity values.


While only certain features of the invention have been illustrated and described herein, many modifications and changes will occur to those skilled in the art. It is, therefore, to be understood that the appended claims are intended to cover all such modifications and changes as fall within the true spirit of the invention.


The automated system 10 (FIG. 11) for carrying out the methods generally comprises: a storage device 12 for at least temporarily storing one or more images; and a processor 14 that categorizes the pixels into a plurality of subsets using one or more indexes, determines an intensity distribution and log-likelihood function of one or more of the subsets, and generates one or more maps based on the determination of the log-likelihood function of one or more of the subsets. The images may comprise, but are not limited to, blob-like and ridge-like structures. For example, one or more of the blob-like structures may comprise at least a portion of a nucleus and one or more of the ridge-like structures may comprise at least a portion of a membrane. One or more of the maps may be a probability map of one or more of the blob-like structures and the ridge-like structures. The image may comprise, but is not limited to, one or more structures selected from a group consisting of: cellular structures, vascular structures, and neural structures.


The storage device may comprise, but is not necessarily limited to, any suitable hard drive memory associated with the processor such as the ROM (read only memory), RAM (random access memory) or DRAM (dynamic random access memory) of a CPU (central processing unit), or any suitable disk drive memory device such as a DVD or CD, or a zip drive or memory card. The storage device may be remotely located from the processor or the means for displaying the images, and yet still be accessed through any suitable connection device or communications network including but not limited to local area networks, cable networks, satellite networks, and the Internet, regardless whether hard wired or wireless. The processor or CPU may comprise a microprocessor, microcontroller and a digital signal processor (DSP).


In one of the embodiments, the storage device 12 and processor 14 may be incorporated as components of an analytical device such as an automated high-throughput system that stains and images tissue micro arrays (TMAs) in one system and still further analyzes the images. System 10 may further comprise a means for displaying 16 one or more of the images; an interactive viewer 18; a virtual microscope 20; and/or a means for transmitting 22 one or more of the images or any related data or analytical information over a communications network 24 to one or more remote locations 26.


The means for displaying 16 may comprise any suitable device capable of displaying a digital image such as, but not limited to, devices that incorporate an LCD or CRT. The means for transmitting 22 may comprise any suitable means for transmitting digital information over a communications network including but not limited to hardwired or wireless digital communications systems. The system may further comprise an automated device 28 for applying one or more of the stains and a digital imaging device 30 such as, but not limited to, an imaging microscope comprising an excitation source 32 and capable of capturing digital images of the TMAs. Such imaging devices are preferably capable of auto focusing and then maintaining and tracking the focus feature as needed throughout processing.

Claims
  • 1. A method for segmenting images, comprising the steps of, providing an image comprising a plurality of pixels;categorizing said pixels into a plurality of subsets using one or more indexes;determining a log-likelihood function of one or more of said indexes; andgenerating one or more maps based on said determination of said log-likelihood function of one or more of said indexes.
  • 2. The method of claim 1, wherein said subsets comprise background pixels, foreground pixels and indeterminate pixels.
  • 3. The method of claim 1, wherein one or more of said indexes comprise one or more features.
  • 4. The method of claim 3, wherein said indexes comprise at least a shape index, a normalized-curvature index, and an intensity value.
  • 5. The method of claim 1, wherein at least one of said indexes comprises a shape index.
  • 6. The method of claim 5, wherein said shape index comprises a curvature based feature comprising
  • 7. The method of claim 1, wherein at least one of said indexes comprises a normalized-curvature index.
  • 8. The method of claim 7, wherein said normalized-curvature index comprises a curvature based feature comprising
  • 9. The method of claim 1, wherein at least one of said indexes comprises an intensity value.
  • 10. The method of claim 1, wherein said step of determining comprises estimating said log-likelihood function of one or more of said indexes.
  • 11. The method of claim 10, wherein said pixels are categorized using at least three of said indexes and wherein said step of determining a log-likelihood function comprises using two out of said three indexes, for an iteration of said step of determining said log-likelihood function, to estimate said log-likelihood of said third index.
  • 12. The method of claim 11, wherein said three subsets comprise background, foreground and indeterminate pixels.
  • 13. The method of claim 11, wherein said log-likelihood is estimated for at least one of said subsets at least in part by estimating one or more decision boundaries.
  • 14. The method of claim 13, wherein one or more of said decision boundaries are used to apply one or more monotonicity constraints for one or more log-likelihood functions.
  • 15. The method of claim 1, wherein one or more of said maps comprise a probability map.
  • 16. The method of claim 1, wherein said image comprises an image of a biological material.
  • 17. The method of claim 16, wherein said biological material comprises a biological tissue.
  • 18. The method of claim 16, wherein said biological tissue comprises one or more cellular structures.
  • 19. The method of claim 18, wherein said cellular structures comprise one or more blob-like and ridge-like structures.
  • 20. A system that embodies the method of claim 1, comprising, a storage device for at least temporarily storing said image; anda processing device that categorizes said pixels into a plurality of subsets using one or more indexes, determines a log-likelihood function of one or more of said indexes, and generates one or more maps based on said determination of said log-likelihood function of one or more of said indexes.
  • 21. The system of claim 20, wherein said image comprises one or more cellular structures.
  • 22. The system of claim 20, wherein said cellular structures comprise blob-like and ridge-like structures.
  • 23. The system of claim 22, wherein one or more of said blob-like structures comprises at least a portion of a nucleus and one or more of said ridge-like structures comprise at least a portion of a membrane.
  • 24. The system of claim 21, wherein one or more of said maps is a map of one or more of said blob-like structures and said ridge-like structures.
  • 25. The system of claim 24, wherein said map of said blob-like and ridge-like structures is a probability map.
  • 26. The system of claim 20, wherein said image comprises vascular structures.
  • 27. A method for segmenting images, comprising the steps of, providing an image comprising a plurality of pixels;categorizing said pixels into a plurality of subsets using one or more indexes;determining a log-likelihood function of one or more of said indexes; andgenerating one or more probability maps based on said determination of said log-likelihood function of one or more of said indexes.
  • 28. The method of claim 27, wherein said subsets comprise background pixels, foreground pixels and indeterminate pixels.
  • 29. A system that embodies the method of claim 27, comprising, a storage device for at least temporarily storing said image; anda processing device that categorize said pixels into a plurality of subsets using one or more indexes, determines a log-likelihood function of one or more of said indexes, and generates one or more probability maps based on said determination of said log-likelihood function of one or more of said indexes.
  • 30. The system of claim 29, wherein said image comprises one or more structures selected from a group consisting of: cellular structures, vascular structures, and neural structures.
CROSS-REFERENCE TO RELATED APPLICATIONS

This is a continuation-in-part of U.S. patent application Ser. No. 11/606,582, entitled “System and Methods for Scoring Images of a Tissue Micro Array, filed on Nov. 30, 2006, which is herein incorporated by reference.

Continuation in Parts (2)
Number Date Country
Parent 11606582 Nov 2006 US
Child 11680063 US
Parent 11500028 Aug 2006 US
Child 11606582 US