1. Field of the Invention
The present invention relates to a method for separating a group of cells contained in a sample into individual cells, particularly, the present invention relates to a method for separating a cell group contained in an image of a sample into individual cells for a subsequent classification of the sample, wherein the cell group comprises a plurality of mutually overlapping cells.
2. Description of the Related Art
For the successful treatment of cancerous diseases, early detection and treatment is necessary. This can be achieved by regularly attending cancer screening tests. In such tests, smears of the tissue to be examined are taken, wherein, in the case of an examination for cervical cancer (cervical carcinoma), this is usually done by the “PAP test” named after the Greek physician Dr. George Papanicolaou, who introduced this method in 1942. This gynecological smear of the cervix, i.e. the neck of the uterus, or cell specimens obtained from other examinations, generally referred to as cytological specimen, must be classified. For this purpose, the cytological specimens are placed onto a slide, dyed and assessed under the microscope by a cytologist.
For the objective diagnostic assistance for such an expert, image processing programs have recently been used with which the cells of a specimen are automatically segmented and classified based on the morphometric properties of cell nucleus and cell plasma, such as extension of the nucleus and the plasma, form of the nucleus and the plasma, relative size of nucleus and plasma, etc., as well as based on the texture of the chromatin structure in the cell nucleus. The image processing programs employed here, however, only allow a reliable segmentation of the cells of a specimen, when these cells of the specimen occur individually.
While, due to their training, cytologists are capable of implicitly separating cell plasmas that overlap each other up to a certain extent to subsequently make a diagnosis into healthy, inflammatory, dysplastic or diseased cells, automatic methods for segmenting and separating such overlapping cells are not known.
Commonly, either only individual cells are used for automatized classification approaches or a manual cell separation is added in between the automatized steps.
The disadvantage of the first approach, using only individual cells, is that no further attention is paid to the overlapping cells, i.e. they are discarded, so that the important information for the classification of a sample contained also in the overlapping cells is lost.
Although this loss of information is avoided in the second approach, the disadvantage is that a fully automatic pre-assessment of a sample is not possible. As soon as an overlapping is found or likely, it is necessary to call on a human examiner for the separation. After the separation, the automatic method proceeds.
The presence of individual cells, however, is generally the exception in cytological specimens. Rather, depending on the type of specimen, up to 80% of all cell plasmas of a specimen will overlap each other in the cytological specimens/samples. Thus it is necessary for nearly every classification of cytological specimens to provide a manual cell separation or to abandon the information contained in these overlapping cells.
It is the object of the present invention to provide a method that detects an overlapping of cells or cell plasmas and segments and separates the individual overlapping cells in order to fulfil the requirements for an automatic classification/assessment of a sample.
The present invention provides a method for separating a cell group contained in an image of a sample into individual cells for a subsequent classification of the sample, wherein the cell group has a plurality of mutually overlapping cells, having the following steps: (a) selecting a cell nucleus of a first cell which is to be separated from the cell group, wherein the cell nucleus of the first cell is located adjacent to a cell nucleus of the second cell, wherein the cell plasma of the first cell and the cell plasma of the second cell overlap each other such that a common cell plasma is formed; (b) determining a contraction of the common cell plasma between the cell nucleus of the first cell and the cell nucleus of the second cell; (c) separating the common cell plasma at the contraction; (d) determining an area of the common cell plasma in which the overlapping of the cell plasmas of the first cell and the second cell is expected; (e) classifying the determined area to associate individual portions of the same with the cell plasma of the first cell and/or the cell plasma of the second cell; and (f) completing the cell plasma of the first cell obtained in step (c) based on the classified portions.
According to a preferred embodiment of the present invention, the method is based on an image which has been generated from a cytological specimen and/or a sample. The image was generated and digitalized, for example, by means of a microscope, wherein each image has a resolution dependent on the capturing device (camera, objective, etc.), such as 1000×700 pixel. The image was captured either in the transmitted light modality or in the fluorescence modality, wherein other known capturing modalities may also be used. According to another embodiment, a plurality of images is used instead of one image, which are registered with each other, and which were generated in different capturing modalities. The different capturing modalities include, for example, capturing an image in a transmitted light modality and capturing a further image in a fluorescence capturing modality. Alternatively, the images may both be generated in the fluorescence capturing modality, but with different parameters regarding the fluorescence.
On the basis of the images thus generated, an automatic segmentation or separation of cell groups into individual cells is performed using the inventive method, which may then form the basis for an automatic further processing for the classification of the cytological sample.
The advantage of the present invention is thus that adding a manual cell separation as well as the work and loss of time connected therewith can be avoided, while at the same time, the information for the classification of cytological specimens contained in the cell groups, also referred to as cell clusters, is no longer lost, but is used for their classification to allow putting the results of the classification on a broader basis of the cells contained in the cytological specimen. This results in the advantage of an improvement in the reliability of the classification of the specimens subsequently performed.
According to another preferred embodiment, the inventive method also includes the preparatory steps required to detect, from a picture (one image or several images) of the sample, one or more cell groups which are then separated into individual cells according to the invention. According to this embodiment, first a detection and segmentation of cell nuclei is performed based on an image of the sample to generate a list of cell nuclei. Next, a detection and segmentation of cell plasmas is performed based on the image of the sample to generate a list of the cell plasmas. The cell nuclei are associated with the pertinent cell plasmas, and, based on the number of cell nuclei associated with a cell plasma, the method detects whether the combination is a cell cluster and/or a cell group or a segmented individual cell.
Preferred embodiments of the present invention are defined in the dependent claims.
In the following, preferred embodiments of the present invention are explained in more detail with respect to the accompanying drawings, in which:
With respect to the following description, it is to be understood that, in the individual figures, elements that are similar or function in the same way are provided with the same reference numbers.
With respect to
In
After the cell group has been provided in step S100, the method proceeds to step S102, in which relevant neighbors, i.e. relevant adjacent cell nuclei, are detected for all pairs of cell nuclei. An embodiment for the selection or detection of relevant adjacent cells will be described in more detail in the following. In the general embodiment shown in
After the adjacent cell nuclei have been determined in step S102, the method proceeds to step S104, in which so-called contraction points are localized. Contraction points or contractions are portions of the common cell plasma formed by all cell plasmas ZP of all cells Z1 to Z5, i.e. portions of the common cell plasma, in which extension of the same compared to the usual extension is low or even minimal. The localization of the contraction points according to step S104 is performed based on an evaluation of the common plasma located between a cell to be separated and a cell adjacent to the cell to be separated. In the embodiment shown in
After the contraction points and/or contractions in the common plasma ZP have been determined, the method proceeds to step S106, in which a separation of the common cell plasma is performed based on the contraction points localized in step S104. As can be seen from
Although the now individual cells Z1 to Z5 are now separated, they do not correspond to the cells contained at the corresponding positions in the original cytological specimen, because there was an overlapping of the cell plasmas, which has not been taken account of by the simple separation in step S106. For this reason, in step S108, the inventive method determines an area for adjacent cells, in which an overlapping of the cell plasmas of the cells is expected. According to a preferred embodiment, for determining this area, a quadrilateral is subtended which extends from a cell nucleus to a first contraction point, thence to the second cell nucleus, thence to the second contraction point and back to the first cell nucleus. In this area, overlapping cell plasma is expected. In
In step S110, a binarization of the overlapping areas U1 to U5 is performed to associate the pixels of each overlapping area with one or both involved cells by means of a classification step.
In step S112, the cells Z1 to Z5 shown in step S106 are expanded by the overlapping areas associated with the respective cells, and are thus completed to individual cells corresponding to the cells contained in the original cytological sample. Alternatively, cleaning may then be performed in step S112. The thus obtained individual cells are moved a little apart in the picture to separate them clearly from each other.
With respect to the preferred embodiment, it is to be noted that the present invention is, of course, not limited thereto. The present invention allows the separation of a cell cluster or a cell group including two overlapping individual cells in general. Here, too, a determination is made for each cell nucleus which relevant adjacent cells are in the proximity (step S102). Next, the contraction points between the two adjacent cell nuclei are detected (S104), which serve as markers of the cells at which the overlapping ends. The cells are then first separated between the contraction points, and, subsequently, the overlapping area of the cells, subtended by the quadrilateral between the contraction points and the two cells, is determined. The pixels of the overlapping area are associated with one or both cells by means of a classification step. Subsequently, an optional cleaning step is performed, as also shown in
In the following, preferred embodiments for the implementation of the steps S102 to S112 described in detail with respect to
With respect to
According to a preferred embodiment, the question whether an adjacent cell nucleus has to be considered in separating an examined cell nucleus is answered based on a distance existing between the two cell nuclei and whether the two cell nuclei are located in a common cell plasma.
As an example, look at the cells Z1 and Z2 shown in
If another cell nucleus ZK2 meeting the requirements with respect to the distance is adjacent to the cell nucleus ZK1 to be separated, then, in addition, it is necessary to ensure that these two cell nuclei belong to the common cell plasma ZP. In order to determine this, a straight line G, that has to be completely within the common cell plasma of the cell cluster, is drawn between the cell nuclei ZK1 and ZK2 and/or between the gravity centers of the associated binary masks, as shown in
If this is not the case, the cell nucleus is not yet discarded, but first a so-called “indirect connection” is checked for. Checking for the indirect connection is described with respect to
If the examination of all points reveals that several of these indirect connections exist, the connection selected is the one which has the largest angle between the two straight lines G1 and G2, and/or which is as close as possible to the straight line G representing the closest connection between the cell nucleus, and/or the connection for which both straight lines G1 and G2 are as short as possible. The three conditions stated above are equivalent. The point chosen in the end is the break point K already mentioned with respect to
According to a preferred embodiment, what is further provided is that the break point K is only a predetermined distance away from the point A of the normal, in which the straight line L perpendicularly intersects the straight line G. In a preferred embodiment, this maximum distance should be about 50 pixels.
If both the distance condition and the connection condition of two cell nuclei are met, the two cell nuclei are considered to be adjacent to each other. If both or one of the conditions are not met, the cell nucleus ZK1 originally to be separated is not examined any further and is discarded.
Although an example has been described with respect to
In a situation in which there are more than two cell nuclei and in which there is an indirect connection for two adjacent cell nuclei, the break point K described above exists. Now the distances to other cell nuclei are examined for this break point K and a determination is made whether one of these distances is smaller than the distance of the break point to the cell nucleus to be separated, as shown in
In the following, a preferred embodiment for the localization of contraction points with respect to two adjacent cell nuclei is explained in more detail with respect to
Based on the cell nucleus to be separated and the adjacent cell nucleus, as well as based on the break point K, now there is a search for contraction points E1, E1′ (see
In order to find the contraction points, a straight line is drawn through the break point K running parallel to the straight line G between the two cell nuclei. If there is a direct connection between the cell nuclei, then it is the straight line G.
Subsequently, all boundary points of the plasma belonging to the cell group, the common cell plasma ZP, are examined.
For each boundary point, a perpendicular is dropped on the straight line drawn through the break point, and subsequently there is a search for the two boundary points that meet the following conditions.
The first condition is that the points must be “between” the cell nuclei, i.e. the intersection point of the normal must be on the line segment between the cell nuclei. According to another embodiment, this area is further limited by declaring a part of the length of the respective average distance of the boundary points Rn to the gravity center S of the cell nucleus from both ends of the line segment between the cell nuclei as “invalid”, as illustrated in
The second conditions is that one of the sought-for points must be “left” and one of the sought-for points must be “right” of the selected straight line G, as illustrated in
The last condition is that, on both sides of the straight line G, the point with the shortest perpendicular is chosen.
If all these conditions are met, the contraction points E1 and E1′ between the cell nucleus ZK1 to be separated and the adjacent cell nucleus ZK2 are determined. If no boundary points are found which satisfy the conditions stated above, the method for the examined cell nucleus ZK1 is stopped, because no appropriate position for a separation has been found.
After the contraction points have been found, now a preferred embodiment for the separation of the common cell plasma is explained in more detail with respect to
According to a preferred embodiment, the detected contraction points are added to the existing list of relevant cell nuclei. Based on the contraction points, a straight line is drawn between the same between each pair consisting of the cell nucleus to be separated and a cell nucleus which is filed in the list and which is relevant because it is adjacent, and the common cell plasma of the cell group is “cut off” at this straight line. This cut is performed to obtain a rough basis for the area of the common cell plasma to be separated.
According to a preferred embodiment containing the information regarding the common cell plasma and the cell nuclei in a binary mask containing “white” and “black” pixels, the cutting-off is performed by drawing a “black” line between the contraction points of a pair, which is performed for all contraction points.
In
Before a potential overlapping area is subsequently determined, the distance between the two contraction points is examined according to a preferred embodiment. If this distance is below a predetermined, empirically determined threshold value, for example 40 pixels, it is to be assumed that the cell plasmas of the adjacent cell nuclei only touched at this section, but did not really overlap. If such a situation is detected, no further processing is required, but the separated portion actually shows the cell that was in the original sample.
Further there is an examination whether the distance of the contraction points does not exceed a maximum value, such as 350 pixels. If exceeding of the maximum value is detected, it is to be assumed that the determination of the contraction points was not done correctly, because an overlapping of this length is unlikely or the cell cluster must be considered indivisible, at least at this place. In this situation, the separation of the cell nucleus of interest with cell plasma is then stopped.
After the rough basis of the cell plasma has been separated from the cell group, it is principally assumed that, at each of the sections, an overlapping of cell plasmas of various cells had occurred. Therefore, in a subsequent step, an area has to be determined in which this overlapping of the plasmas is to be looked for.
With respect to
As shown with respect to
It may happen in a cell group that several adjacent cell nuclei exist for the cell nucleus to be cut out. For each one of them, the contraction points are determined and potential overlapping areas are formed. If it happens that two or more of these potential overlapping areas intersect, they are combined to a single binary mask and treated together. With respect to
As was explained above, the overlapping of the cell plasmas of the individual cells contained in the original sample is expected within the overlapping area. This can generally be seen, for example, by a darker chrominance in a transmitted light image of the sample, because two overlapping plasmas appear darker than one plasma. The easiest way to solve this distinction is with a histogram and an appropriate threshold value determination.
According to a preferred embodiment, now a local histogram of the generated image, such as the transmitted light image, is established with respect to the determined area of overlapping. Subsequently, the histogram is examined in order to determine a threshold value and, with this value, binarize the generated image within the bit mask. This examination may, for example, be performed using the method of Otsu which is described in more detail by T. Lehmann, W. Oberschelp, E. Pelikan, and R. Repges in “Bildverarbeitung für die Medizin”, Springer, Berlin 1997. In this way, the darker pixels in the overlapping are represented white and the brighter pixels in the overlapping are represented black in the binary mask. This is illustrated in
This overlapping binary mask is combined with the binary mask of
In the manner described above, cell groups in a specimen may thus be split up into individual cells by means of the inventive method so that, by the automatization at this point, an overall automatization of the classification method for cytological specimens is achieved.
As has been mentioned above, the inventive method starts with a cell cluster and/or a cell group detected from a picture of a cytological sample. In the following, a block diagram of another preferred embodiment of the present invention is described with respect to
In this embodiment, the method starts with step S200, in which capturing an image of the cytological sample is performed in one or more modalities. As has already been mentioned above, capturing an image is either performed with a capturing modality, such as transmitted light or fluorescence. Alternatively, several multi-modal images registered with each other may be generated, for example by generating images of a sample in a first capturing modality and a second capturing modality. The first capturing modality may, for example, be a transmitted light capturing modality, and the second capturing modality may be a fluorescence capturing modality. Alternatively, fluorescence capturing modalities with different parameters may also be employed.
In the subsequent step S202, the cell nuclei in the picture are detected and segmented to generate a list of the cell nuclei contained in the image and/or the picture. In parallel, the detection of the cell plasmas contained in the picture and their segmentation are performed in step S204 to generate, in turn, a list containing the cell plasmas in the picture. It is to be noted that, when using several images, the segmentation of cell nuclei and the segmentation of cell plasmas does not have to be performed in the same images. Preferably, the segmentation of cell plasmas will be performed on the basis of transmitted light images, whereas the segmentation of cell nuclei may be performed on the basis of fluorescence images. After the cell nuclei and cell plasmas in the sample have been detected, the cell nuclei are associated with the plasmas in step S206, via the generated lists. Subsequently, there is an examination in step S208 whether a plasma is associated with only one single cell nucleus. If this is the case, then this is an individual cell that does not require further segmentation, and the method ends with step S210. If a plasma is detected to be associated with more than one cell nucleus, the method proceeds to step S212 in which the presence of a cell group is detected. This cell group is subsequently separated in step S214 so that, finally, there are the individual cells in the steps S216 and S218 for further processing. With respect to the steps performed in step S214, see the above description of the preferred embodiment for cell group separation.
In the following, a preferred embodiment for detection and segmentation of the cell plasmas and cell nuclei in the picture of a cytological sample will be described.
The cell plasma segmentation is optionally performed in a transmitted light image or in a fluorescence image of the sample. The cell plasma segmentation is performed using histograms. Here, a predetermined threshold value is calculated (e.g. by the method of Otsu mentioned above), with which the transmitted light image is binarized to thus separate cell plasmas from the brighter background. For forming the histograms and the threshold values, various methods well known in the art are implementable.
The binary image of the picture of the cell generated by the histogram-based approach is now examined to determine regions in the binary image which reproduce the plasma, including nucleus, of a cell or which reproduce a cell cluster of overlapping cells. Each independent area in the binary image represents a region of its own, and a sub-image, e.g. in the form of a binary mask, is associated with each individual region, i.e. with each plasma of the cell and/or each area of a cell cluster of overlapping cells.
The cell nucleus segmentation is performed in a similar manner to the segmentation of the cell plasmas, optionally in the transmitted light image or in the fluorescence image. Here, the known histogram-based approach for the detection of cell nuclei in the picture of the cytological sample is also used, so that sub-images, e.g. in the form of binary masks, result for individual cell nuclei.
The list of sub-images (binary masks) resulting from the segmentation of the cell plasmas, wherein each sub-image corresponds to the plasma of a cell and/or the area of a cell cluster of overlapping cells, and the sub-images of involved cell nuclei resulting from the segmentation of the cell nuclei are combined by means of a simple Boolean operation. If the intersection of the binary masks of the cell nucleus and the binary mask of a cell plasma is not empty, then the cell nucleus is associated with the cell plasma. If a cell plasma is detected to be associated with only one cell nucleus, then these are already completely segmented cells with a plasma and a cell nucleus. If a plasma is associated with two or more cell nuclei, then there is a cell cluster or a cell group that is to be separated according to the invention.
In an embodiment of the present invention, a classification of the cell nuclei is performed based on the sub-images associated with the detected cell nuclei, which includes a comparison of selected parameters of the cell nucleus with predetermined parameters in order to determine whether a detected cell nucleus is suitable for further processing.
Based on the picture of the cytological sample thus prepared and processed, the inventive method performs the division of the segmented cell clusters into individual cells.
The above description of the preferred embodiment has set forth that the overlapping area is formed by a quadrilateral. The present invention is not limited to this implementation, the overlapping area may rather be subtended by an area of any form between the contraction points E1, E1′ and the cell nuclei ZK1 and ZK2.
While this invention has been described in terms of several preferred embodiments, there are alterations, permutations, and equivalents which fall within the scope of this invention. It should also be noted that there are many alternative ways of implementing the methods and compositions of the present invention. It is therefore intended that the following appended claims be interpreted as including all such alterations, permutations, and equivalents as fall within the true spirit and scope of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
10217858.5 | Apr 2002 | DE | national |
This application is a continuation of copending International Application No. PCT/EP02/10200, filed on Sep. 11, 2002, which designated the United States and was not published in English.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/EP02/10200 | Sep 2002 | US |
Child | 10970300 | Oct 2004 | US |