The present invention relates to medical image analysis, particularly a method and system for verifying detection of a lung nodule in a chest radiographic image.
Typical chest X-ray (CXR) data is a two-dimensional (2D) projection image; each pixel represents a volumetric integration. Due to human anatomy there are many lung structures which can be falsely identified as nodules in projection data. One such structure is a rib-cross, which appears when the anterior and posterior ribs overlap each other. These rib-cross areas are often falsely identified as nodules and pose problems for Computer Aided Diagnosis (CAD) systems.
The lungs are situated within the rib cage, which is basically a bone structure. The bones absorb X-rays to a greater extent than do soft tissues, and hence these appear as white stripes across the X-ray radiograph. The ribs curl around the body and consequently the rib shadow on the X-ray radiograph appear to cross each other. The cross-over points appear as a pattern. This pattern may disguise the shadows cast by nodules which may be aligned with the crossover points.
For effective diagnostics, the regions in the X-ray image which are identified as possible nodules should be verified to ascertain whether they are actually nodules or false positives. The occurrence of false positives, where some other feature such as a rib crossing shadow is mistakenly identified as being attributed to a nodule, should be minimized. When scanning the lung region of posterior/anterior X-ray images to look for regions of varying contrast or edges, a large number of candidate regions turn out to be rib crossings and the like. Hence there should be a mechanism to cross-check or verify the identified nodules.
In view of the foregoing, an embodiment herein includes a method for verifying detection of a lung nodule in a chest radiographic image, comprising:
In another embodiment the said object is achieved by providing a method for verifying detection of a lung nodule in a chest radiographic image, comprising:
Another embodiment includes a system for verifying detection of a lung nodule in a chest radiographic image, comprising:
The present invention is further described hereinafter with reference to illustrated embodiments shown in the accompanying drawings, in which:
a represents the histogram based on orientation values of an image with a rib-cross, like the one as shown in
b represent the histogram based on confidence values of an image with a rib-cross, like the one as shown in
c represent the histogram based on curvature values of an image with a rib-cross, like the one as shown in
a represents the histogram based on orientation values of an image without a rib-cross, like the one as shown in
b represents the histogram based on confidence values of an image without a rib-cross, like the one as shown in
c represents the histogram based on curvature values of an image without a rib-cross, like the one as shown in
a represents plots for rib-cross and non rib-cross images for a statistical measure, which is the skewness for orientation values based on a straight line model,
b represents plots for rib-cross and non rib-cross images for a statistical measure, which is the skewness for orientation values based on a parabolic curvature model
a represents plots for rib-cross and non rib-cross images for a statistical measure, which is the skewness for confidence values based on a straight line model,
b represents plots for rib-cross and non rib-cross images for a statistical measure, which is the skewness for confidence values based on a parabolic curvature model, and
Chest X-ray images, typically being projection data, are difficult to analyze; it is often difficult to distinguish between rib-cross and non rib-cross regions. An embodiment of the invention makes use of the appearance of ribs or/and rib-crosses in the X-ray image. The ribs have a certain shape which can locally be matched with a curve. Rib-crosses show up in a close pattern within a region which can easily be distinguished in the image. By using a mask based on this geometric pattern one can focus image analysis on the relevant areas. A histogram of an image with values characteristic for the matching of the curve then provides a reliable basis to detect a rib-cross and hence false positives.
The main structure we are interested in our example verification is a rib-cross, even though the same methodology could be extended to other misrepresented parts. For a further description of the method, two images are considered.
At step 104, a curve is matched to each pixel of the sub-image. The matching of the curve with each pixel is carried out based on two types of model functions, i.e. straight line model and the parabolic curvature model, even though other models could also be used for the characterization of these types of oriented patterns. The said matching is done between a curve and individual pixels, together with the neighborhood of each pixel. To perform the said matching, the sub-image may be converted to a gradient image if required to have a clear differentiation in intensities of the neighboring pixels. When the resemblance between a model and a local image is high, the corresponding model parameters, for example orientation and curvature, yield a reliable description of the local image. The matching of the curve based on the said models with the pixels would yield statistical features for example, confidence values which could be represented using feature maps. By comparing these statistical features of various models, one can find out which model describes the local pattern best. The straight-line model requires less computation whereas the parabolic model more closely corresponds to the actual shape of the ribs and hence allows a more accurate matching of the curve.
At step 106, the method involves determining a confidence value for every pixel of the sub-image. The confidence value represents how well the curve is matched to each pixel in the sub-image. For a better understanding, the concepts of oriented pattern, straight line model, and parabolic curvature model are described here.
Consider an oriented pattern f(x, y) representing a local image described in terms of a model function u(x y, a). The confidence with which f(x, y) is described by the model function u(x, y, a) is measured by decomposing the derivative (gradient) energy of f(x, y) into two contributions, one parallel and one perpendicular to the normalized differential field (gradient) of u(x, y, a). This results in the fit energy Ef(a) and residual energy Er(a)
Subscript ⊥ represents 90° rotation of the vector. Comparison of the fit energy with the residual energy gives a measure of the model fit.
In absence of any a priori knowledge to interpret the difference between the fit and the residual energy, the difference energy normalized with the total gradient energy is described as a quality measure (confidence) c(a).
The value of c(a) varies from −1 for a pattern of which the isophotes are exactly perpendicular to those of the model function u(x, y, a) and +1 for a pattern which is exactly described by the model function.
When considering a straight line model, the straight line model function is given by
u(x,y,φ)=xcoxφ+y sin φ (3)
φ is the orientation perpendicular to the model isophotes (tangential).
The fit energy Ef and confidence c is described as
The gradient based orientation estimator is given by
A bar (
When considering a parabolic curvature model, the parabolic curvature model is represented by
Minimization of the residual energy Er gives the curvature. To speed up the process of calculation of curvature approximate Ef and Er are defined as
Curvature κ and confidence c are defined as
A hat above a quantity indicates approximation.
At step 108, a confidence image is generated consisting of the confidence values at the position of the respective pixel of the sub-image.
At step 110, separated regions of pixels are identified within the confidence image which has a confidence value greater than a threshold confidence value. The threshold value could be set as a fixed percentage of a maximum confidence value. One way of arriving at this confidence value might be by initial training using a training set. The training set might include an equal number of rib-cross and non rib-cross images which are analyzed. Since there is a higher possibility of the rib-cross regions in the image to show a higher confidence value, the threshold level could be a higher percentage value for example, it may be 50%, 80%, 90% or 95% based on the training.
At step 112, a filtered confidence image is generated consisting of the separated regions of the confidence image which are larger than a threshold area. The threshold value could be set at a fixed value or could be a percentage of the largest area in the original image. One way of arriving at this confidence value might be by initial training using a training set. The training set might include an equal number of rib-cross and non rib-cross images which are analyzed. For identifying separated regions of pixels within the confidence image, one of the methods that could be employed is the connected component labeling on the chosen pixels. Connected components labeling scans an image and groups its pixels into components based on pixel connectivity, i.e. all pixels in a connected component share similar pixel intensity values. Once all groups have been determined, each pixel is labeled with a gray level or a color labeling according to the component it was assigned to. Then the threshold value of the area, which has been already determined, could be used to filter the connected area to obtain a filtered confidence image from the confidence image.
At step 114, a histogram is determined based on values characteristic for the matching of the curve. The values characteristic for the matching of the curve could be confidence value, orientation value or a curvature value. As per the example models applied for the matching, the curve used could be a straight-line or a parabolic curve. When using the straight-line model, it is possible to determine an orientation value based on the previously explained equations. An orientation value is determined for every pixel of the sub-image as the orientation of the straight line, which is matched to the respective pixel. Hence in this case the values characteristic for the matching of the curve are orientation values.
When using the parabolic curvature model, the curve used is a parabolic curve and it is possible to determine an orientation value and a curvature value of every pixel of the sub-image respectively as the orientation and curvature of the parabolic curve, which is matched to the respective pixel. Hence in this case, the values characteristic for the matching of the curve are the orientation values or the curvature values.
In the present embodiment, the filtered confidence image is used as a mask. The mask defines which parts of the image it is applied to are taken into consideration and which parts are left out. In this case only those parts are relevant which correspond to connected areas of high confidence as described above.
Only those values are considered for the histogram, which correspond to the separated regions of the filtered confidence image.
At step 116, the method involves determining a statistical measure of the histogram. The statistical measure determined is for example a mean or standard deviation or skewness from the values characteristic for the matching, e.g. from confidence values or orientation values or the curvature values. Skewness is a measure of the asymmetry of the probability distribution and standard deviation is a measure of the variability of a probability distribution and mean is the average value.
At step 118, the probable lung nodule is verified to be a cross-rib (or not) based on at least one of these statistical measures. The values of the statistical measures show a wide difference for a rib-cross or non rib-cross based sub-images. Practically if the histogram values for each of the statistical measures of a plurality of images having an equal mixture of images having rib-cross or non rib-cross are plotted in a graph, then the features exhibit a good discrimination between rib-cross and non rib-cross values.
a and
a and
The system comprises of a generator 1208 for generating a confidence image consisting of the confidence values at the position of the respective pixel of the sub-image. The confidence image is fed into an identifier 1210 for identifying separated regions of pixels within the confidence image which have a confidence value greater than a threshold confidence value. A filter 1212 is used for filtering and generating a filtered confidence image consisting of the separated regions of the confidence image which are larger than a threshold area.
An analyzer 1214 then determines a histogram of values characteristic for the matching of the curve wherein the filtered confidence image is used as a mask, such that only values are considered for the histogram which corresponds to the separated regions of the filtered confidence image. A determining module 1216 is used for determining a statistical measure of the histogram. As cited earlier, the statistical measure can include, but in not limited to, mean or standard deviation or skewness. The system further has a verifier 1218 for verifying the detection of the lung nodule based on at least one statistical measure of the histogram.
Although the invention has been described with reference to specific embodiments, this description is not meant to be construed in a limiting sense. Various modifications of the disclosed embodiments, as well as alternate embodiments of the invention, will become apparent to persons skilled in the art upon reference to the description of the invention. It is therefore contemplated that such modifications can be made without departing from the embodiments of the present invention as defined.
The present application claims the benefit of U.S. provisional application No. 61/089,126 filed Aug. 15, 2008, the entire contents of which are herein incorporated by reference
Number | Name | Date | Kind |
---|---|---|---|
5668888 | Doi et al. | Sep 1997 | A |
6694046 | Doi et al. | Feb 2004 | B2 |
7274810 | Reeves et al. | Sep 2007 | B2 |
8045770 | Reeves et al. | Oct 2011 | B2 |
8050481 | Reeves et al. | Nov 2011 | B2 |
20090080748 | Reeves et al. | Mar 2009 | A1 |
Number | Date | Country | |
---|---|---|---|
20100040269 A1 | Feb 2010 | US |
Number | Date | Country | |
---|---|---|---|
61089126 | Aug 2008 | US |