The invention relates to medical image analysis, and more particularly, to a method for normalizing images obtained from different sources as a preparatory step for computer-aided diagnosis.
Some benefits of computer-aided diagnosis in radiology, and particularly in mammography, have been recognized. To date, there has been efforts directed toward computer-aided methods that assist the diagnostician to efficiently identify problem areas detected in a mammography image and to improve the accuracy with which diagnoses are made using this information.
One challenge faced in the design of computer-aided diagnostic (CAD) systems relates to a lack of standardization in results from equipment used to provide images. In mammography, for example, different vendors provide imaging systems that capture and process the images of breast tissue. Some systems, such as Full-Field Digital Mammography (FFDM) systems, obtain digital image data directly. Other Screen-Film Mammography (SFM) systems acquire images on film, which is then scanned to provide digital image data. While there are some standard practices that are followed from site to site for mammography imaging, such as obtaining a specific sequence of views, for example, there can be differences in the characteristics of images that are obtained. Mammography images may be acquired at different locations, using different equipment, at different times, or by different operators. As a result, image characteristics such as contrast and noise levels may vary according to the image source and imaging conditions. Moreover, once images are acquired, follow-on pre-processing can affect the way the image data is stored and presented. Depending on the type of mammography system, for example, images may be optimized for CAD processing or may be pre-processed for display, such as on a display monitor. Mammography CAD images are analyzed for various types of mass and spot/spot cluster feature characteristics that can be relatively subtle to detect.
It would be desirable for flexible CAD system design to adapt to mammographic images from any source. For example, a CAD system should be able to accept an FFDM image or an SFM image from any of a number of sources for processing, and to automatically adjust its processing to suit image characteristics.
U.S. Patent Application No. 2005/0008211 entitled “Lung Contrast Normalization on Direct Digital and Digitized Chest Images for Computer-Aided Detection (CAD) of Early-Stage Lung Cancer” by Xu et al. describes a method for conditioning diagnostic image data for diagnostic imaging using normalization factors for pixel size and intensity value.
Contrast adjustment by means of “contrast stretching” is described in U.S. Pat. No. 5,357,549 entitled “Method Of Dynamic Range Compression Of An X-Ray Image And Apparatus Effectuating The Method” to Maack et al.
U.S. Pat. No. 5,835,618 entitled “Uniform And Non-Uniform Dynamic Range Remapping For Optimum Image Display” to Fang et al. describes dynamic range remapping for enhancing an image in both dark and bright intensity areas by smoothing the data, such as through a low-pass filter.
None of the approaches of these references are well suited to the particular needs of mammography, for which images that have been obtained on different types of imaging apparatus can exhibit different characteristics in contrast and dynamic range and for which CAD utilities must have consistent image treatment in order to properly detect both mass and spot features.
Thus, there is a need for improved normalization methods, particularly for CAD mammography images that have been obtained from different sources.
The present invention provides a method for normalizing an image of tissue from a patient comprising: determining an image type; distinguishing background content values in the image data from tissue content values according to the image type; reducing noise in the background content values; forming a look-up table for remapping tissue content values to a predetermined range; and remapping tissue content values according to the look-up table, providing a normalized image thereby.
The present invention provides normalization of image content so that images from different sources can be processed in a suitable manner by a CAD system. The present invention provides image normalization techniques that preserve the image content needed for accurate CAD assessment, whether the original image is obtained by scanning film or from an FFDM system.
These and other objects, features, and advantages of the present invention will become apparent to those skilled in the art upon a reading of the following detailed description when taken in conjunction with the drawings wherein there is shown and described an illustrative embodiment of the invention.
While the specification concludes with claims particularly pointing out and distinctly claiming the subject matter of the present invention, it is believed that the invention will be better understood from the following description when taken in conjunction with the accompanying drawings.
The following description is directed in particular to elements forming part of, or cooperating more directly with, apparatus in accordance with the invention. It is to be understood that elements not specifically shown or described may take various forms well known to those skilled in the art.
The method of the present invention provides image normalization without compromising the data that is analyzed by mammography CAD routines. Some methods, such as histogram equalization or distribution matching, can cause loss of image data. The method of the present invention adjusts image data characteristics to a standard in order to obtain usable input image content for detailed CAD analysis and more accurate detection of lesions and other abnormal conditions. The processing method of the present invention normalizes an image according to a set of standard characteristics, particularly with respect to the overall dynamic range of the imaged tissue and characteristics of features identified within the imaged tissue. The standard itself is generated using values obtained from a database of mammography images used for CAD analysis.
In the MCC workflow, spot microcalcification candidates are identified in an MCC candidate selection step 64. A spot feature extraction step 66 follows. Once MCC spot features have been identified, a spot feature normalization step 68 is executed, applying the feature normalization processing of the present invention, described subsequently. A spot neural network step 70 applies logic processing for further refinement of the MCC structures. A cluster feature extraction step 72 and an MCC classification step 74 complete the processing path used for MCC detection.
In the mass detection workflow, candidates are initially identified in a mass selection step 80. A mass features step 82 extracts features. A mass feature normalization step 84 is executed, as described subsequently. A mass classification step 86 completes the processing path for mass detection. Results from both MCC classification step 74 and mass classification step 86 are then combined and can be provided as CAD processing results.
The graphs of
As shown in pixel value profile 20 of
A set of working assumptions make it possible to reduce noise levels in the background data, including the following:
The background threshold value can be determined using different methods. In one embodiment, histogram shape is used to identify background content and to distinguish it from image content of interest. Referring to histogram 22 in
Conventionally, CAD mammography employs a defined set of images. Typically, two views of each breast are taken, along cranio-caudal (CC) and mediolateral oblique (MLO) planes. In the mammography image, breast tissue is on one side of the image or the other. Referring, for example, to
At least a portion of the section that contains breast tissue is identified as the tissue area or tissue ROI (region of interest). To identify a threshold value for background identification, the mean of this tissue area is defined as Gtissue. The mean of the background ROI is defined as Gbkg. Therefore, the background level lies within the range [Gbkg, Gtissue]. To obtain an optimal threshold value, the histogram of each of the partial images given in left and right section 28 and 26 is derived. Based on this histogram, a threshold t is then calculated to maximize the following objective function:
wherein:
Using this type of calculation, the intensity threshold t that distinguishes background from tissue content can be identified for a given image. Once the background level can be determined, background pixels can be set to a standard value or otherwise conditioned, thereby reducing noise levels for this image data. With respect to the histogram curves of
Digital images exhibit a generally uniform contrast level for tissue components. The uniform contrast results in a relatively minor distinction between nearby tissue structures. As may be inferred from histograms 22 and 24 in
In this process, the pixel intensities in the image are stretched with two exponential functions and the results averaged out. The exponential functions used in one embodiment are:
g
i
1
=a
1
e
b
ƒ
−G
offset and gi2=a2eb
wherein:
g
i
=g
i
1
+g
i
2
As can be seen from histograms 22 and 24 in
The graph of
In one embodiment, a consideration is to limit the maximum shifting value (where the Limit>0):
wherein:
In addition to a difference in tissue mean, there can be a difference in the tissue dynamic range as well. It may be desirable to expand the dynamic range of image data to a desired dynamic range. To reduce the loss of information by the transformation, a scaling up or “zoom out” can be used (with a scale>1). Consideration should be given to correctly mapping the dark tissue and retaining the pixels for this tissue within the proper dynamic range for tissue, not in the dynamic range where the background lies.
The method of the present invention maps intensity in different ranges. For pixels that are within a range established for tissue, a simple linear remapping can be performed, with interpolation to fix the remaining pixels that may lie outside these values. Referring to the graph of
wherein MaxValue, as shown in
Piecewise linear transformation can be used, in which different ranges of pixels are handled differently, with linear transformations differing in slope, as in the example of
The values for tissue lower bound 42 and tissue upper bound 46 can be determined in a number of ways. In one embodiment, these values are at ±2σ, where σ is the standard deviation. Alternately, a percentage of the tissue cumulative sum is used, so that tissue lower bound level 42 is the value nearest 1% of the tissue cumulative sum and tissue upper bound 46 is the value nearest 99% of the cumulative sum. Background values are suppressed and values above tissue upper bound 46 are flattened.
For the example shown in
wherein:
g represents an input pixel value;
k provides a slope value;
T1, T2, T3, and T4 are remapped pixel values over each respective linear segment;
O1 is the background level of the image;
O2 is the tissue lower bound;
O3 is the tissue mean; and
O4 is the tissue upper bound.
It is noted that these linear calculations can be used to transform and remap each value from the original image data. In practice, a Look-Up Table (LUT) can be generated for remapping tissue content values to a predetermined range and applicable to log scale digital images based on these calculations, allowing a computationally fast remapping of pixel values. Some smoothing of values may be applied, for example, near transition points.
Where linear transformation are not used for one or more portions of an original image, other possible transforms for histogram alteration can alternately be used. This method typically suppresses intensities that are infrequently used and may have some utility for various mammography imaging applications. One known histogram adjustment method is histogram equalization which maps each intensity to form a flattened histogram that provides a more uniform distribution. The operation effectively stretches densely populated values, and suppresses intensities infrequently used.
A general solution is histogram specification, which maps the histogram of the image to a desired curve given by the user. This can be accomplished using a pair of images from the same object: one as the original image, the other as the desired image with desired dynamic range. With histogram specification, a look up table (LUT) can be generated to map the similar original images to the desired ones. However, histogram equalization can expand intensity values most often used but can suppress values that are particularly useful for cancer detection. Thus, while this imaging method may be used for generating an LUT according to the present invention, it may not be optimal as a general solution for normalization.
The flow chart of
A detect background level step 100 identifies image background values using known distribution characteristics from this type of digital imaging equipment, as was described with reference to
The flow chart of
The flow chart of
Referring again to the logic flow shown in
A normalization step 440 is applied if desired. Normalization methods can be linear or non-linear. In one non-linear method, the feature distribution from the image source can be mapped to the standard distribution, allowing the same basic transform to apply for all images from a particular source. A linear method, on the other hand, operates on the assumption of a linear relationship between the digital feature space and analog feature space:
ƒ′=A׃+B.
E[ƒ′]=A×E[ƒ]+B, Var[ƒ′]=A
2
×Var[ƒ]
E(x)=∫xƒ(x)dx
A=Sqrt(Var[ƒ′]/Var[ƒ]),
B=E[ƒ′]−A×E[ƒ].
The method of the present invention allows CAD software to work with mammography image data that has been obtained from any of a number of different imaging systems. Using this capability, images obtained from equipment of various manufacturers can be processed and used on a single CAD system, providing cost savings over alternative methods.
The invention has been described in detail with particular reference to certain preferred embodiments thereof, but it will be understood that variations and modifications can be effected within the scope of the invention as described above, and as noted in the appended claims, by a person of ordinary skill in the art without departing from the scope of the invention. For example, any of a number of methods could be used for deriving a background threshold value or minimizing or eliminating background noise. A number of statistical tests could be used to determine the need for features normalization.
What is provided is an apparatus and method for normalizing images obtained from different sources as a preparatory utility for computer-aided diagnosis.