METHOD FOR DISCRIMINATING BETWEEN MALIGNANT AND BENIGN TISSUE LESIONS

Abstract
An embodiment of the present invention includes a method for discriminating between benign and malignant tissue lesions. The method includes the steps of using a plurality of maps of physiology and morphology parameters generated from reflectance measurements and pure morphology parameters generated from reflectance measurements. The method also includes calculating entropies and cross entropies of the plurality of maps, and calculating a plurality of pure morphology parameters. Further, the method includes assigning a weight to each entropy and a weight to a logarithm of each entropy, a weight to each cross entropy and a weight to a logarithm of each cross entropy, and a weight to each pure morphology parameter and a weight to a logarithm of each pure morphoiogy parameter. The method further includes computing a diagnostic index, defining a cost function, defining a proper threshold value for a diagnostic index and solving an optimization problem to determine a set of weights from the assigned weights to maximize specificity for 100% sensitivity. Further, the method uses calculations, the cost function and the diagnostic index to determine whether the tissue lesion is benign or malignant.
Description
BACKGROUND

1. Technical Field


The present disclosure relates to a method for discriminating between different types of tissue lesions. In particular, the present disclosure is directed to a method for discriminating between malignant and benign tissue lesions.


2. Description of the Related Art


Malignant melanoma is one of the most rapidly increasing cancers in the world. In the United States alone, the estimated incidence for 2008 is 62,480, which leads to an estimated total of 8,420 deaths per year. Successful treatment of melanoma depends on early detection by clinicians with subsequent surgical removal of tumors. Visual detection has its limitations, even when augmented with dermoscopy, especially with less experienced users. Attempts have thus been made to develop automated devices to assist in the screening of pigmented skin lesions for likelihood of melanoma. Several of these devices have digitalized dermoscopy-related features analyzed by artificial neural networks or support vector machine learning systems.


SUMMARY

An embodiment of the present invention includes a method for discriminating between benign and malignant tissue lesions. The method includes the steps of using a plurality of maps of physiology and morphology parameters generated from reflectance measurements and pure morphology parameters generated from reflectance measurements. The method also includes calculating entropies and cross entropies of the plurality of maps, and calculating a plurality of pure morphology parameters. Further, the method includes assigning a weight to each entropy and a weight to a logarithm of each entropy, a weight to each cross entropy and a weight to a logarithm of each cross entropy, and a weight to each pure morphology parameter and a weight to a logarithm of each pure morphology parameter. The method further includes computing a diagnostic index, defining a cost function, defining a proper threshold value for a diagnostic index and solving an optimization problem to determine a set of weights from the assigned weights to maximize specificity for 100% sensitivity. Further, the method uses calculations, the cost function and the diagnostic index to determine whether the tissue lesion is benign or malignant.





BRIEF DESCRIPTION OF THE DRAWINGS

The objects and features of the present disclosure, which are believed to be novel, are set forth with particularity in the appended claims. The present disclosure, both as to its organization and manner of operation, together with further objectives and advantages, may be best understood by reference to the following description, taken in connection with the accompanying drawings as set forth below:



FIG. 1 is a flow chart illustrating a method in accordance with an embodiment of the present invention; and



FIG. 2 is a flow chart illustrating another method in accordance with an embodiment of the present invention.





DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS

The following detailed description refers to the accompanying drawings. The same reference numbers in different drawings may identify the same or similar elements. In addition, the following detailed description does not limit the present disclosure.


The present invention relates to a method for discriminating between different types of tissue lesions. In particular, the present invention is directed to a method for discriminating between malignant and benign tissue lesions. In the present invention, a novel melanoma detection method is taught. The method of the present invention uses morphologic and physiologic maps as well as pure morphological parameters generated from spectral reflectance images of melanocytic lesions, and which can be extended to discriminate between benign and malignant tissue lesions, in general.


The present invention uses maps of morphologic and physiologic tissue parameters as well as pure morphologic parameters, which are generated by the Optical Transfer Diagnosis (OTD) method, as described in the patent entitled “Method and an Arrangement for the Determination of the Optical Properties of Multi-Layered Tissue”, PCT WO 02/069792 A1.


The OTD device used records 30 spectral reflectance images of a lesion under examination. These 30 reflectance images, which constitute one image set, are recorded at 10 different wavelengths (365-1000 nm) from multiple angles of illumination and detection. One version of the OTD device is a spectral reflectance meter consisting of a measurement head with 12 fixed light-emitting diode (LED) lamps and 3 IEEE (Institute of Electrical and Electronics Engineers) 1394 FireWire cameras. Each LED is placed at a different angle relative to the skin to enhance the ability to retrieve information about the depth of the lesion. The polar angles of the LEDs vary between 30 and 45 degrees, and the relative azimuth angles between 34 and 145 degrees. The polar angles of the detectors vary between 0 and 45 degrees, and the relative azimuth angles between 0 and 180 degrees. In the current OTD practice, an alcohol-based gel is used as an interface between the measurement probe and the skin, and a selected area of the skin is illuminated and imaged through a 2.2 cm diameter circular sapphire plate. The imaging time is approximately 5 seconds. On the basis of established absorption and transmission spectra for known skin chromophores and mathematical modeling of skin reflectance, the images from each set are used to derive physiology-morphology maps of the lesions for the following 7 parameters (i) percentage of hemoglobin; (ii) percentage of hemoglobin oxygenation; (iii) upper epidermal thickness; (iv) lower epidermal thickness; (v) percentage of upper melanosome concentration; (vi) percentage of lower melanosome concentration; and (vii) percentage of keratin concentration. These parameters vary and are different for normal and malignant tissues.


From each physiology-morphology map, an entropy value is calculated and the cross entropy values are calculated among different pairs of 2 maps. For example, from the spatial distribution of the melanosome concentration, we compute the entropy of this parameter as the sum of the melanosome concentration multiplied by its logarithm and integrated over the area of the lesion. Simply stated, the entropy provides a measure of the disorder in any one of the maps, and the cross entropy provides a measure of the correlation between two different maps.


In the development of a robust diagnostic procedure, it is important to keep in mind that the final diagnosis to the largest degree possible should be independent of the following:

    • The robustness of the measurement probe
    • Clinical measurement errors due to: bad contact between the probe and skin, too high or too low pressure between the probe and the skin, or too much or too little gel or oil applied between the probe and the skin
    • Azimuth angle between the probe and the lesion (for asymmetric lesions)
    • Incorrect exposure time (which may be fixed independent of skin type)
    • Location of lesion in the picture frame
    • Skin type
    • Lesion location on the patient's body
    • Changes in the physiological state of the patient due to, for example, but not limited to, jumps in the blood pressure or blood oxygenation


Further, a final diagnosis should be insensitive to errors and assumptions made in the pre-processing of the data.


From a set of digital reflectance images of a lesion generated by the OTD device, the optical properties of a lesion can be determined by the methods of PCT WO 02/069792 A1, and maps of physiology-morphology parameters and pure morphology parameters, can be made in a number of ways, including by the method of U.S. Provisional Patent Application No. 61/037,503.


In an embodiment of the present invention, a method 100, as shown in FIG. 1, includes considering a set of physiology-morphology maps, as well as a number of pure morphology parameters, as given quantities. A diagnosis may be made using the following steps. At step 102, for each available measurement, process the measurements in order to obtain a diagnosis. This diagnosis is obtained by using a set of physiology-morphology maps as well as a set of pure morphology parameters derived from reflectance images of the lesion generated by the OTD device. At step 104, compare the diagnosis obtained in step 102 with the pathology results obtained from clinical data. Patients with suspicious lesions are referred for scanning by the OTD device. After scanning, lesions are biopsied for histopathologic examination by dermatopathologists, and a diagnosis is provided. At step 106, a cost function consisting of the following is defined: a master term (including pathology), constraints, regularization, and Occam's rule. The master term is specifically designed to discriminate between malignant and benign lesions. Constraints are used to take advantage of a priori information about the covariance of the measurements, while regularization is used to suppress variations in the measurements, which do not contribute to the correct diagnosis. Some of the physiology-morphology parameters and the pure morphology parameters may not contribute significantly to the diagnosis. Occam's rule is designed to exclude insignificant parameters. At step 108, an optimization procedure is used to derive optimal weights. The optimization is an iterative process in which the weights are adjusted until the number of false positives (diagnosing the lesion as malignant when it actually is benign) becomes as small as possible when at the same time the number of false negatives is zero (no malignant lesions missed by the diagnosis). At step 110, the quality of the optimal weights is analyzed by computing receiver operating characteristic (ROC) curves. A ROC curve is a graphical display of the sensitivity vs. specificity, where the sensitivity is the probability of correctly diagnosing a malignant lesion, and the specificity is the probability of correctly identifying a benign lesion. At step 112, a proper threshold value for the diagnostic index, D, is defined to differentiate between melanomas and non-melanomas. A proper threshold value refers to the desire to discriminate between benign and malignant lesions in such a way that the diagnostic index for malignant lesions is well enough separated from those of benign lesions that inevitable errors in the measurement procedure are properly taken into account. At step 114, the robustness of the diagnosis is analyzed and procedures for cross validation are developed. Given a set of data generated by the OTD device and a corresponding set of pathology results, the method may be used to find optimized weights for the entire data set. But the method may also be used to find optimized weights for a subset of the entire data set obtained by excluding some of the measurements. By creating several such subsets, the robustness of the method can be tested by assessing how well the method works on such subsets. This process is referred to as cross validation.


For the sake of clarity, details of the diagnostic procedure of the method 100 are further explained. In order to facilitate the solution of the global optimization problem involving our entire processing chain, which depends on a large number of processing parameters, measurements of the maps of the physiology-morphology parameters and the pure morphology parameters obtained using current processing codes should be considered. Further, the maps and pure morphology parameters will be noisy (irreproducible), reflecting both noisy initial digital images of the same lesion and noise arising from the current non-optimal processing.


The diagnostic procedure, when properly optimized, shall give a diagnosis that is insensitive to variations in the input parameters due to experimental factors, such as, but not limited to, camera-tissue interface pressure, lesion-camera angular orientation and positioning. Further, the factors may be dependent on data acquisition and processing. Properly optimized means that variations between different reflectance measurements of the same lesion and corresponding variations in the maps and pure morphology parameters automatically shall be suppressed during the processing going from the maps and pure morphology parameters to the diagnosis. After optimization of the weights, all maps and pure morphology parameters will be processed in an optimal manner.


In order to obtain an optimum diagnosis, another method is shown in FIG. 2. The following steps, method 200, are taken for each lesion measurement. At step 202, physiology and morphology maps are computed, including estimated uncertainties. At step 204, entropies and relative entropies are calculated for the probability density function associated with each of the maps. At step 206, pure morphology parameters are calculated from the compressed nadir images for the green and the Near Infrared (NIR) channel. At step 208, weights are assigned to each of the entropies and cross entropies and their logarithms and to each of the pure morphology parameters and their logarithms. For example, there may be 7 physiology and morphology maps, and 10 pure morphology parameters. Therefore, there would be 28 parameters for the entropies and cross entropies, 28 parameters for their logarithms and 10 parameters for the pure morphology parameters and 10 parameters for their logarithms. In this example, 76 weights, wi, would be assigned. At step 210, a diagnostic index, D, is computed associated with the diagnosis of melanoma in the following manner: D=wTp, where w is a column vector (the superscript T denotes transpose) consisting of 76 weights wi (to be optimized as explained below), p is a column vector of 76 elements or input parameters consisting of 28 entropies and cross entropies and their 28 logarithms together with 10 pure morphology parameters and their 10 logarithms, and D is a value of the diagnostic index computed for every lesion being examined.


In step 212, a cost function is determined. The cost function, as discussed in method 100, may be defined by a general expression as the following:






J(w)=J0(w)+α1J1(w)+α2J2(w)+α3J3(w),


where α1, α2 and α3 are coefficients to be determined, and J0(w) is called the master term of the cost function, J1(w) represents constraints of the cost function, J2(w) represents self-adaptive regularization of the cost function, and J3(w) is called Occam's rule.


For exemplary purposes, an example of how the diagnostic procedure may be implemented is shown using a special case of discrimination between benign pigmented lesions and malignant melanomas. The total set of obtained measurements under consideration is divided into two subsets. One subset consists of all non-melanomas, and the other subset consists of all melanomas, the total number of the latter being N. For any chosen weight vector w, the distribution of the diagnostic indices is calculated for the set of non-melanomas being considered. The average value of this distribution is μ(w), and the standard deviation squared is σ2(w). The master term of the cost function may be given by:








J
0



(
w
)


=


-

1
N







n
=
1

N











D
n
M



(
w
)


-

μ


(
w
)




σ


(
w
)





[

1
+

tanh


(


-
β

·




D
n
M



(
w
)


-

μ


(
w
)




σ


(
w
)




)



]








where DnM(w) is the diagnostic index for melanoma number n as determined by pathology, and β is a penalizing factor, which must be positive. Here, the difference DnM(w)−μ(w) should be positive and as large as possible in order to minimize the value of the master term J0(w). The master term of the cost function is designed to be quadratic around its minimum value so that it works well with standard optimization methods based on quadratic forms. It is also designed to penalize low DnM(w) values. Thus, a master term of the cost function is created that pushes all differences DnM(w)−μ(w) for melanomas towards higher values, while pushing corresponding differences for non-melanomas towards lower values.


Multiplication of the weight vector by any positive constant, will not change the distribution of the diagnostic indices. Thus, the length of w may be constrained by the second term in the following equation:






J(w)=J0(w)+α1(w*w−w0*w0)22J2(w)+α3J3(w),


where w0 is the initial guess for the weight vector. The second term in the equation above constrains the end points of the weight vectors w to lie on the surface of a hypersphere in 76 (per the earlier example) dimensions of radius w0. If the end point of the weight vector w lies outside or inside the surface of the hypersphere, its length will be longer or shorter than the length of the vector w0, which is the radius of hypersphere. The constraint term α1(w*w−w0*w0)2 will be non-negative, and will increase the further away from the surface of the hypersphere the end point of the weight vector w lies. Thus, by adding the constraint term with positive value of the coefficient α1 to the cost function, which is to be minimized, the optimization routine is constrained to look for solutions w that have end points close to the surface of the hypersphere of radius |w0|.


Probabilistic (soft) constraints on the weight vector w will be required to get a well-posed optimization problem. Such constraints should include a priori information about the covariance. A self-adaptive regularization, which exploits the use of a specific function associated with Fisher's information operator may be used. This kind of regularization allows for automatic suppression of variations in those elements of w, which do not contribute to the correct diagnosis.


It is possible that some of the 28 physiology-morphology parameters and the 10 pure morphology parameters may not be vital for the diagnosis. In order to exclude insignificant parameters, the fourth term J3(w) in the cost function equation is included. Thus, the fourth term of the cost function in terms of Shannon's entropy is:






J
3(w)=−s(wln s(w)


where s(w) is the probability density vector function of the weight vector w, with components given by








s
i

=




w
i



/




i
=
1

76









w
i






;










i
=
1

76



s
i


=
1




and where ln s(w) is a vector with components (ln s1, ln s2, . . . , ln s76).


Note that J3(w) is non-negative, and that its smallest value of zero occurs if all weights are zero, except for one of them. Since the goal is to minimize the cost function, the Shannon entropy should be as small as possible, i.e. as many weights as possible should be zero, so that the corresponding physiology/morphology or pure morphology parameters from the input parameter vector p can be excluded. For this reason the term involving J3(w) is called Occam's rule.


Occam's rule may not be sufficient in order to exclude insignificant parameters. Therefore, the 76-element weight vector may be a sum of the initial weight vector w0 and a superposition of certain basis vectors vj with coefficients αj:






w=w
0
+V a


where V is a 76×L matrix, composed of L<76 basis vectors (column vectors: v1, v2, v3, . . . , vL), each having 76 components, and a is a column vector with L components αj(j=1,2, . . . , L). Further, the number L is determined through the use of an information operator. Also, the basis vectors v1, v2, v3, . . . , vL, and the coefficients αj of interest may be determined. An information operator H0* is defined as follows:






H
0
*=H
0
C
{dot over (U)}
−1
C
p
C
p
C
{dot over (U)}
−1
H
0,


where H0 is the Hessian matrix associated with the master term of the cost function, i.e.








H
0



(
w
)


=



[




2




w
2






J
0



(
w
)



]


w
0


.





Here w0 is the initial value of w. Cp is the covariance matrix of the input parameter vectors p for all measurements:






C
p=custom-character(p−custom-characterpcustom-character)(p−custom-characterpcustom-character)Tcustom-character


and C{dot over (U)} is the covariance matrix of the measurement errors:






C
{dot over (U)}
=C
{dot over (U)}
*+kI


where I is the unity matrix, k is a regularization factor to ensure the invertibility of C{dot over (U)}, and C{dot over (U)}* is given by






C
{dot over (U)}*=custom-charactercustom-character(plesioncustom-characterplesioncustom-character)(plesioncustom-characterplesioncustom-character)Tcustom-charactercustom-character=custom-character(Cp)lesioncustom-character


where (Cp)lesion is the covariance matrix for all measurements performed on one lesion, and where the final averaging is over all lesions.


Note that the information operator contains all pathological information as well as information about measurement uncertainties and the ranges of variation of the input parameters. The expression for the information operator can be interpreted as follows. The larger the value of H0*, the smaller the measurement errors (represented by C{dot over (U)}), the wider the range of the input parameters (represented by Cp), the more information we can obtain for diagnostic purposes.


Next, the eigenvalue problem for the information operator





H0*v(0)(0)v(0)


where v(0) is an eigenvector that depends on the initial weight vector w0, and λ(0) is the eigenvalue. However, the tangential projection e of the eigenvector is






e=(I−w0w0T)v(0)


where w0 is the initial weight vector. The vectors e, which are not orthonormal, but which satisfy the constraint that the end point of the weight vector should lie on the hypersphere, will be used as basis vectors to represent w.


Next, the set of eigenvectors is ordered according to the magnitude of the product li of the eigenvalue λi the length squared of the tangential projection ei (li=|λi||ei|2), and then only those tangential projections ei of the eigenvector are considered that satisfy the inequality






l
i=|λi||ei|2>α{(|λ||e|)2}max


where α is a threshold value to be determined that is positive and less than a certain pre-selected value, currently set to 0.01. The larger the value of li, the larger the information content associated with the corresponding basis vector ei.


Now the weight vector is defined by replacing V by E in the previous expression for w:






w=w
0
+E a


where E is a 76×L matrix, composed of L basis vectors (column vectors: e1, e2, e3, . . . , eL), each having 76 components, and a is a column vector with L components, the number L being the maximum value of i. The vector a will be determined by the optimization.


Using the representation above for w, we redefine the cost function as follows:













J
*



(
a
)


=



J
0
*



(
a
)


+


α
1




J
1
*



(
a
)



+


α
2



?


+


α
3




J
3
*



(
a
)












?



indicates text missing or illegible when filed





where






J*(a)=J(w)=J(w0+E a); Jn*(a)=Jn(w)=Jn(w0+E a); (n=0,1,2,3).


The regularization term a·(Ra) contains the diagonal matrix R with elements (l1−1, l2−1, . . . , lL−1), where li=|λi||ei|2, and L is the maximum value of i.


The regularization term is constructed such that the smaller the information content in a certain direction, the shorter the step taken in that direction.


In order to investigate the accuracy and robustness of the method, an example is to apply the method to a clinical data set that had been performed on pigmented lesions on 100 different patients. The total number of lesions was 125, and three OTD measurements (each consisting of 30 images) were taken of each lesion, making the total number of measurements 125×3=375. But some of the measurements were discarded because of measurement errors, reducing the total number of useful measurements to 342.


A receiver operating characteristic (ROC) curve (as defined here) is a graphical plot of sensitivity vs. specificity. By applying the method of the present invention to the clinical data set described above, the sensitivity was found to be 1 (i.e. 100%) for any specificity below 0.914 (i.e. 91.4%). The area underneath the ROC curve should be equal to 1 in order to have a sensitivity of 100% for a specificity of 100%. The application to clinical data just mentioned showed that a reduction of the value of the master term of the cost function is highly correlated to an increase in the area underneath the ROC curve.


The optimization may be performed in accordance with the described procedure. The optimization is an iterative process, where the optimized weight vector from a given iteration is used as the input weight vector to the next iteration. For example, a total of 8 iterations may be used for each optimization, and the weights for each of the iterations may be stored. The optimization may be performed on any subset. In this example, the criterion used for accepting a weight vector was that it should give a specificity value larger than 90% at 100% sensitivity for all subsets.


The described method may be used to find optimized weights for a few different subsets of the entire data set. The subsets were created by starting from the entire set of all clinical measurements (342) and excluding some of the measurements. One subset was created by dividing the total set into three equal parts, each consisting of every third measurement, and then excluding one third of the total subset, so that the remaining subset contained two thirds of the entire data set. In the current data set there are a total of eleven melanomas. Nine different subsets were created by excluding all measurements that had been performed on one of the eleven lesions being a melanoma according to the pathology report.


Covariance matrices are used in the information operator and Occam's rule is used in the cost function, in order to suppress insignificant weights (and corresponding input parameters). In the clinical data described above, several weights were found to be very small and were set equal to zero. The new weight vector with fewer non-zero weights was tested for acceptance. With a correct selection of insignificant weights, the specificity at 100% sensitivity for all subsets did not change noticeably. With the present invention, weights optimized using several different subsets were found to be acceptable with as many as 26 of the 76 weights set equal to zero.


No element, act, or instruction used in the present disclosure should be construed as critical or essential unless explicitly described as such. In addition, as used herein, the article “a” is intended to include one or more items. Where only one item is intended, the term “one” or similar language is used.


It will be understood that various modifications may be made to the embodiments disclosed herein. Therefore, the above description should not be construed as limiting, but merely as exemplifications of the various embodiments of the present disclosure. Those skilled in the art will envision other modifications within the scope and spirit of the claims appended hereto.

Claims
  • 1. A method for discriminating between benign and malignant tissue lesions, the method comprising the steps of: using a plurality of maps of physiology and morphology parameters generated from reflectance measurements;using a plurality of pure morphology parameters generated from reflectance measurements;calculating entropies and cross entropies of said plurality of maps;calculating a plurality of pure morphology parameters;assigning a weight to each said entropy and a weight to a logarithm of each said entropy;assigning a weight to each said cross entropy and a weight to a logarithm of each said cross entropy;assigning a weight to each said pure morphology parameter and a weight to a logarithm of each said pure morphology parameter;computing a diagnostic index;defining a cost function;defining a proper threshold value for a diagnostic index;solving an optimization problem to determine a set of weights from said assigned weights to maximize specificity for 100% sensitivity; andusing calculations, said cost function and the diagnostic index to determine whether said tissue lesion is benign or malignant.
  • 2. The method of claim 1, wherein the diagnostic index is computed by using said entropies, cross entropies, said pure morphology parameters and said plurality of logarithms.
  • 3. The method of claim 1, wherein the cost function includes a master term with pathology information from a plurality of investigated lesions.
  • 4. The method of claim 1, wherein the proper threshold value for said diagnostic index is used to differentiate between benign and malignant tissue lesions.
  • 5. The method of claim 1, wherein the method is used to discriminate between benign pigmented lesions and malignant melanoma.
  • 6. The method of claim 1, wherein the method is used to discriminate between benign tissue and basal cell carcinoma.
  • 7. The method of claim 1, wherein the method is used to discriminate between benign tissue and squamous cell carcinoma.
  • 8. The method of claim 1, wherein the method is used for beauty care.
  • 9. The method of claim 1, wherein the method is used for forensic medicine.
  • 10. The method of claim 1, wherein the method is used to monitor efficacies of different kinds of treatment.
CROSS-REFERENCE TO RELATED APPLICATIONS

This patent application claims priority to U.S. Provisional Application Ser. No. 61/023,242, filed in the U.S. Patent and Trademark Office on Jan. 24, 2008 by Jakob J. Stamnes and Knut Stamnes, and U.S. Provisional Application Ser. No. 61/037,503, filed in the U.S. Patent and Trademark Office on Mar. 18, 2008 by Jakob J. Stamnes and Knut Stamnes, and is a continuation-in-part from U.S. patent application Ser. No. 10/471,111, filed in the U.S. Patent and Trademark Office on Oct. 23, 2003, by Jakob J. Stamnes and Knut Stamnes, the entire contents of these applications being incorporated herein by reference in their entirety.

PCT Information
Filing Document Filing Date Country Kind 371c Date
PCT/US09/31969 1/26/2009 WO 00 11/15/2010
Provisional Applications (2)
Number Date Country
61023242 Jan 2008 US
61037503 Mar 2008 US