Embodiments of the subject matter disclosed herein relate to medical imaging, and more particularly to breast cancer detection using medical imaging.
Breast cancer is the most common cancer in women worldwide, affecting an estimated 1.5 million women around the world each year. It is also a leading cause of cancer-related death in women. Early detection of breast cancer can reduce mortality and the intensity of treatment required. Among the breast cancer subtypes, invasive Triple negative breast cancer (TNBC) (which is stage 1+) is the most aggressive and heterogeneous breast cancer subtype and accounts for 10-20% of newly diagnosed early breast cancers. The lack of hormone receptors and human epidermal growth factor receptor 2 (HER2) prevent TNBCs from being treated with therapies against these targets. Recurrences of TNBC occurs in about 25% of patients and is observed within the first few years after diagnosis. Early detection of recurrence from routinely collected mammograms would allow early intervention and a better treatment procedure.
In many cases, the cancer tumor in a breast is detected by a medical imaging procedure such as a Mammography. In digital mammography, a scout or pre-shot image may be taken of a patient to determine an x-ray technique (e.g., x-ray tube current and voltage, exposure time) to acquire images of the patient having a sufficient brightness. Upon determination of the x-ray technique, one or more x-ray images of the patient may be acquired. In some examples, multiple x-ray images may be acquired at different view angles and/or at different energy levels.
Although, an existing cancer tumor may be detected using the mammography technique, predicting recurrence in TNBC is difficult from routinely collected clinical data including biopsy samples, clinical information, and mammograms. Therefore, there is a need for an improved system and method to determine recurrence for triple negative breast cancer patients.
In addition, ductal carcinoma in-situ (DCIS) is an early stage 0 breast cancer and accounts for 20% of newly diagnosed breast cancer cases. Diagnostically, DCIS needs to be distinguished from other intraductal proliferations (such as usual ductal hyperplasia and atypical hyperplasia) as well as normal ducts and lobules. Local recurrences and/or progression to invasive cancer is a major concern. This results in patients being treated with surgery (e.g., lumpectomy or mastectomy) without or without additional radiotherapy. System anti-estrogenic therapy in the form of tamoxifen is also often recommended. Recurrence and/or progression is observed in 30% of patients with surgery alone. This can be reduced to approximately 10% with the addition of radiotherapy and endocrine therapy.
DCIS patients have an excellent overall survival rate and over-treatment is always a cause for concern due to potential side-effects. Standard clinicopathological factors (e.g. age, growth pattern, tumor size, margin status, grade, etc.) have been shown to have limited value in predicting recurrence and segregation of high and low risk patients. In addition, current tests to determine risk stratification of patients require a significant amount of material for genomic analysis (e.g., Oncotype Dx). However, getting enough sample is difficult since DCIS is often small, thus, it is difficult to perform genomic analysis on DCIS. Therefore, there is a need for early and accurate recurrence prediction to facilitate a more aggressive treatment policy for high-risk patients (e.g., mastectomy or adjuvant radiation therapy), while simultaneously reducing over-treatment of low-risk patients.
In accordance with an embodiment of the present technique, a method for determining a recurrence of a disease in a patient is presented. The method includes generating a plurality of medical images of an organ of the patient and determining a plurality of recurrence probabilities from the plurality of medical images. The method further includes determining a recurrence of the disease based on the plurality of recurrence probabilities and clinicopathological data of the patient using a Bayesian network.
In accordance with another embodiment of the present technique, a system including a memory, a display device and a processor communicable coupled to the memory is presented. The processor is configured to generate a plurality of medical images of an organ of the patient and determine a plurality of recurrence probabilities from the plurality of medical images. The processor is further configured to determine a recurrence of the disease based on the plurality of recurrence probabilities and clinicopathological data of the patient using a Bayesian network.
In accordance with yet another embodiment of the present technique, a method for determining a recurrence of a disease in a patient is presented. The method includes generating Hematoxylin and Eosin (H&E) biopsy sample images of the patient and extracting fixed image patches of predefined pixels from the H&E biopsy sample images. The method further includes automatically mapping each of the fixed image patches to an initial latent space and refining the latent space by encoding patches generated by a generative adversarial network (GAN) model that captures features of aggressive cancers. Finally, the method includes using a deep learning (DL) network to predict the first recurrence probability based on the refined latent space.
In accordance with a further embodiment of the present technique, a method for determining a recurrence of ductal carcinoma in situ (DCIS) in a patient is presented. The method includes generating a plurality of medical images of breast tissue of the patient and determining a plurality of recurrence probabilities from the plurality of medical images. The method further includes determining a recurrence of the DCIS based on the plurality of recurrence probabilities and clinicopathological data of the patient using a Bayesian network.
In accordance with a still further embodiment of the present technique, a system including a memory, a display device and a processor communicable coupled to the memory is presented. The processor is configured to generate Hematoxylin and Eosin (H&E) biopsy sample images of breast tissue of the patient and to extract fixed image patches of predefined pixels from the H&E biopsy sample images. The processor is also configured to automatically map each of the fixed image patches to an initial latent space and refine the latent space by encoding patches generated by a generative adversarial network (GAN) model that captures features of aggressive ductal carcinoma in situ (DCIS). Finally, the processor is further configured to use a deep learning (DL) network to predict a recurrence probability of the DCIS based on the refined latent space.
In accordance with an even further embodiment of the present technique, a method for determining a recurrence of ductal carcinoma in situ (DCIS) in a patient is presented. The method includes generating Hematoxylin and Eosin (H&E) biopsy sample images of breast tissue of the patient and extracting fixed image patches of predefined pixels from the H&E biopsy sample images. The method further includes automatically mapping each of the fixed image patches to an initial latent space and refining the latent space by encoding patches generated by a generative adversarial network (GAN) model that captures features of aggressive DCIS. Finally, the method includes using a deep learning (DL) network to predict a recurrence probability of the DCIS based on the refined latent space.
It should be understood that the brief description above is provided to introduce in simplified form a selection of concepts that are further described in the detailed description. It is not meant to identify key or essential features of the claimed subject matter, the scope of which is defined uniquely by the claims that follow the detailed description. Furthermore, the claimed subject matter is not limited to implementations that solve any disadvantages noted above or in any part of this disclosure.
The present disclosure will be better understood from reading the following description of non-limiting embodiments, with reference to the attached drawings, wherein below:
One or more specific embodiments of the present disclosure are described below. These described embodiments are only examples of the systems and methods for locally enhancing a medical image. The skilled artisan will understand that specific details described in the embodiments can be modified when being placed into practice without deviating from the spirit of the present disclosure.
As used herein, an element or step recited in the singular and proceeded with the word “a” or “an” should be understood as not excluding plural of said elements or steps, unless such exclusion is explicitly stated. Moreover, the terms “first,” “second,” and “third,” etc. are used merely as labels, and are not intended to impose numerical requirements or a particular positional order on their objects. Furthermore, unless explicitly stated to the contrary, embodiments “comprising,” “including,” or “having” an element or a plurality of elements having a particular property may include additional such elements not having that property. The terms “including” and “in which” are used as the plain-language equivalents of the respective terms “comprising” and “wherein.” As the terms “connected to,” “coupled to,” etc. are used herein, one object (i.e., a material, element, structure, number, etc.) can be connected to or coupled to another object regardless of whether the one object is directly connected or coupled to the other object or whether there are one or more intervening objects between the one object and the other object. In addition, it should be understood that references to “one embodiment” or “an embodiment” of the present disclosure are not intended to be interpreted as excluding the existence of additional embodiments that also incorporate the recited features.
The malignancy of breast cancer is unclear and no dominant cause has emerged; however, early detection and treatment may generate a good prognosis for the patient. In recent years, radiomics and deep learning (DL) models have made substantial progress in improving the accuracy and throughput of pathology workflow for diagnostic and prognostic prediction. These DL models are often combined with clinical data for disease risk stratification and prognosis prediction. However, fusion of information coming from different source of data is difficult. Often features from different data sources including imaging and clinical records are concatenated before prognosis prediction. Often such prediction models are not clinically interpretable and suffers from data scaling issues as features are combined from different data sources.
As explained earlier, ductal carcinoma in situ (DCIS) or stage 0 account for a significant percentage of newly diagnosed cancer cases (approximately 20-25%). Also, as explained earlier, among the breast cancer subtypes, invasive Triple negative breast cancer (TNBC) (e.g., stage 1+) is the most aggressive and heterogeneous breast cancer subtype. In the present technique, a TNBC or a DCIS recurrence prediction model is used that combines information from different data sources including mammogram, histopathology images and clinic-pathological variables to predict prognosis. In contrast to directly combining features for prediction as in conventional methods, in the present technique, the recurrence probabilities from different sources of data are used for prognosis prediction. For example, the recurrence probabilities are combined in a Bayesian learning framework to create a clinically explainable artificial intelligence (AI) model.
The Pathomics model 102 is used to detect a first recurrence probability of the disease from whole slide Hematoxylin and Eosin (H&E) biopsy samples of the patient. In one embodiment, the model 102 extracts fixed image patches of predefined pixels (e.g., 224×224 or 256×256) from the whole slide H&E biopsy samples. The pathomics model further includes a pre-trained deep learning encoder to automatically map each of the fixed image patches to an initial latent space. The initial Latent space is further refined by encoding patches generated by a generative adversarial network (GAN) model that captures features of aggressive cancers. Such refinement of latent space helps identify representative features that are used in a neural-network architecture to predict the first recurrence probability as will be explained in more detail subsequently.
The radiomics model 104 detects a second recurrence probability of the disease from routine mammogram of the patient. The radiomics model 104 may detect irregular infiltration of a surrounding normal tissue around a tumor region in the form greater heterogeneity in intensity distribution by radiomic methods. For example, a plurality of radiomics features is extracted from invasive edge of breast tumors as observed in routine mammograms. The plurality of radiomics features is then analyzed to determine the second recurrence probability. The details of determining the second recurrence probability from the plurality of radiomics features has been explained in co-pending patent application filed on Mar. 25, 2022, Ser. No. 17/704,531, which is incorporated herein by reference in its entirety.
The disease prediction model 110 may further receive other plurality of disease recurrence probabilities from various other data sources such as in situ imaging model for example transcriptomics or immunohistochemistry model 106 on a small set of tissue images of the patient to determine the disease recurrence probability. The disease prediction model 110 may further receive clinicopathological data 108 related to the patient. The clinicopathological data 108 related of the patient may include age and size of a tumor and/or Lymph node positivity for example. The disease prediction model 110 will be explained in more detail subsequently.
Referring back to
Referring back to
Further, the Bayesian network 600 clearly shows conditional dependencies between a disease grade 603 and laterality 605 and between lymph node 606, patient age 602 and laterality 605. Moreover, there is another conditional dependency between lymph node 606, size 604 and node type 607. Based on the Bayesian network 600, the clinician can easily interpret that grade and size of the tumor are playing major part in the calculated disease recurrence probability 614.
At step 704, the method includes determining a plurality of recurrence probabilities of the disease based on the plurality of medical images. As explained above a first recurrence probability of a disease may be determined by a pathomics model, a second recurrence probability may be determined by a radiomics model and third recurrence probability may be determined by in situ imaging model. As described earlier, determining the first recurrence probability includes extracting fixed image patches of predefined pixels (e.g., 224×224 or 256×256) from the whole slide H&E biopsy samples and automatically mapping each of the fixed image patches to an initial latent space. The initial Latent space is further refined by encoding patches generated by a generative adversarial network (GAN) model that captures features of aggressive cancers. Such refinement of latent space helps in identifying representative features that are used in a neural-network architecture to predict the first recurrence probability. For example, the latent space may be utilized to help in automatically identifying lesions as ductal carcinoma in situ (DCIS) as opposed to benign intraductal proliferations. In addition, if the lesion is DCIS, it may be stratified based on the risk of recurrence and/or progression to invasive cancer as determined by the neural-network architecture. The second recurrence probability is determined from routine mammogram of the patient. In one embodiment, the method includes detecting irregular infiltration of a surrounding normal tissue around a tumor region in the form greater heterogeneity in intensity distribution using radiomic methods. For example, a plurality of radiomics features is extracted from invasive edge of breast tumors as observed in routine mammograms. The plurality of radiomics features is then analyzed to determine the second recurrence probability.
The third disease recurrence probability may be determined based on in situ imaging on a small set of tissue images of the patient. Finally, at step 706, the method includes using a Bayesian network for determining a recurrence of the disease in the patient based on the plurality of recurrence probabilities and clinicopathological data of the patient. The clinicopathological data of the patient may include age and size of a tumor and/or lymph node positivity, for example. The plurality of recurrence probabilities and clinicopathological data values represent nodes in the Bayesian network. Thus, the Bayesian network determines the final disease recurrence, depending on all the node probability values and conditional probabilities between the nodes.
As an example, the system 100 in
The DL classification model trained with both TMA (H&E) images and GAN generated patches predicted recurrence with a mean area under curve (AUC) of 0.87, a sensitivity of 0.83, and a specificity of 0.91 in the validation set. The DL classification model trained from TMA (H&E) images only predicted recurrence with an AUC of 0.81. The utilization of the GAN model to generate H&E images circumvents the need for a large cohort and accurate labor-intensive manual annotation of histopathological images, which is often required for training a deep learning classification model. The use of GAN generate aggressive image patches during training significantly improves recurrence prediction accuracy of the deep learning classification model.
The radiomics model 104, pathomics model 102, in situ imaging model 106, disease prediction model 110 and some of the steps of method 700 described herein may be implemented on any suitable computing device 800 and software implemented therein.
Moreover, in the exemplary embodiment, computing device 800 includes a display interface 817 that presents information, such as input events and/or validation results, to the user. The display interface 817 may also include a display adapter 808 that is coupled to at least one display device 810. More specifically, in the exemplary embodiment, the display device 810 may be a visual display device, such as a cathode ray tube (CRT), a liquid crystal display (LCD), a light-emitting diode (LED) display, and/or an “electronic ink” display. Alternatively, the display interface 817 may include an audio output device (e.g., an audio adapter and/or a speaker) and/or a printer.
The computing device 800 also includes a processor 814 and a memory device 818. The processor 814 is coupled to the user interface 804, the display interface 817, and the memory device 818 via a system bus 820. In the exemplary embodiment, the processor 814 communicates with the user, such as by prompting the user via the display interface 817 and/or by receiving user inputs via the user interface 804. The term “processor” refers generally to any programmable system including systems and microcontrollers, reduced instruction set computers (RISC), complex instruction set computers (CISC), application specific integrated circuits (ASIC), programmable logic circuits (PLC), and any other circuit or processor capable of executing the functions described herein. The above examples are exemplary only, and thus are not intended to limit in any way the definition and/or meaning of the term “processor.”
In the exemplary embodiment, the memory device 818 includes one or more devices that enable information, such as executable instructions and/or other data, to be stored and retrieved. Moreover, the memory device 818 includes one or more computer readable media, such as, without limitation, dynamic random access memory (DRAM), static random access memory (SRAM), a solid state disk, and/or a hard disk. In the exemplary embodiment, the memory device 818 stores, without limitation, application source code, application object code, configuration data, additional input events, application states, assertion statements, validation results, and/or any other type of data. The computing device 800, in the exemplary embodiment, may also include a communication interface 830 that is coupled to the processor 814 via the system bus 820. Moreover, the communication interface 830 is communicatively coupled to data acquisition devices.
In the exemplary embodiment, the processor 814 may be programmed by encoding an operation using one or more executable instructions and providing the executable instructions in the memory device 818. In the exemplary embodiment, the processor 814 is programmed to select a plurality of measurements that are received from data acquisition devices.
In operation, a computer executes computer-executable instructions embodied in one or more computer-executable components stored on one or more computer-readable media to implement aspects of the invention described and/or illustrated herein. The order of execution or performance of the operations in embodiments of the invention illustrated and described herein is not essential, unless otherwise specified. That is, the operations may be performed in any order, unless otherwise specified, and embodiments of the invention may include additional or fewer operations than those disclosed herein. For example, it is contemplated that executing or performing a particular operation before, contemporaneously with, or after another operation is within the scope of aspects of the invention.
One of the advantages of the present technique is that it can predict TNBC and/or DCIS recurrence from routine mammograms, biopsy sample and clinic-pathological variables. The technique be also used for cancer risk stratification for other cancers including prostate, lung and glioblastoma.
It is to be understood that the above description is intended to be illustrative, and not restrictive. For example, the above-described embodiments (and/or aspects thereof) may be used in combination with each other. In addition, many modifications may be made to adapt a particular situation or material to the teachings of the subject matter set forth herein without departing from its scope. While the dimensions and types of materials described herein are intended to define the parameters of the disclosed subject matter, they are by no means limiting and are exemplary embodiments. Many other embodiments will be apparent to those of skill in the art upon reviewing the above description. The scope of the subject matter described herein should, therefore, be determined with reference to the appended claims, along with the full scope of equivalents to which such claims are entitled. In the appended claims, the terms “including” and “in which” are used as the plain-English equivalents of the respective terms “comprising” and “wherein.” Further, the limitations of the following claims are not written in means-plus-function format and are not intended to be interpreted based on 35 U.S.C. § 112(f), unless and until such claim limitations expressly use the phrase “means for” followed by a statement of function void of further structure.
This written description uses examples to disclose the invention, including the best mode, and also to enable a person of ordinary skill in the relevant art to practice the invention, including making and using any devices or systems and performing any incorporated methods. The patentable scope of the invention is defined by the claims, and may include other examples that occur to those of ordinary skill in the art. Such other examples are intended to be within the scope of the claims if they have structural elements that do not differ from the literal language of the claims, or if they include equivalent structural elements with insubstantial differences from the literal languages of the claims.
This application is a continuation-in-part Application of U.S. patent application Ser. No. 17/710,326, entitled “SYSTEM AND METHOD FOR DETECTING RECURRENCE OF A DISEASE”, filed Mar. 31, 2022, which is herein incorporated.
Number | Date | Country | |
---|---|---|---|
Parent | 17710326 | Mar 2022 | US |
Child | 17981913 | US |